Slide 1

Slide 1 text

Machine Learning Experimentation DVC and VS Code Presented by: Gift Ojeabulu

Slide 2

Slide 2 text

Who am I? - Co-founder and community lead for Data Community Africa/DatafestAfrica - Content & Community Support Advocate at Iterative.ai - AWS ML Community Builder & MLOps Lagos Community Lead. - DevNetwork Advisory Board Member (Developer Advocate) - Facilitator of the African Data Community Newsletter. - Technical Writer, Public Speaker, Social Media Content creator, Machine Learning Thought Leader (Global AI Hub),Open-source & Community Advocate. Gift Ojeabulu Iterative.ai @GiftOjeabulu_

Slide 3

Slide 3 text

An Overture - What is MLOps - What is DVC? - CRISP-DM VS ASUM-DM Methodology - Experiment tracking tools. - VS Code Extensions for Machine learning.

Slide 4

Slide 4 text

Lesson Outline ⬥ What is experiment tracking? ⬥ Why is experiment tracking important in machine learning? ⬥ Experimenting workflow with DVC and VS Code ⬥ Project Setup and DVCLive ⬥ Conclusion

Slide 5

Slide 5 text

What is Experiment Tracking?

Slide 6

Slide 6 text

⬥Experiment tracking refers to the practice of systematically recording and managing information about the various experiments and iterations conducted during the development and training of machine learning models. ⬥It is a crucial aspect of managing the machine learning lifecycle and ensuring reproducibility, collaboration, and effective model management. What is Experiment tracking?

Slide 7

Slide 7 text

Why Experiment Tracking?

Slide 8

Slide 8 text

⬥ Experiment tracking is a best practice in machine learning because it promotes transparency, collaboration, and reproducibility. It helps data scientists and teams manage the complexity of machine learning projects, make informed decisions, and deliver more robust and reliable models. Why Experiment tracking?

Slide 9

Slide 9 text

Here are some specific examples of how experiment tracking can be used in machine learning: - A data scientist is developing a new model to predict customer churn. They track their experiments to compare different model architectures and hyperparameters. They also track their results to identify the best performing model (compare different experiment) - A team of data scientists is working on a project to develop a model to detect fraud. They use experiment tracking to share their results with each other and to collaborate on the development of the model (collaborate with others) - An ML Engineer is monitoring a model that is in production. They use experiment tracking to track the performance of the model over time and to identify any potential problems (Track progress overtime)

Slide 10

Slide 10 text

Machine learning without experiment tracking is like sailing without navigation. You might reach your destination, but you'll never know how you got there or how to improve the journey.

Slide 11

Slide 11 text

Experimenting Workflow with DVC and VS Code

Slide 12

Slide 12 text

Experimenting Workflow ⬥ Git: branching and tagging experiments a. create a separate branch & tag for each experiment b. keep failed experiments ⬥ DVC: run pipelines and keep states a. use dvc exp run b. clean up DVC cache for not important experiments ⬥ VS Code: visualize metrics and collaborate a. metrics visualization b. experiments comparison c. collaboration

Slide 13

Slide 13 text

Project Setup

Slide 14

Slide 14 text

Pip or conda DVC init DVC get data DVC add data

Slide 15

Slide 15 text

What Experiment tracking looks like with DVC on VS Code

Slide 16

Slide 16 text

No content

Slide 17

Slide 17 text

⬥ DVCLive is an auto-experiment tracking library that integrates seamlessly with DVC and the DVC VSCode extension. It is framework-specific, meaning it has separate tracking callbacks for many popular ML and DL frameworks

Slide 18

Slide 18 text

DVC Live

Slide 19

Slide 19 text

What have we learned?

Slide 20

Slide 20 text

Conclusion

Slide 21

Slide 21 text

In the world of MLOps, DVC is the secret sauce that makes everything taste better. It keeps our experiments reproducible, our data organized, and our models accountable

Slide 22

Slide 22 text

Links for further learning ◆ Community article on experiment tracking https://pub.towardsai.net/how-to-track-ml-experiments-with-dvc-inside-vsco de-to-boost-your-productivity-a654ace60bab ◆ Article - https://towardsdatascience.com/turn-vs-code-into-a-one-stop-shop-for-ml-exp eriments-49c97c47db27 ◆ Article - https://towardsdatascience.com/the-minimalists-guide-to-experiment-trackin g-with-dvc-f07e4636bdbb ◆ Youtube - https://www.youtube.com/@dvcorg8370/videos ◆ Newsletter - https://www.linkedin.com/pulse/dvc-august-23-community-updates-iterative -ai ◆ Discord - https://discord.gg/N5YzBBuFms

Slide 23

Slide 23 text

Webinar Alert!!!

Slide 24

Slide 24 text

/gift-ojabu /Gift Ojabu GiftOjeabulu_

Slide 25

Slide 25 text

Thank you! Gift Ojeabulu iterative.ai @GiftOjeabulu_