Data Version Control Blog

Insights and updates from the DVC team. Explore best practices in data versioning, machine learning workflows, and model management. Stay informed with our latest news, tutorials, and community highlights.
Tutorial: Scalable and Distributed ML Workflows with DVC and Ray on AWS (Part 2)
Need to setup DVC to work with Ray Cluster on AWS? This tutorial has you covered!
  • Mikhail Rozhkov
  • Mar 13, 202416 min read
Tutorial: Scalable and Distributed ML Workflows with DVC and Ray (Part 1)
This tutorial introduces you to integrating DVC (Data Version Control) with Ray, turning them into your go-to toolkit for creating automated, scalable, and distributed ML pipelines.
  • Mikhail Rozhkov
  • Mar 12, 202415 min read
Running DVC on a SLURM cluster
Learn how Exscientia uses DVC experiments on a cloud-deployed SLURM cluster to scale their ML experimentation.
  • Dom Miketa
  • Mar 11, 20248 min read
Tutorial: Automate Data Validation and Model Monitoring Pipelines with DVC and Evidently
Ensuring your machine learning models remain precise and efficient as time progresses, and verifying that your data consistently reflects the real-world scenario.
  • Mikhail Rozhkov
  • Jan 19, 202410 min read
Integrating DVC and Git LFS via libgit2 filters
Read about how we built a Python Git LFS client to support integrating projects which use Git LFS into your DVC workflow.
  • Peter Rowlands
  • Jan 03, 20245 min read
Turn Your Favorite IDE into a Full Machine Learning Experimentation Platform
DVC extension enables you to run, track and manage ML experiments without leaving VS Code.
  • Tapa Dipti Sitaula
  • Nov 16, 20232 min read
Leveraging LLMs in Chatbots: The DVC Approach
Read how DVC can optimize the development process for chatbots built on Large Language Models.
  • Ryan Turner
  • Sep 25, 20236 min read