Data Version Control Blog

Insights and updates from the DVC team. Explore best practices in data versioning, machine learning workflows, and model management. Stay informed with our latest news, tutorials, and community highlights.
Fine-Tuning Large Language Models with a Production-Grade Pipeline
This post describes a production ML pipeline for fine-tuning large language models using DVC, SkyPilot, HuggingFace Transformers, and quantization techniques.
  • Alex Kim
  • Sep 08, 202310 min read
Automate model deployment to Amazon SageMaker with the DVC Model Registry
DVC provides a Git-based mechanism to automate model deployment from an intuitive web UI.
  • Tapa Dipti Sitaula
  • Aug 30, 20236 min read
The DVC 3.0 Stack: Beyond the Command Line
DVC 3.0 introduces a stack of tools outside the command line to bring it closer to where you work (in code, IDE, web) while also focusing on DVC fundamentals.
  • Dave Berenbaum
  • Jun 14, 20234 min read
Managing OpenFOAM Physical Simulations with DVC, CML, and Studio (Part 2)
In this second part, we discuss how to utilize cloud computing resources and visualize simulation data with CML and Iterative Studio.
  • Mikhail Rozhkov
  • May 10, 20236 min read
Testing external contributions using GitHub Actions secrets
Learn how to test open source contributors' pull requests using GitHub Actions secrets, securely.
  • Helio Machado
  • Apr 20, 20232 min read
Managing OpenFOAM Physical Simulations with DVC, CML, and Studio (Part 1)
In the first part of the series we learn how to use DVC for OpenFOAM simulation experiments and data management.
  • Mikhail Rozhkov
  • Apr 17, 202314 min read
Automate Your ML Pipeline: Combining Airflow, DVC, and CML for a Seamless Batch Scoring Experience
This tutorial shows you how to supercharge your batch scoring workflow by harnessing the power of Aiflow, DVC and CML.
  • Mikhail Rozhkov
  • Mar 22, 202310 min read
Organize Your Storage with DVC Cloud Versioning
DVC cloud versioning makes it easy to take full advantage of your cloud provider’s built-in versioning capabilities.
  • Dave Berenbaum
  • Feb 22, 20235 min read
Real-time visualization of Computer Vision model training with DVC and Iterative Studio
Save time and resources by tracking your deep learning experiments in real-time with DVC and Iterative Studio.
  • Maxim Shmakov
  • Feb 13, 20234 min read