Data Version Control or DVC (http://dataversioncontrol.com) is an open source project which makes data science and machine learning projects reproducible and shareable by automatically building data dependency graph (DAG) and sharing code by Git and data by cloud storage (AWS S3, GCP) in a single DVC environment.