Data versioning

Research Data Management
Author

Esther Plomp

Published

October 31, 2022

This post provides information about versioning files.

Version control is important in research projects as it allows for tracking changes that were made and by whom these changes were made. This is especially helpful in collaborative projects where different people can work on the same files simultaneously.

Manual version control

  • Decide how many and which versions to keep

    • For example, milestones such as publishable files or files that can be reviewed by collaborators

    • Examples could be: draft, final

  • Decide on how long to keep these versions

  • Decide on how to organise these versions and how to name them (see file naming)

  • Record the changes made to files and by whom this is done

Automated version control

An automatic version control system stores a history of changes and who made them. This is stored within the same file, so that your directory is not filled up by all the previous versions. Automatic version control can save a lot of time and headaches compared to manual version control, but does need some more technical skill. Git is a tool to version control your work - see The Turing Way on how to get started with Git.

PhD comic on "Final". doc. Where the PhD candidate has a final.doc and then needs to update this several times after feedback from their supervisor. The file name is ending up in "Final_rev.22.comments49.corrections10#@*#WhyDidIComeToGradschool???.doc

More information