Yann Büchau :python: · @nobodyinperson
199 followers · 850 posts · Server fosstodon.org

Yesterday I submitted my first manuscript where I managed, synced and backup up all the data and files with :gitannex: and @datalad.

I use 's for across machines (data analysis and environments).

A very reassuring feeling being able to work on it anywhere and to know all files are securely backed up in a decentralized manner.

Also, reusing individual parts is possible. I'll use this workflow from now on 💪

#phdlife #gitannex #Apptainer #reproducibility #texlatex #DataLad #reproducibleresearch

Last updated 1 year ago

Patrick Anker :neovim: 🫔 · @psanker
115 followers · 696 posts · Server hachyderm.io

@winnie git-annex is pretty much just a symlink manager, but unlike git you don’t check the files themselves into version control (saving you the problem of checking out a multi-gigabyte repo if you just wanted to work on the code). However, you can set up retention policies e.g. there must be X remotes that have the actual data. Great system built to be fault tolerant.
Coupled with , you can manage all sorts of nested annexes with different ACLs. It’s great.

#DataLad

Last updated 2 years ago

@jcolomb Yes, is great: for those that don't know, it uses git-annex under the hood and makes it more usable (though I must say, I haven't used it before). It's also funded a lot of work on git-annex.

Should we recommend DataLad as the way to get started for researchers/RDM? When would someone need to go straight to git-annex?

datalad.org/

#DataLad

Last updated 2 years ago

Julien Colomb · @jcolomb
487 followers · 457 posts · Server nerdculture.de

@SciCompAalto @rkdarst

Hey cool information (especially the video), mostly for data managers and RSE who needs to understand the tool background, though.

For researchers, I do present how to work with datalad and/or GIN (set rules about what gets annexed, use simple commands later on).

Note that we want something like git, not only for large files, but also for many files: therefore the use of submodules in .

#DataLad

Last updated 2 years ago

Yann Büchau :python: · @nobodyinperson
133 followers · 452 posts · Server fosstodon.org

The above advantages make an ideal candidate for . It's directly supported by @datalad (see the `datalad-container` extension) and is a bliss to use compared to in my opinion. Synchronizing the images with or works very well. Overall very impressed how straight-forward can be! 👍

#Apptainer #reproducibleresearch #docker #gitannex #DataLad #containerization

Last updated 2 years ago

Lennart Wittkuhn · @lnnrtwttkhn
371 followers · 28 posts · Server fediscience.org

What are your favorite resources for learning with and @datalad?

We will soon start preparing teaching materials (which will all be shared openly) for a full-semester course on "Version control of code and data using Git and DataLad" at University of Hamburg (project with @nicoschuck) supported by a grant awarded by the Digital and Data Literacy in Teaching Lab 🎉

#ddlitlab #DataLad #git #versioncontrol

Last updated 2 years ago

Yann Büchau :python: · @nobodyinperson
111 followers · 393 posts · Server fosstodon.org

Looks like @datalad is the next thing I'll look into. Using for reproducible workflows, exactly what I need right now!

datalad.org/

#gitannex #science #dataanalysis #DataLad #reproducibleresearch #git

Last updated 2 years ago

Lennart Wittkuhn · @lnnrtwttkhn
87 followers · 6 posts · Server fediscience.org

@Russpoldrack For my talks on , , etc. (made with xaringan package), I set up a „gallery page“ (based on ) that separately displays slides of all previous talks. Past talks are „archived“ to a separate folder in the same repo. The CI that deploys to gh-pages builds all slide decks + the gallery page. Not super happy with this because I already had to fix broken links in past slides (still need to change it) lennartwittkuhn.com/talk-rdm/

#rmarkdown #rstats #DataLad #git #rdm

Last updated 2 years ago

John Goerzen · @jgoerzen
713 followers · 2218 posts · Server floss.social

@joeyh I just stumbled across . I guess it's built on git-annex. I can't find a description anywhere of what its purpose in life is, or how that differs from git-annex. Do you have any insight on that?

#DataLad

Last updated 2 years ago

Csepp 🌢 · @csepp
1523 followers · 9401 posts · Server merveilles.town

Stumbled upon while learning . Looks like it's built for big scientific datasets, but I wonder, could it be used by artists? 🤔

#gitannex #DataLad

Last updated 4 years ago