Hi, Catonano <[email protected]> skribis:
> They are part of a pipeline and they should be versioned too. And sometimes > a pipeline produces a dataset. So there could be packages producing > packages. > > There's this project, DAT, and it seems they are onto something, in this > domain. > > http://dat-data.com/ >From a quick look it seems to me that DAT is primarily focusing on efficient peer-to-peer data distribution, at least in its current form. In that sense, I would say that DAT and Guix would be complementary rather than overlapping in a reproducible science toolbox: Guix could be used to described data sources, build processes, and pipelines, while DAT would take care of retrieving data sets (DAT data sets could be described using ‘origin’ in Guix.) Thanks for sharing! Ludo’.
