Hi there Is there current "best practice" for maintaining a collection of reference genome data in a Galaxy server? I.e. filling in the dbkeys.loc and all_fasta.loc table and associated data. I know of the following discussions on this topic:
1. The data integration page: https://galaxyproject.org/admin/data-integration/ 2. The data managers page: https://galaxyproject.org/admin/tools/data-managers/ 3. This biostars answer: https://biostar.usegalaxy.org/p/7176/ 4. This page on custom genomes that seems to be user / history oriented: https://galaxyproject.org/learn/custom-genomes/ 5. This page on rsyncing from the Galaxy reference data collection: https://galaxyproject.org/admin/use-galaxy-rsync/ - quick test gives me rsync error though 6. This guide to using the human reference data: https://biostar.usegalaxy.org/p/14777/ 7. I hear the usegalaxy.org reference collection is available via cvmfs - I think this was discussed in a Galaxy Admins meetup at some point. None of this is comprehensive. There are 2 sets of questions for me (in my role as Galaxy admin): 1. If I want to make reference genomes and common indices (like for HISAT and BWA) available for e.g. human and mouse, what is the best way to do this? 2. If I want to add a genome for a non-model organism (e.g. M. tuberculosis or L. calcarifer) available, what is the best way to do this? Which data manager should or could I use? Thanks, Peter
___________________________________________________________ Please keep all replies on the list by using "reply all" in your mail client. To manage your subscriptions to this and other Galaxy lists, please use the interface at: https://lists.galaxyproject.org/ To search Galaxy mailing lists use the unified search at: http://galaxyproject.org/search/