Re: [galaxy-dev] Regarding galaxy wrappers: lumi, DESeq edgeR

2013-07-08 Thread Ross
(Sachit, please don't cross-post to the dev and user lists?)
There's a beta version of an edgeR/deseq/voom wrapper and a companion htseq
based count matrix maker in the test toolshed statistics section owned by
me - fubar. Definitely not quite ready for production but installable and
I'd appreciate feedback from anyone brave enough to try them out. Best
installed on a test instance you plan on destroying because the R 3.0
dependency is about to swap over to Bjorns better one - should be done
later this week when I get some time.

Be warned it takes a long time to auto-install - 10-15 minutes or so on my
laptop.


On Mon, Jul 8, 2013 at 3:36 PM, Sachit Adhikari sachit.techner...@gmail.com
 wrote:

 Hi,

 These are the R libraries. Have anyone written a wrapper for these
 libraries lumi, DESeq edgeR?

 Thanks,

 Sachit

 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   http://lists.bx.psu.edu/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/




-- 
Ross Lazarus MBBS MPH;
Head, Medical Bioinformatics, BakerIDI; Tel: +61 385321444
http://scholar.google.com/citations?hl=enuser=UCUuEM4J
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

[galaxy-dev] Galaxy Server Processes Dying?

2013-07-08 Thread Dan Sullivan
Hi, Galaxy Developers,

I have what I'm hoping is a fairly simple inquiry for the Galaxy community; 
basically, our production Galaxy server processes appear to be dying off over 
time.  Our production Galaxy instance implements apache web scaling features so 
I have a number of server processes, for example my apache Apache configuration 
has:

BalancerMember http://127.0.0.1:8080
BalancerMember http://127.0.0.1:8081
BalancerMember http://127.0.0.1:8082
BalancerMember http://127.0.0.1:8083
BalancerMember http://127.0.0.1:8084
BalancerMember http://127.0.0.1:8085

Nothing unconventional as I understand it.  Similarly, my galaxy config has 
matching [server:ws3], [server:ws2] configuration blocks for each of these 
processes.  When I restart Galaxy, everything is all fine and good.  I'll see a 
server listening on each one of these ports (if I do something like lsof -i TCP 
-P, for example).  What appears to be happening, is that for whatever reason, 
these server processes seem to die off over time (i.e eventually nothing is 
listening on ports 8080-8085).  This process can take days, and at the time 
when no servers are available, Apache will begin throwing 503 service 
unavailable errors.   I am fairly confident this process is gradual, for 
example I just checked now and the Galaxy was still available, however one 
server had died (the one on TCP port 8082).  I do do have a single separate job 
manager and two job handlers; at this point I believe this problem to be 
related to the servers only (i.e. the job manager and job handlers do not app!
 ear to be crashing).

Now, I believe that late last week I might have 'caught' the last server 
process dying, just by coincidence, although I am not 100% certain.  Here is 
the Traceback as it occurred:

galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:47:12,011 (6822/39485.sc01) PBS job 
state changed from Q to R
galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:54:36,565 (6822/39485.sc01) PBS job 
state changed from R to C
galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:54:36,566 (6822/39485.sc01) PBS job 
has completed successfully
galaxy.jobs DEBUG 2013-07-02 08:54:36,685 Tool did not define exit code or 
stdio handling; checking stderr for success
galaxy.datatypes.metadata DEBUG 2013-07-02 08:54:36,812 loading metadata from 
file for: HistoryDatasetAssociation 6046
galaxy.jobs DEBUG 2013-07-02 08:54:38,153 job 6822 ended
galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:54:49,130 (6812/39473.sc01) PBS job 
state changed from R to E
galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:54:52,267 (6812/39473.sc01) PBS job 
state changed from E to C
galaxy.jobs.runners.pbs ERROR 2013-07-02 08:54:52,267 (6812/39473.sc01) PBS job 
failed: Unknown error: -11
galaxy.jobs.runners ERROR 2013-07-02 08:54:52,267 (unknown) Unhandled exception 
calling fail_job
Traceback (most recent call last):
  File /group/galaxy/galaxy-dist/lib/galaxy/jobs/runners/__init__.py, line 
58, in run_next
method(arg)
  File /group/galaxy/galaxy-dist/lib/galaxy/jobs/runners/pbs.py, line 560, in 
fail_job
if pbs_job_state.stop_job:
AttributeError: 'AsynchronousJobState' object has no attribute 'stop_job'

Now, I have some questions regarding this issue;

1) It appears to me that although this is a sub-optimal solution, restarting 
Galaxy solves this problem (i.e. server processes will be listening after 
restarting Galaxy).   Is it possible, or safe, or sane to just restart a single 
server on a singe port?  Ideally I would actually like to fix the problem that 
is causing my server processes to crash, although I figured it wouldn't hurt to 
ask this question regardless.
2) Similar to the question above, is it possible to configure Galaxy in a way 
that server processes re-spawn  in a self-service manner (i.e. is this a 
feature of Galaxy, for example, because server processes dying regularly is 
either a known issue or expected and tolerable (but undesired) behaivor)?
3) To me, the error messages above aren't very meaningful, other than the 
Traceback appears to be PBS-related.  Would anybody be able comment on the 
problem above (i.e. have you seen something like this), or comment on Galaxy 
server processes dying in general?  I have done some brief searching of the 
Galaxy mailing list for server crashes and did not find anything suggesting 
this is a common problem.
4) I am not 100% confident at this point that the Traceback above is what 
killed the server process.  Does anybody know of a specific string I can search 
for (a literal) to identify when a server process actually dies?  I have done 
some basic review of log data (our Galaxy server generates lots of logs), and 
Traceback does not appear to be a valid string to uniquely identify a server 
crash (they occur too frequently).  I currently have logging configured at 
DEBUG.

In case this is relevant, I am using the following change set for Galaxy:
 hg parents
changeset:   9320:47ddf167c9f1
branch:  stable
tag: tip
user:Nate Coraor 

Re: [galaxy-dev] Regarding galaxy wrappers: lumi, DESeq edgeR

2013-07-08 Thread Sachit Adhikari
Sorry about that Ross. I would try your wrapper of edgeR and would
happily report you the bug. Can you make me the link of Galaxy Test
Toolshed?

Thanks,

Sachit


On Mon, Jul 8, 2013 at 11:51 AM, Ross ross.laza...@gmail.com wrote:

 (Sachit, please don't cross-post to the dev and user lists?)
 There's a beta version of an edgeR/deseq/voom wrapper and a companion
 htseq based count matrix maker in the test toolshed statistics section
 owned by me - fubar. Definitely not quite ready for production but
 installable and I'd appreciate feedback from anyone brave enough to try
 them out. Best installed on a test instance you plan on destroying because
 the R 3.0 dependency is about to swap over to Bjorns better one - should be
 done later this week when I get some time.

 Be warned it takes a long time to auto-install - 10-15 minutes or so on my
 laptop.


 On Mon, Jul 8, 2013 at 3:36 PM, Sachit Adhikari 
 sachit.techner...@gmail.com wrote:

 Hi,

 These are the R libraries. Have anyone written a wrapper for these
 libraries lumi, DESeq edgeR?

 Thanks,

 Sachit

 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   http://lists.bx.psu.edu/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/




 --
 Ross Lazarus MBBS MPH;
 Head, Medical Bioinformatics, BakerIDI; Tel: +61 385321444
 http://scholar.google.com/citations?hl=enuser=UCUuEM4J

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Regarding galaxy wrappers: lumi, DESeq edgeR

2013-07-08 Thread Ross
Hi Sachit,

It's packaged up so it installs from the test toolshed directly via the
admin interface of your test instance, but you'll need a good reliable
network as there is a lot that gets downloaded and installed.

Like other toolshed tools, it would be possible but not advisable to
download and unpack the archive at
http://testtoolshed.g2.bx.psu.edu/view/fubar/edger_test and install
manually but your instance will need R 3.0.1 with
bioconductor/edgeR/deseq/limma installed - if it's your first try, it will
probably take some time and effort to read all the relevant docs to do that
or the R/BioC mailing lists can provide assistance if you need it.

The edgeR tool requires a count matrix (reads per sample per contig) as
input which the htseq tool
http://testtoolshed.g2.bx.psu.edu/view/fubar/htseq_bams_to_count_matrix will
generate from any number of aligned bam/sam files and a GTF file with the
appropriate gene model. Again, automatic installation from the toolshed is
highly recommended, otherwise you'll need to manually install the tool and
the dependencies.

Note that edgeR requires replicate samples in each comparison group so the
absolute minimum is 4 bam files as input to the count matrix maker. p
values from analyses without replicates are not interpretable in any
biologically meaningful way and are meaningless in terms of the usual
frequentist interpretation in my opinion.


On Mon, Jul 8, 2013 at 4:28 PM, Sachit Adhikari sachit.techner...@gmail.com
 wrote:

 Sorry about that Ross. I would try your wrapper of edgeR and would
 happily report you the bug. Can you make me the link of Galaxy Test
 Toolshed?

 Thanks,

 Sachit


 On Mon, Jul 8, 2013 at 11:51 AM, Ross ross.laza...@gmail.com wrote:

 (Sachit, please don't cross-post to the dev and user lists?)
 There's a beta version of an edgeR/deseq/voom wrapper and a companion
 htseq based count matrix maker in the test toolshed statistics section
 owned by me - fubar. Definitely not quite ready for production but
 installable and I'd appreciate feedback from anyone brave enough to try
 them out. Best installed on a test instance you plan on destroying because
 the R 3.0 dependency is about to swap over to Bjorns better one - should be
 done later this week when I get some time.

 Be warned it takes a long time to auto-install - 10-15 minutes or so on
 my laptop.


 On Mon, Jul 8, 2013 at 3:36 PM, Sachit Adhikari 
 sachit.techner...@gmail.com wrote:

 Hi,

 These are the R libraries. Have anyone written a wrapper for these
 libraries lumi, DESeq edgeR?

 Thanks,

 Sachit

 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   http://lists.bx.psu.edu/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/




 --
 Ross Lazarus MBBS MPH;
 Head, Medical Bioinformatics, BakerIDI; Tel: +61 385321444
 http://scholar.google.com/citations?hl=enuser=UCUuEM4J





-- 
Ross Lazarus MBBS MPH;
Head, Medical Bioinformatics, BakerIDI; Tel: +61 385321444
http://scholar.google.com/citations?hl=enuser=UCUuEM4J
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] [galaxy-user] Regarding galaxy wrappers: lumi, DESeq edgeR

2013-07-08 Thread Björn Grüning
Hi Sachit,

we are working currently on a DESeq2 wrapper. Not yet finised sorry.
There is also a edgeR Test repository from Ross Lazarus under:

http://testtoolshed.g2.bx.psu.edu/view/fubar/edger_test

Work is in progress, help is welcome!
Ciao,
Bjoenr

 Hi,
 
 
 These are the R libraries. Have anyone written a wrapper for these
 libraries lumi, DESeq edgeR? 
 
 
 Thanks,
 
 
 Sachit
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:
 
   http://lists.bx.psu.edu/listinfo/galaxy-dev
 
 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:
 
   http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
 
   http://galaxyproject.org/search/mailinglists/



___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] [galaxy-user] Regarding galaxy wrappers: lumi, DESeq edgeR

2013-07-08 Thread Sachit Adhikari
Thank you Björn  and Ross. I will install the tools from the admin
interface.


On Mon, Jul 8, 2013 at 12:35 PM, Björn Grüning 
bjoern.gruen...@pharmazie.uni-freiburg.de wrote:

 Hi Sachit,

 we are working currently on a DESeq2 wrapper. Not yet finised sorry.
 There is also a edgeR Test repository from Ross Lazarus under:

 http://testtoolshed.g2.bx.psu.edu/view/fubar/edger_test

 Work is in progress, help is welcome!
 Ciao,
 Bjoenr

  Hi,
 
 
  These are the R libraries. Have anyone written a wrapper for these
  libraries lumi, DESeq edgeR?
 
 
  Thanks,
 
 
  Sachit
  ___
  The Galaxy User list should be used for the discussion of
  Galaxy analysis and other features on the public server
  at usegalaxy.org.  Please keep all replies on the list by
  using reply all in your mail client.  For discussion of
  local Galaxy instances and the Galaxy source code, please
  use the Galaxy Development list:
 
http://lists.bx.psu.edu/listinfo/galaxy-dev
 
  To manage your subscriptions to this and other Galaxy lists,
  please use the interface at:
 
http://lists.bx.psu.edu/
 
  To search Galaxy mailing lists use the unified search at:
 
http://galaxyproject.org/search/mailinglists/




___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Galaxy Server Processes Dying?

2013-07-08 Thread Ross
Hi Dan,
That's old code. Updating will probably help.
Logging level just takes disk space, but just in case you haven't followed
http://wiki.galaxyproject.org/Admin/Config/Performance/ProductionServer?action=showredirect=Admin%2FConfig%2FPerformance
leaving
debug = True uncommented used to fill all available server process RAM
eventually AFAIK.
If not already done, try
# Debug enables access to various config options useful for development and
# debugging: use_lint, use_profile, use_printdebug and use_interactive.  It
# also causes the files used by PBS/SGE (submission script, output, and
error)
# to remain on disk after the job is complete.  Debug mode is disabled if
# commented, but is uncommented by default in the sample config.
# debug = True

Hope this helps?


On Mon, Jul 8, 2013 at 4:26 PM, Dan Sullivan dansulli...@gmail.com wrote:

 Hi, Galaxy Developers,

 I have what I'm hoping is a fairly simple inquiry for the Galaxy
 community; basically, our production Galaxy server processes appear to be
 dying off over time.  Our production Galaxy instance implements apache web
 scaling features so I have a number of server processes, for example my
 apache Apache configuration has:

 BalancerMember http://127.0.0.1:8080
 BalancerMember http://127.0.0.1:8081
 BalancerMember http://127.0.0.1:8082
 BalancerMember http://127.0.0.1:8083
 BalancerMember http://127.0.0.1:8084
 BalancerMember http://127.0.0.1:8085

 Nothing unconventional as I understand it.  Similarly, my galaxy config
 has matching [server:ws3], [server:ws2] configuration blocks for each of
 these processes.  When I restart Galaxy, everything is all fine and good.
  I'll see a server listening on each one of these ports (if I do something
 like lsof -i TCP -P, for example).  What appears to be happening, is that
 for whatever reason, these server processes seem to die off over time (i.e
 eventually nothing is listening on ports 8080-8085).  This process can take
 days, and at the time when no servers are available, Apache will begin
 throwing 503 service unavailable errors.   I am fairly confident this
 process is gradual, for example I just checked now and the Galaxy was still
 available, however one server had died (the one on TCP port 8082).  I do do
 have a single separate job manager and two job handlers; at this point I
 believe this problem to be related to the servers only (i.e. the job
 manager and job handlers do not app!
  ear to be crashing).

 Now, I believe that late last week I might have 'caught' the last server
 process dying, just by coincidence, although I am not 100% certain.  Here
 is the Traceback as it occurred:

 galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:47:12,011 (6822/39485.sc01)
 PBS job state changed from Q to R
 galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:54:36,565 (6822/39485.sc01)
 PBS job state changed from R to C
 galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:54:36,566 (6822/39485.sc01)
 PBS job has completed successfully
 galaxy.jobs DEBUG 2013-07-02 08:54:36,685 Tool did not define exit code
 or stdio handling; checking stderr for success
 galaxy.datatypes.metadata DEBUG 2013-07-02 08:54:36,812 loading metadata
 from file for: HistoryDatasetAssociation 6046
 galaxy.jobs DEBUG 2013-07-02 08:54:38,153 job 6822 ended
 galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:54:49,130 (6812/39473.sc01)
 PBS job state changed from R to E
 galaxy.jobs.runners.pbs DEBUG 2013-07-02 08:54:52,267 (6812/39473.sc01)
 PBS job state changed from E to C
 galaxy.jobs.runners.pbs ERROR 2013-07-02 08:54:52,267 (6812/39473.sc01)
 PBS job failed: Unknown error: -11
 galaxy.jobs.runners ERROR 2013-07-02 08:54:52,267 (unknown) Unhandled
 exception calling fail_job
 Traceback (most recent call last):
   File /group/galaxy/galaxy-dist/lib/galaxy/jobs/runners/__init__.py,
 line 58, in run_next
 method(arg)
   File /group/galaxy/galaxy-dist/lib/galaxy/jobs/runners/pbs.py, line
 560, in fail_job
 if pbs_job_state.stop_job:
 AttributeError: 'AsynchronousJobState' object has no attribute 'stop_job'

 Now, I have some questions regarding this issue;

 1) It appears to me that although this is a sub-optimal solution,
 restarting Galaxy solves this problem (i.e. server processes will be
 listening after restarting Galaxy).   Is it possible, or safe, or sane to
 just restart a single server on a singe port?  Ideally I would actually
 like to fix the problem that is causing my server processes to crash,
 although I figured it wouldn't hurt to ask this question regardless.
 2) Similar to the question above, is it possible to configure Galaxy in a
 way that server processes re-spawn  in a self-service manner (i.e. is this
 a feature of Galaxy, for example, because server processes dying regularly
 is either a known issue or expected and tolerable (but undesired) behaivor)?
 3) To me, the error messages above aren't very meaningful, other than the
 Traceback appears to be PBS-related.  Would anybody be able comment on the
 problem above (i.e. have you 

Re: [galaxy-dev] gzipped fastq reader

2013-07-08 Thread Peter Cock
On Thu, Jul 4, 2013 at 9:49 PM, Robert Baertsch
robert.baert...@gmail.com wrote:
 Dan,
 Do these readers support gzip files?

reader = fastqVerboseErrorReader
 reader = fastqReader

Presumably you are writing a Python script using this library?
The answer is a qualified yes. Instead of passing them a normal
file handle using open(example.fastq) you instead use
gzip.open(example.fastq) via import gzip.

 Do I have to define a special type in galaxy for gzipped files or will the 
 fastq type be ok?


This needs a special file format - but you are not the first person to
look at this, some groups have defined custom gzipped variants of
the FASTQ formats within their own Galaxy instances. I've not
done this but there should be some useful emails in the archive.

Note you'd also need to modify any tool definitions to that they
can accept a gzipped FASTQ file.

 Ideally, I would like to keep my files zipped and not have galaxy unzip them, 
 since they triple in size when unzipped.

 I'm happy to do a push request if you don't support this but I want to make 
 sure I'm in line with your roadmap.

Personally I would like a more general system in Galaxy for
potentially any file type to be held compressed in a range of
formats (e.g. using gzip, bgzf, xy, bz2, etc), with exclusions
for things like BAM which are already compressed. This way
naive tools would get the gzipped file file uncompressed to a
temporary folder before use (i.e. no change for the tool wrapper),
but if a tool accepts a gzipped file it will get that (less disk IO
and CPU usage, but requires updating tool wrappers).

That idea is quite ambitious through ;)

 I have written a simple tool to convert Illumina fastq to mapsplice fastq. 
 Does that already exist already somewhere?


I don't know.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] [GSoC2013] Week 2 Accomplishments and Week 3 Plans

2013-07-08 Thread Saket Choudhary
Week 3 and Week 4 Plans:
http://galaxy-gsoc2013.blogspot.in/2013/07/week-3-accomplishments-and-week-4-plans.html

On 6 July 2013 16:46, Saket Choudhary sake...@gmail.com wrote:
 Here is a short post explaining the data model. Not ver elaborately
 written, so incase you you find a mistake or something poorly
 explained , please drop a comment.

 http://galaxy-gsoc2013.blogspot.in/2013/07/understanding-datamodel-of-galaxy.html

 On 1 July 2013 16:06, Saket Choudhary sake...@gmail.com wrote:
 Not much accomplished this week. I plan to write one more blogspot
 detailing the datamodel I learnt about galaxy.

 http://galaxy-gsoc2013.blogspot.com/2013/07/week-2-accomplishments-and-week-3-plans.html

 On 24 June 2013 14:11, Saket Choudhary sake...@gmail.com wrote:
 Hi Everyone,


 Just wrote a blog post for my Week1 tasks and Week 2 to-dos:

 http://galaxy-gsoc2013.blogspot.in/2013/06/week-1-accomplishments-and-week-2-plans.html


 I am maintaining a work log in this doc :
 https://docs.google.com/document/d/1VCKCpKW6C_Tt1DYWi43GAN_Jo-nebkkj8Ctq7WLIZFM/edit


 If you have any suggestions for the VCF4.1 issue or otherwise. please do let
 me know.

 Thanks

 Saket Choudhary
 Undergraduate
 Dept. of Chemical Engineering
 IIT Bombay

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] CloudMan Error

2013-07-08 Thread greg
Hi guys,

Any thoughts on this?  I'm kind of stuck.

(Even some pointers on where to look for more clues would be extremely helpful.)

Thanks,

Greg

On Fri, Jul 5, 2013 at 11:10 AM, greg margeem...@gmail.com wrote:
 Hi guys,

 I'm hitting an error using CloudMan using the Share-an-Instance
 option.  It says:

 Error creating volume from shared cluster's snapshot
 '['snap-cfa775ba']': 'filesystems'.

 Also disk stats says 0 /0 and the Applications light is yellow while
 the data light is green.

 I'm using the share string
 cm-808d863548acae7c2328c39a90f52e29/shared/2012-09-17--19-47

 It's always worked in the past.

 Thanks,

 Greg

 Here's the full log:

 14:58:18 - Master starting
 14:58:20 - Completed the initial cluster startup process. This is a
 new cluster; waiting to configure the type.
 14:58:24 - Migration service prerequisites OK; starting the service
 14:58:24 - SGE service prerequisites OK; starting the service
 14:58:31 - Setting up SGE...
 14:58:51 - HTCondor service prerequisites OK; starting the service
 14:58:51 - HTCondor config file /etc/condor/condor_config not found!
 14:58:59 - Hadoop service prerequisites OK; starting the service
 14:59:48 - Done adding Hadoop service; service running.
 15:01:45 - Error creating volume from shared cluster's snapshot
 '['snap-cfa775ba']': 'filesystems'
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] RFC: Cheetah template in tool_dependencies.xml

2013-07-08 Thread James Taylor
It probably makes sense to specify the template type in this case.
Maybe an additional attribute, language='cheetah' ?

On Sun, Jul 7, 2013 at 1:19 PM, Björn Grüning
bjoern.gruen...@pharmazie.uni-freiburg.de wrote:
 action type=template_command
 #if env.get('X11_LIB_DIR', False) and env.get('X11_INCLUDE_DIR', False):
./configure --prefix=$env.INSTALL_DIR
 --x-includes=$env.X11_INCLUDE_DIR --x-libraries=$env.X11_LIB_DIR
#else:
   ./configure --prefix=$env.INSTALL_DIR
#end if
 /action




--
James Taylor, Assistant Professor, Biology/CS, Emory University

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] RFC: Cheetah template in tool_dependencies.xml

2013-07-08 Thread Björn Grüning
Hi James,

 It probably makes sense to specify the template type in this case.
 Maybe an additional attribute, language='cheetah' ?

My first suggestion was action type=cheetah_command to reflect the
template engine. I'm fine with both. 

 On Sun, Jul 7, 2013 at 1:19 PM, Björn Grüning
 bjoern.gruen...@pharmazie.uni-freiburg.de wrote:
  action type=template_command
  #if env.get('X11_LIB_DIR', False) and env.get('X11_INCLUDE_DIR', False):
 ./configure --prefix=$env.INSTALL_DIR
  --x-includes=$env.X11_INCLUDE_DIR --x-libraries=$env.X11_LIB_DIR
 #else:
./configure --prefix=$env.INSTALL_DIR
 #end if
  /action
 
 
 
 
 --
 James Taylor, Assistant Professor, Biology/CS, Emory University



___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] How were liftover and nib files generated?

2013-07-08 Thread James Taylor
liftOver files come from the UCSC genome browser, derived from
pairwise alignments.

A variety of tools can create nib files from fasta, however we prefer
the use of the twoBit format now, which can be created using
faToTwoBit from the UCSC utilities.

--
James Taylor, Assistant Professor, Biology/CS, Emory University


On Sat, Jul 6, 2013 at 9:19 PM, Perez, Ricardo ricky_...@neo.tamu.edu wrote:
 Dear all,

 When I rsync files from let say the following command:
 rsync -avzP rsync://datacache.g2.bx.psu.edu/indexes/dm3 .

 The system will sync files that are related to genome aligments, liftover, 
 and nibs.
 How did the Galaxy team generated this files?
 The reason I ask is because I would lather will like to generate similar 
 files to custom genomes to be used in galaxy.

 Thank you for your time,
 --Ricardo Perez
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   http://lists.bx.psu.edu/

 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] RFC: Cheetah template in tool_dependencies.xml

2013-07-08 Thread Greg Von Kuster
I'll soon be committing a slightly altered version of your patch which includes 
a language attribute for the template_command action type.

On Jul 8, 2013, at 10:02 AM, Björn Grüning 
bjoern.gruen...@pharmazie.uni-freiburg.de wrote:

 Hi James,
 
 It probably makes sense to specify the template type in this case.
 Maybe an additional attribute, language='cheetah' ?
 
 My first suggestion was action type=cheetah_command to reflect the
 template engine. I'm fine with both. 
 
 On Sun, Jul 7, 2013 at 1:19 PM, Björn Grüning
 bjoern.gruen...@pharmazie.uni-freiburg.de wrote:
 action type=template_command
#if env.get('X11_LIB_DIR', False) and env.get('X11_INCLUDE_DIR', False):
   ./configure --prefix=$env.INSTALL_DIR
 --x-includes=$env.X11_INCLUDE_DIR --x-libraries=$env.X11_LIB_DIR
   #else:
  ./configure --prefix=$env.INSTALL_DIR
   #end if
 /action
 
 
 
 
 --
 James Taylor, Assistant Professor, Biology/CS, Emory University
 
 
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/
 


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


[galaxy-dev] Error Uploading Directory of Files

2013-07-08 Thread Nicholas Kline
Hi,

Our lab recently installed a local version of Galaxy on a mid-2012 Mac
Pro computer. We can access the Galaxy server and sign in as an
administrator. Today we tried creating a Data Library, adding a
dataset to it, and uploading a directory of files. We followed this
Galaxy documentation (
http://wiki.galaxyproject.org/Admin/DataLibraries/UploadingLibraryFiles?action=showredirect=Admin%2FData+Libraries%2FUploading+Library+Files
) to setup this feature:

- Admin  Data Library  Add datasets  Upload directory of files
- file format was set to auto-detect
- and we chose the option to link to files instead of copying them

Galaxy confirmed that the files were successfully uploaded. However,
in the data library, under the Message column, is a message in red
saying Job error (click name for more info). Clicking on one of the
uploaded files displays a page with this information:

Date uploaded: 2013-06-28

File size: 7.5 GB

Data type: auto

Build: sacCer2

Miscellaneous information:
Traceback (most recent call last): File
/Users/administrator/galaxy-dist/tools/data_source/upload.py, line
386, in __main__() File
/Users/administrator/galaxy-dist/tools/data_source/upload.py, line
357, in __main__ output_paths =

Job Standard Error
Traceback (most recent call last):
File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
line 386, in
__main__()
File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
line 357, in __main__
output_paths = parse_outputs( sys.argv[4:] )
File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
line 64, in parse_outputs
id, files_path, path = arg.split( ':', 2 )
ValueError: need more than 1 value to unpack
error

Database/Build: sacCer2
Number of data lines: None
Disk file: /Volumes/G-SPEED Q/data/Person 2012 project/DCP2 mef.fastq

Questions:
1. Should we be concerned about this error?
2. If so, what is the right way to fix it?
3. If not, how do we remove the red error message next to each file:
Job error (click name for more info) ?

Thank you
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] Error Uploading Directory of Files

2013-07-08 Thread Peter Cock
On Mon, Jul 8, 2013 at 3:36 PM, Nicholas Kline nx...@case.edu wrote:
 Hi,

 Our lab recently installed a local version of Galaxy on a mid-2012 Mac
 Pro computer. We can access the Galaxy server and sign in as an
 administrator. Today we tried creating a Data Library, adding a
 dataset to it, and uploading a directory of files. We followed this
 Galaxy documentation (
 http://wiki.galaxyproject.org/Admin/DataLibraries/UploadingLibraryFiles?action=showredirect=Admin%2FData+Libraries%2FUploading+Library+Files
 ) to setup this feature:

 - Admin  Data Library  Add datasets  Upload directory of files
 - file format was set to auto-detect
 - and we chose the option to link to files instead of copying them

 Galaxy confirmed that the files were successfully uploaded. However,
 in the data library, under the Message column, is a message in red
 saying Job error (click name for more info). Clicking on one of the
 uploaded files displays a page with this information:

 Date uploaded: 2013-06-28

 File size: 7.5 GB

 Data type: auto

 Build: sacCer2

 Miscellaneous information:
 Traceback (most recent call last): File
 /Users/administrator/galaxy-dist/tools/data_source/upload.py, line
 386, in __main__() File
 /Users/administrator/galaxy-dist/tools/data_source/upload.py, line
 357, in __main__ output_paths =

 Job Standard Error
 Traceback (most recent call last):
 File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 386, in
 __main__()
 File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 357, in __main__
 output_paths = parse_outputs( sys.argv[4:] )
 File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 64, in parse_outputs
 id, files_path, path = arg.split( ':', 2 )
 ValueError: need more than 1 value to unpack
 error

 Database/Build: sacCer2
 Number of data lines: None
 Disk file: /Volumes/G-SPEED Q/data/Person 2012 project/DCP2 mef.fastq


I have a hunch that the problem could be related to the spaces
in the input filename. In this case perhaps the upload.py script
is being called without the filename being quoted or escaped.
Could you retry from a nice space free import directory?

Also does the path where you have installed Galaxy have
spaces? Even if Galaxy itself copes, it is highly likely that
one or more underlying tools wrapped for use within Galaxy
would fail.

[I've had minor problems with this with various tools on the
Mac outside of Galaxy usage]

Regards,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


[galaxy-dev] ToolShed dependencies KeyError: 'invalid_repository_dependencies'

2013-07-08 Thread Peter Cock
Hi Greg,

Previously my workflows had hard coded dependencies on the main
Tool Shed, which currently is a bit of a problem on the Test Tool Shed.

Therefore I'm update my Tool Shed workflows using a
repository_dependencies.xml with implicit repository and revision fields
(i.e. the latest revision within the current ToolShed).

It worked for one example but not the other - this turned out to be my
own error, but the Tool Shed did not handle it gracefully. Perhaps
this class of error could be caught instead? Thanks!



This worked, secreted_protein_workflow, Revision: 2:aecb871dfe4c
http://testtoolshed.g2.bx.psu.edu/view/peterjc/secreted_protein_workflow

Input:

?xml version=1.0?
repositories description=This requires my SignalP and TMHMM wrapers,
and my FASTA filtering tool.
!-- Revision 15:6abd809cefdd on the main tool shed is v0.2.4, the
current latest - but older should be OK --
repository name=tmhmm_and_signalp owner=peterjc /
!-- Revision 2:abdd608c869b on the main tool shed is v0.0.5, the
current latest - but older should be OK --
repository name=seq_filter_by_id owner=peterjc /
/repositories



This failed, rxlr_venn_workflow, Revision: 3:65d174589251
http://testtoolshed.g2.bx.psu.edu/view/peterjc/rxlr_venn_workflow

On uploading the tar-ball I got this error in the browser:

quote
Internal Server Error
Galaxy was unable to sucessfully complete your request

URL: 
http://testtoolshed.g2.bx.psu.edu/upload/upload?repository_id=20a9f7389ced5696
Module galaxy.web.framework.middleware.error:149 in __call__
  app_iter = self.application(environ, sr_checker)
Module paste.debug.prints:106 in __call__
  environ, self.app)
Module paste.wsgilib:543 in intercept_output
  app_iter = application(environ, replacement_start_response)
Module paste.recursive:84 in __call__
  return self.application(environ, start_response)
Module paste.httpexceptions:633 in __call__
  return self.application(environ, start_response)
Module galaxy.web.framework.base:132 in __call__
  return self.handle_request( environ, start_response )
Module galaxy.web.framework.base:190 in handle_request
  body = method( trans, **kwargs )
Module galaxy.web.framework:98 in decorator
  return func( self, trans, *args, **kwargs )
Module galaxy.webapps.tool_shed.controllers.upload:219 in upload
  invalid_repository_dependencies_message = 
 repository_dependency_util.generate_message_for_invalid_repository_dependencies(
  metadata_dict )
Module tool_shed.util.repository_dependency_util:252 in
generate_message_for_invalid_repository_dependencies
  invalid_repository_dependencies = invalid_repository_dependencies_dict[ 
 'invalid_repository_dependencies' ]
KeyError: 'invalid_repository_dependencies'
extra data

full traceback

text version

This may be an intermittent problem due to load or other unpredictable
factors, reloading the page may address the problem.
/quote

Input repository_dependencies.xml where for the tmhmm_and_signalp entry
I am missing the 'n' in name:

?xml version=1.0?
repositories description=This requires my SignalP and TMHMM wrapers,
and my FASTA filtering tool.
!-- Revision 15:6abd809cefdd on the main tool shed is v0.2.4, the
current latest - but older should be OK --
repository ame=tmhmm_and_signalp owner=peterjc /
!-- Revision 2:abdd608c869b on the main tool shed is v0.0.5, the
current latest - but older should be OK --
repository name=seq_filter_by_id owner=peterjc /
!-- Revisiion 2:c96bef0643dc on the main tool shed is v0.0.3, the
current latest --
repository name=venn_list owner=peterjc /
/repositories

Output repository_dependencies.xml with mangled tmhmm_and_signalp entry:

?xml version=1.0?
repositories description=This requires my SignalP and TMHMM wrapers,
and my FASTA filtering tool.
!-- Revision 15:6abd809cefdd on the main tool shed is v0.2.4, the
current latest - but older should be OK --
repository ame=tmhmm_and_signalp owner=peterjc
toolshed=http://testtoolshed.g2.bx.psu.edu; /
!-- Revision 2:abdd608c869b on the main tool shed is v0.0.5, the
current latest - but older should be OK --
repository changeset_revision=66d1ca92fb38
name=seq_filter_by_id owner=peterjc
toolshed=http://testtoolshed.g2.bx.psu.edu; /
!-- Revisiion 2:c96bef0643dc on the main tool shed is v0.0.3, the
current latest --
repository changeset_revision=51fe47a5a803 name=venn_list
owner=peterjc toolshed=http://testtoolshed.g2.bx.psu.edu; /
/repositories

I corrected the ame/name error, and re-uploaded a fresh tar-ball which worked,
Revision: 4:25e0ffd4f762

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified 

Re: [galaxy-dev] RFC: Cheetah template in tool_dependencies.xml

2013-07-08 Thread Greg Von Kuster
Hello Björn,

Thanks very much for the patch - I've committed alsightly alterered verson of 
it in changetset 10204:afec613d77fa which is currently running on the test tool 
shed.

Greg Von Kuster

On Jul 7, 2013, at 6:59 PM, Björn Grüning 
bjoern.gruen...@pharmazie.uni-freiburg.de wrote:

 Hi Greg,
 
 bitbucket has a downtime so I have attached a patch. I will create
 tomorrow a pull request if I find time.
 
 It will work as described and we have now a new variable (__is64bit__)
 that we can use to determine if python is running under 64bit (assuming
 that the rest is also 64bit-ish).
 
 Attached is a ported version of emboss as test case (Dave B wrapper).
 
 Have a nice evening!
 Bjoern
 
 
 Hi Björn,
 
 
 This is great.  A pull request or a patch is fine.  Whatever one is
 most convenient for you.  The code doesn't have to be pretty, just
 functional.  We can always pretty it up later.
 
 
 Thanks very much for doing this!
 
 
 Greg
 
 On Jul 7, 2013, at 1:19 PM, Björn Grüning
 bjoern.gruen...@pharmazie.uni-freiburg.de wrote:
 
 Hi Greg,
 
 Hello Björn,
 
 This would be great!  I'm not sure about the type attribute - action 
 type=cheetah_command may be ok, but maybe a more generic type like 
 action type=template_command would be better.  
 I'm not quite sure, so go with what you feel is best.
 I come up with the following:
 
 action type=template_command
#if env.get('X11_LIB_DIR', False) and env.get('X11_INCLUDE_DIR',
 False):
   ./configure --prefix=$env.INSTALL_DIR --x-includes=
 $env.X11_INCLUDE_DIR --x-libraries=$env.X11_LIB_DIR
   #else:
  ./configure --prefix=$env.INSTALL_DIR
   #end if
 /action
 
 $env is the global access point to the env variables and its working
 fine for the few repositories I tested it.
 I had hoped to get it a little bit prettier but fabric and cheetah
 doesn't work well together.
 
 If that is still satisfying I would clean up my code and create a
 pull request [1]. 
 What do you think? 
 
 Cheers,
 Bjoern
 
 [1] or patch, I heard rumours that you do not like pull
 requests ;)? 
 
 Thanks for looking into this!
 
 Greg Von Kuster
 
 
 On Jul 7, 2013, at 5:50 AM, Björn Grüning 
 bjoern.gruen...@pharmazie.uni-freiburg.de wrote:
 
 Hi,
 
 I was thinking how to tackle the build failures like the atlas library
 on OS-X or the ones from xlib and ease the writing of the shell_command
 a little bit.
 
 Dave B did some fancy shell scripting in 'package_emboss_5_0_0' and I
 was also sometimes forced to use some bash tricks.
 
 What about to make that more easy and powerful, and as a plus more
 consistent with the tool-command tag, and use cheetah for that kind of
 work.
 
 Something like that could work easily:
 
 #if ${os.environ}.getVar('X11_LIB_DIR', None):
   ./configure ... --x-includes=
 #else:
   ./configure
 #end if
 
 Also we could define some reserved variables like $__is64__ and cheetah
 evaluate that. $INSTALL_DIR and $REPOSITORY_DIR can also be a cheetah
 variable and we do not need to replace it later manually.
 
 What do you think any comments?
 If no one has any objections I can try to implement an action
 type=cheetah_command.
 
 Thanks,
 Björn
 
 
 
 
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
 http://galaxyproject.org/search/mailinglists/
 
 
 
 
 tool_dependencies.xmltemplate_command.patch___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


[galaxy-dev] Setting up reference genomes for GATK Genotyper

2013-07-08 Thread Marco Ocana
Hi,

I am running an instance of Galaxy that I setup using Cloudman.
We are trying to use the GATK Unified Genotyper, but we are missing the
reference genomes.
I saw in your website instructions on how to setup reference genomes for
some other tools, but not for the Unified Genotyper.
Where can I find instructions on how to setup reference genomes for this
tool?

Thanks

Marco
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Error Uploading Directory of Files

2013-07-08 Thread Björn Grüning
Hi Nicholas,

as pointed out by Peter and here:

http://www.biostars.org/p/76021/

please check your white spaces and provide a little bit more
information.

Btw. what is wrong with biostar? Why the double post?
Cheers,
Bjoern 

 Hi,
 
 Our lab recently installed a local version of Galaxy on a mid-2012 Mac
 Pro computer. We can access the Galaxy server and sign in as an
 administrator. Today we tried creating a Data Library, adding a
 dataset to it, and uploading a directory of files. We followed this
 Galaxy documentation (
 http://wiki.galaxyproject.org/Admin/DataLibraries/UploadingLibraryFiles?action=showredirect=Admin%2FData+Libraries%2FUploading+Library+Files
 ) to setup this feature:
 
 - Admin  Data Library  Add datasets  Upload directory of files
 - file format was set to auto-detect
 - and we chose the option to link to files instead of copying them
 
 Galaxy confirmed that the files were successfully uploaded. However,
 in the data library, under the Message column, is a message in red
 saying Job error (click name for more info). Clicking on one of the
 uploaded files displays a page with this information:
 
 Date uploaded: 2013-06-28
 
 File size: 7.5 GB
 
 Data type: auto
 
 Build: sacCer2
 
 Miscellaneous information:
 Traceback (most recent call last): File
 /Users/administrator/galaxy-dist/tools/data_source/upload.py, line
 386, in __main__() File
 /Users/administrator/galaxy-dist/tools/data_source/upload.py, line
 357, in __main__ output_paths =
 
 Job Standard Error
 Traceback (most recent call last):
 File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 386, in
 __main__()
 File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 357, in __main__
 output_paths = parse_outputs( sys.argv[4:] )
 File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 64, in parse_outputs
 id, files_path, path = arg.split( ':', 2 )
 ValueError: need more than 1 value to unpack
 error
 
 Database/Build: sacCer2
 Number of data lines: None
 Disk file: /Volumes/G-SPEED Q/data/Person 2012 project/DCP2 mef.fastq
 
 Questions:
 1. Should we be concerned about this error?
 2. If so, what is the right way to fix it?
 3. If not, how do we remove the red error message next to each file:
 Job error (click name for more info) ?
 
 Thank you
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
   http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/



___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] ToolShed dependencies KeyError: 'invalid_repository_dependencies'

2013-07-08 Thread Greg Von Kuster
Hello Peter,

Thanks for reporting this - I have committed a fix in changeset 
10206:86efa5ac1fae, which is currently running on the test tool shed.

Greg Von Kuster

On Jul 8, 2013, at 11:35 AM, Peter Cock p.j.a.c...@googlemail.com wrote:

 Hi Greg,
 
 Previously my workflows had hard coded dependencies on the main
 Tool Shed, which currently is a bit of a problem on the Test Tool Shed.
 
 Therefore I'm update my Tool Shed workflows using a
 repository_dependencies.xml with implicit repository and revision fields
 (i.e. the latest revision within the current ToolShed).
 
 It worked for one example but not the other - this turned out to be my
 own error, but the Tool Shed did not handle it gracefully. Perhaps
 this class of error could be caught instead? Thanks!
 
 
 
 This worked, secreted_protein_workflow, Revision: 2:aecb871dfe4c
 http://testtoolshed.g2.bx.psu.edu/view/peterjc/secreted_protein_workflow
 
 Input:
 
 ?xml version=1.0?
 repositories description=This requires my SignalP and TMHMM wrapers,
 and my FASTA filtering tool.
!-- Revision 15:6abd809cefdd on the main tool shed is v0.2.4, the
 current latest - but older should be OK --
repository name=tmhmm_and_signalp owner=peterjc /
!-- Revision 2:abdd608c869b on the main tool shed is v0.0.5, the
 current latest - but older should be OK --
repository name=seq_filter_by_id owner=peterjc /
 /repositories
 
 
 
 This failed, rxlr_venn_workflow, Revision: 3:65d174589251
 http://testtoolshed.g2.bx.psu.edu/view/peterjc/rxlr_venn_workflow
 
 On uploading the tar-ball I got this error in the browser:
 
 quote
 Internal Server Error
 Galaxy was unable to sucessfully complete your request
 
 URL: 
 http://testtoolshed.g2.bx.psu.edu/upload/upload?repository_id=20a9f7389ced5696
 Module galaxy.web.framework.middleware.error:149 in __call__
 app_iter = self.application(environ, sr_checker)
 Module paste.debug.prints:106 in __call__
 environ, self.app)
 Module paste.wsgilib:543 in intercept_output
 app_iter = application(environ, replacement_start_response)
 Module paste.recursive:84 in __call__
 return self.application(environ, start_response)
 Module paste.httpexceptions:633 in __call__
 return self.application(environ, start_response)
 Module galaxy.web.framework.base:132 in __call__
 return self.handle_request( environ, start_response )
 Module galaxy.web.framework.base:190 in handle_request
 body = method( trans, **kwargs )
 Module galaxy.web.framework:98 in decorator
 return func( self, trans, *args, **kwargs )
 Module galaxy.webapps.tool_shed.controllers.upload:219 in upload
 invalid_repository_dependencies_message = 
 repository_dependency_util.generate_message_for_invalid_repository_dependencies(
  metadata_dict )
 Module tool_shed.util.repository_dependency_util:252 in
 generate_message_for_invalid_repository_dependencies
 invalid_repository_dependencies = invalid_repository_dependencies_dict[ 
 'invalid_repository_dependencies' ]
 KeyError: 'invalid_repository_dependencies'
 extra data
 
 full traceback
 
 text version
 
 This may be an intermittent problem due to load or other unpredictable
 factors, reloading the page may address the problem.
 /quote
 
 Input repository_dependencies.xml where for the tmhmm_and_signalp entry
 I am missing the 'n' in name:
 
 ?xml version=1.0?
 repositories description=This requires my SignalP and TMHMM wrapers,
 and my FASTA filtering tool.
!-- Revision 15:6abd809cefdd on the main tool shed is v0.2.4, the
 current latest - but older should be OK --
repository ame=tmhmm_and_signalp owner=peterjc /
!-- Revision 2:abdd608c869b on the main tool shed is v0.0.5, the
 current latest - but older should be OK --
repository name=seq_filter_by_id owner=peterjc /
!-- Revisiion 2:c96bef0643dc on the main tool shed is v0.0.3, the
 current latest --
repository name=venn_list owner=peterjc /
 /repositories
 
 Output repository_dependencies.xml with mangled tmhmm_and_signalp entry:
 
 ?xml version=1.0?
 repositories description=This requires my SignalP and TMHMM wrapers,
 and my FASTA filtering tool.
!-- Revision 15:6abd809cefdd on the main tool shed is v0.2.4, the
 current latest - but older should be OK --
repository ame=tmhmm_and_signalp owner=peterjc
 toolshed=http://testtoolshed.g2.bx.psu.edu; /
!-- Revision 2:abdd608c869b on the main tool shed is v0.0.5, the
 current latest - but older should be OK --
repository changeset_revision=66d1ca92fb38
 name=seq_filter_by_id owner=peterjc
 toolshed=http://testtoolshed.g2.bx.psu.edu; /
!-- Revisiion 2:c96bef0643dc on the main tool shed is v0.0.3, the
 current latest --
repository changeset_revision=51fe47a5a803 name=venn_list
 owner=peterjc toolshed=http://testtoolshed.g2.bx.psu.edu; /
 /repositories
 
 I corrected the ame/name error, and re-uploaded a fresh tar-ball which worked,
 Revision: 

[galaxy-dev] Connecting to new Galaxy Cloudman by FTP

2013-07-08 Thread Mohammad Heydarian
Hi,
I am having trouble setting up a FTP connection with the recently released
version of Galaxy Cloudman (ami-118bfc78).

I have instantiated the new version of Galaxy Cloudman with
CloudLaunchhttp://usegalaxy.org/cloudlaunch and
also through the AWS EC2 wizard (using the same security group settings as
the previous versions) and neither instance will connect to my FTP
connection.

Has anyone else had this problem? Does anyone know what is preventing the
FTP connection?

Any help would be greatly appreciated.


Cheers,
Mo Heydarian
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] gzipped fastq reader

2013-07-08 Thread Peter Cock
On Mon, Jul 8, 2013 at 10:24 PM, Robert Baertsch
robert.baert...@gmail.com wrote:
 Peter and Dan,
 I like the idea of replacing all open() with galaxy_open() in all tools. You
 can tell the format by looking at the first 4 byes (see C code below from
 the UCSC browser team). Is there some pythonic way of overriding open?

There is monkey patching (replace the current 'open' function with
your modified version), but that is not a good idea in general.

In any case, this would only affect the small number of Python
tools which happen to use the Galaxy parsing libraries - which
is a very small fraction of the tools in Galaxy. Most of the tools
in Galaxy are compiled programs and are entirely separate.

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] gzipped fastq reader

2013-07-08 Thread Peter Cock
On Mon, Jul 8, 2013 at 11:21 PM, Robert Baertsch rbaer...@ucsc.edu wrote:
 I respectfully disagree,  If you want an extensible system, you should
 always wrap primitive system level calls.

 Any tools that opens a file that could be compressed would be affected.
 That is a huge number of tools. Do you really want a cottage industry of
 tools that have different methods of dealing with compression?

But defining a Python helper function within the Galaxy Python
libraries doesn't achieve that.

Are you talking about patching the OS level POSIX open functions
or something? The tools available in Galaxy are written in a range
of languages including C, Perl, R, etc. Yes, some are in Python,
but of those most are independent of Galaxy and can be used
separately from Galaxy.

 Encoding the gzip status in the datatype will create an explosion of
 datatypes. Compression is not actually a datatype, it tells you nothing
 about the content data that is stored in the file.

What we'd previously discussed was a dual system, holding
the file type as now (e.g. FASTA, SAM, GFF3, etc) and any
compression (e.g., None, normal GZIP, BGZF which is a
GZIP variant, BZIP2, etc).

Galaxy tool wrappers currently define input files with a list
of file types - they'd also have to give a list of supported
compression types (defaulting to none). Likewise for any
output files - if they are already compressed the XML for
the tool wrapper would have to tell Galaxy this.

 It is up to the galaxy team to provide a standard way to interact
 with compressed files.

That is my preference too - although this could be driven by
the Galaxy community rather than the core team? I see
defining new datatypes like 'gzippedfastq' as a stop gap
special case (but a very practical route for now).

 My proposed solution, is a very small change that could
 be phased in over time. Any tools that uses open would not support
 compressed files, but they would not break on uncompressed files.

 Do others have an opinion?

Either I don't understand your plan, or it would only help in
a tiny minority of cases.

Regards,

Peter
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/