from:"Ido Tamir"

[galaxy-dev] ulimit problems after update

2014-11-10 Thread Ido Tamir

Hi,
I updated to the latest galaxy distribution (after one year). And now
every job fails with:
/home/imba/solexa/.profile.sh: line 118: ulimit: virtual memory: cannot modify 
limit: Operation not permitted

The limit is ridiculously high:
 ulimit -v 6000

Its just to prevent some badly programmed in house galaxy tools to crash the 
server.
I think the problem happens after the set_metadata stage.

Any advice?

thank you very much,
ido
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Jobs stuck in new state - Data Library datasets to blame?

2014-11-10 Thread Ido Tamir

did you check if the metadata on the input was set correctly/at all?
This happens sometimes in our galaxy instance. Metadata is not set correctly, 
and jobs run after metadata is set by hand.
No reupload necessary.

best,
ido

On 06 Nov 2014, at 17:13, Lance Parsons lpars...@princeton.edu wrote:

 I'v run into this same issue again (just with some other Data Library 
 datasets).  This time, there are a few users involved with quite a few 
 stuck jobs.  Does anyone have any advice on pushing these jobs through?  
 Maybe even a pointer to the relevant code?  I'm running latest_2014.08.11.  
 Thanks in advance.
 
 Lance
 
 Lance Parsons wrote:
 Thanks, that was the first thing I checked.  However, restarting the handler 
 didn't help.  Downloading the offending data and re-uploading as a new data 
 set and then rerunning using the new dataset as input did work.  Also, all 
 other jobs continued to run fine.
 
 Lance
 
 Kandalaft, Iyad wrote:
 I’ve had jobs get stuck in the new state when one of the handler servers 
 crashes.  If you have dedicated handlers, check to make sure they are still 
 running.
 Restart the handler to see if the jobs get resumed automatically.
  
  
  
 Iyad Kandalaft
 
  
 From: galaxy-dev-boun...@lists.bx.psu.edu 
 [mailto:galaxy-dev-boun...@lists.bx.psu.edu] On Behalf Of Aaron Petkau
 Sent: Wednesday, October 01, 2014 5:32 PM
 To: Lance Parsons
 Cc: galaxy-dev@lists.bx.psu.edu
 Subject: Re: [galaxy-dev] Jobs stuck in new state - Data Library datasets 
 to blame?
  
 Are you attempting to upload datasets to a Data Library, and then copy to a 
 history and run jobs on them right away?  I've run into issues before where 
 if I attempt to run a job on a dataset in a library before it is finished 
 being uploaded and processed, then the job gets stuck in a queued state and 
 never executes.
 
 Aaron
  
 On Wed, Oct 1, 2014 at 2:51 PM, Lance Parsons lpars...@princeton.edu 
 wrote:
 Recently, I updated our Galaxy instance to use two processes (one for web, 
 the other as a job handler).  This has been working well, except in a few 
 cases.  I've noticed that a number of jobs get stuck in the new status.
 
 In a number of cases, I've resolved the issue by downloading and uploading 
 one of the input files and rerunning the job using the newly uploaded file. 
  In at least one of these cases, the offending input file was one that was 
 copied from a Data Library.
 
 Can anyone point me to something to look for in the database, etc. that 
 would cause a job to think a dataset was not ready for use as a job input?  
 I'd very much like to fix these datasets since having to re-upload data 
 libraries would be very tedious.
 
 Thanks in advance.
 
 -- 
 Lance Parsons - Scientific Programmer
 134 Carl C. Icahn Laboratory
 Lewis-Sigler Institute for Integrative Genomics
 Princeton University
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/
  
 
 -- 
 Lance Parsons - Scientific Programmer
 134 Carl C. Icahn Laboratory
 Lewis-Sigler Institute for Integrative Genomics
 Princeton University
 
 
 -- 
 Lance Parsons - Scientific Programmer
 134 Carl C. Icahn Laboratory
 Lewis-Sigler Institute for Integrative Genomics
 Princeton University
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] A proposed modules extension for toolshed wrappers

2014-04-25 Thread Ido Tamir

Hi,
has anything changed in galaxy in this regard?
Any way to modify an environment before a tools is run?

I now have a tool relying on R-devel and bioconductor devel, both of which I 
can load in a module.
The tool comes from the toolshed with xml like:

command interpreter=Rscript 
…


I don't want to hack around in the tool itself, but simply load the necessary 
R-version.

thank you very much,
ido


On Sep 13, 2013, at 3:23 AM, Guest, Simon simon.gu...@agresearch.co.nz 
wrote:

 Just been reading a bit more about the Galaxy packaging system.  Here's a 
 slight modification to what I was suggesting that might fit in a bit better.  
 Apologies for not being more familiar with the existing system before 
 proposing extensions.
 
 Recall that my goal is to support using a system-installed (native) package, 
 at a defined version, which I aim to achieve by loading the appropriate 
 environment module before running a tool.
 
 We still have tool_dependencies.xml defining a package at a particular 
 version, but rather than download and build the source code, there's just a 
 directive that says how to pick up the correct program version at runtime, 
 e.g. which environment module to load.
 
 So instead of the tool_dependencies.xml fragment:
 tool_dependency
package name=bwa version=0.6.2
install version=1.0
actions
action 
 type=download_by_urlhttp://downloads.sourceforge.net/project/bio-bwa/bwa-0.6.2.tar.bz2/action
action type=shell_commandmake/action
action type=move_file
sourcebwa/source
destination$INSTALL_DIR/bin/destination
/action
action type=set_environment
environment_variable name=PATH 
 action=prepend_to$INSTALL_DIR/bin/environment_variable
/action
/actions
/install
/package
 /tool_dependency
 
 We have something like this (NB: element and attribute names are for 
 illustrative purposes only):
 
 tool_dependency
package name=bwa version=0.6.2
use_native
actions
action type=module_loadbwa/0.6.2/action
/actions
/use_native
/package
 /tool_dependency
 
 This causes the right thing (module load bwa/0.6.2) to be stuck into the 
 dependencies env.sh file when this package is installed from the toolshed.  
 We could call this toolshed tool native_package_bwa_0_6_2, to avoid confusion 
 with the existing download-and-make one.
 
 We might want a bit of flexibility on what actions are supported (in case we 
 want to support Software Collections, for example).
 
 What do you think?
 
 cheers,
 Simon
 
 PS: In case it wasn't already clear, solving this problem well is quite 
 important to us here at AgResearch.  ;-)
 
 
 ===
 Attention: The information contained in this message and/or attachments
 from AgResearch Limited is intended only for the persons or entities
 to which it is addressed and may contain confidential and/or privileged
 material. Any review, retransmission, dissemination or other use of, or
 taking of any action in reliance upon, this information by persons or
 entities other than the intended recipients is prohibited by AgResearch
 Limited. If you have received this message in error, please notify the
 sender immediately.
 ===
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] A proposed modules extension for toolshed wrappers

2014-04-25 Thread Ido Tamir

Dear  Björn,

maybe I could change the tool, which is not mine, and which I don't want to 
maintain to use a specific R-version that is already available on our cluster 
and
which I can put into my path with module load R/3.1.0-devel ( 
http://modules.sourceforge.net/)

there even is a requirements in the wrapper which is not fulfilled (these 
versions are not available at the moment) and still it installed without 
problems.
  requirements
requirement type=R-module version=3.5.27edgeR/requirement
requirement type=R-module version=3.18.13limma/requirement
  /requirements

This requirements tags looks also rather inflexible to me. With an additional 
level of user configurable indirection it would be possible to make the tools
fit to different infrastructures without having to use binaries provided by 
somebody else, taking up space for just one tool etc...

Currently for jobs that are not run locally there is a general 
environment_setup_file. There could be one optional environment_setup_file for 
every job (or destination).

In the end I created yet another wrapper.

best,
ido


On Apr 25, 2014, at 2:34 PM, Björn Grüning bjoern.gruen...@gmail.com wrote:

 Hi Ido,
 
 I do not get your question in all detail, but it is possible to define a 
 tool_dependencies.xml with a specific R version, + R libraries and use only 
 that specific version from your tool with requirement tags.
 
 For an example please see various R tools from:
 
 https://github.com/bgruening/galaxytools
 
 Cheers,
 Bjoern
 
 Am 25.04.2014 14:25, schrieb Ido Tamir:
 Hi,
 has anything changed in galaxy in this regard?
 Any way to modify an environment before a tools is run?
 
 I now have a tool relying on R-devel and bioconductor devel, both of which I 
 can load in a module.
 The tool comes from the toolshed with xml like:
 
 command interpreter=Rscript
 …
 
 
 I don't want to hack around in the tool itself, but simply load the 
 necessary R-version.
 
 thank you very much,
 ido
 
 
 On Sep 13, 2013, at 3:23 AM, Guest, Simon simon.gu...@agresearch.co.nz 
 wrote:
 
 Just been reading a bit more about the Galaxy packaging system.  Here's a 
 slight modification to what I was suggesting that might fit in a bit 
 better.  Apologies for not being more familiar with the existing system 
 before proposing extensions.
 
 Recall that my goal is to support using a system-installed (native) 
 package, at a defined version, which I aim to achieve by loading the 
 appropriate environment module before running a tool.
 
 We still have tool_dependencies.xml defining a package at a particular 
 version, but rather than download and build the source code, there's just a 
 directive that says how to pick up the correct program version at runtime, 
 e.g. which environment module to load.
 
 So instead of the tool_dependencies.xml fragment:
 tool_dependency
package name=bwa version=0.6.2
install version=1.0
actions
action 
 type=download_by_urlhttp://downloads.sourceforge.net/project/bio-bwa/bwa-0.6.2.tar.bz2/action
action type=shell_commandmake/action
action type=move_file
sourcebwa/source
destination$INSTALL_DIR/bin/destination
/action
action type=set_environment
environment_variable name=PATH 
 action=prepend_to$INSTALL_DIR/bin/environment_variable
/action
/actions
/install
/package
 /tool_dependency
 
 We have something like this (NB: element and attribute names are for 
 illustrative purposes only):
 
 tool_dependency
package name=bwa version=0.6.2
use_native
actions
action type=module_loadbwa/0.6.2/action
/actions
/use_native
/package
 /tool_dependency
 
 This causes the right thing (module load bwa/0.6.2) to be stuck into the 
 dependencies env.sh file when this package is installed from the toolshed.  
 We could call this toolshed tool native_package_bwa_0_6_2, to avoid 
 confusion with the existing download-and-make one.
 
 We might want a bit of flexibility on what actions are supported (in case 
 we want to support Software Collections, for example).
 
 What do you think?
 
 cheers,
 Simon
 
 PS: In case it wasn't already clear, solving this problem well is quite 
 important to us here at AgResearch.  ;-)
 
 
 ===
 Attention: The information contained in this message and/or attachments
 from AgResearch Limited is intended only for the persons or entities
 to which it is addressed and may contain confidential and/or privileged
 material. Any review, retransmission, dissemination or other use of, or
 taking of any action in reliance upon, this information by persons or
 entities other than the intended recipients is prohibited by AgResearch
 Limited. If you have received this message in error, please notify

Re: [galaxy-dev] A proposed modules extension for toolshed wrappers

2014-04-25 Thread Ido Tamir


On Apr 25, 2014, at 3:55 PM, John Chilton jmchil...@gmail.com wrote:

 Additionally (perhaps more pertinently), I have an open pull request
 (created just this week) to allow setting environment variables on a
 per destination basis - this is sort of what you are getting at and
 might be a better way to go -
 https://bitbucket.org/galaxy/galaxy-central/pull-request/378/allow-specification-of-environment.
 With this modification you could hack up the path to reflect the
 changes made by the module load. That is a bit of hack but would work
 for this case. That said I think you make a good point about the
 environment file, I will modify the new env tags to allow a file
 attribute and just source that file. This will give you your per
 destination environment files - hopefully this is satisfactory.
 
 

That sounds great!

best,
ido

 
 
 On Fri, Apr 25, 2014 at 8:22 AM, Ido Tamir ta...@imp.ac.at wrote:
 Dear  Björn,
 
 maybe I could change the tool, which is not mine, and which I don't want to 
 maintain to use a specific R-version that is already available on our 
 cluster and
 which I can put into my path with module load R/3.1.0-devel ( 
 http://modules.sourceforge.net/)
 
 there even is a requirements in the wrapper which is not fulfilled (these 
 versions are not available at the moment) and still it installed without 
 problems.
  requirements
requirement type=R-module version=3.5.27edgeR/requirement
requirement type=R-module version=3.18.13limma/requirement
  /requirements
 
 This requirements tags looks also rather inflexible to me. With an 
 additional level of user configurable indirection it would be possible to 
 make the tools
 fit to different infrastructures without having to use binaries provided by 
 somebody else, taking up space for just one tool etc...
 
 Currently for jobs that are not run locally there is a general 
 environment_setup_file. There could be one optional environment_setup_file 
 for every job (or destination).
 
 In the end I created yet another wrapper.
 
 best,
 ido
 
 
 On Apr 25, 2014, at 2:34 PM, Björn Grüning bjoern.gruen...@gmail.com wrote:
 
 Hi Ido,
 
 I do not get your question in all detail, but it is possible to define a 
 tool_dependencies.xml with a specific R version, + R libraries and use only 
 that specific version from your tool with requirement tags.
 
 For an example please see various R tools from:
 
 https://github.com/bgruening/galaxytools
 
 Cheers,
 Bjoern
 
 Am 25.04.2014 14:25, schrieb Ido Tamir:
 Hi,
 has anything changed in galaxy in this regard?
 Any way to modify an environment before a tools is run?
 
 I now have a tool relying on R-devel and bioconductor devel, both of which 
 I can load in a module.
 The tool comes from the toolshed with xml like:
 
 command interpreter=Rscript
 …
 
 
 I don't want to hack around in the tool itself, but simply load the 
 necessary R-version.
 
 thank you very much,
 ido
 
 
 On Sep 13, 2013, at 3:23 AM, Guest, Simon simon.gu...@agresearch.co.nz 
 wrote:
 
 Just been reading a bit more about the Galaxy packaging system.  Here's a 
 slight modification to what I was suggesting that might fit in a bit 
 better.  Apologies for not being more familiar with the existing system 
 before proposing extensions.
 
 Recall that my goal is to support using a system-installed (native) 
 package, at a defined version, which I aim to achieve by loading the 
 appropriate environment module before running a tool.
 
 We still have tool_dependencies.xml defining a package at a particular 
 version, but rather than download and build the source code, there's just 
 a directive that says how to pick up the correct program version at 
 runtime, e.g. which environment module to load.
 
 So instead of the tool_dependencies.xml fragment:
 tool_dependency
   package name=bwa version=0.6.2
   install version=1.0
   actions
   action 
 type=download_by_urlhttp://downloads.sourceforge.net/project/bio-bwa/bwa-0.6.2.tar.bz2/action
   action type=shell_commandmake/action
   action type=move_file
   sourcebwa/source
   destination$INSTALL_DIR/bin/destination
   /action
   action type=set_environment
   environment_variable name=PATH 
 action=prepend_to$INSTALL_DIR/bin/environment_variable
   /action
   /actions
   /install
   /package
 /tool_dependency
 
 We have something like this (NB: element and attribute names are for 
 illustrative purposes only):
 
 tool_dependency
   package name=bwa version=0.6.2
   use_native
   actions
   action type=module_loadbwa/0.6.2/action
   /actions
   /use_native
   /package
 /tool_dependency
 
 This causes the right thing (module load bwa/0.6.2) to be stuck into the 
 dependencies env.sh file when this package is installed from the 
 toolshed.  We could call this toolshed tool native_package_bwa_0_6_2, to 
 avoid

[galaxy-dev] select both chromosome size and mappable genome size

2014-04-17 Thread Ido Tamir

Hi,
my tool needs both a file with chromosome lengths and the total mappable genome 
size.
I would like to hardcode the mappable genome size per build somewhere.
However the user should have to use only one drop down to select both. How can 
I achieve this?

e.g. currently:
param name=FAI type=select label=Select chromsome sizes 
   options from_file=chrsize.loc
column name=name index=0/
column name=value index=1/
   /options
 /param

with chrsize.loc:
mm9 /groups/csf-ngs/wfsys/galaxy/tool-data/shared/ucsc/chrom/mm9.len
mm10/groups/csf-ngs/wfsys/galaxy/tool-data/shared/ucsc/chrom/mm10.len
hg19/groups/csf-ngs/wfsys/galaxy/tool-data/shared/ucsc/chrom/hg19.len
dm3 /groups/csf-ngs/wfsys/galaxy/tool-data/shared/ucsc/chrom/dm3.len

Currently I multiply the genome size from the fai file by 0.8 to get the 
mappable size within my tool,
but I would like to use more exact numbers per build.

thank you very much,
ido


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] jobs stuck in new state

2014-03-31 Thread Ido Tamir

In my case it was uncompleted metadata in one of the input files.
(but maybe it was not new state but something else?)

HTH,
ido

On Mar 26, 2014, at 5:25 PM, David Hoover hoove...@helix.nih.gov wrote:

 I have many jobs stuck in the 'new' state on our local Galaxy instance.  The 
 jobs can't be stopped using the Admin-Manage jobs tool.  First, does anyone 
 know why a job would get stuck in the 'new' state for weeks?  I have cleaned 
 things up by manually setting their states to 'error' in the MySQL database.  
 Is there a better way of dealing with 'new' jobs?
 
 BTW, our Galaxy instance was updated about two weeks ago.
 
 Wondering,
 David Hoover
 Helix Systems Staff
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] universe_wsgi.ini question for Cistrome

2014-02-28 Thread Ido Tamir

I have CEAS as a tool. I wrapped the standalone CEAS version. I tried adding a 
binary datatype for the annotation db but even with the binary datatype upload 
somehow did not work (IIRC galaxy tried to convert it to ascii).
I then put the database as a selectable parameter in a dropdown list, because 
users anyway would not create and upload their own annotation db.

best,
ido

On Feb 27, 2014, at 7:13 AM, ruiwang.sz ruiwang...@gmail.com wrote:

 Hi Guys,
 
 Are there anyone who is using Cistrome?
 
 I tried to merge Cistrome's setting into our own galaxy instance. A new entry 
 is
 
 # Path to the static library files for assembly, ceaslib, chromLen, 
 conservation, liftOver and MAT-lib
 # Then in other tool configuration xml file, we can use 'from galaxy import 
 config' then '$config.Configuration().cistrome_static_library_path'
 # default would be in tool-data/ folder
 cistrome_static_library_path = 
 /home/bioinfoadmin/app/cistrome-apps/cistrome_library
 
 However, I got error when I tried CEAS: Enrichment on chromosome and 
 annotation:
  
 *
 Dataset 23: CEAS: Enrichment on chromosome and annotation on data 16 and data 
 13
 The Galaxy framework encountered the following error while attempting to run 
 the tool:
  
 Traceback (most recent call last):
   File 
 /home/bioinfoadmin/app/galaxy-dist/lib/galaxy/jobs/runners/__init__.py, 
 line 121, in prepare_job
 job_wrapper.prepare()
   File /home/bioinfoadmin/app/galaxy-dist/lib/galaxy/jobs/__init__.py, line 
 707, in prepare
 config_filenames = self.tool.build_config_files( param_dict, 
 self.working_directory )
   File /home/bioinfoadmin/app/galaxy-dist/lib/galaxy/tools/__init__.py, 
 line 2609, in build_config_files
 f.write( fill_template( template_text, context=param_dict ) )
   File /home/bioinfoadmin/app/galaxy-dist/lib/galaxy/util/template.py, line 
 9, in fill_template
 return str( Template( source=template_text, searchList=[context] ) )
   File 
 /home/bioinfoadmin/app/galaxy-dist/eggs/Cheetah-2.2.2-py2.7-linux-x86_64-ucs4.egg/Cheetah/Template.py,
  line 1004, in __str__
 return getattr(self, mainMethName)()
   File cheetah_DynamicallyCompiledCheetahTemplate_1391037616_15_43258.py, 
 line 274, in respond
 NotFound: cannot find 'cistrome_static_library_path' while searching for 
 '__app__.config.cistrome_static_library_path'
  
 Tool execution generated the following error message:
  
 failure preparing job
  
 
 
 I'm quite confused since '__app__.config.cistrome_static_library_path' should 
 be defined? or right now __app__ is dropped
 as Galaxy evolves? Besides, what does the comment mean by
 
 # Then in other tool configuration xml file, we can use 'from galaxy import 
 config' then '$config.Configuration().cistrome_static_library_path'
 
 Does that mean that we should not be using __app__.config...but how to use 
 this? I tried to insert 
 
 'from galaxy import config' 
 
 into the xml file but only got error...
 
 I'll appreciate any input.
 
 Thanks,
 Rui
 
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Security vulnerability in Galaxy filtering tools

2013-11-05 Thread Ido Tamir

This seems to happen often e.g. 
http://wiki.galaxyproject.org/DevNewsBriefs/2012_10_23#Compute_Tool_Security_Fix

a) are there general guidelines in the wiki on how to avoid these problems when 
creating tools?
b) is there a way to check automatically if all input fields are correctly 
escaped in a tool?

A search for security in the wiki brings up:
• Admin/Data Libraries/Library Security
0.0k - rev: 1 (current) last modified: 2013-01-02 23:54:33
• Admin/DataLibraries/LibrarySecurity
19.2k - rev: 4 (current) last modified: 2013-01-03 00:12:36
• HelpOnConfiguration/SecurityPolicy
1.9k - rev: 1 (current) last modified: 0
• Learn/Security Features
7.0k - rev: 3 (current) last modified: 2011-09-13 16:52:08
• News/2013_04_08_Galaxy_Security_Release
1.3k - rev: 3 (current) last modified: 2013-04-08 16:56:41

escape does not bring up anything.

thank you very much,
ido


On Nov 5, 2013, at 12:45 AM, Nate Coraor n...@bx.psu.edu wrote:

 A security vulnerability was recently discovered by John Chilton with 
 Galaxy's Filter data on any column using simple expressions and Filter on 
 ambiguities in polymorphism datasets tools that can allow for arbitrary 
 execution of code on the command line.
 
 The fix for these tools has been committed to the Galaxy source.  The timing 
 of this commit coincides with the next Galaxy stable release (which has also 
 been pushed out today).
 
 To apply the fix and simultaneously update to the new Galaxy stable release, 
 ensure you are on the stable branch and upgrade to the latest changeset:
 
 % hg branch
 stable
 
 % hg pull -u
 
 For Galaxy installations that administrators are not yet ready to upgrade to 
 the latest release, there are three workarounds.
 
 First, for Galaxy installations running on a relatively new version of the 
 stable release (e.g. release_2013.08.12), Galaxy can be updated to the 
 specific changeset that that contains the fix.  This will include all of the 
 stable (non-feature) commits that have been accumulated since the 8/12 
 release plus any new features included with (and prior to) the 8/12 release, 
 but without all of the new features included in the 11/4 release.  Ensure you 
 are on the stable branch and then upgrade to the specific changeset:
 
 % hg pull -u -r e094c73fed4d
 
 Second, the patch can be downloaded and applied manually:
 
 % wget -o security.patch 
 https://bitbucket.org/galaxy/galaxy-central/commits/e094c73fed4dc66b589932edb83412cb8b827cd3raw/
 
 and then:
 
 % hg patch security.patch
 
 or:
 
 % patch -p1  security.patch
 
 Third, the tools can be completely disabled by removing them from the tool 
 configuration file (by default, tool_conf.xml) and restarting all Galaxy 
 server processes.  The relevant lines in tool_conf.xml are:
 
   tool file=stats/dna_filtering.xml /
   tool file=stats/filtering.xml /
 
 The full 11/4 Galaxy Distribution News Brief will be available later today 
 and will contain details of changes since the last release.
 
 --nate
 Galaxy Team
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Security vulnerability in Galaxy filtering tools

2013-11-05 Thread Ido Tamir

On Nov 5, 2013, at 6:28 PM, Nate Coraor n...@bx.psu.edu wrote:

Hi Ido,

Thanks for the feedback. Replies below.

On Nov 5, 2013, at 9:54 AM, Ido Tamir wrote:

This seems to happen often e.g.
http://wiki.galaxyproject.org/DevNewsBriefs/2012_10_23#Compute_Tool_Security_Fix

I'm not sure I'd agree that it's often - we've had 4 or 5 vulnerabilities
over the life of the project. 2 allowed arbitrary code execution, the others
were less severe.

But these were written by experts, not by people like me, that don't know what
the galaxy framework really does/does not do with the input, so I guess I make
many more mistakes.

a) are there general guidelines in the wiki on how to avoid these problems
when creating tools?

The guidelines for writing a Galaxy tool are no different from best practices
for writing secure code. In specific for this vulnerability, execution of
user input should be handled with extreme care, and this tool had some gaps
in its input validation and sanitization. For what it's worth, the filter
tool (on which the other vulnerable tools were based) is one of the few tools
surviving from the very early days of Galaxy, and would not be implemented
the same way if written today.

I think it would be nice to have a small outline on the wiki of what galaxy
does with the input data and how it could affect a tool.
What sanitisation is there by default so I don't have to worry about it, but
what could happen I if I don't care to check/remove sanitise ' | or ...,
maybe with examples.

b) is there a way to check automatically if all input fields are correctly
escaped in a tool?

I am not sure how Galaxy could do this. Galaxy sanitizes the command line so
that input fields passed to a tool as command line arguments cannot be
crafted to exploit the shell's parsing rules.
Thats good

best,
ido

What the tool itself does with its inputs are out of Galaxy's control.

--nate

A search for security in the wiki brings up:
• Admin/Data Libraries/Library Security
0.0k - rev: 1 (current) last modified: 2013-01-02 23:54:33
• Admin/DataLibraries/LibrarySecurity
19.2k - rev: 4 (current) last modified: 2013-01-03 00:12:36
• HelpOnConfiguration/SecurityPolicy
1.9k - rev: 1 (current) last modified: 0
• Learn/Security Features
7.0k - rev: 3 (current) last modified: 2011-09-13 16:52:08
• News/2013_04_08_Galaxy_Security_Release
1.3k - rev: 3 (current) last modified: 2013-04-08 16:56:41

escape does not bring up anything.

thank you very much,
ido

On Nov 5, 2013, at 12:45 AM, Nate Coraor n...@bx.psu.edu wrote:

A security vulnerability was recently discovered by John Chilton with
Galaxy's Filter data on any column using simple expressions and Filter
on ambiguities in polymorphism datasets tools that can allow for arbitrary
execution of code on the command line.

The fix for these tools has been committed to the Galaxy source. The
timing of this commit coincides with the next Galaxy stable release (which
has also been pushed out today).

To apply the fix and simultaneously update to the new Galaxy stable
release, ensure you are on the stable branch and upgrade to the latest
changeset:

% hg branch
stable

% hg pull -u

For Galaxy installations that administrators are not yet ready to upgrade
to the latest release, there are three workarounds.

First, for Galaxy installations running on a relatively new version of the
stable release (e.g. release_2013.08.12), Galaxy can be updated to the
specific changeset that that contains the fix. This will include all of
the stable (non-feature) commits that have been accumulated since the 8/12
release plus any new features included with (and prior to) the 8/12
release, but without all of the new features included in the 11/4 release.
Ensure you are on the stable branch and then upgrade to the specific
changeset:

% hg pull -u -r e094c73fed4d

Second, the patch can be downloaded and applied manually:

% wget -o security.patch
https://bitbucket.org/galaxy/galaxy-central/commits/e094c73fed4dc66b589932edb83412cb8b827cd3raw/

and then:

% hg patch security.patch

or:

% patch -p1 security.patch

Third, the tools can be completely disabled by removing them from the tool
configuration file (by default, tool_conf.xml) and restarting all Galaxy
server processes. The relevant lines in tool_conf.xml are:

tool file=stats/dna_filtering.xml /
tool file=stats/filtering.xml /

The full 11/4 Galaxy Distribution News Brief will be available later today
and will contain details of changes since the last release.

--nate
Galaxy Team
___
Please keep all replies on the list by using reply all
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
http://lists.bx.psu.edu/

To search Galaxy mailing lists

Re: [galaxy-dev] Test Toolshed Biopython package dependency Atlas fails to install (Was: Re: UnboundLocalError: local variable 'prior_installation_required' referenced before assignment)

2013-09-20 Thread Ido Tamir

Yes this tar is broken at least on OSX.
Other people have the same issue:
http://code.google.com/p/libarchive/issues/detail?id=299

On Sep 20, 2013, at 10:41 AM, Bjoern Gruening bjoern.gruen...@gmail.com wrote:

Hi Carlos,

Hi Peter and Carlos,

On Mon, Sep 16, 2013 at 8:57 PM, Carlos Borroto
carlos.borr...@gmail.com wrote:
I did an extra test. Started with a clean 'galaxy-dist'. This time
both repositories fail with the same error. I guess before something
was cached for the repository with version 0.1.4.

I used biopython repository as a guide to write my tool dependency
definition:
http://testtoolshed.g2.bx.psu.edu/view/biopython/package_biopython_1_61

I can confirm biopython repository is failing to install for me with
exactly the same error. I wonder if a recently addition in the test
toolshed broke the treatment of prior_installation_required.

Thanks,
Carlos

Could be - note that currently Biopython isn't currently
installing properly on the Test Tool Shed due to ALTAS
failing (a requirement of NumPy which is a requirement
of Biopython). Dave and Bjoern are I think looking at this
already...

I can't do much I tested it again and for me its working fine on my
computers I have at hand ... sorry.

In case it helps, this is how the INSTALLATION.log file ends on OS X 10.8.4:
$ tail -n 3
~/src/shed_tools_dependencies.central/atlas/3.10.1/iuc/package_atlas_3_10/3508de0ebae1/INSTALLATION.log
x ATLAS/tune/lapack/lanbsrch.c
tar: Error exit delayed from previous errors.
#

This is the relevant part I can find in Galaxy's log:
[localhost] local: tar xfvj atlas3.10.1.tar.bz2

Warning: local() encountered an error (return code 1) while executing
'tar xfvj atlas3.10.1.tar.bz2'

After noticing this I got what I'm guessing is the original file from
sourceforge:
http://downloads.sourceforge.net/project/math-atlas/Stable/3.10.1/atlas3.10.1.tar.bz2

I can confirm that trying to untar this file fails with the exact same
error. However, on Ubuntu 13.04 untaring this file works just fine.

That is new to me. How can that happen? Anyone with an OS-X can confirm
that?

On Ubuntu 13.04 the error I see is:
STDERR
It appears you have cpu throttling enabled, which makes timings
unreliable and an ATLAS install nonsensical. Aborting.
See ATLAS/INSTALL.txt for further information
#

Björn, you said cpu throttling can be easily disable on Ubuntu. Can
you comment how? Do I need to disable it completely, or should the
install script do it when installing?

ATLAS (once untared ;-)) needs cpu-throttling to be disabled to
optimizes its library. If it is not disabled ATLAS compilation will
fail. For OS-X I found that:

http://apple.stackexchange.com/questions/41045/how-can-i-disable-cpu-throttling-and-cpu-disabling

Sorry, I never touched a OS-X. Nevertheless, If its not disabled, it is
supposed to fail silently and downstream packages will not be affected.
But if its crashing during untarring, I can't do much. What I can do is
to repack the tarball and host it elsewhere. What brings me to:

http://gmod.827538.n3.nabble.com/RFC-Storing-of-tarballs-and-patches-for-tool-dependencies-to-enable-reproducibility-td4036591.html

Bad news for a Friday morning, sorry :(
Bjoern

Thanks,
Carlos

___
Please keep all replies on the list by using reply all
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
http://galaxyproject.org/search/mailinglists/

___
Please keep all replies on the list by using reply all
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
http://galaxyproject.org/search/mailinglists/

___
Please keep all replies on the list by using reply all
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Test Toolshed Biopython package dependency Atlas fails to install (Was: Re: UnboundLocalError: local variable 'prior_installation_required' referenced before assignment)

2013-09-20 Thread Ido Tamir

tar xvfj atlas3.11.11.tar.bz2
shows no errors on OSX and creates one ATLAS folder.

best,
ido

On Sep 20, 2013, at 11:17 AM, Bjoern Gruening bjoern.gruen...@gmail.com wrote:

Hi Ido and Carlos,

can you check if that tarball is working?

http://downloads.sourceforge.net/project/math-atlas/Developer%20%28unstable%29/3.11.11/atlas3.11.11.tar.bz2

The chance is low, but if its working for you I will consider to create a new
version for it.
Thanks,
Bjoern

Yes this tar is broken at least on OSX.
Other people have the same issue:

http://code.google.com/p/libarchive/issues/detail?id=299

On Sep 20, 2013, at 10:41 AM, Bjoern Gruening
bjoern.gruen...@gmail.com
wrote:

Hi Carlos,

Hi Peter and Carlos,

I used biopython repository as a guide to write my tool dependency
definition:
http://testtoolshed.g2.bx.psu.edu/view/biopython/package_biopython_1_61

I can confirm biopython repository is failing to install for me with
exactly the same error. I wonder if a recently addition in the test
toolshed broke the treatment of prior_installation_required.

Thanks,
Carlos

I can't do much I tested it again and for me its working fine on my
computers I have at hand ... sorry.

In case it helps, this is how the INSTALLATION.log file ends on OS X
10.8.4:
$ tail -n 3
~/src/shed_tools_dependencies.central/atlas/3.10.1/iuc/package_atlas_3_10/3508de0ebae1/INSTALLATION.log
x ATLAS/tune/lapack/lanbsrch.c
tar: Error exit delayed from previous errors.
#