Re: [galaxy-dev] Implementing blend4j

2013-07-15 Thread Eric Kuyt
Nice, this works! Maybe after some testing we can try to get the package in
the central maven repositories.

I think for now all the functionality I need is in your version of blend4j.
But if I think of something new I'll try to participate.

Thanks


On 13 July 2013 08:58, John Chilton chil...@msi.umn.edu wrote:

 So you have packaged blend4j and all of its dependencies into one jar
 file? I don't have any experience doing that. I could imagine
 potentially some files conflicting and only one version being written
 and so jersey-json breaks? Is this a possibility, some of the jar
 metadata sort of files?

 I have used blend4j a few different ways (in a war, as a Java web
 start application, and as a standalone application) but always with
 applications where it is in its own jar file and all of its
 dependencies are in their own and original jar files. So I would
 recommend using it in that mode, if that is not possible however I
 would search the web for others having problems repackaging
 jersey-json.

 As for development, you can use my artifactory server to grab the
 canonical blend4j, by adding the following to your pom.xml file:

   repositories
 repository
   idmsi-artifactory/id
   nameMSI Artifactory/name
   urlhttp://artifactory.msi.umn.edu/libs-snapshot/url
 /repository
   /repositories

   dependencies
 dependency
   groupIdcom.github.jmchilton.blend4j/groupId
   artifactIdblend4j/artifactId
   version0.1-SNAPSHOT/version
 /dependency
 

 If instead you want to test your own changes to blend4j, I believe you
 can just do a mvn install or something like that from the blend4j
 fork directory on your development server to install a local copy into
 your maven cache. blend4j is the first project I have used maven with,
 so some of my terminology could be wrong there, but hopefully the idea
 makes sense. Also happy to look at pull requests if it helps to push
 changes upstream.

 Sorry if this response is not more helpful, let me know if there is
 anything else I can do or answer.

 -John

 On Thu, Jul 11, 2013 at 4:37 AM, Eric Kuyt eric.ku...@wur.nl wrote:
  Hello all,
 
  I'm trying to implement blend4j in a java application. for this I cloned
 the
  source, fetched the dependencies and added  a maven assembly plugin to
  assemble a jar file with dependencies.
  This jar I put in my classpath.
 
  Now fetching histories works, but creating a new one fails. Online I see
 a
  lot of jersey users have this problem and it seems like jersey-json is
 not
  available, but jersey-json is packaged in the jar.
 
  Anyone else have this problem?
 
  @John maybe you have better practices? I would like to have blend4j in my
  maven dependencies, but not build my own nexus.
 
  Thanks,
 
  Eric
 
 
  --
  Central Veterinary Institute of Wageningen UR (CVI)
  Department of Infection Biology
  PO box 65, 8200 AB Lelystad, NL
  Visiting address: ASG, Edelhertweg 15, 8219 PH Lelystad
 
  Tel:  +31-(0)320-293391
  Fax: +31-(0)320-238153
  E-mail: eric.ku...@wur.nl
  Web: http://www.cvi.wur.nl




-- 
Central Veterinary Institute of Wageningen UR (CVI)
Department of Infection Biology
PO box 65, 8200 AB Lelystad, NL
Visiting address: ASG, Edelhertweg 15, 8219 PH Lelystad

Tel:  +31-(0)320-293391
Fax: +31-(0)320-238153
E-mail: eric.ku...@wur.nl
Web: http://www.cvi.wur.nl
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] No samtools build after building index through Data Manager.

2013-07-15 Thread graham etherington (TSL)
Hi Dan,
Thanks for your reply.
I don't handle the updates, but it was updated on 25 Jun 13. I'm pretty
sure I can rule out the version number as the BWA builds, created using
the Data Manager, work fine.

Here is the pertinent contents of shed_tool_data_table_conf.xml

?xml version=1.0?
tables
table comment_char=# name=all_fasta
columnsvalue, dbkey, name, path/columns
file 
path=/home/galaxy/software/galaxy-central/tool-data/testtoolshed.g2.bx.psu
.edu/repos/blankenberg/data_manager_bwa_index_builder/fe6508204acc/all_fast
a.loc /

tool_shed_repositorytool_shedtesttoolshed.g2.bx.psu.edu/tool_shedrep
ository_namedata_manager_bwa_index_builder/repository_namerepository_ow
nerblankenberg/repository_ownerinstalled_changeset_revisionfe6508204ac
c/installed_changeset_revision/tool_shed_repository/table

table comment_char=# name=all_fasta
columnsvalue, dbkey, name, path/columns
file 
path=/home/galaxy/software/galaxy-central/tool-data/testtoolshed.g2.bx.psu
.edu/repos/blankenberg/data_manager_sam_fa_index_builder/926e50397b83/all_f
asta.loc /

tool_shed_repositorytool_shedtesttoolshed.g2.bx.psu.edu/tool_shedrep
ository_namedata_manager_sam_fa_index_builder/repository_namerepository
_ownerblankenberg/repository_ownerinstalled_changeset_revision926e5039
7b83/installed_changeset_revision/tool_shed_repository/table


table comment_char=# name=sam_fa_indexes
columnsline_type, value, path/columns
file 
path=/home/galaxy/software/galaxy-central/tool-data/testtoolshed.g2.bx.psu
.edu/repos/blankenberg/data_manager_sam_fa_index_builder/926e50397b83/sam_f
a_indices.loc /

tool_shed_repositorytool_shedtesttoolshed.g2.bx.psu.edu/tool_shedrep
ository_namedata_manager_sam_fa_index_builder/repository_namerepository
_ownerblankenberg/repository_ownerinstalled_changeset_revision926e5039
7b83/installed_changeset_revision/tool_shed_repository/table

table comment_char=# name=all_fasta
columnsvalue, dbkey, name, path/columns
file 
path=/home/galaxy/software/galaxy-central/tool-data/testtoolshed.g2.bx.psu
.edu/repos/blankenberg/data_manager_fetch_genome_all_fasta/ca8b3709309e/all
_fasta.loc /

tool_shed_repositorytool_shedtesttoolshed.g2.bx.psu.edu/tool_shedrep
ository_namedata_manager_fetch_genome_all_fasta/repository_namereposito
ry_ownerblankenberg/repository_ownerinstalled_changeset_revisionca8b37
09309e/installed_changeset_revision/tool_shed_repository/table
/tables






in ${GALAXY_HOME}/tool_data_table_conf.xml.sample the entry for sam_fa
reads as so:

!-- Location of SAMTools indexes and other files --
table name=sam_fa_indexes comment_char=#
columnsline_type, value, path/columns
file path=tool-data/sam_fa_indices.loc /
/table



The file tool-data/sam_fa_new_indices.loc (and .sample) does not exist.



If I keep the manually inserted builds listed in
tool-data/sam_fa_indices.loc and restart Galaxy, then I get the following
(abridged) entries in the paster.log:
galaxy.tools.data DEBUG 2013-07-15 09:51:11,109 Loaded tool data table
'all_fasta'
galaxy.tools.data DEBUG 2013-07-15 09:51:11,115 Loaded tool data table
'bwa_indexes'
galaxy.tools.data DEBUG 2013-07-15 09:51:11,116 Loaded tool data table
'bwa_indexes_color'
galaxy.tools.data DEBUG 2013-07-15 09:51:11,167 Loaded tool data table
'sam_fa_indexes'
...


galaxy.tools.data DEBUG 2013-07-15 09:51:11,324 Loading another instance
of data table 'all_fasta', attempting to merge content.

galaxy.tools.data DEBUG 2013-07-15 09:51:11,340 Loading another instance
of data table 'bwa_indexes', attempting to merge content.
galaxy.tools.data DEBUG 2013-07-15 09:51:11,348 Loading another instance
of data table 'bwa_indexes_color', attempting to merge content.
galaxy.tools.data DEBUG 2013-07-15 09:51:11,410 Loading another instance
of data table 'all_fasta', attempting to merge content.
galaxy.tools.data DEBUG 2013-07-15 09:51:11,422 Loading another instance
of data table 'sam_fa_indexes', attempting to merge content.
galaxy.tools.data ERROR 2013-07-15 09:51:11,422 Attempted to add fields
(['index', 'cfraxinea_s1v1',
'/home/galaxy/software/galaxy-central/tool-data/cfraxinea_s1v1/sam_index/cf
raxinea_s1v1/c_fraxinea_s1v1.fa']) to data table 'sam_fa_indexes', but
this entry already exists and allow_duplicates is False.
galaxy.tools.data ERROR 2013-07-15 09:51:11,422 Attempted to add fields
(['index', 'b_distachyon',
'/home/galaxy/software/galaxy-central/tool-data/b_distachyon/sam_index/b_di
stachyon/b_distachyon.fa']) to data table 'sam_fa_indexes', but this entry
already exists and allow_duplicates is False.
galaxy.tools.data ERROR 2013-07-15 09:51:11,423 Attempted to add fields
(['index', 'n_sylvestris',
'/home/galaxy/software/galaxy-central/tool-data/n_sylvestris/sam_index/n_sy
lvestris/n_sylvestris.fa']) to data table 'sam_fa_indexes', but this entry
already exists and allow_duplicates is False.
galaxy.tools.data ERROR 2013-07-15 09:51:11,423 Attempted to add fields
(['index', 

[galaxy-dev] Display data in browser

2013-07-15 Thread Gromobir

Hello list,
I'm currently trying to create a new tool, which has an .xhtml file as 
its output.
After finishing the job, I would like to be able to click on the eye 
icon aka Display data in browser
and display the resulting .xhtml file in the frame to the left of my 
tool history. Unfortunately I have no idea yet

how I could accomplish this. Any help is highly appreciated.

Best regards,
Gromobir
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
 http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
 http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] CloudMan Error

2013-07-15 Thread greg
Thanks for getting back to me, Enis.

I went ahead and started a new cluster instance.  I'll try to leave it
running today in case there's anything we want to check.

Here's my whole process:

Start Screen Setup:
--
http://snag.gy/DMIeC.jpg

Entering my share string:

http://snag.gy/wKFLy.jpg


Main Page Text and log:

Cluster name:

MSGGREG

Disk status:

0 / 0 (0%)

Worker status:

Idle: 0 Available: 0 Requested: 0

Service status:

Applications

 Data


Cluster status log

14:08:32 - Master starting
14:08:34 - Completed the initial cluster startup process. This is a
new cluster; waiting to configure the type.
14:08:50 - Migration service prerequisites OK; starting the service
14:08:50 - SGE service prerequisites OK; starting the service
14:08:58 - Setting up SGE...
14:09:13 - HTCondor service prerequisites OK; starting the service
14:09:21 - Hadoop service prerequisites OK; starting the service
14:09:38 - Done adding Hadoop service; service running.
14:11:53 - Error creating volume from shared cluster's snapshot
'['snap-cfa775ba']': 'filesystems'



Admin Page CloudMan Log: (unfortunately nothing is jumping out at me?)
---

CloudMan from Galaxy
Admin | Report bugs | Wiki | Screencast
The entire log file (paster.log) is shown. Show latest | Back to admin view
Python version:  (2, 7)
Image configuration suports: {'apps': ['cloudman', 'galaxy']}
2013-07-15 14:08:32,406 DEBUGapp:68   Initializing app
2013-07-15 14:08:32,407 DEBUGec2:121  Gathering instance
zone, attempt 0
2013-07-15 14:08:32,410 DEBUGec2:127  Instance zone is 'us-east-1d'
2013-07-15 14:08:32,410 DEBUGec2:45   Gathering instance
ami, attempt 0
2013-07-15 14:08:32,412 DEBUGapp:71   Running on 'ec2'
type of cloud in zone 'us-east-1d' using image 'ami-118bfc78'.
2013-07-15 14:08:32,412 DEBUGapp:89   Getting pd.yaml
2013-07-15 14:08:32,412 DEBUGec2:338  No S3 Connection,
creating a new one.
2013-07-15 14:08:32,413 DEBUGec2:342  Got boto S3 connection.
2013-07-15 14:08:32,452 DEBUG   misc:212  Checking if bucket
'cm-0479bd75a331acc874033e98b2e1e03e' exists... it does not.
2013-07-15 14:08:32,452 DEBUG   misc:583  Bucket
'cm-0479bd75a331acc874033e98b2e1e03e' does not exist, did not get
remote file 'persistent_data.yaml'
2013-07-15 14:08:32,452 DEBUGapp:96   Setting
deployment_version to 2
2013-07-15 14:08:32,453 INFO app:103  Master starting
2013-07-15 14:08:32,453 DEBUG master:55   Initializing console
manager - cluster start time: 2013-07-15 14:08:32.453182
2013-07-15 14:08:32,453 DEBUG   comm:42   AMQP Connection
Failure:  [Errno 111] Connection refused
2013-07-15 14:08:32,453 DEBUG master:791  Trying to discover
any worker instances associated with this cluster...
2013-07-15 14:08:32,454 DEBUGec2:317  Establishing boto
EC2 connection
2013-07-15 14:08:32,535 DEBUGec2:305  Got region as
'RegionInfo:us-east-1'
2013-07-15 14:08:32,777 DEBUGec2:326  Got boto EC2
connection for region 'us-east-1'
2013-07-15 14:08:33,022 DEBUG   misc:574  Retrieved file
'snaps.yaml' from bucket 'cloudman' on host 's3.amazonaws.com' to
'cm_snaps.yaml'.
2013-07-15 14:08:33,035 DEBUGec2:286  Got region name as 'us-east-1'
2013-07-15 14:08:33,035 DEBUG master:226  Loaded default
snapshot data: [{'snap_id': 'snap-adad90fc', 'name': 'galaxy',
'roles': 'galaxyTools,galaxyData'}, {'snap_id': 'snap-5b030634',
'name': 'galaxyIndices', 'roles': 'galaxyIndices'}]
2013-07-15 14:08:33,035 DEBUGec2:81   Gathering instance
id, attempt 0
2013-07-15 14:08:33,037 DEBUGec2:87   Instance ID is 'i-5346d733'
2013-07-15 14:08:33,125 DEBUGec2:360  Adding tag
'clusterName:MSGGREG' to resource 'i-5346d733'
2013-07-15 14:08:33,307 DEBUGec2:360  Adding tag
'role:master' to resource 'i-5346d733'
2013-07-15 14:08:33,554 DEBUGec2:360  Adding tag
'Name:master: MSGGREG' to resource 'i-5346d733'
2013-07-15 14:08:33,744 DEBUG master:246  ud at manager start:
{'region_name': u'us-east-1', 'region_endpoint': u'ec2.amazonaws.com',
'ec2_port': None, 'deployment_version': 2, 'cloud_name': u'Amazon',
'boot_script_name': 'cm_boot.py', 'is_secure': True, 'password':
'', 'access_key': 'redacted!', 's3_port': None, 'cloud_type':
u'ec2', 'cloudman_home': '/mnt/cm', 'cluster_name': u'MSGGREG',
'freenxpass': u'', 'bucket_default': 'cloudman', 'role': 'master',
'bucket_cluster': 'cm-0479bd75a331acc874033e98b2e1e03e',
'boot_script_path': '/tmp/cm', 'secret_key': u'redacted!',
's3_conn_path': u'/', 's3_host': u's3.amazonaws.com', 'ec2_conn_path':
u'/'}
2013-07-15 14:08:33,744 DEBUG master:1858 Generating root
user's public key...
2013-07-15 14:08:33,763 DEBUG

[galaxy-dev] [GSoC2013] Week 4 Accomplishments and Week 5 Plans

2013-07-15 Thread Saket Choudhary
Hi All,

Week 4 blog post :

http://galaxy-gsoc2013.blogspot.com/2013/07/week-4-accomplishments-and-week-5-plans.html

Do drop in your comments , if any.

Thanks

Saket
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] Error Uploading Directory of Files

2013-07-15 Thread Nate Coraor
On Jun 28, 2013, at 12:35 PM, Nicholas Kline wrote:

 Hi,
 
 Our lab recently installed a local version of Galaxy on a mid-2012 Mac
 Pro computer. We can access the Galaxy server and sign in as an
 administrator. Today we tried creating a Data Library, adding a
 dataset to it, and uploading a directory of files. We followed the
 Galaxy documentation at
 http://wiki.galaxyproject.org/Admin/DataLibraries/UploadingLibraryFiles?action=showredirect=Admin%2FData+Libraries%2FUploading+Library+Files
 to setup this feature:
 
 - Admin  Data Library  Add datasets  Upload directory of files
 - file format was set to auto-detect
 - and we chose the option to link to files instead of copying them
 
 Galaxy confirmed that the files were successfully uploaded. However,
 in the data library, under the Message column, is a message in red
 saying Job error (click name for more info). Clicking on one of the
 uploaded files displays a page with this information:
 
 
 Date uploaded: 2013-06-28
 
 File size: 7.5 GB
 
 Data type: auto
 
 Build: sacCer2
 
 Miscellaneous information:
 Traceback (most recent call last): File
 /Users/administrator/galaxy-dist/tools/data_source/upload.py, line
 386, in __main__() File
 /Users/administrator/galaxy-dist/tools/data_source/upload.py, line
 357, in __main__ output_paths =
 
 Job Standard Error
 Traceback (most recent call last):
  File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 386, in
__main__()
  File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 357, in __main__
output_paths = parse_outputs( sys.argv[4:] )
  File /Users/administrator/galaxy-dist/tools/data_source/upload.py,
 line 64, in parse_outputs
id, files_path, path = arg.split( ':', 2 )
 ValueError: need more than 1 value to unpack
 error
 
 Database/Build: sacCer2
 Number of data lines: None
 Disk file: /Volumes/G-SPEED Q/data/Person 2012 project/DCP2 mef.fastq

Hi Nicholas,

Could you try re-adding the data to the library with the spaces removed from 
all path components?  I thought this had been fixed a long time ago but it's 
possible that the bug reappeared at some point.

--nate

 
 
 Questions:
 1. Should we be concerned about this error?
 2. If so, what is the right way to fix it?
 3. If not, how do we remove the red error message next to each file:
 Job error (click name for more info) ?
 
 Thank you
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


[galaxy-dev] Galaxy Server Questions

2013-07-15 Thread Jennifer Jackson

Hi Zain,

The minimum requirements for Galaxy are pretty much any recent model 
computer. A basic install will load and run within a very short amount 
of time, following the instructions at:

http://getgalaxy.org

Advanced tools come from the Tool Shed. These are installed separately. 
No programming is needed, but some very basic unix skills are required, 
as they would be for ongoing server maintenance. The Tool Shed is one of 
the best documented parts of Galaxy, so you should be able to find all 
the answers here, or this is the right list to use for questions about 
local installs (galaxy-...@bx.psu.edu):

http://wiki.galaxyproject.org/Tool%20Shed

However, running a production instance that is intended to run compute 
intensive tools (such as Tophat2), and where you have some throughput 
goals, will require more substantial resources. This is always a 
difficult question to answer since so much depends on the tools used and 
the data volume. But in general, minimum requirements are about around 
the same as what those underlying tools would require on their own, if 
run line-command. So for Tophat or Tophat2, and really the entire Tuxedo 
RNA-seq tool package, you might be able to get by on 8G memory and 2 or 
4 cores. But, it will probably be slow and if you are running replicates 
through Cuffdiff at the end, you might run out of memory if the files 
are large and the genome is large (such as human). And if you are are 
hosting a web Galaxy at the same time with visualizations and such, 
well, this is why systems are often set up with clusters. With all of 
these will be competing for the same resources, going low can work, but 
this will be something that will have to be managed/tested, and it will 
change through time as tools upgrade.


You could test this out by setting up a cloud instance with the hardware 
you plan to use, loading some of your data, and running your workflow to 
see how this benchmarks. Have users on the instance while you are doing 
this - to judge performance. Cloud installations come with many of the 
advanced tools and data indexes already installed/configured, so this 
would be less investment than buying the hardware first and finding out 
later it was not not enough.

http://usegalaxy.org/cloud

And of course Slipstream Galaxy is an option. The whole intention here 
is to make a complete package with tools  data already configured, in a 
system that has enough compute capability to do the work, for 
scientists/labs who do not want to deal with administrative tasks or 
work in the cloud (for whatever reason).

http://bioteam.net/slipstream/galaxy-edition/

Good luck with your decision! Other are welcomed to comment about how 
they have set up their system.


Jen
Galaxy team

On 7/15/13 7:14 AM, Zain A Alvi wrote:

Hi Jen,

I hope this reaches you well. I have a small question in regards to 
setting up a galaxy server. My mentors and I are looking into buying a 
server for doing NGS analysis through the use of Galaxy. We saw that 
slipstream's specifications for hardware to be the following:


CPU: 2x Intel Xeon Processor E5-2690, 8 core (16 cores total)
RAM Memory: 12x 32GB RDIMM (384 GB) with option to upgrade it to 512 GB
Storage (Hard Drive space): 7x 3TB SAS 6 Gbps (16 TB usable) with 1 x 
100GB Solid State Disk

Power: Dual Pedundant Power Supplies
Network: Dual Gigabit Network Adapter

We are wondering what can be the minimum server hardware 
specifications that Galaxy be run on.


Our second question is if we install Galaxy on the server, do all the 
tools currently available on Galaxy come pre-installed on it or do we 
have to program (Via Perl and Python) and install each of those tool 
sets ourselves.  If we have to install those tools ourselves, is there 
a guide that we can do so? Lastly, how can we upgrade the tool sets 
such as Tophat 1.44 to Tophat 2 on this server. I was wondering about 
the last question as Tophat 1.44 is available on the main Galaxy 
server whereas Tophat 2 is available on the test Galaxy server.


Sorry for so many questions. Thank you again for all the great help.

Sincerely,

Zain


--
Jennifer Hillman-Jackson
Galaxy Support and Training
http://galaxyproject.org

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Connecting to new Galaxy Cloudman by FTP

2013-07-15 Thread Nate Coraor
On Jul 8, 2013, at 3:50 PM, Mohammad Heydarian wrote:

 Hi,
 I am having trouble setting up a FTP connection with the recently released 
 version of Galaxy Cloudman (ami-118bfc78). 
 
 I have instantiated the new version of Galaxy Cloudman with CloudLaunch and 
 also through the AWS EC2 wizard (using the same security group settings as 
 the previous versions) and neither instance will connect to my FTP connection.
 
 Has anyone else had this problem? Does anyone know what is preventing the FTP 
 connection? 
 
 Any help would be greatly appreciated.

Hey Mo,

This may be a case of the new password hashing algorithm's incompatibility with 
the provided ProFTPD config.  Could you try the following:

1. Set 'use_pbkdf2 = False' in universe_wsgi.ini anywhere in the [app:main] 
section
2. Restart Galaxy
3. Reset your password in the Galaxy UI
4. Test FTP again

--nate

 
  
 Cheers, 
 Mo Heydarian
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/
 
 To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


[galaxy-dev] plain text output format

2013-07-15 Thread mark.rose
Hi All

I'm trying to format one of the three outputs of my galaxy tool as plain text.  
Thus far no matter how I specify my output format in the tool xml file I have 
the same problem.  When I click the eye icon it opens a save dialog rather that 
displaying the text output in the central frame.  The preview shows the text 
output and if I follow through and save the output and open with wordpad it is 
fine.  Also if I edit the attributes of the output and change it to txt 
(which is one of the formats I had tried specifying in the xml)  it will render 
in the central frame but I would rather not have to do that.  Both of my other 
outputs (a vcf and a tab-delimited file are rendered in the central frame 
properly).  What can I do to fix this?

Thanks

Mark




This message may contain confidential information. If you are not the 
designated recipient, please notify the sender immediately, and delete the 
original and any copies. Any use of the message by you is prohibited. 
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

[galaxy-dev] Upstart script to manage a multi instance load balanced installation

2013-07-15 Thread Seth Sims
After some work i've created an Upstart script which can manage a load
balanced galaxy configuration as described in the wiki. I thought that I
would put it on this list for other people to use. The script parses
universe_wsgi.ini just like run.sh and spawns all of the servers it finds.
It comes in two pieces galaxy.conf and galaxy-worker.conf. Once you place
them both in /etc/init and make the proper edits for the environment a
server can be started with sudo start galaxy. The configuration starts
the server at boot time and puts all of the instances under the management
of upstart which deals with pids, logging to syslog and respawning if an
instance crashes.
I have just gotten this working reasonably well but have done basically no
testing so there are bugs to be found. Any comments are welcome if anyone
knows a better way to do something here.

- Seth

*galaxy.conf*

author Seth Sims seth.s...@gmail.com
version 0.0.1 test
description galaxy master process. Fetches eggs and spawns all of the
servers it finds configured

start on started network-services

# make sure that any eggs we download are at least owned by the galaxy
group.
# we cannot use setuid in this script because only root can issue the
start galaxy-worker
# command. But this way the galaxy instances should still be able to use
their eggs.
setgid galaxy

# put galaxy root directory here
chdir /srv/galaxy-dist/

pre-start script
date
echo checking python version
python ./scripts/check_python.py
[ $? -ne 0 ]  exit 1

echo pre-fetching tossing out expired eggs
python ./scripts/check_eggs.py -q
if [ $? -eq 0 ]; then
echo Some eggs are out of date, attempting to fetch...
python ./scripts/fetch_eggs.py
if [ $? -eq 0 ]; then
echo Fetch Successful.
else
echo Fetch failed.
fi
fi

echo starting servers
SERVERS=`sed -n 's/^\[server:\(.*\)\]/\1/  p' universe_wsgi.ini | xargs
echo`
for SERVER in ${SERVERS} ; do
echo starting server ${SERVER}
start galaxy-worker SERVER_NAME=${SERVER}
done
end script

post-stop script
SERVERS=`sed -n 's/^\[server:\(.*\)\]/\1/  p' universe_wsgi.ini | xargs
echo`
date
echo stopping galaxy servers.
for SERVER in ${SERVERS} ; do
echo stopping ${SERVER}
stop galaxy-worker SERVER_NAME=${SERVER}
done
end script
---
*galaxy-worker*
author Seth Sims seth.s...@gmail.com
version 0.0.1 test
description Starts a galaxy server instance. This is run from the
galaxy.conf file

instance $SERVER_NAME

#make sure we are running as the galaxy user
setuid galaxy
setgid galaxy

#put the galaxy root directory here
chdir /srv/galaxy-dist/

#having multiple instances of galaxy using the same egg directory was
causing a race
#condition that was stopping the instances from starting correctly. So give
each instance
#its own directory under /tmp
env PYTHON_EGG_CACHE=/tmp/${SERVER_NAME}_egg/

respawn

script
exec python ./scripts/paster.py serve universe_wsgi.ini
--server-name=${SERVER_NAME}
end script
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Implementing blend4j

2013-07-15 Thread John Chilton
Fantastic. blend4j definitely should be in a public repository, just don't
know how and have not had the time. Any help in that process would be most
welcome.

-John
On Jul 15, 2013 10:43 AM, Eric Kuyt eric.ku...@wur.nl wrote:

 Nice, this works! Maybe after some testing we can try to get the package
 in the central maven repositories.

 I think for now all the functionality I need is in your version of
 blend4j. But if I think of something new I'll try to participate.

 Thanks


 On 13 July 2013 08:58, John Chilton chil...@msi.umn.edu wrote:

 So you have packaged blend4j and all of its dependencies into one jar
 file? I don't have any experience doing that. I could imagine
 potentially some files conflicting and only one version being written
 and so jersey-json breaks? Is this a possibility, some of the jar
 metadata sort of files?

 I have used blend4j a few different ways (in a war, as a Java web
 start application, and as a standalone application) but always with
 applications where it is in its own jar file and all of its
 dependencies are in their own and original jar files. So I would
 recommend using it in that mode, if that is not possible however I
 would search the web for others having problems repackaging
 jersey-json.

 As for development, you can use my artifactory server to grab the
 canonical blend4j, by adding the following to your pom.xml file:

   repositories
 repository
   idmsi-artifactory/id
   nameMSI Artifactory/name
   urlhttp://artifactory.msi.umn.edu/libs-snapshot/url
 /repository
   /repositories

   dependencies
 dependency
   groupIdcom.github.jmchilton.blend4j/groupId
   artifactIdblend4j/artifactId
   version0.1-SNAPSHOT/version
 /dependency
 

 If instead you want to test your own changes to blend4j, I believe you
 can just do a mvn install or something like that from the blend4j
 fork directory on your development server to install a local copy into
 your maven cache. blend4j is the first project I have used maven with,
 so some of my terminology could be wrong there, but hopefully the idea
 makes sense. Also happy to look at pull requests if it helps to push
 changes upstream.

 Sorry if this response is not more helpful, let me know if there is
 anything else I can do or answer.

 -John

 On Thu, Jul 11, 2013 at 4:37 AM, Eric Kuyt eric.ku...@wur.nl wrote:
  Hello all,
 
  I'm trying to implement blend4j in a java application. for this I
 cloned the
  source, fetched the dependencies and added  a maven assembly plugin to
  assemble a jar file with dependencies.
  This jar I put in my classpath.
 
  Now fetching histories works, but creating a new one fails. Online I
 see a
  lot of jersey users have this problem and it seems like jersey-json is
 not
  available, but jersey-json is packaged in the jar.
 
  Anyone else have this problem?
 
  @John maybe you have better practices? I would like to have blend4j in
 my
  maven dependencies, but not build my own nexus.
 
  Thanks,
 
  Eric
 
 
  --
  Central Veterinary Institute of Wageningen UR (CVI)
  Department of Infection Biology
  PO box 65, 8200 AB Lelystad, NL
  Visiting address: ASG, Edelhertweg 15, 8219 PH Lelystad
 
  Tel:  +31-(0)320-293391
  Fax: +31-(0)320-238153
  E-mail: eric.ku...@wur.nl
  Web: http://www.cvi.wur.nl




 --
 Central Veterinary Institute of Wageningen UR (CVI)
 Department of Infection Biology
 PO box 65, 8200 AB Lelystad, NL
 Visiting address: ASG, Edelhertweg 15, 8219 PH Lelystad

 Tel:  +31-(0)320-293391
 Fax: +31-(0)320-238153
 E-mail: eric.ku...@wur.nl
 Web: http://www.cvi.wur.nl

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] [galaxy-user] Connecting to new Galaxy Cloudman by FTP

2013-07-15 Thread Mohammad Heydarian
Hey Nate,
Thanks for the response and instructions.

I understand the last three steps of your protocol, but the first step is
difficult for me to understand. I'm guessing that, 1. Set 'use_pbkdf2 =
False' in universe_wsgi.ini anywhere in the [app:main] section, is telling
me to change a setting of Cloudman by command line. This is generally where
we get stuck in using Cloudman, because we aren't familiar with command
line (we get cold sweats and light palpitations) and are weary of making
changes to the Cloudman code.

I would ask our IT guys for help, but their expertise ends at updating
Office tools. I would bother a programmer or bioinformatician, but most of
them are so busy you need a formal collaboration to get on their radar. I
would ask people who vaguely know command line for help, but most of the
time their knowledge of command line is just higher than mine and we end up
troubleshooting an issue neither of us can really grasp.

So, is there a webcast, or video, or slideshow, that can show a newbie how
to command line into Cloudman and make the changes you outline in step 1?



Cheers,
Mo Heydarian




On Mon, Jul 15, 2013 at 4:22 PM, Nate Coraor n...@bx.psu.edu wrote:

 On Jul 8, 2013, at 3:50 PM, Mohammad Heydarian wrote:

  Hi,
  I am having trouble setting up a FTP connection with the recently
 released version of Galaxy Cloudman (ami-118bfc78).
 
  I have instantiated the new version of Galaxy Cloudman with CloudLaunch
 and also through the AWS EC2 wizard (using the same security group settings
 as the previous versions) and neither instance will connect to my FTP
 connection.
 
  Has anyone else had this problem? Does anyone know what is preventing
 the FTP connection?
 
  Any help would be greatly appreciated.

 Hey Mo,

 This may be a case of the new password hashing algorithm's incompatibility
 with the provided ProFTPD config.  Could you try the following:

 1. Set 'use_pbkdf2 = False' in universe_wsgi.ini anywhere in the
 [app:main] section
 2. Restart Galaxy
 3. Reset your password in the Galaxy UI
 4. Test FTP again

 --nate

 
 
  Cheers,
  Mo Heydarian
 
  ___
  Please keep all replies on the list by using reply all
  in your mail client.  To manage your subscriptions to this
  and other Galaxy lists, please use the interface at:
   http://lists.bx.psu.edu/
 
  To search Galaxy mailing lists use the unified search at:
   http://galaxyproject.org/search/mailinglists/


 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

   http://lists.bx.psu.edu/

 To search Galaxy mailing lists use the unified search at:

   http://galaxyproject.org/search/mailinglists/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

[galaxy-dev] Galaxy-less Tool Installing

2013-07-15 Thread John Chilton
  One of my goals for the GCC was to sell the idea that tool shed
repositories need to be installable without a database present. I
talked with James Taylor and Enis Afgan about this idea briefly and they seemed
to believe this was a good idea - I kept meaning to discuss it with Greg but I
never got a good opportunity. Though in past Greg has made this sound
potentially doable and has never objected to the goal overtly.

  I have two specific use cases in mind (CloudBioLinux and LWR), but perhaps the
higher-level justification is something along the lines that a lot of effort
from Greg and others (Dave, Bjorn, Peter, Nate) has gone into building a modular
dependency system that could very easily be leveraged by applications other than
Galaxy, so the extra steps that could be taken to make this possible should to
make the codebase as broadly useful and to encourage adoption. The Galaxy
community could benefit from other applications potentially utilizing and
populating the tool shed and Galaxy tool developers would be further incentized
to write good, modular dependencies and publish them to the tool shed.

  A high-level task decomposition would be something like this:

  1. Rework installing tool shed repositories to not require a database. A kind
of messy way to do this might be adding a use_database flag throughout. A
cleaner way might be to use allow the core functionality to work with callbacks
or plugins that performed the database interactions.

  2. Separate the core functionality out of the Galaxy code base entirely into
a reusable, stand-alone library.

  I would love buy in from the Galaxy team on item 2 above, but it is not
strictly needed for my goals - I imagine I could write a script to pull it out
Galaxy and build the library automatically or even just have the Galaxy codebase
present when using Galaxy-less tool shed dependencies.

  Buy in on item 1 by the Galaxy team (specifically Greg and Dave B.)
however is needed, are there any objections to this idea? Do you have any broad
advice on how to approach this to ensure the changes make sense, work with your
long term vision, and end up in Galaxy?

  Of all the things on my TODO list for the next year, this is probably the most
potentially broadly interesting to this weeks BOSC codefest attendees, so I was
going to attempt to sell this as something to work on. The sales pitch would
include building a little tool shed version of the module command -
http://linux.die.net/man/1/module to demonstrate this work and have something
immediately useful produced.

  The idea would be to create a command-line tool for utilizing tool shed
dependencies.

  # Unlike standard module, install procedure is available. Probably could
  # default to main tool shed and latest installable revision

  % tsmodule repo:install galaxyp/tint
  % tsmodule repo:install toolshed.g2.bx.psu.edu/galaxyp/tint/ab43b5ba7a4e

  # module lets you list packages, I guess tool shed version would need
  # repository and package listings:

  % tsmodule repo:list
  toolshed.g2.bx.psu.edu/galaxyp/tint/ab43b5ba7a4e
  % tsmodule package:list
  tint_proteomics_scripts/1.19.19/galaxyp/tint/ab43b5ba7a4e

  # Finally, a use command would source the env.sh script and make dependency
  # available in the command-line (might require starting new shell?):

  % tsmodule package:use tint_proteomics_scripts
  % tsmodule package:use tint_proteomics_scripts/1.19.19
  % tsmodule package:use
int_proteomics_scripts/1.19.19/galaxyp/tint/ab43b5ba7a4e

  # use apps that would be available to tools with valid requirements tags.
  % iQuantCLI

  This would be different from using the API scripts because there would be no
API, Galaxy instance, or Galaxy database involved - just the Galaxy code. If
this was able to split into its own Python library, one could imagine even
allowing something like tsmodule to be installable right from pip and
recursively fetch a toolshed_client library or something like that.
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/


Re: [galaxy-dev] Galaxy-less Tool Installing

2013-07-15 Thread Michael Cotterell
 This would be different from using the API scripts because there would be
 no
 API, Galaxy instance, or Galaxy database involved - just the Galaxy code.
 If
 this was able to split into its own Python library, one could imagine even
 allowing something like tsmodule to be installable right from pip and
 recursively fetch a toolshed_client library or something like that.


I especially like this idea. I wasn't at GCC, but my boss was, so you may
have seen some of the stuff I'm working on with Web Services.

Right now, the transition to the tool shed it awesome, except for when it
comes time to actually run the tools for adding web services. You see right
now, the tool my colleagues and I have developed dynamically creates tool
config files based on WSDLs and WADLs. The problem is, at the moment, our
tool actually edits the tools_config.xml file directly in order to add the
web service operation tools to galaxy. Your idea, if I understand it
correctly, would mean that it might be possible for my tool to utilize this
other method of installation in order to add/remove the generated tools to
Galaxy.

I'd definitely like to hear more about this.

-- 
Sincerely,
Michael E. Cotterell

Ph.D. Student in Computer Science, University of Georgia
Graduate RA  TA, University of Georgia
mepcotter...@gmail.com
mepc...@uga.edu
m...@cs.uga.edu
http://michaelcotterell.com/
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

Re: [galaxy-dev] Galaxy-less Tool Installing

2013-07-15 Thread Greg Von Kuster
Hi John,

It's really too bad that we didn't find time to discuss this in person at the 
GCC.  Until now, I've not heard from anyone that installation from the tool 
shed without requiring a Galaxy database is important, so I'm lacking some 
context on this (I assume your statement without a database present refers to 
the Galaxy database).

Some concerns that immediately pop into my mind are the following - this is not 
a complete list.

1) How do you ensure dependency relationships between installed repositories?

2) How do you manage locating installed repository contents?

3) How do you manage the current state of an installed repostiroy and determine 
if it can be updated?

4) How do you manage tool version lineage for installed repositories that 
contain tools (this plays an importnt role in ensuring reproducibility)?

5) How do you maintain the state of an installed repository, enabling it to be 
repaired if it or any of it's dependencies are in an error state?

Since I've never considered re-engineering the tool shed installation process 
so that it would function in an environment without a Galaxy database, I'm not 
sure how much effort would need to go into doing so, or where to start.  I'll 
have to think about this for a while.

Greg Von Kuster

On Jul 15, 2013, at 7:27 PM, John Chilton chil...@msi.umn.edu wrote:

 One of my goals for the GCC was to sell the idea that tool shed
 repositories need to be installable without a database present. I
 talked with James Taylor and Enis Afgan about this idea briefly and they 
 seemed
 to believe this was a good idea - I kept meaning to discuss it with Greg but I
 never got a good opportunity. Though in past Greg has made this sound
 potentially doable and has never objected to the goal overtly.
 
 I have two specific use cases in mind (CloudBioLinux and LWR), but perhaps the
 higher-level justification is something along the lines that a lot of effort
 from Greg and others (Dave, Bjorn, Peter, Nate) has gone into building a 
 modular
 dependency system that could very easily be leveraged by applications other 
 than
 Galaxy, so the extra steps that could be taken to make this possible should to
 make the codebase as broadly useful and to encourage adoption. The Galaxy
 community could benefit from other applications potentially utilizing and
 populating the tool shed and Galaxy tool developers would be further 
 incentized
 to write good, modular dependencies and publish them to the tool shed.
 
 A high-level task decomposition would be something like this:
 
 1. Rework installing tool shed repositories to not require a database. A kind
 of messy way to do this might be adding a use_database flag throughout. A
 cleaner way might be to use allow the core functionality to work with 
 callbacks
 or plugins that performed the database interactions.
 
 2. Separate the core functionality out of the Galaxy code base entirely into
 a reusable, stand-alone library.
 
 I would love buy in from the Galaxy team on item 2 above, but it is not
 strictly needed for my goals - I imagine I could write a script to pull it out
 Galaxy and build the library automatically or even just have the Galaxy 
 codebase
 present when using Galaxy-less tool shed dependencies.
 
 Buy in on item 1 by the Galaxy team (specifically Greg and Dave B.)
 however is needed, are there any objections to this idea? Do you have any 
 broad
 advice on how to approach this to ensure the changes make sense, work with 
 your
 long term vision, and end up in Galaxy?
 
 Of all the things on my TODO list for the next year, this is probably the most
 potentially broadly interesting to this weeks BOSC codefest attendees, so I 
 was
 going to attempt to sell this as something to work on. The sales pitch would
 include building a little tool shed version of the module command -
 http://linux.die.net/man/1/module to demonstrate this work and have something
 immediately useful produced.
 
 The idea would be to create a command-line tool for utilizing tool shed
 dependencies.
 
 # Unlike standard module, install procedure is available. Probably could
 # default to main tool shed and latest installable revision
 
 % tsmodule repo:install galaxyp/tint
 % tsmodule repo:install toolshed.g2.bx.psu.edu/galaxyp/tint/ab43b5ba7a4e
 
 # module lets you list packages, I guess tool shed version would need
 # repository and package listings:
 
 % tsmodule repo:list
 toolshed.g2.bx.psu.edu/galaxyp/tint/ab43b5ba7a4e
 % tsmodule package:list
 tint_proteomics_scripts/1.19.19/galaxyp/tint/ab43b5ba7a4e
 
 # Finally, a use command would source the env.sh script and make dependency
 # available in the command-line (might require starting new shell?):
 
 % tsmodule package:use tint_proteomics_scripts
 % tsmodule package:use tint_proteomics_scripts/1.19.19
 % tsmodule package:use
 int_proteomics_scripts/1.19.19/galaxyp/tint/ab43b5ba7a4e
 
 # use apps that would be available to tools with valid requirements tags.
 % iQuantCLI