Hi,

I am working on creating a workflow that gets all of the nucleotide fasta sequences for species of mayflies and conducts alignment and phylogenetic analyses on them.

In my local database I entered all of the species that I want to analyze and I do an esearch query for each one: Ephemera strigata"[Organism]

I am using this webservice: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/soap/eutils.wsdl
run_eSearch

This gives me back several lists with genes for each species.  The problem is that the result does not show me the name of the gene in any standardized way.  I may see 18s ribosomal mentioned in the description but I don't see a standardized way of knowing which of these genes are homologous across species.
My goal is to find all of the genes that appear in say 3 or more of the species I am analyzing and construct a dataset with those.  Is there another database that I should be querying if I want to find a list homologous genes for a list of species?  Or, is there some way to match these up to find out which are homologous?
------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
taverna-users mailing list
[email protected]
[email protected]
Web site: http://www.taverna.org.uk
Mailing lists: http://www.taverna.org.uk/taverna-mailing-lists/

Reply via email to