Hi Julian,
Futhermore, For URL linking jsut as Ensembl does with BioMart, you can
use the format exaplained and examples below. By default it supports
HTML format only.


====== URL BASED QUERY FORMAT

?VIRTUALSCHEMANAME=<VSName>
&ATTRIBUTES=<datasetInternalName>.<Interface>.<AttributePageInternalName>.<AtrributeInternalName>."<OPTIONAL:
 comma separated list of values-this deals with attributeFilters such upstream 
downstream Sequence Flanks>"
&FILTERS=<datasetInternalName>.<Interface>.<FilterPageInternalName>.<FilterInternalName>."<comma
 separated list of values-this deals with attributeFilters such upstream 
downstream Sequence Flanks>"

====== NOTES

- All params must be supplied unless they are OPTIONAL.
- &FILTERS: is OPTIONAL
- When using FilterInternalName for Container type filters 
        e.g DropDownMenu with Only/exculded choice  [ENSEMBL HUMAN ID LIST
FILTERS]
        OR DropDownMenu with TEXTField for values   [ENSEMBL HUMAN ID list
limit]
        the FilterInternalName should be the internalName of option in dropdown
and 
        value in double quotes would be internal name only/excluded OR any
value
        should you wish to appear in text field. This is just as in XML based
queries.
- incase you don't have a virtualSchema in registry, then use default
- incase yow don't have any interface for a dataset, use default


======= EXAMPLES

?VIRTUALSCHEMANAME=default&ATTRIBUTES=msd.default.feature_page.pdb_id&FILTERS=msd.default.filters.experiment_type."Electron
 diffraction,Electron microscopy"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=msd.default.feature_page.pdb_id|
msd.default.feature_page.experiment_type|
msd.default.feature_page.assembly_code&FILTERS=msd.default.filters.experiment_type."Electron
 diffraction,Electron microscopy"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.chromosome_name."4"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=mmusculus_gene_ensembl.default.feature_page.strand|mmusculus_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=mmusculus_gene_ensembl.default.filters.chromosome_name."Y"


?VIRTUALSCHEMANAME=default&ATTRIBUTES=msd.default.feature_page.assembly_code|hsapiens_gene_ensembl.default.structure.biotype|msd.default.feature_page.pdb_id|hsapiens_gene_ensembl.default.structure.exon_stable_id|hsapiens_gene_ensembl.default.structure.gene_stable_id|msd.default.feature_page.experiment_type&FILTERS=msd.default.filters.experiment_type."Electron
 diffraction,Electron microscopy"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=msd.default.feature_page.assembly_code|hsapiens_gene_ensembl.default.feature_page.strand|msd.default.feature_page.pdb_id|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id|msd.default.feature_page.experiment_type&FILTERS=msd.default.filters.experiment_type."Electron
 diffraction,Electron 
microscopy"|hsapiens_gene_ensembl.default.filters.chromosome_name."4"


------------- with AttributeFilters Upstream downstream flanks

?VIRTUALSCHEMANAME=default&ATTRIBUTES=msd.default.feature_page.assembly_code|hsapiens_gene_ensembl.default.sequences.gene_flank|hsapiens_gene_ensembl.default.sequences.upstream_flank."140"|msd.default.feature_page.pdb_id|hsapiens_gene_ensembl.default.sequences.biotype|hsapiens_gene_ensembl.default.sequences.gene_stable_id|msd.default.feature_page.experiment_type

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.sequences.gene_flank|msd.default.feature_page.assembly_code|hsapiens_gene_ensembl.default.sequences.upstream_flank."140"|msd.default.feature_page.pdb_id|hsapiens_gene_ensembl.default.sequences.biotype|hsapiens_gene_ensembl.default.sequences.gene_stable_id|msd.default.feature_page.experiment_type&FILTERS=hsapiens_gene_ensembl.default.filters.chromosome_name."4"


?VIRTUALSCHEMANAME=default&ATTRIBUTES=msd.default.feature_page.assembly_code|hsapiens_gene_ensembl.default.sequences.cdna|hsapiens_gene_ensembl.default.sequences.exon_stable_id|hsapiens_gene_ensembl.default.sequences.gene_stable_id


------------- with peptide

?VIRTUALSCHEMANAME=default&ATTRIBUTES=msd.default.feature_page.assembly_code|hsapiens_gene_ensembl.default.sequences.peptide|msd.default.feature_page.pdb_id|hsapiens_gene_ensembl.default.sequences.exon_stable_id|hsapiens_gene_ensembl.default.sequences.gene_stable_id


Queries with CONTAINER TYPE FILTERS filter

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.with_ccds."only"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.with_ipi."only"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.with_ipi."excluded"|hsapiens_gene_ensembl.default.filters.chromosome_name."4"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.biotype."miRNA,Mt_rRNA"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.band_start."p36.33"|hsapiens_gene_ensembl.default.filters.band_end."p31.3"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.encode_region."16:1:500000"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.chromosome_name."Y"


?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.start."4000"


?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.chromosome_name."4"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.pathology_term."neoplasia"


?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.pathology_term."neoplasia"|hsapiens_gene_ensembl.default.filters.chromosome_name."4"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id|msd.default.feature_page.assembly_code&FILTERS=hsapiens_gene_ensembl.default.filters.chromosome_name."4"|msd.default.filters.has_cath."only"

?VIRTUALSCHEMANAME=default&ATTRIBUTES=msd.default.feature_page.assembly_code|hsapiens_gene_ensembl.default.feature_page.strand|hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id&FILTERS=hsapiens_gene_ensembl.default.filters.chromosome_name."4"|msd.default.filters.has_cath."only"


Feel free to write back if you need further assistance.

Kind Regards
Syed



On Mon, 2007-03-12 at 09:56 +0000, Arek Kasprzyk wrote:
> On 12 Mar 2007, at 03:48, Julian Catchen wrote:
> 
> > Hello,
> >
> > Does anyone have any example XML queries they are using to poll the  
> > Ensembl biomart interface? I have gotten some simple examples working  
> > that pull down lists of ensembl IDs by using examples from the  
> > documentation. However, I can't seem to find any examples of how to  
> > query for FASTA formatted cDNA sequences or translations. Also, I  
> > can't find any documentation of how to request gzipped data.
> >
> 
> Hi Julian,
> please go  to www.biomart.org/biomart/martview, create your favourite  
> query using MView and click
>   XML button. This will give you the exact xml format required for your  
> web service query. In principle anything
> that you can do with MView you should be also able to do with  
> webservice XML. If not, that we need
> to fix it.
> 
> In order to invoke a formatter you simply need to add  
> 'formatter="FASTA' to your xml query.
> 
> For example the below query will give you peptides from chromosome 22  
> in FASTA format:
> 
> 
> <?xml version="1.0" encoding="UTF-8"?>
> <!DOCTYPE Query>
> <Query  virtualSchemaName = "default" Header = "1" count = ""  
> softwareVersion = "0.5" formatter="FASTA" >
> 
>          <Dataset name = "hsapiens_gene_ensembl" interface = "default" >
>                  <Attribute name = "peptide" />
>                  <Attribute name = "str_chrom_name" />
>                  <Attribute name = "gene_stable_id" />
>                  <Attribute name = "biotype" />
>                  <Filter name = "chromosome_name" value = "22"/>
>          </Dataset>
> </Query>
> 
> 
> 
> you can run this query using the webExample.pl script:
> 
> 
> http://cvs.sanger.ac.uk/cgi-bin/viewcvs.cgi/biomart-perl/scripts/ 
> webExample.pl?view=markup
> 
> 
> 
> 
> > I used to have all of these automated through simple URL GET queries  
> > that no longer seem to work with Ensembl post version 41.
> >
> 
> URL GET query still work but the format has changed. We have not yet  
> documented it properly.
> 
> I am cc-ing your email to mart-dev so someone from there will send you  
> a few examples
> 
> hope that helps,
> a.
> 
> 
> > Any pointers to examples or documentation for XML queries would be  
> > appreciated.
> >
> > Thanks,
> >
> > julian
> >
> 
> 
> ------------------------------------------------------------------------ 
> -------
> Arek Kasprzyk
> EMBL-European Bioinformatics Institute.
> Wellcome Trust Genome Campus, Hinxton,
> Cambridge CB10 1SD, UK.
> Tel: +44-(0)1223-494606
> Fax: +44-(0)1223-494468
> ------------------------------------------------------------------------ 
> -------
> 
> 
-- 
======================================
Syed Haider.
EMBL-European Bioinformatics Institute
Wellcome Trust Genome Campus, Hinxton,
Cambridge CB10 1SD, UK.
======================================

Reply via email to