Hi Nathaniel,

please scroll down to see how you can address these problems,

Nathaniel Street wrote:
Hi

I am trying to setup a custom mart db etc and have run into some problems that I can't find solutions to in the documentation. Any help would be much appreciated - even just pointers of what to read and where or links to see others config files etc.

My mart install can be accessed at http://130.239.72.85:9002/biomart/martview

1) I have uploaded sequence data for my genome (one 'column' for CDS and one for protein). Using MartEditor, how do I specify that I want this exported using the FASTA formatter? Also, how do I control what goes in the header? Using XML examples with WebExample.pl works but only if the sequence is the first attribute in the list.

Firstly, I would recommend you to look at ensembl mart and sequence mart, which are publicly available. Connect through MartEditor using the credentials:
databaseType: mysql
host: martdb.ensembl.org
port: 5316
use anonymous
pass: <leave it empty>
database & schema: ensembl_mart_50 OR sequence_mart_50

With ensembl, the sequence itself comes from sequence_mart_50 and the coordinates are used from ensembl_mart_50. This is all handled by Exportables and Importables. If you have CDS or Proteins already processed and stored in once column, then I am afraid you see the right behaviour from software point of view that sequence should be the first attribute in user selection. If you want to hack this based on the name of the attribute e.g if (attName == CDS), feel free to do this in lib <http://cvs.sanger.ac.uk/cgi-bin/viewcvs.cgi/biomart-perl/lib/?root=BioMart#dirlist> / BioMart <http://cvs.sanger.ac.uk/cgi-bin/viewcvs.cgi/biomart-perl/lib/BioMart/?root=BioMart#dirlist> / Formatter <http://cvs.sanger.ac.uk/cgi-bin/viewcvs.cgi/biomart-perl/lib/BioMart/Formatter/?root=BioMart#dirlist> / FASTA.pm

You can also write a custom formatter, and leave it in this directory. That will be your own formatter.



2) When I use the XML generated by MartView, none of the filters I have work. I just get a blank file returned (XML pasted below).
How are you using the MartView generated XML where the filters ar'nt working. Please send me your client.


3) Is there any extra info I can read about importables and exportables as I don't really understand form what is in the user manual (I'm a biologist trying to learn all this, so a newbie to db etc.).
Ensembl mart example would help you on this. Exp/Imps are matching pairs which establish linking between two local or remote datasets which may belong to same or different marts. Let me know if you need more details on this.

4) I only have filters available for the __main table, not any of the __dm tables when I use MartEditor. Any common mistakes that can lead to this?
sorry, are you getting some errors around this issue ? unless its not point 2 you mentioned above.


5) Is it possible to view the public marts in MArtEditor so I can see how others have things working that I want? I'm particularly interested in GRAMENE GENES as they have everything working that I currently want.

Yes certainly possible with MartEditor. With Gramene, I am not sure if you can. Gramene is derived from Ensembl data-model anyways. You can look at ensembl. cc'ing to Will and Richard, between them, they manage Gramene.

A lot of questions I know. Help with any would be great.

Hope this helps.

Cheers
Syed

Thanks

Nat Street

-- XML with non-working filters (remove filter lines and it works) --
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE Query>
<Query virtualSchemaName = "default" formatter = "TSV" header = "0" uniqueRows = "0" count = "" datasetConfigVersion = "0.6" > <Dataset name = "locations" interface = "default" >
        <Filter name = "strand_103" value = "+"/>
        <Filter name = "lg_103" value = "LG_I"/>
        <Attribute name = "gm_101" />
        <Attribute name = "lg_103" />
        <Attribute name = "start_103" />
        <Attribute name = "stop_103" />
        <Attribute name = "protid_101" />
        <Attribute name = "strand_103" />
    </Dataset>
</Query>


======================================
Syed Haider.
EMBL-European Bioinformatics Institute
Wellcome Trust Genome Campus, Hinxton,
Cambridge CB10 1SD, UK.
======================================


Reply via email to