Hi Mike,
Unfornately, i've got any error message. All the job run without 
complains about such a problem.
But thank's you for investigate with this special case.


Mike Cariaso wrote:
> The problem occurs when a workers sees that no GIs apply to it, and then 
> decides that it should do no GI filtering. Nodes which match some GIs, 
> stil respect the filter. The problem occurs in the ncbi toolbox, not the 
> pure mpiBLAST code, since it assumes that running on a single node with 
> a filter that matches none of the GIs is illogical.
> 
> Please confirm that your runs produce this message on STDERR:
> BlastCreateVirtualOIDList: Missing oidlists to attach virtual_oidlist
> 
> If so the problem occurs in ncbi/tools/blastool.c at line 2727
>     /* attach mask to appropriate place */
>     if (oidlist_forall_rdfp && real_ngis > 0) {
>         rdfp_chain->oidlist = virtual_oidlist;
>     } else {
>         if (mask_rdfp)
>             mask_rdfp->oidlist = virtual_oidlist;
>         else {
>             /* Should never happen */
>             ErrPostEx(SEV_ERROR, 0, 0, "BlastCreateVirtualOIDList: Missing "
>                     "oidlists to attach virtual_oidlist");
>             OIDListFree(virtual_oidlist);
>             return NULL;
>         }
>     }
> 
> 
> 
> Unfortunately, simply replacing the  /* Should never happen */ block 
> with another
>              mask_rdfp->oidlist = virtual_oidlist;
>    or
>              mask_rdfp->oidlist = NULL;
> seems to result in crashes when there are no GIs for a particular worker.
> 
> 
> Presumably the worker can be told to look for this case, and return a 
> correct "No results found" response to the master. Doing so, is a bit 
> beyond my abilities tonight, and before I attempt anything of the sort 
> I'd welcome input from others. While the problem is most visible in 
> Jerome's extreme case, this could be an issue any time the filter 
> removes more GIs than are present on a single worker. That pathological 
> case is certainly rare, but not impossible.
>           
> 
> 
> 
>  
> --
> Mike Cariaso * Bioinformatics Software * http://www.cariaso.com
> 
> 
> ----- Original Message ----
> From: Mike Cariaso <[EMAIL PROTECTED]>
> To: [email protected]
> Sent: Wednesday, March 14, 2007 8:57:46 PM
> Subject: Re: [Mpiblast-users] Strange issue in mpiBlast limiting by "Gi"
> 
> 
> By having so few GIs, it becomes likely that at least one of the workers 
> is given 0 sequences to analyze. This seems like a rather special case, 
> and I would imagine it is where the bug hunt should begin. I'll try to 
> do a few tests this evening.
> 
>  
> --
> Mike Cariaso * Bioinformatics Software * http://www.cariaso.com
> 
> 
> ----- Original Message ----
> From: Jerome <[EMAIL PROTECTED]>
> To: [email protected]
> Sent: Wednesday, March 14, 2007 8:43:23 PM
> Subject: Re: [Mpiblast-users] Strange issue in mpiBlast limiting by "Gi"
> 
> Hi Mike,
> Yes, i understand our remark, and you are rigth in the way that the
> number of sequence's reference is very big.
> But in my special case, the end user are using genome sequence for
> bacteria, that have just one reference (or two for ribozomal ciclic
> sequence) for all of the genome. And, searching for just two or three
> special bacterias, i found this issue.
> With the other databases as nt, and i've never use filter of 2 or 3 Gi.
> Take my question more like a desie to understand why this problem.
> Best regards.
> 
> Mike Cariaso wrote:
>  > Jerome,
>  >
>  > What you are describing may be true (if so, its worth pursing), but my
>  > your usage sounds very strange to me.
>  >
>  > Having 7 fragments would be reasonable for a small cluster with 7 nodes.
>  > But for your GI filter file to only have 7 lines, would mean that you
>  > would only be searching 7 sequences. When I search a database such as
>  > NCBI's 'nt', my filterfile usually has more than 4,000,000 GIs.
>  >
>  > The GI filter is used to include/exclude individual sequences when
>  > searching your blastable database. This is done with the "-l
>  > filterfilename "(lowercase L) command line switch, where filterfilename
>  > is (usually) a text file of GI #s, one per line. Typically a blastable
>  > database will have thousands to millions of sequences, and therefore
>  > thousands to millions of GIs.
>  >
>  > The number of fragments refers to the number of pieces you split your
>  > blastable database into. Commonly this will be the same size as the
>  > number of workers in your cluster, or perhaps a multiple such as 2x or
>  > 3x larger.
>  >
>  > In most cases the number of sequences in the database is many thousands
>  > of times larger than the number of fragments. Otherwise each worker
>  > would have very little work to do, and mpiBLAST probably wouldn't make
>  > much sense.
>  >
>  > Does it sound like you and I are describing the same thing?
>  >
>  > --
>  > Mike Cariaso * Bioinformatics Software * http://www.cariaso.com
>  >
>  >
>  > ----- Original Message ----
>  > From: Jerome <[EMAIL PROTECTED]>
>  > To: [email protected]
>  > Sent: Wednesday, March 14, 2007 7:55:28 PM
>  > Subject: [Mpiblast-users] Strange issue in mpiBlast limiting by "Gi"
>  >
>  > Hi all,
>  > ive just instal a Mpiblast to running on our cluster. And it's a verya
>  > fast program that i can thank's you!
>  > For justa few day, some user want's to limit the search in a database of
>  > bacterians genome. And i could notice that, if the number of "gi"
>  > entries in the filter file is less or equivalent of the number of
>  > fragments of the initial database, i don't matter about this filter file.
>  > I mean that, for example, if i format the databases in 7 fragments, and
>  > ask with a filter file size of >7 lines, i don't use it and answer the
>  > similar response as if i don't use this filter.
>  > Could someone help me in this issue?
>  >
>  > --
>  > -- Jérôme
>  > Le vase donne une forme au vide, et la musique au silence.
>  >     (Georges Braque)
>  >

-- 
-- Jérôme
En essayant continuellement, on finit par réussir. Donc:
Plus ça rate, plus on a de chance que ça marche.
        (Devise Shadok)

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Mpiblast-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/mpiblast-users

Reply via email to