Re: [Bioc-sig-seq] `+` for GenomeData and coverage from several lanes

Simon Anders Tue, 30 Jun 2009 10:51:37 -0700

Hi

Patrick Aboyoun wrote:

Simon,
Could you provide some profiling information to show where thebottlenecks are?

I don't know if there is really a clear bottleneck. 9 minutes tocalculate the coverage of 29 mio reads is 20 seconds per mio reads; thisis probably what the coverage function always needed. So, in the codegiven in my mail, the summing up of the GenomeData objects is justawkward but not a performance penalty.


> I am also wondering if I should be building up the
> functionality for RleList, which could have `+` and other Math
> operations. We have a lot of classes in the Sequence space and it is
> not clear yet which classes are going to be part of the winning
> solution.

I'd say that this is the main issue. I discover new classes every day.You just mentioned 'RleList', Michael mentions 'GenomeDataList', andMartin has another way to go again.

I'm sorry to say that, at least for me, this has become hopelesslyconfusing, and I imagine that many other users fell the same. You writethat "it is not clear yet which classes are going to be part of thewinning solution" and I completely agree that it makes more sense tohave a few good classes rather than adding functionality to any class ondemand. So, maybe don't bother with a `+` operation for now.


Best regards
  Simon

_______________________________________________
Bioc-sig-sequencing mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

Re: [Bioc-sig-seq] `+` for GenomeData and coverage from several lanes

Reply via email to