Re: [Bioc-sig-seq] large BAM files and large BED files

Michael Lawrence Fri, 16 Sep 2011 14:11:44 -0700

It sounds like you're trying to use BED as an alternative to BAM? Probably
not a good idea, especially at this scale. Why are you aiming for a
GenomeData? A GappedAlignments might be more appropriate. See
GenomicRanges::readGappedAlignments() for bringing a BAM into a
GappedAlignments.


This page might help:
http://bioconductor.org/help/workflows/high-throughput-sequencing/#sequencing-resources

But it could really be improved.

Michael

On Fri, Sep 16, 2011 at 1:44 PM, Rene Paradis <rene.para...@genome.ulaval.ca
> wrote:

> Hello,
>
> I am experiencing a problem regarding the load in memory of bed files of
> 30 GB. my function read.table unleash the error : Error in unique(x) :
> length xxxxxx is too large for hashing.
>
> this is generated by the function MKsetup of the unique.c file. Even by
> increasing by 10 000x the value, the error persists. I believe the
> function pushes more data in ram, but I am not sure this is the good way
> to focus on.
>
> Ultimately, I would like to produce a GenomeData object from either a
> BAM file or a bed file.
>
> has someone ever worked with very very big BAM files (about 30 GB)
>
> thanks
>
> Rene paradis
>
> _______________________________________________
> Bioc-sig-sequencing mailing list
> Bioc-sig-sequencing@r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing
>

        [[alternative HTML version deleted]]

_______________________________________________
Bioc-sig-sequencing mailing list
Bioc-sig-sequencing@r-project.org
https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing

Re: [Bioc-sig-seq] large BAM files and large BED files

Reply via email to