Hi Sebastien,
     They were Illumina 2x100 PE reads. However we did switch to a 
higher kmer for that assembly (63, I think.)
     We do have a SOLiD, though. SAET doesn't convert color space reads 
to sequence space. It examines all the reads to find errors that are 
correctable.
     Generally to convert a color space sequence to sequence space you 
need a "key" base prefixing the sequence.
Phillip
PS The newer SOLiDs -- the 5500XLs have a new chemistry, "ECC", that 
will allow direct conversion of the reads to sequence space without loss 
of accuracy. But those are just getting installed now.

On 6/22/2011 1:10 PM, Sébastien Boisvert wrote:
> Quick question:
>
> By looking at your CoverageDistribution.txt file, I can see that there
> are numerous low-coverage erroneous k-mers.
>
> I would say that these reads are not Illumina reads and not 454 reads
> although I might be wrong.
>
> Therefore, are these color-space reads that you converted to fasta or
> fastq using SAET from SOLiD ?
>
>
> Just asking.
>
> Also, if you happen to work with color space data, do you know a way to
> convert color-space contigs into nucleotide-space contigs ?
>
>
>                         Sébastien
>
> On Mon, 2011-06-20 at 06:59 -0400, Phillip San Miguel wrote:
>> Hi Sébastien,
>> Here it is:
>>
>> http://pastebin.com/Mv5ziDVk
>>
>> Regards,
>> Phillip
>>
>> On 6/17/2011 12:50 PM, Sébastien Boisvert wrote:
>>> What is the content of the CoverageDistribution.txt file with the
>>> smoothing code ?
>>>
>>>
>>>
>>> On Thu, 2011-06-16 at 11:32 -0400, Rick Westerman wrote:
>>>> Sebastian:
>>>>
>>>>      Now that I look at the source closer, the version number in the 
>>>> v.1.6.1-rc1 link is set to 1.6.0 however the code itself looks like you 
>>>> have put in smoothing therefore I am confident that we have the latest and 
>>>> greatest.  I've recompiled a couple of times, including a time when I 
>>>> changed the version number just to be 100% sure that we are running the 
>>>> latest code.  However our over-all problem of short contigs continues:
>>>>
>>>>     The 'LibraryStatistics' file looks like:
>>>>
>>>> -------------------
>>>>
>>>>    File: ../FastQ/000617_TL3360_both.fastq
>>>>     NumberOfSequences: 13001302
>>>>
>>>> Total: 13001302
>>>>
>>>> NumberOfPairedLibraries: 1
>>>>
>>>> LibraryNumber: 0
>>>>    InputFormat: Interleaved,Paired
>>>>    DetectionType: Automatic
>>>>    File: ../FastQ/000617_TL3360_both.fastq
>>>>     NumberOfSequences: 13001302
>>>>    AverageOuterDistance: 1018
>>>>    StandardDeviation: 963
>>>>    DetectionFailure: Yes
>>>>
>>>> -------------------
>>>>
>>>>
>>>>      The old file looks like:
>>>>
>>>> -------------------
>>>>
>>>>    File: ../FastQ/000617_TL3360_both.fastq
>>>>     NumberOfSequences: 13001302
>>>>
>>>> Total: 13001302
>>>>
>>>> NumberOfPairedLibraries: 1
>>>>
>>>> LibraryNumber: 0
>>>>    InputFormat: Interleaved,Paired
>>>>    DetectionType: Automatic
>>>>    File: ../FastQ/000617_TL3360_both.fastq
>>>>     NumberOfSequences: 13001302
>>>>    AverageOuterDistance: 385
>>>>    StandardDeviation: 628
>>>>    DetectionFailure: Yes
>>>>
>>>> -------------------
>>>>
>>>>     So obviously we are getting a different (and perhaps better) distance 
>>>> distribution.
>>>>
>>>>     I am going to try some other parameters out but any suggestions would 
>>>> be useful
>>>>
>>>> Thanks,
>>>> -- Rick
>>>>
>>>>
>>>>
>>>> ----- Original Message -----
>>>>> Sebastian:
>>>>>
>>>>> The download link for v1.6.1-rc1 brings up 1.6.0 code. At least that
>>>>> what the version says. We are also still having problems with our
>>>>> sequences. Could you look into this problem. Perhaps create another
>>>>> download?
>>>>>
>>>>> Thanks,
>>>>>
>>>>> --
>>>>> Rick Westerman
>>>>> [email protected]
>>>>>
>>>>> Bioinformatics specialist at the Genomics Facility.
>>>>> Phone: (765) 494-0505 FAX: (765) 496-7255
>>>>> Department of Horticulture and Landscape Architecture
>>>>> 625 Agriculture Mall Drive
>>>>> West Lafayette, IN 47907-2010
>>>>> Physically located in room S049, WSLR building
>>>
>
>


------------------------------------------------------------------------------
Simplify data backup and recovery for your virtual environment with vRanger.
Installation's a snap, and flexible recovery options mean your data is safe,
secure and there when you need it. Data protection magic?
Nope - It's vRanger. Get your free trial download today.
http://p.sf.net/sfu/quest-sfdev2dev
_______________________________________________
Denovoassembler-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/denovoassembler-users

Reply via email to