[aroma.affymetrix] Exception Error when computing non-paired CBS: Exception: .. Too many non-finite values ......

sean nj Fri, 18 May 2012 11:17:42 -0700

Hi, it's me again :)
 
I have some level 2 TCGA agilent CGH 244A data, the text files of log2 
ratios. I followed vignettes ( <http://aroma-project.org/node/43>
http://aroma-project.org/node/43 <http://aroma-project.org/node/43 
and><http://aroma-project.org/node/43 and>
 and http://www.aroma-project.org/node/88) and created UGP files and data 
files (asb files) containing log2 CN ratios. But I got the error when I ran 
CBS computing by  > fit(sm, verbose=-10). Basically it complianed that 
there are too many non-finite values on chromosome 1 of assay sample 1:
      Array #1 ('MSK_00001_251469322729') of 6 on chromosome 1...
      Error in getRawCnData.CopyNumberChromosomalModel(this, ceList = 
ceList,  : 
  
      [2012-05-18 14:02:13] Exception: Something is wrong with the 
copy-number ratios of sample 
'MSK_00001_251469322729,S01_CGH-v4_91__GCN_V3,log2ratio,total' relative to 
reference '      
.average-signals-median-mad,78c9a8b043bc37fd29db00f3f674295c' on chromosome 
1. Too many non-finite values: 6721 (34.9% > 20.0%) out of 19246.
I also have one more question regarding CBS. Once I finish CBS computing, I 
end up with a text file such as:
 
         sample chromosome start stop mean count
      1 NA06991 1 61736 106013377 -0.002 65870
      2 NA06991 1 106019206 106022376 -1.675 1
      3 NA06991 1 106024056 149036525 -0.002 11462
      4 NA06991 1 149040066 149256692 -0.443 141
      5 NA06991 1 149259417 149436843 -0.144 36
 
Is there a way to map the original probe ID or gene IDs to segmentation 
result? I want to end up with a table that has a column for original probe 
ID, a column for the related CN ratio mean from CBS, a column for 
chromosome, and a column for chromosome start position.
 
Thanks a lot for the help!
 
Ying
 
 
> library(aroma.core)
> library(aroma.cn)
> chipType <- "HG-CGH-244A,TCGA_GBM_MSK"
> unf <- TextUnitNamesFile$byChipType(chipType)
> ugp <- getAromaUgpFile(unf)
> dataSet <- "TCGA_GBM_MSKCC"
> ds <- AromaUnitTotalCnBinarySet$byName(dataSet, chipType=chipType)
> print(ds)
AromaUnitTotalCnBinarySet:
Name: TCGA_GBM_MSKCC
Tags: 
Full name: TCGA_GBM_MSKCC
Number of files: 6
Names: MSK_00001_251469322729, MSK_00003_251469322731, 
MSK_00006_251469322732, ..., MSK_00010_251469322627 [6]
Path (to the first file): rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK
Total file size: 5.40 MB
RAM: 0.01MB
> sm <- CbsModel(ds)
Loading required package: DNAcopy
> print(sm)
CbsModel:
Name: TCGA_GBM_MSKCC
Tags: 
Chip type (virtual): HG-CGH-244A
Path: cbsData/TCGA_GBM_MSKCC/HG-CGH-244A
Number of chip types: 1
Sample & reference file pairs:
Chip type #1 ('HG-CGH-244A') of 1:
Sample data set:
AromaUnitTotalCnBinarySet:
Name: TCGA_GBM_MSKCC
Tags: 
Full name: TCGA_GBM_MSKCC
Number of files: 6
Names: MSK_00001_251469322729, MSK_00003_251469322731, 
MSK_00006_251469322732, ..., MSK_00010_251469322627 [6]
Path (to the first file): rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK
Total file size: 5.40 MB
RAM: 0.01MB
Reference data set/file:
<average across arrays>
RAM: 0.00MB
> fit(sm, verbose=-10)
Building tuple of reference sets...
 No reference available.
 Calculating average copy-number signals...
  Retrieving average unit signals across 6 arrays...
   Searching for an existing file...
    Paths:
    [1] "rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK"
    Filename: 
.average-signals-median-mad,78c9a8b043bc37fd29db00f3f674295c.asb
    Searching path #1 of 1...
     Path: rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK
     Pathname: 
rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK/.average-signals-median-mad,78c9a8b043bc37fd29db00f3f674295c.asb
     Found an existing file.
    Searching path #1 of 1...done
    Located pathname: 
rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK/.average-signals-median-mad,78c9a8b043bc37fd29db00f3f674295c.asb
   Searching for an existing file...done
   Loading existing data file...
   Loading existing data file...done
   AromaUnitTotalCnBinaryFile:
   Name: .average-signals-median-mad
   Tags: 78c9a8b043bc37fd29db00f3f674295c
   Full name: .average-signals-median-mad,78c9a8b043bc37fd29db00f3f674295c
   Pathname: 
rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK/.average-signals-median-mad,78c9a8b043bc37fd29db00f3f674295c.asb
   File size: 921.57 kB (943689 bytes)
   RAM: 0.00 MB
   Number of data rows: 235834
   File format: v1
   Dimensions: 235834x1
   Column classes: double
   Number of bytes per column: 4
   Footer: <createdOn>20120518 13:04:05 
EDT</createdOn><platform>Agilent</platform><chipType>HG-CGH-244A</chipType><srcDetails><nbrOfFiles>6</nbrOfFiles><checkSum>5629a7bc8e75693ad556a0d8c9f3417c</checkSum></srcDetails><params><meanName>median</meanName><sdName>mad</sdName></params>
   Platform: Agilent
   Chip type: HG-CGH-244A
   Number of units to be updated: 7
   Processing chunk...
     chr "Indices in chunk:"
     int [1:7] 93672 120223 160149 216186 234895 234902 234936
    Reading data...
    Reading data...done
    Estimating averages and standard deviations...
    Estimating averages and standard deviations...done
    Writing estimates...
    Writing estimates...done
   Processing chunk...done
  Retrieving average unit signals across 6 arrays...done
 Calculating average copy-number signals...done
Building tuple of reference sets...done
Using reference tuple:
AromaUnitTotalCnBinarySetTuple:
Name: TCGA_GBM_MSKCC
Tags: 
Chip types: HG-CGH-244A
AromaUnitTotalCnBinarySet:
Name: TCGA_GBM_MSKCC
Tags: 
Full name: TCGA_GBM_MSKCC
Number of files: 6
Names: .average-signals-median-mad, .average-signals-median-mad, 
.average-signals-median-mad, ..., .average-signals-median-mad [6]
Path (to the first file): rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK
Total file size: 5.40 MB
RAM: 0.01MB
RAM: 0.00MB
Extract DataFileMatrix...
 Array: 1
 Test data sets:
 AromaUnitTotalCnBinarySetTuple:
 Name: TCGA_GBM_MSKCC
 Tags: 
 Chip types: HG-CGH-244A
 AromaUnitTotalCnBinarySet:
 Name: TCGA_GBM_MSKCC
 Tags: 
 Full name: TCGA_GBM_MSKCC
 Number of files: 6
 Names: MSK_00001_251469322729, MSK_00003_251469322731, 
MSK_00006_251469322732, ..., MSK_00010_251469322627 [6]
 Path (to the first file): rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK
 Total file size: 5.40 MB
 RAM: 0.01MB
 RAM: 0.00MB
 Reference data sets:
 AromaUnitTotalCnBinarySetTuple:
 Name: TCGA_GBM_MSKCC
 Tags: 
 Chip types: HG-CGH-244A
 AromaUnitTotalCnBinarySet:
 Name: TCGA_GBM_MSKCC
 Tags: 
 Full name: TCGA_GBM_MSKCC
 Number of files: 6
 Names: .average-signals-median-mad, .average-signals-median-mad, 
.average-signals-median-mad, ..., .average-signals-median-mad [6]
 Path (to the first file): rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK
 Total file size: 5.40 MB
 RAM: 0.01MB
 RAM: 0.00MB
 Test data files:
 $`HG-CGH-244A`
 AromaUnitTotalCnBinaryFile:
 Name: MSK_00001_251469322729
 Tags: S01_CGH-v4_91__GCN_V3,log2ratio,total
 Full name: MSK_00001_251469322729,S01_CGH-v4_91__GCN_V3,log2ratio,total
 Pathname: 
rawCnData/TCGA_GBM_MSKCC/HG-CGH-244A,TCGA_GBM_MSK/MSK_00001_251469322729,S01_CGH-v4_91__GCN_V3,log2ratio,total.asb
 File size: 921.40 kB (943513 bytes)
 RAM: 0.00 MB
 Number of data rows: 235834
 File format: v1
 Dimensions: 235834x1
 Column classes: double
 Number of bytes per column: 4
 Footer: <createdOn>20120518 11:04:29 
EDT</createdOn><platform>Agilent</platform><chipType>HG-CGH-244A</chipType>
 Platform: Agilent
 Chip type: HG-CGH-244A
 
 attr(,"class")
 [1] "AromaUnitTotalCnBinaryFileList" "GenericDataFileList"            
"list"                          
Extract DataFileMatrix...done
Genomic-signal tags: S01_CGH-v4_91__GCN_V3,log2ratio,total
Reference tags: 216b0999d2e57831383f208942c5d3c6
Array #1 ('MSK_00001_251469322729') of 6 on chromosome 1...
Error in getRawCnData.CopyNumberChromosomalModel(this, ceList = ceList,  : 
  
[2012-05-18 14:02:13] Exception: Something is wrong with the copy-number 
ratios of sample 
'MSK_00001_251469322729,S01_CGH-v4_91__GCN_V3,log2ratio,total' relative to 
reference '.average-signals-median-mad,78c9a8b043bc37fd29db00f3f674295c' on 
chromosome 1. Too many non-finite values: 6721 (34.9% > 20.0%) out of 19246.
  at #06. getRawCnData.CopyNumberChromosomalModel(this, ceList = ceList, 
              refList = rfList, chromosome = chromosome, ..., estimateSd = 
FALSE, 
              verbose = less(verbose))
          - getRawCnData.CopyNumberChromosomalModel() is in environment 
'aroma.core'
  at #05. getRawCnData(this, ceList = ceList, refList = rfList, chromosome 
= chromosome, 
              ..., estimateSd = FALSE, verbose = less(verbose))
          - getRawCnData() is in environment 'aroma.core'
  at #04. extractRawCopyNumbers.CopyNumberChromosomalModel(this, array = 
array, 
              chromosome = chr, maxNAFraction = maxNAFraction)
          - extractRawCopyNumbe
In addition: Warning messages:
1: In log(M, base = 2) : NaNs produced
2: In log(A, base = 2) : NaNs produced
Array #1 ('MSK_00001_251469322729') of 6 on chromosome 1...done
> 
> sessionInfo()
R version 2.15.0 (2012-03-30)
Platform: x86_64-pc-mingw32/x64 (64-bit)
locale:
[1] LC_COLLATE=English_United States.1252  LC_CTYPE=English_United 
States.1252    LC_MONETARY=English_United States.1252
[4] LC_NUMERIC=C                           LC_TIME=English_United 
States.1252    
attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     
other attached packages:
 [1] aroma.cn_1.0.0         DNAcopy_1.30.0         aroma.affymetrix_2.5.0 
affxparser_1.28.0      aroma.apd_0.2.2       
 [6] R.huge_0.3.0           aroma.core_2.5.0       aroma.light_1.24.0     
matrixStats_0.5.0      R.rsp_0.7.5           
[11] R.cache_0.6.2          R.filesets_1.1.5       digest_0.5.2           
R.utils_1.12.1         R.oo_1.9.3            
[16] R.methodsS3_1.2.2     
>


-- 
When reporting problems on aroma.affymetrix, make sure 1) to run the latest 
version of the package, 2) to report the output of sessionInfo() and 
traceback(), and 3) to post a complete code example.


You received this message because you are subscribed to the Google Groups 
"aroma.affymetrix" group with website http://www.aroma-project.org/.
To post to this group, send email to aroma-affymetrix@googlegroups.com
To unsubscribe and other options, go to http://www.aroma-project.org/forum/

[aroma.affymetrix] Exception Error when computing non-paired CBS: Exception: .. Too many non-finite values ......

Reply via email to