Hi Mark, Thanks for posting your code. Following exactly what you did(the same code and the same custom CDF), I can reproduce what you got. However, next I put the version 10 of the U133A Refseq custom CDF (binary file) into the relevant locations (For version 10, there is supposed to be 17599 units) I used the same code. At the command fit(plm, verbose=verbose), I got the error message " Fitting chunk #1 of 1... Units: int [1:19760] 1 2 3 4 5 6 7 8 9 10 ... Error in readCdfCellIndices(pathname, ...) : Argument 'units' contains an element out of range: 17600 Fitting chunk #1 of 1...done Fitting model of class RmaPlm:...done " I then changed fit(plm, verbose=verbose) to fit(plm, units=NULL, verbose=verbose) and resolved the issue.
Finally the dimension I got is 19760 instead of 17599, which is for v12 but not for v10 I now remember the "11979" I kept getting before is from some custom cdf I used earlier. As I used the same code as yours, I only posted the directory, cdf infor and sessionInfo from my run > print(cdf) AffymetrixCdfFile: Path: annotationData/chipTypes/HG-U133A Filename: HG-U133A,Hs_REFSEQ.cdf Filesize: 10.04MB Chip type: HG-U133A,Hs_REFSEQ RAM: 0.00MB File format: v4 (binary; XDA) Dimension: 712x712 Number of cells: 506944 Number of units: 17599 Cells per unit: 28.81 Number of QC units: 10 >dir("rawData/test/HG-U133A/") [1] "GSM65316,.CEL" "GSM65317,.CEL" "GSM65318,.CEL" "GSM65319,.CEL" R version 2.9.0 (2009-04-17) i386-apple-darwin8.11.1 > sessionInfo() locale: en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] preprocessCore_1.6.0 Biobase_2.4.1 aroma.affymetrix_1.1.1 aroma.apd_0.1.6 affxparser_1.16.0 R.huge_0.1.8 aroma.core_1.1.2 aroma.light_1.12.2 [9] matrixStats_0.1.6 R.rsp_0.3.4 R.filesets_0.5.2 digest_0.3.1 R.cache_0.1.7 R.utils_1.1.7 R.oo_1.4.8 affy_1.22.0 [17] R.methodsS3_1.0.3 loaded via a namespace (and not attached): [1] affyio_1.12.0 Thanks. Yiwen On Aug 14, 3:11 pm, Mark Robinson <mrobin...@wehi.edu.au> wrote: > Hi Yiwen. > > Well, I'm having trouble reproducing this problem. I downloaded > version 12 of the U133A Refseq custom CDF from: > > http://brainarray.mbni.med.umich.edu/Brainarray/Database/CustomCDF/12... > > and converted it to binary using convertCdf(), then put the files in > the relevant locations. The commands below generate a matrix of chip > effects with the correct dimensions. > > -------------------- > cdf <- AffymetrixCdfFile$fromChipType("HG-U133A",tag="Hs_REFSEQ") > cs <- AffymetrixCelSet$fromName("test", cdf=cdf) > > bc <- RmaBackgroundCorrection(cs); > csBC <- process(bc, verbose=verbose); > > qn <- QuantileNormalization(csBC, typesToUpdate="pm"); > csN <- process(qn, verbose=verbose); > > plm <- RmaPlm(csN); > fit(plm, verbose=verbose); > ces <- getChipEffectSet(plm) > > dm <- extractMatrix(ces) > > > dir("rawData/test/HG-U133A/") > > [1] "E-TABM-26-raw-cel-518987776.cel" "E-TABM-26-raw-cel-518987804.cel" > > [3] "E-TABM-26-raw-cel-518987862.cel" "E-TABM-26-raw-cel-518987878.cel" > > > dir("annotationData/chipTypes/HG-U133A/") > > [1] "HG-U133A,Hs_REFSEQ,monocell.CDF" "HG-U133A,Hs_REFSEQ.cdf" > > > dim(dm) > > [1] 19760 4 > > > print(cdf) > > AffymetrixCdfFile: > > Path: annotationData/chipTypes/HG-U133A > > Filename: HG-U133A,Hs_REFSEQ.cdf > > Filesize: 11.32MB > > Chip type: HG-U133A,Hs_REFSEQ > > RAM: 0.00MB > > File format: v4 (binary; XDA) > > Dimension: 712x712 > > Number of cells: 506944 > > Number of units: 19760 > > Cells per unit: 25.66 > > Number of QC units: 10 > > > sessionInfo() > > R version 2.9.1 (2009-06-26) > > i386-apple-darwin8.11.1 > > locale: > > en_CA.UTF-8/en_CA.UTF-8/C/C/en_CA.UTF-8/en_CA.UTF-8 > > attached base packages: > > [1] stats graphics grDevices utils datasets methods base > > other attached packages: > > [1] preprocessCore_1.6.0 Biobase_2.4.1 > aroma.affymetrix_1.1.0 > > [4] aroma.apd_0.1.6 R.huge_0.1.8 aroma.core_1.1.3 > > [7] aroma.light_1.13.2 matrixStats_0.1.7 R.rsp_0.3.5 > > [10] R.filesets_0.5.3 digest_0.3.1 R.cache_0.1.8 > > [13] R.utils_1.1.7 R.oo_1.4.8 affy_1.22.0 > > [16] R.methodsS3_1.0.3 affxparser_1.15.6 > > loaded via a namespace (and not attached): > > [1] affyio_1.12.0 > > -------------------- > > Perhaps you can attach a complete code example and your sessionInfo(). > > Cheers, > Mark > > On 15-Aug-09, at 2:04 AM, tea wrote: > > > > > > > To test other possibilities, I modified the following function > > theta <- extractMatrix(ces, returnUgcMap=TRUE) > > to > > theta <- extractMatrix(ces, returnRefcMap=TRUE) > > It gave the same result. > > > On Aug 14, 11:57 am, tea <ywcu...@gmail.com> wrote: > >> Hi Mark, > > >> The following are the outputs:> cdf<- getCdf(cs) > >>> print(cdf) > > >> AffymetrixCdfFile: > >> Path: annotationData/chipTypes/HG-U133A > >> Filename: HG-U133A.cdf > >> Filesize: 41.16MB > >> Chip type: HG-U133A > >> RAM: 0.00MB > >> File format: v3 (text; ASCII) > >> Dimension: 712x712 > >> Number of cells: 506944 > >> Number of units: 17599 > >> Cells per unit: 28.81 > >> Number of QC units: 10> print(getMonoCell(cdf)) > > >> AffymetrixCdfFile: > >> Path: annotationData/chipTypes/HG-U133A > >> Filename: HG-U133A,monocell.CDF > >> Filesize: 3.11MB > >> Chip type: HG-U133A,monocell > >> RAM: 0.00MB > >> File format: v4 (binary; XDA) > >> Dimension: 147x146 > >> Number of cells: 21462 > >> Number of units: 17599 > >> Cells per unit: 1.22 > >> Number of QC units: 10> print(getCdf(ces)) > > >> AffymetrixCdfFile: > >> Path: annotationData/chipTypes/HG-U133A > >> Filename: HG-U133A,monocell.CDF > >> Filesize: 3.11MB > >> Chip type: HG-U133A,monocell > >> RAM: 0.00MB > >> File format: v4 (binary; XDA) > >> Dimension: 147x146 > >> Number of cells: 21462 > >> Number of units: 17599 > >> Cells per unit: 1.22 > >> Number of QC units: 10 > > >> On Aug 14, 1:51 am, Mark Robinson <mrobin...@wehi.edu.au> wrote: > > >>> Hi Yiwen. > > >>> Can you check a couple quick things for me? > > >>> What does the output of these give: > > >>> cdf <- getCdf(cs) > > >>> print(cdf) > > >>> print(getMonoCell(cdf)) > > >>> print(getCdf(ces)) > > >>> Cheers, > >>> Mark > > >>> On 14-Aug-09, at 1:44 PM, tea wrote: > > >>>> Hi, > > >>>> I used the script in the following page: > >>>>http://groups.google.com/group/aroma-affymetrix/web/redundancy- > >>>> tests > >>>> to process a dataset of 189 arrays using Human custom refseq CDF > >>>> version 10, HGU133A_Hs_REFSEQ > >>>> After I finished > >>>> fit(plm, verbose=verbose) > >>>> I got the following message: > >>>> " > >>>> Total time for all units across all 189 arrays: 532.37s == 8.87min > >>>> Total time per unit across all 189 arrays: 0.04s/unit > >>>> Total time per unit and array: 0.235ms/unit & array > >>>> Total time for one array (17599 units): 0.07min = 0.00h > >>>> " > >>>> When I finished up to the code > >>>> ces <- getChipEffectSet(plm); > >>>> theta <- extractMatrix(ces, returnUgcMap=TRUE); > >>>> theta <- log2(theta); > >>>> ugcMap <- attr(theta, "unitGroupCellMap"); > >>>> rownames(theta) <- getUnitNames(getCdf(ces), ugcMap[,"unit"] > >>>> and type > > >>>> dim(theta) > >>>> I found that he total number of probesets is only 11979. > >>>> "11979 189" > > >>>> If I used affy(rma) and tested on a small number of CEL file from > >>>> the > >>>> same dataset, the total number of probesets is 17599. > > >>>> From the custom cdf document the total number of probeset of this > >>>> cdf > >>>> is 17531, so with the "AFFX" probes(68), > >>>> the total number should be 17599. I was wondering what parameter > >>>> should be used to get right number of probesets from cdf file for > >>>> refseq, entrezg, ug respectively > > >>>> Thank you very much. > > >>>> -Yiwen Chen > > >>> ------------------------------ > >>> Mark Robinson, PhD (Melb) > >>> Epigenetics Laboratory, Garvan > >>> Bioinformatics Division, WEHI > >>> e: m.robin...@garvan.org.au > >>> e: mrobin...@wehi.edu.au > >>> p: +61 (0)3 9345 2628 > >>> f: +61 (0)3 9347 0852 > >>> ------------------------------ > > ------------------------------ > Mark Robinson, PhD (Melb) > Epigenetics Laboratory, Garvan > Bioinformatics Division, WEHI > e: m.robin...@garvan.org.au > e: mrobin...@wehi.edu.au > p: +61 (0)3 9345 2628 > f: +61 (0)3 9347 0852 > ------------------------------ --~--~---------~--~----~------------~-------~--~----~ When reporting problems on aroma.affymetrix, make sure 1) to run the latest version of the package, 2) to report the output of sessionInfo() and traceback(), and 3) to post a complete code example. You received this message because you are subscribed to the Google Groups "aroma.affymetrix" group. To post to this group, send email to aroma-affymetrix@googlegroups.com To unsubscribe from this group, send email to aroma-affymetrix-unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/aroma-affymetrix?hl=en -~----------~----~----~----~------~----~------~--~---