Hi Henrik, Thanks for the answer and also the ACS file. I have one more question regarding the UFL file generation.
I tried it by using the NettAffx and I got the following error: R> ufl <- AromaUflFile$allocateFromCdf(cdf, nbrOfEnzymes=2, tags=c("na30", "RI20100601")) R> csv <- AffymetrixNetAffxCsvFile$byChipType(chipType, tags=".na30"); R> units <- importFrom(ufl, csv); Warning messages: 1: In updateDataColumn.AromaTabularBinaryFile(this, .con = con, .hdr = hdr, : Values to be assigned were out of range [-32767,32768] and therefore censored to fit the range. 2: In updateDataColumn.AromaTabularBinaryFile(this, .con = con, .hdr = hdr, : Values to be assigned were out of range [-32767,32768] and therefore censored to fit the range. R> csv <- AffymetrixNetAffxCsvFile$byChipType(chipType, tags=".cn.na30"); R> units <- importFrom(ufl, csv); Warning messages: 1: In updateDataColumn.AromaTabularBinaryFile(this, .con = con, .hdr = hdr, : Values to be assigned were out of range [-32767,32768] and therefore censored to fit the range. 2: In updateDataColumn.AromaTabularBinaryFile(this, .con = con, .hdr = hdr, : Values to be assigned were out of range [-32767,32768] and therefore censored to fit the range. And the summary produce the following R> summary(ufl) length length.02 Min. :-32768 Min. :-32768 1st Qu.: 614 1st Qu.: 541 Median : 1146 Median : 997 Mean : 1601 Mean : 1466 3rd Qu.: 2195 3rd Qu.: 2000 Max. : 22095 Max. : 30002 NA's :230775 NA's :230775 Would you be so kind and build also the UFL and UGP files? Best Regards Robert On May 30, 7:27 pm, Henrik Bengtsson <h...@stat.berkeley.edu> wrote: > Hi. > > On Wed, May 26, 2010 at 3:24 PM, Ivanek, Robert <robert.iva...@fmi.ch> wrote: > > Dear Sir or Madam, > > > I would like to analyse the copy number variation data from Affymetrix > > Mouse Diversity Array. I have not found any information on your website > > about this particular array. > > I have created page for this: > > http://aroma-project.org/chipTypes/MOUSEDIVm520650 > > > I have tried to build the annotation files > > which are required by aroma but without success. I have few questions > > regarding that: > > > 1: Is aroma.affymetrix able to analyse the "Mouse Diversity Array" ? > > Yes, because there should be no reason why it shouldn't - it uses a > standard CDF etc. As you've noted, UGP (and UFL) files have not been > created by anyone yet. > > For CN analysis, at least the UGP (genome positions) annotation data > file needs to be there. > > > > > > > 2: I tried to build the "UGP" file directly from NetAffx annotation > > files using the code on your website, however I am getting the following > > error. > > > ## > > library("aroma.affymetrix") > > ## > > ## create UGP from NetAffx files > > cdf <- AffymetrixCdfFile$byChipType("MOUSEDIVm520650") > > ## > > ## Creates an empty UGP file for the CDF, if missing. > > ugp <- AromaUgpFile$allocateFromCdf(cdf, tags=c("na30", "RI20100526")) > > ## > > ## Import NetAffx unit position data > > csv <- AffymetrixNetAffxCsvFile$byChipType("MOUSEDIVm520650", > > otags=".na30") > > > Error in list(`AffymetrixNetAffxCsvFile$byChipType("MOUSEDIVm520650", > > tags = ".na30")` = <environment>, : > > > [2010-05-26 15:11:00] Exception: File format error of the tabular file > > ('annotationData/chipTypes/MOUSEDIVm520650/NetAffx/MOUSEDIVm520650.na30.annot.csv'): > > \ > > line 1 did not have 12 elements > > at throw(Exception(...)) > > at throw.default("File format error of the tabular file ('", > > getPathname(this), "'): ", ex$message) > > at throw("File format error of the tabular file ('", > > getPathname(this), "'): ", ex$message) > > at value[[3]](cond) > > at tryCatchOne(expr, names, parentenv, handlers[[1]]) > > at tryCatchList(expr, classes, parentenv, handlers) > > at tryCatch({ > > at verify.TabularTextFile(this, ...) > > at verify(this, ...) > > at this(...) > > at newInstance.Class(clazz, ...) > > at newInstance(clazz, ...) > > at newInstance.Object(static, pathname) > > at newInstance(static, pathname) > > at method(static, ...) > > at AffymetrixNetAffxCsvFile$byChipType("MOUSEDIVm520650", tags = > > ".na30") > > In addition: Warning message: > > In read.table(3L, header = TRUE, colClasses = c(NA_character_, > > NA_character_, : > > not all columns named in 'colClasses' exist > > I had a look at the MOUSEDIVm520650.na30.annot.csv file. The line > containing column names, that is: > > "Probe Set ID","dbSNP RS ID","Chromosome","Physical > Position","Strand","Cytoband","Allele A","Allele B","Associated > Gene","Genetic Map","Fragment Enzyme Type Length Start Stop", > > contains a trailing comma (,) that shouldn't be there ("file format > error"). This cause R to think there should be 12 and not 11 columns > in the data set. Open the file in an editor and remove that trailing > comma and any whitespace after "Fragment Enzyme Type Length Start > Stop". Then save the file. That should solve the problem. > > The other CSV file - MOUSEDIVm520650.cn.na30.annot.csv - does not have > this problem. > > > > > 3. I tried it also by using the "manual" approach using the > > tab=delimited file, however it seems to me that the mitochondria probes > > are skipped (NA values in ugp[,1] but valid values in ugp[,2]). > > The Affymetrix NetAffx CSV files use s "M" for the mitochondria > chromosome. In aroma we encode this by integer 25. > > > Another > > problem is that some positions for other chromosomes are not loaded in > > properly (valid values in ugp[,1] but NA values in ugp[,2]). > > You don't show how you read the data "manually", so it is hard to say > what you are doing wrong here. But note that there are quite a few > arguments in read.table() that you need to set correctly in order to > read Affymetrix NetAffx CSV files (it doesn't make easier that > Affymetrix changes the file format once in a while and have stray > erroneous symbols such as the above comma). > > Also, search our forum for 'MOUSEDIVm520650', because about a year ago > David Rosenberg disscussed this chip type and I think he did create > various annotation data files for the chip type. This was before the > chip type was publicly announced by Affymetrix. > > /Henrik > > > > > Here is the sessionInfo: > > > R version 2.11.0 (2010-04-22) > > x86_64-unknown-linux-gnu > > > locale: > > [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C > > LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8 LC_MONETARY=C > > [6] LC_MESSAGES=en_US.UTF-8 LC_PAPER=en_US.UTF-8 LC_NAME=C > > LC_ADDRESS=C LC_TELEPHONE=C > > [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C > > > attached base packages: > > [1] stats graphics grDevices utils datasets methods base > > > other attached packages: > > [1] aroma.affymetrix_1.6.0 aroma.apd_0.1.7 affxparser_1.20.0 > > R.huge_0.2.0 aroma.core_1.6.0 aroma.light_1.16.0 > > [7] matrixStats_0.2.1 R.rsp_0.3.6 R.cache_0.3.0 > > R.filesets_0.8.1 digest_0.4.2 R.utils_1.4.0 > > [13] R.oo_1.7.2 R.methodsS3_1.2.0 RColorBrewer_1.0-2 > > > loaded via a namespace (and not attached): > > [1] tools_2.11.0 > > > Best Regards > > > Robert > > > -- > > Robert Ivanek > > Postdoctoral Fellow Schuebeler Group > > Friedrich Miescher Institute > > Maulbeerstrasse 66 > > 4058 Basel / Switzerland > > Office phone: +41 61 697 6100 > > > -- > > When reporting problems on aroma.affymetrix, make sure 1) to run the latest > > version of the package, 2) to report the output of sessionInfo() and > > traceback(), and 3) to post a complete code example. > > > You received this message because you are subscribed to the Google Groups > > "aroma.affymetrix" group with websitehttp://www.aroma-project.org/. > > To post to this group, send email to aroma-affymetrix@googlegroups.com > > To unsubscribe and other options, go tohttp://www.aroma-project.org/forum/ -- When reporting problems on aroma.affymetrix, make sure 1) to run the latest version of the package, 2) to report the output of sessionInfo() and traceback(), and 3) to post a complete code example. You received this message because you are subscribed to the Google Groups "aroma.affymetrix" group with website http://www.aroma-project.org/. To post to this group, send email to aroma-affymetrix@googlegroups.com To unsubscribe and other options, go to http://www.aroma-project.org/forum/