Dear Henrik
Thank you very much for your reply and your scripts.
First, the differences between na26 and na27 are as follows:
Header:
- line 11: changed annotation date format from July 21 2008 to
2008-12-01
Data:
- line 17: added one new column, i.e. last column is now %GC
- the data for Probe Set ID,Chromosome,Physical Position are
identical for na26 and na27.
This means that in principle I can still use na26.
Second, when I compare your results for HindIII, XbaI and NspI for
na26 to my results with na26 and na27, I get identical summary results
for both ufl and ugp files,as you may have realized when adding your
results for na26.
In contrast, for StyI I my results for both na26 and na27 are still:
snp cnp affxSnp other total
enzyme1-only 144868 0 0 0 144868
missing 93436 0 074 93510
total238304 0 074 238378
This means I get 93436 missing snps vs your 607 missing snps. (See the
partial output for importFrom below!!!)
Even downloading the na26 file and the cdf file from Affymetrix again
did not change the result, which is strange since for the other 3
chiptypes the results agree with your results.
BTW, I have downloaded 250k_sty_libraryfile_rev4.zip. However, there
exists also an older version 250k_sty_libraryfile_rev3.zip. Which
version have you used for creating the ufl file?
Third, trying to run your script Mapping250K_Sty,UFL,na26.R gave the
following error at line
units - importFrom(ufl, csv, enzymes=enzyme, verbose=log);
Error in list(importFrom(ufl, csv, enzymes = enzyme, verbose = -50)
= environment, :
[2009-01-15 12:33:30] Exception: Argument 'enzymes' contains 1 NA value
(s).
at throw(Exception(...))
at throw.default(sprintf(Argument '%s' contains %d NA value
(s)., .name, sum(
at throw(sprintf(Argument '%s' contains %d NA value(s)., .name, sum
(is.na(x)
at getNumerics.Arguments(static, ..., asMode = integer, disallow =
disallow)
at getNumerics(static, ..., asMode = integer, disallow = disallow)
at getIntegers.Arguments(static, ..., range = range)
at getIntegers(static, ..., range = range)
at method(static, ...)
at Arguments$getIndices(enzymes, range = c(1, 10))
at readDataUnitFragmentLength.AffymetrixNetAffxCsvFile(csv, enzymes
= enzymes,
at readDataUnitFragmentLength(csv, enzymes = enzymes, rows =
keep, ..., verbos
at importFromAffymetrixNetAffxCsvFile.AromaUflFile(this, src, ...)
at importFromAffymetrixNetAffxCsvFile(this, src, ...)
at importFrom.AromaUnitTabularBinaryFile(ufl, csv, enzymes = enzyme,
verbose =
at importFrom(ufl, csv,
In addition: Warning message:
In eval(expr, envir, enclos) : NAs introduced by coercion
Importing (unit name, fragment length+) data from
AffymetrixNetAffxCsvFile...done
This is strange since I have tested that argument enzymes returns
correctly StyI.
Maybe this has to do with my version of aroma.affymetrix_0.9.4.
Here are the partial outputs for importFrom(ufl, csv, verbose=-50);
for NspI and StyI:
1. Running importFrom(ufl, csv, verbose=-50) for NspI has the
following partial output (see ==):
Reading AffymetrixNetAffxCsvFile...done
Extracting fragment lengths from ([enzyme], lengths, start, stop)...
Inferring if enzyme names are specified...
Has enzyme names: TRUE
Inferring if enzyme names are specified...done
Identifying number of enzymes...
nbrOfEnzymes
1 2 3
==131564 130698 2
Max number of enzymes: 3
Identifying number of enzymes...done
Splitting into subunits and padding with NAs...
Splitting into subunits and padding with NAs...done
Extracting enzyme names...
Identified enzymes: NspI, StyI
Extracting enzyme names...done
Identifying the location of the fragment lengths...
Offset: 3
Identifying the location of the fragment lengths...done
Extracting fragment lengths...
Summary of *all* fragment lengths:
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
19.0526.0755.0842.9 1049.0 2000.0 394527.0
Extracting fragment lengths...done
Sorting data by enzyme...
V1 V2 V3
Min. : 100.0 Min. :19 Min. :NA
1st Qu.: 476.0 1st Qu.: 937 1st Qu.:NA
Median : 644.0 Median : 1280 Median :NA
Mean : 648.9 Mean : 1231 Mean : NaN
3rd Qu.: 816.0 3rd Qu.: 1614 3rd Qu.:NA
Max. :1480.0 Max. : 2000 Max. :NA
== NA's : 701.0 NA's :131564 NA's :262264
Sorting data by enzyme...done
int [1:262264, 1] 574 700 580 631 666 1060 798 842 822 608 ...
Extracting fragment lengths from ([enzyme], lengths, start,
stop)...done
Reading (unitName, fragmentLength+) from file...done
2. Running importFrom(ufl, csv, verbose=-50) for StyI has the
following partial output:
Reading AffymetrixNetAffxCsvFile...done
Extracting fragment lengths from ([enzyme], lengths, start, stop)...
Inferring