hi, thanks of the compliments to the package, i'm happy to hear you
liked it! i must acknowledge that part of the design of the package is
the result of conversations i had with Martin, Marc and specially
Valerie during the review process.
i only got to know about VRanges once the package was nearly finished in
its current form and it is in my mind to try to adapt it to that data
structure for the reasons you comment. i haven't explored
'ReportingTools' but i'll give a look at it.
i'll be also very glad if you use it for teaching at Brixen but
beware that this is the first version that entered the release, so it
may have bugs that haven't been discovered yet (a list of known
shortcomings is at the end of the vignette). do not hesitate to report
any problem or feature request that may help using it during the course
and i'll try to fix or add it asap. i'm coming to BioC in Boston, we can
discuss there further directions for a better integration of
VariantFiltering with the rest of the BioC infrastructure.
cheers,
robert.
On 06/18/2014 03:43 PM, Michael Lawrence wrote:
Wow, VariantFiltering is awesome. It's impressive how it integrates all
of the annotation resources so seamlessly. And the shiny app looks very
useful. This should be a model package for where we want to bring
Bioconductor.
You should look into using VRanges instead of GRanges for the return
value of filteredVariants(). It can record the provenance of the filters
if you are using the FilterRules framework. And ReportingTools may be
useful for generating semi-interactive HTML reports.
I'm going to use this in the course next week at Brixen. My tutorial is
already based on the CEU trio, so the vignette integrates perfectly.
Nice work,
Michael
On Wed, Jun 18, 2014 at 6:20 AM, Robert Castelo <robert.cast...@upf.edu
<mailto:robert.cast...@upf.edu>> wrote:
hi Michael,
this souns like if you had a list of variants where you have
annotated their Entrez Gene IDs, which sometimes are NA because
those variance do not overlap any gene and sometimes are repeated
Entrez Gene IDs when two or more of those variants overlap the same
gene :)
at least is the situation i had when programming the
VariantFiltering package, i also could not find a one-liner solution
but you might want to look to what i ended up doing there, in case
it might be also useful for you.
you'll find it in the method "annotateVariants" that dispatches
"OrgDb" objects (i.e., gene-centric annotation packages), within
VariantFiltering/R/__annotationEngine.R
if you take a look at it, do not hesitate to comment if you have any
suggestion to improve this. i also look forward to the
annotation-gurus feedback on this question :)
cheers,
robert.
On 06/18/2014 03:03 PM, Michael Lawrence wrote:
Let's say I have a vector of gene IDs where some are NA, and are
some are
repeated, and I want to map them to gene symbols, where I get
NAs for the
NA IDs or IDs without a symbol. What is the best way to do this?
I tried select() but it gave me a table with unique entries; not
very
convenient. It also does not handle NAs. And totally breaks with
duplicates
using the GENEID key type (kind of works with ENTREZID):
select(Homo.sapiens, GENEID, "SYMBOL", "GENEID")
Error in `[[<-`(`*tmp*`, name, value = list(GENEID = c("245938",
"245939",
:
269 elements in value to replace 1312 elements
Also tried the venerable mget(GENEID, org.Hs.egSYMBOL,
ifnotfound=NA), but
this returns a list and fails with NAs.
What would be nice is something like:
map(Homo.sapiens, GENEID, "SYMBOL", "GENEID", OneToOneOrNone)
where OneToOneOrNone is an assertion that I expect the mappings
to be
one-to-one, so it will unlist() or whatever and throw an error
if the
assertion fails. It should return NA for anything not found, and
for any NA
GENEID. Does something like this already exist?
Michael
[[alternative HTML version deleted]]
_________________________________________________
Bioc-devel@r-project.org <mailto:Bioc-devel@r-project.org>
mailing list
https://stat.ethz.ch/mailman/__listinfo/bioc-devel
<https://stat.ethz.ch/mailman/listinfo/bioc-devel>
--
Robert Castelo, PhD
Associate Professor
Dept. of Experimental and Health Sciences
Universitat Pompeu Fabra (UPF)
Barcelona Biomedical Research Park (PRBB)
Dr Aiguader 88
E-08003 Barcelona, Spain
telf: +34.933.160.514 <tel:%2B34.933.160.514>
fax: +34.933.160.550 <tel:%2B34.933.160.550>
--
Robert Castelo, PhD
Associate Professor
Dept. of Experimental and Health Sciences
Universitat Pompeu Fabra (UPF)
Barcelona Biomedical Research Park (PRBB)
Dr Aiguader 88
E-08003 Barcelona, Spain
telf: +34.933.160.514
fax: +34.933.160.550
_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel