Re: NeuroNames [was: slides for the UMLS presentation]

William Bug Tue, 06 Jun 2006 07:43:15 -0700


Hi All,

Sorry - I'd thought I'd already subscribed to this list, butapparently not - until now.

The need for a mereotopologically-sound, neuroanatomical ontology isquite pressing across the community of neuroscientists involved inneuroinformatics projects most of which include a neuroimagingcomponent. Generally there is only one thing neuroscientists areinterested in when analyzing images at whatever resolution from themacromolecular (EM) on up to the macroscopic - i.e., identifyingbiologically relevant shapes. In order for these shapes to have anymeaning in a context where one attempts to pool data and performrelevant data reduction operations, the shapes must exist within ashared coordinate space of some sort. For instance, if two separatelabs are examining the change in the size of the Substantia Nigraduring the course of Parkinsonian neurodegeneration, in order forthem to compare their observations, they require several dataintegration/semantic frameworks:

        - a shared neuroanatomical terminology

- a shared coordinate space (to place the shapes from their imagesin a comparable coordinate framework)- a shared, well-founded anatomical ontology which encapsulatesmereotopological knowledge about shapes in - at least - 3D space.Other knowledge resources can be helpful in supplementing this arrayof tools, but, generally, these are the absolute minimum.

[NOTE: the Wikipedia has a moderately clear definition ofmereotopology (http://en.wikipedia.org/wiki/Mereotopology).Basically, it combines a formal, ontological theory of shapes andboundaries (mereology) with the mathematics of topology with the goalof providing a computational formalism to support applying logicaloperations to objects in space. As has been pointed out by others, agreat deal of the work in this field of applied biomedicalmereotopology derives from related work in the GIS field. Use ofmereotopology by geographers has been going on for quite some timeand is much more advanced. Work from GIS can be adapted for use inthe biomedical domain, but it must be done with great care, as manyof the assumptions behind the way researchers represent space andmanner of information being represented can differ significantlyacross these disciplines.]

The same is true as you scale this problem up to field-wide projectssuch as BIRN or The NeuroCommons.

As several have mentioned in this thread, there are already existingresources that can begin to fill this need.


1) NeuroNames

Kei, Olivier, Peter Mork, and others have already given sufficientreferences on NeuroNames in this thread, so that others can dig indeeper to the specifics if they like.

Having worked with Doug Bowden, Mark Dubach, and their colleaguesover the last year or so in an advisory capacity on the specificissue of use of NeuroNames for semantically-based, neuroanatomicaldata set integration, I can add a few important qualifying points:a) Doug et al. have been working on the extremely difficult task ofunifying neuroanatomical terminologies across mammalian species for20 years now. Embedded in Neuronames & Braininfo, there is a wealthof hard won empirical knowledge related to how one achieves thisend. I think it would be ill-advised to try to duplicate theireffort, as the myriad scientific problems related to this effortwould surely present themselves again and only need to be worked outonce one.b) Doug et al. are extremely collegial and quite receptive tofeedback and collaboration - within the bounds of their limitedresources.c) NeuroNames is a terminological resource - not a well-founded,spatial ontology of brain anatomy capable of supportingmereotopological reasoning. As with most research-basedterminologies, there are many semantically-based relations embeddedin the NeuroNames graphs, but as the primary goal of NN is todisambiguate and integrate across the neuroanatomical lexicon, theembedded semantic information can often lead to a logical dead end.For instance, many neuroanatomical terms critical to specifyinglocation in the rodent brain have been placed in the NN category"ancillary terms," as they don't fit into the core hierarchy in anunambiguous way. This can make use of NN for annotating mouse braingene & protein expression patterns (e.g., GENSAT, the Allen BrainAtlas, various BIRN projects) extremely problematic.d) The NN primary structures (http://braininfo.rprc.washington.edu/indexabout.html) provide the closest thing to an ontology in NN. AsPeter Mork pointed out, there has been an effort in the past to unitethis core NN hierarchy with the FMA, which does provide amereotopologically sound framework for anatomy. Barry Smith (formalontologist who has worked for over a decade on problems in biomedicalontology - most especially, though hardly exclusively, in the area ofmereotopological reasoning) and his colleagues have worked closelywith the Cornelius Rosse and his colleagues at the FMA project tocreate in association with the work started in the FMA a foundationalontology for biomedicine (the Ontology of Biological Reality) that isbecoming increasingly important to all of the ontologies beingmonitored by NCBO and incorporated into the OBO site and the emergingOBO Foundary (http://obofoundry.org/).e) Doug and his colleagues have worked closely with Jack Park (aconsulting scientist to SRI's AI Center - http://www.ai.sri.com/) torepresent NN as a TopicMap (XTM). As many on this list may know,there has been a moderate amount of effort to integrate and/orreconcile XTM with RDF here at the W3C (search on "TopicMaps" at themain RDF page - http://www.w3.org/RDF/). I'm not certain how thiseffort will ultimately make NN more "semantic web" compliant, but thebottom line is a great deal of effort has already been expended toexpress NN in a semantically well-grounded formalism.f) Though - as Don points out - neuroanatomical representations arelikely to significantly evolve over the coming decades, as the numberof large scale gene & protein expression characterization studiesfocussed on the brain continue to accumulate. Having said that, the"conventional" view of neuroanatomy will likely remain relevant for along while to come, not only because it has been used to characterizefindings in the literature for the last 125+ years, but also becauseit did derive from a wealth of empirical observation which is likelyto remain valid in many domains of neuroanatomical study. I wouldalso modify Don's well informed comment regarding the derivation of"conventional" views of neuroanatomy. To a large extent they arerelated to functional studies of the brain - as well as lesion basedstudies of functional deficits dating back to the 19th century (think"Broca's Area"), but they are also very much based on a study of themorphology of the brain - both the external surface morphology(sulci, gyri, and lobes), as well as histological examination ofinternal structures. Many of these studies of structure in space arelikely to stay with us for some time to come (and are well-founded inreality), though as Tim Clark & Don have pointed out in this thread,nomenclature is still a very significant problem even in this very"old" field.g) licensing of NN - Doug et al. formerly had a completely openpolicy to distributing NN. The only a reason a license wasinstituted was at some point about 5 years back another group suckeddown the entirety of NN, reworked a lot of what was there - probablywith very practical goals directed toward making NN more "correct"and effective in their problem domain - then "republished" theirproduct as "NeuroNames". This lead to a great deal of confusion.The fact they chose to do this on sly also meant the work they didwas not necessarily compatible with the work done by Doug et al.. Inorder to avoid this happening again, it was decided a license wouldbe established to discourage this sort of behavior. As anyone whohas developed a terminology and/or ontology, it is absolutelyessential there remain a single curating authority, if the value ofthe resource is to remain in tact. The "vetting" performed by thecentral authority - as is extensively done by the curators of theGene Ontology, for instance - is absolutely essential to theguaranteeing the integrity of the knowledge resource. This is not a"closed" or proprietary process, just a highly controlled one.Unfortunately, Doug Bowden's resources are MUCH MUCH smaller thanthose available to the curators/developers of GO, so the NN curationeffort necessarily moves at a slower pace.


2) Working with the Neuroscience community

As Kei, Don, and others have stated, it would be unwise to proceed increating an "open source" neuroanatomical ontology withoutinteracting with the researchers who've already put a lot of effortinto this problem over the past decade or so. With this in mind, Ihave several suggestions:

        a) The 5 ways of knowing neuroanatomy:

This is a pitch I've been making which I think helps to sum up thecurrent ways various sub-fields have attempted to identify/label/collate brain morphology

                i) Terminlogies - e.g., NN, BrainLex
                ii) Ontologies - e.g., Neuro-FMA (the project Peter Mork 
referred to)

iii) Literature Informatics (CocoMac, BrainMap, NeuroScholar, BAMS,ArrowSmith, etc.).These are very mature projects. Some include their ownmereotopological reasoning systems (e.g., CocoMac and BrainMap) inorder to be able to pool and compare the relatedness of structuresand connectivity across different studies in the literature. Thegoal in this category is to perform large-scale semantic mining ofthe literature to confirm/refute current knowledge and uncover newcorrelations - very much along the lines of what The NeuroCommonsProject expects to achieve via use of semantic web technologies.Some researchers in this category are actually participating in TheNeuroCommons Project (i.e., Gully Burns, who developed NeuroScholar).

                iv) voxel/pixel analysis:

This approach applies computer vision algorithms to automatically- or semi-automatically - identify 2D & 3D shapes in digitalanatomical images. This field is also extremely mature, though thereare many significant caveats to exactly how much of this work can beeffectively automated.

                v) parameterized models:

Often these are derived from - or used to drive - the voxel/pixelbased analysis described in 'iv' - though the spatial modeling isdefinitely a distinct approach from the pure voxel/pixel approach.

None of studies you'd fit into these categories exclusively focus ontheir technique/tool alone without some aspect of the other "ways ofknowing neuroanatomy" playing a role in what they do. However, it isclear much fundamental work in this area primarily focuses on onetechnique over the others.

Having said that, when the neuroscience community makes use of thiswork to examine a specific biological problem, they will often drawsignificant tools and resources from more than one of these domains.

b) NCBO/NCOR sponsored meeting focused on mereotopology inneuroanatomy:Barry Smith is working to bring together researchers working in the5 domains described above. There is a very pressing need in large-scale, field-wide neuroinformatics projects such as what is beingdone in the BIRN project to have these 5 domains converge and workmore cooperatively. Right now, a lot of manual effort has to be putout to bring them together. This is something BIRN has beenpursuing. In the last 6 months, we have received a great deal ofsupport and guidance on this effort from NCBO. Daniel Rubininteracts directly with the BIRN Ontology Task Force, and the workBarry Smith has been doing with FMA, OBO, FuGO, and PATO have verymuch begun to create a much more well-founded and computable pathtoward performing large-scale annotation of neuroimaging data.This meeting is on the NCBO/NCOR slate for 2007, but in the interimI hope to see more effort invested in the coming year across the 5communities listed above toward the goal of integrating across these"ways of knowing" now that the need has been recognized.

                        
3) Microarrays:

Just as Don, Kei, Alan R., and others have pointed out, high-throughput assays - microarrays, BAC-based IHC, in situ studies usingthe Gene Paint technology employed by the Allen Institute of BrainScience to construct the Allen Brain Atlas of gene expression in thebrain - are going to transform our understanding of neuroanatomy overthe coming decades. This is just a given. There is a pressing needto derive a means to integrate spatially-mapped studies of gene &protein expression into a neuroimaging setting. The spatialresolution may be very coarse - e.g., "whole brain" - but they stillprovide sufficient spatial information to be usable in the context ofa neuroanatomical coordinate system.We are working in the BIRN project to create a means for researchersto integrate these distinct approaches to studying the brain. AsAlan R. pointed out, FuGO is working to put description of microarrayexperiments on a solid, formal footing, and I would expect one aspectof that will be to represent microarray data in RDF/OWL. This is nota trivial problem, given as much of the available data is merelyMIAME-compliant - MIAME not even being a data format, but just acollection of minimal data requirements. One need only look at thegreat complexity of the data submission process at the NCBI GEO siteto get an appreciation for how difficult this problem can be. Agreat deal of effort is being invested in the microarray field tocome up with a better means handle this issue, and the FuGO effortwill be a critical clearinghouse for this work. The important thingto remember when it comes to field-wide data pooling and re-analysis,it may sometimes be necessary to get right back to the microarrayprimary image files so as to reapply different criterial whenperforming the statistical tests and reductions on pooled data.Given this requirement - one we also see in the neuroimaging domain -I believe it is very important to proceed in a well-reasoned mannerwhen seeking to integrate across microarray datasets using semanticweb technologies. Alan R. and myself - possibly others too - on thislist are on the FuGO Coordinators Committee, so hopefully we can helpto keep those lines of communication open.

Sorry to go on so, but this is a topic on which I've labored quiteintensively over the past year. There is a lot being done on thisissue, and I think all efforts will get much further more quickly -and in a way that will carry more street cred with practicingneuroscientists - if we all try to work together.


Cheers,
Bill

Bill Bug
Senior Analyst/Ontological Engineer

Laboratory for Bioimaging  & Anatomical Informatics
www.neuroterrain.org
Department of Neurobiology & Anatomy
Drexel University College of Medicine
2900 Queen Lane
Philadelphia, PA    19129
215 991 8430 (ph)
610 457 0443 (mobile)
215 843 9367 (fax)


Please Note: I now have a new email - [EMAIL PROTECTED]







This email and any accompany attachments are confidential. This information is 
intended solely for the use of the individual to whom it is addressed. Any 
review, disclosure, copying, distribution, or use of this email communication 
by others is strictly prohibited. If you are not the intended recipient please 
notify us immediately by returning this message to the sender and delete all 
copies. Thank you for your cooperation.

Re: NeuroNames [was: slides for the UMLS presentation]

Reply via email to