Hi: i am not sure i understand your question. can you give more details about your application?
your data is a list of classes and some faculty info. so what's the structure or schema of these data? does the data come from database? If all of ur data is web pages, here is my hint: use some kind of text extraction technology(regular expression, XPath,...) to extract structured data from web pages and store into database. then u can get exact match by sql. PS: this is about Information Extraction. You can google it to find many pointers. good luck. yanky 2009/3/31 bruce <[email protected]> > Hi... > > looking to play with lucene/nutch... but i have an issue i'm trying to > solve, or get additional information on... i'm in a fog, as to where to > start, so i'm looking for serious pointers... > > i've got an app where i get a list of classes. part of the returned data is > the faculty/prof. i can do a "faculty search", but i might get multiple (or > one, or none) replies from the search for the faculty name. > > my question... what tools/research/open source/etc.. can i research to see > how to "match" the returned "faculty" data/link to attempt to match the > correct faculty with the class.. > > i would assume that this is a kind of content analysis, text/phrase > matching > issue, statistical text pattern matching, etc... not sure if my 'net > searches are in the right areas... > > what terms would be useful to search for to try to solve this issue... > > pointers, or references to open source projects, websites, personal > references to faculty/companies in this space would be great!!!! > > > > thanks > > >
