Hi, Freeling seems to have some of the features that you are looking for in Russian:
Tokenization Sentence splitting Number detection Date detection Morphological dictionary Basic named entity detection Quantity detection PoS tagging Thanks, Thomas --------------------------------------------------------------------------- Dr. Thomas Plagwitz |Director, Language Resource Center UNC Charlotte | Dept. of Languages and Culture Studies 9201 University City Blvd. | Charlotte, NC 28223 Phone: 704-687-8762 | Fax: 704-687-3496 http://plagwitz.org | http://lrc.uncc.edu --------------------------------------------------------------------------- If you are not the intended recipient of this transmission or a person responsible for delivering it to the intended recipient, any disclosure, copying, distribution, or other use of any of the information in this transmission is strictly prohibited. If you have received this transmission in error, please notify me immediately by reply e-mail or by telephone at 704-687-8762. Thank you. -----Original Message----- From: Александр Крылов [mailto:[email protected]] Sent: Friday, July 13, 2012 5:53 AM To: Torsten Zesch Cc: [email protected] Subject: Re: Using Apache UIMA for processing russian texts ok, tnank You for Your answer! So, I will see DKPro Core Framework today, And also i would like to ask You -- can i use external resources/libraries/api (etc) in my annotators? (It's may be keywords and entity extractors, filters, rubricators, russian morphology, detecrots, etc) - i have this libraties (example: aot.ru - the Alexey Sokirko's morphology projects -- greatest russian morphology) But hight level of this project will be Apache UIMA. (All my logic -- incapsulated in Annotators, written by me). It's possible? You faithfully, Alexander 2012/7/12 Torsten Zesch <[email protected]> > Redirected the request to UIMA userlist ... > > Hi Alexander, > > In addition to what you have already found, the DKPro Core Framework > http://code.google.com/p/dkpro-core-asl/ > has a POS Tagger (TreeTagger) that comes with a Russian model. > > I am not aware of Russian components for detecting dates, regions etc. > > -Torsten > > > -----Original Message----- > > From: Александр Крылов [mailto:[email protected]] > > Sent: Wednesday, July 11, 2012 11:17 AM > > To: [email protected] > > Subject: Using Apache UIMA for processing russian texts > > > > Hello! > > > > Sorry of my English - It's bad.. > > I would like to use Apache UIMA Annotators and other UIMA Tools for > > processing russian language texts.. It's search of statistircs term, > dates, > > regions in text documents. > > In examples I found only english (and some other) languages, but no > russian. > > But on Apache UIMA seb site written that Showball Annotator supports > > the russian language. > > So, I would like to ask - what Annotators supports russian language? > > Can > I use > > external russian morphology systems in Annotators, created by using > Apache > > UIMA? > > > > Thank You > > Your faithfully, > > Alexander >
