Re: Stemming in Nutch

2010-03-10 Thread kanimesh
timizePhrase. The other change > I mentioned previously was adding the PorterStemFilter > to NutchDocumentAnalysis.tokenStream. > > If anyone is interested in the changes, let me know > and I'll send them to you. Or maybe it's worth slapping > onto the Wiki. > >

Re: Stemming issues

2010-03-10 Thread kanimesh
;> >> >> Otis >> -- >> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch >> >> >> >> - Original Message >> > From: RanjithStar >> > To: nutch-user@lucene.apache.org >> > Sent: Wednesday, December 17, 2008

Re: Stemming issues

2008-12-17 Thread David Jashi
; > To: nutch-user@lucene.apache.org > > Sent: Wednesday, December 17, 2008 2:29:07 AM > > Subject: Re: Stemming issues > > > > > > Hi, > > Thanks for your reply. I can do stemming. So 'flowers' will be stemmed as > > 'flower' and Lucene will i

Re: Stemming issues

2008-12-17 Thread Otis Gospodnetic
gt; To: nutch-user@lucene.apache.org > Sent: Wednesday, December 17, 2008 2:29:07 AM > Subject: Re: Stemming issues > > > Hi, > Thanks for your reply. I can do stemming. So 'flowers' will be stemmed as > 'flower' and Lucene will index it as 'flower

AW: Stemming issues

2008-12-16 Thread Koch Martina
lto:ranjith2...@gmail.com] Gesendet: 17 December 2008 08:29 An: nutch-user@lucene.apache.org Betreff: Re: Stemming issues Hi, Thanks for your reply. I can do stemming. So 'flowers' will be stemmed as 'flower' and Lucene will index it as 'flower' itself. But the problem is, i

Re: Stemming issues

2008-12-16 Thread RanjithStar
Hi, Thanks for your reply. I can do stemming. So 'flowers' will be stemmed as 'flower' and Lucene will index it as 'flower' itself. But the problem is, if I search for 'flowers', it won't give any result. How can we tackle this? If we perform a search

Re: Stemming issues

2008-12-16 Thread Otis Gospodnetic
Hi, Yes, if you want flowers to match flower you will want to apply stemming. You can use the Snowball for English. I don't have any code handy, but you can see how it's done if you look at Lucene's unit test for Snowball Analyzer. Otis -- Sematext -- http://sematext.com/ -

Stemming issues

2008-12-16 Thread RanjithStar
ow can it be accomplished? Also, which stemmer can I use? snowball? Can any one please add some code snippet for how to use it? -- View this message in context: http://www.nabble.com/Stemming-issues-tp21035261p21035261.html Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Lost regrading Stemming in nutch

2008-10-31 Thread David Jashi
I managed to connect Nutch 0.9 to my stemming machine. Don't know if my approach would work on 0.8.1 On Wed, Oct 29, 2008 at 10:56 PM, jcze <[EMAIL PROTECTED]> wrote: > > Hi, i'm using nutch 0.8.1, I'm lost about the stemming of nutch, tried the > wiki on MultiLingua

Lost regrading Stemming in nutch

2008-10-29 Thread jcze
Hi, i'm using nutch 0.8.1, I'm lost about the stemming of nutch, tried the wiki on MultiLingual Support. coz it said that it could stem the words.. hmm.. but I'm lost because it said that I need to modify the IndexSegment class which i couldnt find.. =( Anywayz, i tried the stemm

Re: Stemming / Summary issue

2008-05-08 Thread ogjunk-nutch
gt; From: Sathyam Y <[EMAIL PROTECTED]> > To: nutch-user@lucene.apache.org > Sent: Thursday, May 8, 2008 12:16:07 PM > Subject: Stemming / Summary issue > > I am trying to integrate PorterStemming with Nutch and was able to > successfully follow the changes suggested a

Stemming / Summary issue

2008-05-08 Thread Sathyam Y
I am trying to integrate PorterStemming with Nutch and was able to successfully follow the changes suggested at http://wiki.apache.org/nutch/Stemming?highlight=%28stemming%29 The search results are working well with stemmed words, but I am having difficulty getting correct summaries. I am

stemming / summary problem

2008-05-07 Thread Sathyam Y
I am trying to integrate PorterStemming with Nutch and was able to successfully follow the changes suggested at http://wiki.apache.org/nutch/Stemming?highlight=%28stemming%29 The search results are working well with stemmed words, but I am having difficulty getting correct summaries

Re: Solr Integration/Stemming?

2008-05-07 Thread Sathyam Y
All, I am trying to integrate PorterStemming with Nutch and was able to successfully follow the changes suggested at http://wiki.apache.org/nutch/Stemming?highlight=%28stemming%29 The search results are working well with stemmed words, but I am having difficulty getting correct

Stemming plugin problem.

2008-04-01 Thread Lyndon Maydwell
Hi list, I'm trying to get stemming working on nutch-1.0-dev using the instructions found on the wiki for version 0.8 ( http://wiki.apache.org/nutch/Stemming ). I've set up everything pretty much how it was outlined in the walkthrough, but I'm getting errors when I try to

Re: Solr Integration/Stemming?

2008-02-11 Thread Nick Tkach
;. Specifically, I'd forgotten to make the change (as they have on the wiki) to my nutch-default.xml, in the value for plugin.includes replacing "query-(basic|site|url)" with "query-(stemmer|site|url)". Howie Wang wrote: It sounds like the query parser is not stemming

RE: Solr Integration/Stemming?

2008-02-11 Thread Howie Wang
It sounds like the query parser is not stemming for you. Make sure that you activate the new stemming query filter is activated in the Nutch directory under your app server. Check the nutch-*.xml files under WEB-INF/classes to make sure that your new query filter is included. Howie > Date:

Solr Integration/Stemming?

2008-02-11 Thread Nick Tkach
First of all, a question on stemming. We've tried applying the patches from the main wiki ( http://wiki.apache.org/nutch/Stemming ) and that seems to work fine for the most part. We are seeing one kind of strange result though. If we index a series of pages (web crawl of 2 of our sites

Re: Stemming with Nutch

2007-06-28 Thread Enis Soztutar
Doğacan Güney wrote: On 6/28/07, Robert Young <[EMAIL PROTECTED]> wrote: Hi, Are the Nutch Stemming modifications available as a patch? I can't seem to find anything on issue.apache.org There is some sort of stemming for German and French languages (available as plugin anal

Re: Stemming with Nutch

2007-06-28 Thread Doğacan Güney
On 6/28/07, Robert Young <[EMAIL PROTECTED]> wrote: Hi, Are the Nutch Stemming modifications available as a patch? I can't seem to find anything on issue.apache.org There is some sort of stemming for German and French languages (available as plugin analysis-de and analysis-fr). I

Stemming with Nutch

2007-06-28 Thread Robert Young
Hi, Are the Nutch Stemming modifications available as a patch? I can't seem to find anything on issue.apache.org Thanks Rob

Re[4]: Problems stemming

2007-06-19 Thread Scam
Hello Ronny, Tuesday, June 19, 2007, 14:38, you wrote: NR> Is'nt English default for nutch? It must as I understand but I do not see any stemming for English. -- Best regards, Scammailto:[EMAIL PROTECTED]

Re: Re[2]: Problems stemming

2007-06-19 Thread Naess, Ronny
Is'nt English default for nutch? Regards, Ronny -Opprinnelig melding- Fra: Scam [mailto:[EMAIL PROTECTED] Sendt: 19. juni 2007 11:54 Til: shinta himura Emne: Re[2]: Problems stemming Hello shinta, Monday, June 18, 2007, 23:23, you wrote: sh> I resolved this problem : Thank

Re[2]: Problems stemming

2007-06-19 Thread Scam
Hello shinta, Monday, June 18, 2007, 23:23, you wrote: sh> I resolved this problem : Thank you for the answer! But your solution is for languages FR and DE. I need to turn analysis for English language and I do not see anylisis-en at all in the plugins directory. May be you have some ideas about

Re: Problems stemming

2007-06-18 Thread Naess, Ronny
I am also interested in the stemming part :-) Regards, Ronny -Opprinnelig melding- Fra: Scam [mailto:[EMAIL PROTECTED] Sendt: 18. juni 2007 18:04 Til: shinta himura Emne: Re: Problems stemming Hello shinta, Wednesday, June 13, 2007, 12:36, you wrote: sh> I have some problems w

RE: Problems stemming

2007-06-18 Thread shinta himura
> nutch-user@lucene.apache.org> Subject: Re: Problems stemming> > Hello > shinta,> > Wednesday, June 13, 2007, 12:36, you wrote:> > sh> I have some > problems with Nutch's stemmer. I don't manage to get> sh> it working.. I use > Nucht 0.9. Could you explain

Re: Problems stemming

2007-06-18 Thread Scam
tch-default.xml. I have the same problem. Analysis plugin is included (analysis-(fr|en|de)) but stemming does not work at all. -- Best regards, Scammailto:[EMAIL PROTECTED]

Problems stemming

2007-06-13 Thread shinta himura
Hi, I have some problems with Nutch's stemmer. I don't manage to get it working. I use Nucht 0.9. Could you explain me what I have to do in way to activate it. I already add necessary plugins in file conf/nutch-default.xml. Regards, Damien.

Re : exact matches and stemming

2007-01-29 Thread Aïcha
: Re: exact matches and stemming Maybe you could store in your index both the stemmed word and the original one. Although it will increment the size of your index. Another posibllity could be to develop a WildcardQuery plugin or a FuzzyQuery plugin, because lucene comes with this capabilities,

Re: exact matches and stemming

2007-01-26 Thread Alvaro Cabrerizo
Maybe you could store in your index both the stemmed word and the original one. Although it will increment the size of your index. Another posibllity could be to develop a WildcardQuery plugin or a FuzzyQuery plugin, because lucene comes with this capabilities, and avoid stemming task. But it is

exact matches and stemming

2007-01-24 Thread Aïcha
Hello, I want to use the FrenchAnalyzer for stop word and stemming treatment but I want to still be able to do exact search, the problem is that the FrenchAnalyzer remove characters from the terms when the indexing is made so it isn't possible to have only exact matches from an index in

Re: Stemming and Synonyms

2006-09-19 Thread Richard Braman
pawns a great deal of controversy as > to weather it can be considered "intelligent" or just very good at > smalltalk. :) > >> Having said that, I'd like to make my users search experience as good as >> possible. To do that, I need to solve two little "problems

Re: Stemming and Synonyms

2006-09-19 Thread Tomi NA
ing and spawns a great deal of controversy as to weather it can be considered "intelligent" or just very good at smalltalk. :) Having said that, I'd like to make my users search experience as good as possible. To do that, I need to solve two little "problems" : -

Stemming and Synonyms

2006-09-19 Thread Gonçalo Gaiolas
Hi everyone! I’m using version 7.2 of Nutch and I’m very happy with it. Want to send a big thumbs up for you guys behind it! Having said that, I’d like to make my users search experience as good as possible. To do that, I need to solve two little “problems” : - Stemming – in my

no accent and stemming problem

2006-08-24 Thread aicha BEN
at we have to replace the accented chars before indexing, and also to do it on the query string. Could you detailed for me how to use the class AccentReplacer . - second point, I want to make French stemming, could somebody helps me? Thanks in advance. Aîcha

Re: stemming - RESOLVED

2006-07-31 Thread Matthew Holt
We could, although other than readability, it won't make any difference. [EMAIL PROTECTED] wrote: Hi, Matthew I think we should use fieldName instead of field, or not... ===stemming code begin=== public TokenStream tokenStream(String field, Reader r

Re[2]: stemming - RESOLVED

2006-07-29 Thread bb300
Hi, Matthew I think we should use fieldName instead of field, or not... ===stemming code begin=== public TokenStream tokenStream(String field, Reader reader) { Analyzer analyzer; if ("anchor".equals(field)) { analyzer = ANCHO

Re: stemming - RESOLVED

2006-07-28 Thread Matthew Holt
Howie, Thanks for all the help configuring your stemming addon for version 0.8. I compared query-basic and query-stemmer and the only new feature that was added is a "host" boost. I made the changes and everything works perfect. I uploaded the code to the wiki for both version

Re: stemming

2006-07-27 Thread Matthew Holt
his version of stemming everything works. Also the pagination is realized too. The best way is to develop Eugen's code - this is my opinion. I think that Jerome Charron also interested in that code - because of highlighting of results. What is Your opinion about aforesaid? to Eugene: Can You

Re[2]: stemming

2006-07-27 Thread bb300
Hi, I think we should wait when Eugen can share his code. In his version of stemming everything works. Also the pagination is realized too. The best way is to develop Eugen's code - this is my opinion. I think that Jerome Charron also interested in that code - because of highlighting of re

Re: stemming

2006-07-27 Thread Howie Wang
quot; doesnt, even though thats the word thats actually on the page). I tried a different approach and removed the query-stemmer value from nutch-site.xml to attempt to disable the plugin. I reran the crawl and it didn't load the plugin. However, it still had the same stemming functionalit

Re: stemming

2006-07-27 Thread Matthew Holt
an the crawl and it didn't load the plugin. However, it still had the same stemming functionality. I'm guessing this is due to editing the main files such as CommonGrams.java and NutchDocumentAnalyzer.java. Should I attempt too copy the needed methods into StemmerQueryFilter.java and try

Re: stemming

2006-07-26 Thread Howie Wang
ay, someone recently told me that they were able to put all the stemming code into an indexing filter without touching any of the main code. All they did was to copy some of the code that is being done in NutchDocumentAnalyzer and CommonGrams into their custom index filter. Haven't tried it myse

Re: stemming

2006-07-26 Thread Matthew Holt
gine "interview", the stemming takes place and the page with the word "interviews" is returned. However, if I type in the word "interviews" no page is returned. (The page with the word interviews on it should be returned). Any ideas?? Matt Dima Mazmanov wrote: Hi,

Re: stemming

2006-07-16 Thread bb300
d.equals("title")) { ts = new LowerCaseFilter(ts); return new PorterStemFilter(ts); } else { return ts; P.S. May be I miss something - because I can't make the my last nutch buld to crawl. Regards Alexey. Hi, . I've gotten a couple of questions offlist about stemm

Re: stemming

2006-06-29 Thread bb300
Hi, Dima Thanks for Your contribution. I'll try it on this sunday. > Hi, . > > I've gotten a couple of questions offlist about stemming > so I thought I'd just post here with my changes. Sorry that > some of the changes are in the main code and not in a plug

stemming

2006-06-29 Thread Dima Mazmanov
Hi, . I've gotten a couple of questions offlist about stemming so I thought I'd just post here with my changes. Sorry that some of the changes are in the main code and not in a plugin. It seemed that it's more efficient to put in the main analyzer. It would be nice if later rel

Re: Stemming in Nutch 0.7.2 issue

2006-06-29 Thread Jayant Kumar Gandhi
otels" miss out of the results. Also because of stemming my fielded searching on custom fields has stopped working. Implemented on the lines of http://wiki.apache.org/nutch/WritingPluginExample If I search for "rating:3" it gets modified to "rate 3" and hence I don

Re: Stemming in Nutch 0.7.2 issue

2006-06-29 Thread Jérôme Charron
I am using the code as given at http://www.nabble.com/RE%3A-Nutch-does-not-use-stemmers--p249520.html Deactivate the basic query filter and it should work. Jérôme -- http://motrech.free.fr/ http://www.frutch.org/

Re: Stemming in Nutch 0.7.2 issue

2006-06-29 Thread Jayant Kumar Gandhi
I am using the code as given at http://www.nabble.com/RE%3A-Nutch-does-not-use-stemmers--p249520.html On 6/29/06, Jérôme Charron <[EMAIL PROTECTED]> wrote: Yes, that's what stemming is supposed to do. But take a look at your query (that I have cut and paste in my previous mail): bot

Re: Stemming in Nutch 0.7.2 issue

2006-06-29 Thread Jérôme Charron
yeah that page had both hotel and hotels, but shouldn't it have been all pages that contain hotel or hotels or both. thats what stemming is supposed to do. Yes, that's what stemming is supposed to do. But take a look at your query (that I have cut and paste in my previous mail): both

Re: Stemming in Nutch 0.7.2 issue

2006-06-29 Thread Jayant Kumar Gandhi
yeah that page had both hotel and hotels, but shouldn't it have been all pages that contain hotel or hotels or both. thats what stemming is supposed to do. I have 2 pages that contain 'groves' and no page containing 'grove', I get no result when stemmer plugin is e

Re: Stemming in Nutch 0.7.2 issue

2006-06-29 Thread Jérôme Charron
I need stemming in my search engine based on Nutch 0.7.2, the stemming query is being created but I am not getting appropriate results. If I search for hotel, I get 11 results, but if I search for hotels, I get 1 result. You got one result that contains both hotel and hotels ... no

Stemming in Nutch 0.7.2 issue

2006-06-29 Thread Jayant Kumar Gandhi
Hey, I need stemming in my search engine based on Nutch 0.7.2, the stemming query is being created but I am not getting appropriate results. If I search for hotel, I get 11 results, but if I search for hotels, I get 1 result. Any thoughts? I have implemented stemming using the code in the mail

Re: stemming

2006-06-28 Thread Andrzej Bialecki
Eugen Kochuev wrote: P.P.S Why not to develop efficient technique to fight near-duplicates and SE spam? This is absolutely necessary if build Internet search Why not, indeed? ;) The answer is that it is very difficult. There are simple methods that Nutch uses (MD5 and "text profile"), but g

Re[4]: stemming

2006-06-28 Thread bb300
Hi,Eugen I think that is right way. --- Regards, Alexey > P.P.S Why not to develop efficient technique to fight near-duplicates > and SE spam? This is absolutely necessary if build Internet search > engine based on nutch. Another "must have" is variable refetch time > for pages (this

Re: Re[3]: stemming

2006-06-26 Thread bb300
Hi,Eugene Thanks a lot! --- Regards Alexey > Sorry for the delay answering you. I will definitely share my code > with nutch community, but currently I'm on vacation, away from my > sources, so I will share them as soon as my vacation ends ;-)

Re[3]: stemming

2006-06-26 Thread Eugen Kochuev
Alexey, Sorry for the delay answering you. I will definitely share my code with nutch community, but currently I'm on vacation, away from my sources, so I will share them as soon as my vacation ends ;-) P.S. Nutch is great I and I hope that my efforts will help to make it better. P.P.S Why not t

Re[2]: stemming

2006-06-26 Thread bb300
Hi,Jerome I think that the best way is to ask Eugene to share his code. I hope he will comply our request... :) I want to believe that his answer will be positive! if not, then I will share my "BAD code" to You. --- Regards Alexey na> I don't know. na> Could you please send me off li

Re: Re[2]: stemming

2006-06-26 Thread Jérôme Charron
I succeeded in implementing Russian stemming for nutch 0.8. Here's the example http://j1.lan23.net:8080/?query=%D1%81%D0%B0%D0%B9%D1%82&hitsPerPage=10 Everything is working fine, including highlighting. Eugen, don't want to share your code with the community? ;-) -- http://m

Re: stemming

2006-06-26 Thread Jérôme Charron
What is my mistake in wrapping of lucene's russian analyzer? As I understand lucene works well with russian (I read about it in the lucene users and developers mail lists). I don't know. Could you please send me off list your code. Jérôme

Re[2]: stemming

2006-06-25 Thread bb300
Hi, Eugen! Could You help me with russian stemming and highlighting! How did You do that? --- Regards Alexey

Re: stemming

2006-06-25 Thread bb300
Hi, Eugen! Could You help me with russian stemming and highlighting! How did You do that? --- Regards Alexey

Re[2]: stemming

2006-06-25 Thread Eugen Kochuev
Alexey, I succeeded in implementing Russian stemming for nutch 0.8. Here's the example http://j1.lan23.net:8080/?query=%D1%81%D0%B0%D0%B9%D1%82&hitsPerPage=10 Everything is working fine, including highlighting. -- Best regards, Eugenmailto:[EMAIL PROTECTED]

Re: stemming

2006-06-24 Thread bb300
to Jerome What is my mistake in wrapping of lucene's russian analyzer? As I understand lucene works well with russian (I read about it in the lucene users and developers mail lists). --- Regards Alexey

Re: stemming

2006-06-21 Thread bb300
erstand how it works. Yes, I'am trying to wrap lucene russian stemming into nutch. Russian language, in my opinion, more powerfull "big language" in the world - One tning could be told by many ways (using only stemming of the words or so...). --- Regards Alexey

RE: stemming

2006-06-21 Thread Teruhiko Kurosaka
> For example: The page contain next words (in a different > forms)(text is in russian): Russian? I don't see a Russian Analyzer in the Trunk. Did you write your own analyzer for Russian? Are you using org.apache.lucene.analysis.ru.RussianAnalyzer? The source code of RussianAnalyzer looks very c

Re: stemming

2006-06-21 Thread bb300
different forms)(text is in russian): - fish (different forms), - sea, - mission (only in main form), - electricity, - aquarium (different forms), - lighting (different forms). 1)with stemming - fish (main form and not) - find (stemming works) - sea - can't find - mission (onl

RE: stemming

2006-06-21 Thread Teruhiko Kurosaka
in", "durch", "wegen", "wird" }; // From src/java/org/apache/lucene/analysis/de/GermanAnalyzer.java // of Lucene 1.4.3 distribution. This could be slightly out of date. You'd have to either modify the source code in: src/plugin/analysis-de/src/java/org/apache/nutch/analysis/de/GermanAnalyzer.java to use the constructor that takes the word list or the file name of the word list, I think. > when I use trunk version should I change some code as it shown at wiki > in MultiLingual support page? Because, as I understand everything in > trunk version have been done for stemming plugins integration without > code changing. I believe Jérôme has implemented these code changes into the Trunk. -kuro

Re: Re: stemming

2006-06-21 Thread Jérôme Charron
Actually I could not find stopwords file. Could You help me with this. If you have simply wrapped a Lucene's analyzer (like fr and de analyzers), the default stop word list is inside the analyzer code (take a look at the analyzer source). Jérôme -- http://motrech.free.fr/ http://www.frutch.org

Re: stemming

2006-06-21 Thread Jérôme Charron
When I disable the stemming (the index is the same) it could find that words (of course it find only that form of the words which presents in the queries). Just a silly question: Do you build your index with the analyzers turned on? (does the documents language was correctly guessed and the

Re: stemming

2006-06-21 Thread bb300
to Jerome > Checks that these words are not in the stopword list of your analyzer. Actually I could not find stopwords file. Could You help me with this. Actually I am sure that such words as mission, sea, ocean, building, electricity, etc. couldn't be in stopwords file. (at my previous questio

Re: Re: stemming

2006-06-21 Thread bb300
to Jerome > Checks that these words are not in the stopword list of your analyzer. Actually I could not find stopwords file. Could You help me with this. Actually I am sure that such worda as a mission, sea, ocean, building, electricity, etc. couldn't be in stopwords file. (at my previous quest

Re: stemming

2006-06-21 Thread bb300
Thanks! to Jerome > Checks that these words are not in the stopword list of your analyzer. That words aren't in the stopword list. It couldn't find them at all. When I disable the stemming (the index is the same) it could find that words (of course it find only that form of th

Re: stemming

2006-06-21 Thread Jérôme Charron
1) some words it can find, but some not (I'am sure that "missing" words present in index, because when I disable the stemming it finds them) Checks that these words are not in the stopword list of your analyzer. 2) the queries with positive results have some strange thing - it

Re: stemming

2006-06-20 Thread bb300
At first I use trunk version. The problem is in the next points: 1) some words it can find, but some not (I'am sure that "missing" words present in index, because when I disable the stemming it finds them) 2) the queries with positive results have some strange thing - it finds not a

Re: stemming

2006-06-20 Thread Jérôme Charron
The analysis-xx plugin provides the stemming function to the analyzer used for indexing but it does not provide the same stemming function the the query analyzer. In the trunk, the analysis plugins are both used for documents analysis and queries analysis. The right thing to do is to make the

RE: stemming

2006-06-20 Thread Teruhiko Kurosaka
Here's my understanding of the current state of analyzer, which might be wrong. The analysis-xx plugin provides the stemming function to the analyzer used for indexing but it does not provide the same stemming function the the query analyzer. This means that instead of typing the complete

Re: stemming

2006-06-18 Thread bb300
Is there any way to setup stemming? I made necessary changes in includes by adding analysis-(de|fr|ru), but it seems to me that there is a problem with "search query module", because the words that present in index the nutch couldn't find (even with the right form of the words). What is my mistake?

Re: stemming

2006-05-23 Thread Jérôme Charron
> Could anybody help me with adding steming for russian language. As suggested by Matthias, you can use the lucene stemming package and wrap it in a NutchAnalyzer. See the analysis-fr and analysis-de sample plugins in Nutch. A description of the internal mechanism is available on the nutch w

Re: stemming

2006-05-23 Thread Matthias Jaekle
Hi, there is a lucene package for russion stemming: http://lucene.apache.org/java/docs/api/org/apache/lucene/analysis/ru/package-summary.html We have once implemented the corresponding german stemming module: http://wiki.apache.org/nutch/German But this is for an old nutch version. Maybe this

stemming

2006-05-23 Thread bb300
Could anybody help me with adding steming for russian language. Thanks

RE: ontology, stemming, wordnet?

2005-07-14 Thread Howie Wang
2) How do I get stemming to work on Nutch? Here's what I did: http://www.nutchhacks.com/ftopic873.php

ontology, stemming, wordnet?

2005-07-14 Thread J S
Hi, I wonder if you could help me out. I just have a few general questions which I'm really stuck on: 1) What does the ontology plugin do? 2) How do I get stemming to work on Nutch? 3) I've seen some literature on Wordnet and just wondered if there's a way to get that working on

ontology, stemming, wordnet?

2005-07-11 Thread J S
Hi, I wonder if you could help me out. I just have a few general questions which I'm really stuck on: 1) What does the ontology plugin do? 2) How do I get stemming to work on Nutch? 3) I've seen some literature on Wordnet and just wondered if there's a way to get that work