Re: Classnotfoundexception in https plugin

2005-07-19 Thread Feng Ji
Hi there, I have successfully checkout Nutch and compiled successfully, thanks all the hints; by the way, what is the difference between Anonymous Subversion and Committer Subversion Access I guess Committer Subversion Access has the right to check code back in. Is it right, thanks, Michael,

Re: NutchAnalysis and CJK

2005-07-19 Thread Transbuerg Tian
hi, the weblucene do not use dictionary base segmentation. on the contrary , it use the bi-gram segmentation. you could get more infomation at : http://www.chedong.com or search 车东 and lucene for more information. at this time , I am try to use dictionary base segmentation , you could visit my

Re: bin/nutch issue - on Mac OS X

2005-07-19 Thread Piotr Kosiorowski
Hello, Tested on cygwin and on linux box. :" based syntax is used ealier in nutch script too. Commited. Thanks Piotr Erik Hatcher wrote: I'm getting "expr: syntax error" when running all bin/nutch commands. It comes from this line: if expr match `uname` 'CYGWIN*' > /dev/null; then sho

RE: bin/nutch issue - on Mac OS X

2005-07-19 Thread Howie Wang
It works for me and I'm on Cygwin. Howie I'm getting "expr: syntax error" when running all bin/nutch commands. It comes from this line: if expr match `uname` 'CYGWIN*' > /dev/null; then should this be modified to be this instead: if expr `uname` : 'CYGWIN*' > /dev/null; then That

bin/nutch issue - on Mac OS X

2005-07-19 Thread Erik Hatcher
I'm getting "expr: syntax error" when running all bin/nutch commands. It comes from this line: if expr match `uname` 'CYGWIN*' > /dev/null; then should this be modified to be this instead: if expr `uname` : 'CYGWIN*' > /dev/null; then That works for me, but I'm not running Cygwin.

[jira] Commented: (NUTCH-74) French Analyzer Plugin

2005-07-19 Thread Jerome Charron (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-74?page=comments#action_12316094 ] Jerome Charron commented on NUTCH-74: - Christophe, I already done such plugin for French and German in order to test the Analyzer Factory. The difference with your approach

[crawl] Response content length is not known

2005-07-19 Thread Christophe Noel
Hello, I was used to crawl with Nutch 0.6 last release... But new CVS version, I get some "response content length is not known" sometimes. What is it ? Thanks for help : (sample log) 050719 145804 http.auth.ntlm.username = 050719 145804 fetcher.server.delay = 2000 050719 145804 http.max.del

[jira] Updated: (NUTCH-74) French Analyzer Plugin

2005-07-19 Thread Christophe Noel (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-74?page=all ] Christophe Noel updated NUTCH-74: - Attachment: analyze-french.zip A zip file for this plugin (plugin.xml is not right) > French Analyzer Plugin > -- > > Key: NUTCH-

[jira] Created: (NUTCH-74) French Analyzer Plugin

2005-07-19 Thread Christophe Noel (JIRA)
French Analyzer Plugin -- Key: NUTCH-74 URL: http://issues.apache.org/jira/browse/NUTCH-74 Project: Nutch Type: New Feature Environment: Nutch Reporter: Christophe Noel Attachments: analyze-french.zip This is DRAFT for a new plugin for Fren

Re: NutchAnalysis and CJK

2005-07-19 Thread Jack Tang
Hi Transbuerg Could you please describe your solution in detail? Appreciate your time. Regards /Jack On 7/15/05, Transbuerg Tian <[EMAIL PROTECTED]> wrote: > hi, > Jack Tang > > I have the same condition with u , could you share your total > NutchAnalysis.jj file

Re: NutchAnalysis and CJK

2005-07-19 Thread Jack Tang
Hi ShiBin Thanks for your post. I had known weblucene since 2003. It was said weblucene used FMM(dictionary based segmentation) segmenation. But I find nothing in weblucene cvs util now. Here, I hope you can understand my mail, there is no difficult to make cjk-plugin avaiable, and I wanna Nutc