its not for laziness that this doesn't have new api, the problem is that it shares code with ArabicAnalyzer.
at the time I submitted it you could not mix APIs. Let me know if you need a new patch, but I agree it might be best to do all of contrib/analysis at once... On Tue, Jul 14, 2009 at 3:11 PM, Mark Miller<markrmil...@gmail.com> wrote: > Fine with me. > > On Tue, Jul 14, 2009 at 2:25 PM, Michael McCandless (JIRA) <j...@apache.org> > wrote: >> >> [ >> https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12731017#action_12731017 >> ] >> >> Michael McCandless commented on LUCENE-1628: >> -------------------------------------------- >> >> I think we should go ahead and commit this and cutover to the new API as a >> separate step? >> >> > Persian Analyzer >> > ---------------- >> > >> > Key: LUCENE-1628 >> > URL: https://issues.apache.org/jira/browse/LUCENE-1628 >> > Project: Lucene - Java >> > Issue Type: New Feature >> > Components: contrib/analyzers >> > Reporter: Robert Muir >> > Assignee: Mark Miller >> > Priority: Minor >> > Fix For: 2.9 >> > >> > Attachments: LUCENE-1628.patch, LUCENE-1628.patch >> > >> > >> > A simple persian analyzer. >> > i measured trec scores with the benchmark package below against >> > http://ece.ut.ac.ir/DBRG/Hamshahri/ : >> > SimpleAnalyzer: >> > SUMMARY >> > Search Seconds: 0.012 >> > DocName Seconds: 0.020 >> > Num Points: 981.015 >> > Num Good Points: 33.738 >> > Max Good Points: 36.185 >> > Average Precision: 0.374 >> > MRR: 0.667 >> > Recall: 0.905 >> > Precision At 1: 0.585 >> > Precision At 2: 0.531 >> > Precision At 3: 0.513 >> > Precision At 4: 0.496 >> > Precision At 5: 0.486 >> > Precision At 6: 0.487 >> > Precision At 7: 0.479 >> > Precision At 8: 0.465 >> > Precision At 9: 0.458 >> > Precision At 10: 0.460 >> > Precision At 11: 0.453 >> > Precision At 12: 0.453 >> > Precision At 13: 0.445 >> > Precision At 14: 0.438 >> > Precision At 15: 0.438 >> > Precision At 16: 0.438 >> > Precision At 17: 0.429 >> > Precision At 18: 0.429 >> > Precision At 19: 0.419 >> > Precision At 20: 0.415 >> > PersianAnalyzer: >> > SUMMARY >> > Search Seconds: 0.004 >> > DocName Seconds: 0.011 >> > Num Points: 987.692 >> > Num Good Points: 36.123 >> > Max Good Points: 36.185 >> > Average Precision: 0.481 >> > MRR: 0.833 >> > Recall: 0.998 >> > Precision At 1: 0.754 >> > Precision At 2: 0.715 >> > Precision At 3: 0.646 >> > Precision At 4: 0.646 >> > Precision At 5: 0.631 >> > Precision At 6: 0.621 >> > Precision At 7: 0.593 >> > Precision At 8: 0.577 >> > Precision At 9: 0.573 >> > Precision At 10: 0.566 >> > Precision At 11: 0.572 >> > Precision At 12: 0.562 >> > Precision At 13: 0.554 >> > Precision At 14: 0.549 >> > Precision At 15: 0.542 >> > Precision At 16: 0.538 >> > Precision At 17: 0.533 >> > Precision At 18: 0.527 >> > Precision At 19: 0.525 >> > Precision At 20: 0.518 >> >> -- >> This message is automatically generated by JIRA. >> - >> You can reply to this email to add a comment to the issue online. >> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org >> For additional commands, e-mail: java-dev-h...@lucene.apache.org >> > > > > -- > -- > - Mark > > http://www.lucidimagination.com > > -- Robert Muir rcm...@gmail.com --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org