- Original Message -
From: William Wong
To: Lucene-user
Sent: Friday, October 05, 2001 5:12 PM
Subject: RE: Lucene has moved to Jakarta
How about adding filters for different file types such as
-HTML (there is one in the demo already)
-XML
-PDF
-MsWord/RTF
-other common file formats
THanks.
-william
-Original Message-
From: Doug Cutting [mailto:[EMAIL PROTECTED]]
Sent: Friday, October 05, 2001 11:42 AM
To: '[EMAIL PROTECTED]'; [EMAIL PROTECTED]
Subject: RE: Lucene has moved to Jakarta
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]
Congratulations on the move!
Thanks!
As near as I can see, the two major changes for 1.2-rc1 are:
switch to org.apache.lucene package names.
Apache license instead of LGPL.
Yes. Thanks for pointing these out. These are big incompatible changes
that I forgot to mention.
Other changes since 1.01b include:
- ant-only build -- no more makefiles
- addition of lock files--now fully thread process safe
- addition of German stemmer
- MultiSearcher now supports low-level search API
- added RangeQuery, for term-range searching
- Analyzers can choose tokenizer based on field name
- misc bug fixes.
I need to work up detailed release notes for the final 1.2 release.
Sometime when someone has a chance, I'd love to hear a bit about what
plans there are for Lucene development.
Let's see, some short term tasks for the 1.2 release:
- get source code back into releases
- clean up example code
- write release notes
Some mid-term tasks:
- add contributed Chinese analyzers
- add Hits.SetOrdering() support
- add some term highlighting support
Longer term tasks:
- add JDBC-based Directory
- optimize simple conjunctive queries
- optionally store document vectors in index
Have I missed your favorite?
Doug