Sure, I'll fix that on the wiki. Thanks for pointing that out Uwe. On Fri, Feb 20, 2015 at 1:10 PM, Uwe Schindler <u...@thetaphi.de> wrote:
> Many thanks! :-) Nice work! > > I found a small typo in the announcement text on the mail and web page: " > Those indexes can then be read (see next section) with Lucene 5..." > The "see next section" should not be there, it's only relevant in the > migration guide (because there is a section following). Maybe fix this on > the web page, for the mail it's too late. > > Uwe > > ----- > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -----Original Message----- > > From: Anshum Gupta [mailto:ans...@anshumgupta.net] > > Sent: Friday, February 20, 2015 9:55 PM > > To: d...@lucene.apache.org; gene...@lucene.apache.org; java- > > u...@lucene.apache.org > > Subject: [ANNOUNCE] Apache Lucene 5.0.0 released > > > > 20 February 2015, Apache Lucene™ 5.0.0 available > > > > The Lucene PMC is pleased to announce the release of Apache Lucene 5.0. > > > > Apache Lucene is a high-performance, full-featured text search engine > > library written entirely in Java. It is a technology suitable for nearly > any > > application that requires full-text search, especially cross-platform. > > > > This release contains numerous bug fixes, optimizations, and > improvements, > > some of which are highlighted below. The release is available for > immediate > > download at: > > http://lucene.apache.org/core/mirrors-core-latest-redir.html > > > > See the CHANGES.txt file included with the release for a full list of > details. > > > > Lucene 5.0 Release Highlights: > > > > Stronger index safety > > > > * All file access now uses Java’s NIO.2 APIs which give Lucene stronger > index > > safety in terms of better error handling and safer commits. > > > > * Every Lucene segment now stores a unique id per-segment and per- > > commit to aid in accurate replication of index files. > > > > * During merging, IndexWriter now always checks the incoming segments > > for corruption before merging. This can mean, on upgrading to 5.0.0, that > > merging may uncover long-standing latent corruption in an older 4.x > index. > > > > Reduced heap usage > > > > * Lucene now supports random-writable and advance-able sparse bitsets > > (RoaringDocIdSet and SparseFixedBitSet), so the heap required is in > > proportion to how many bits are set, not how many total documents exist > in > > the index. > > > > * Heap usage during IndexWriter merging is also much lower with the new > > Lucene50Codec, since doc values and norms for the segments being merged > > are no longer fully loaded into heap for all fields; now they are loaded > for the > > one field currently being merged, and then dropped. > > > > * The default norms format now uses sparse encoding when appropriate, so > > indices that enable norms for many sparse fields will see a large > reduction in > > required heap at search time. > > > > * 5.0 has a new API to print a tree structure showing a recursive > breakdown > > of which parts are using how much heap. > > > > Other features > > > > * FieldCache is gone (moved to a dedicated UninvertingReader in the misc > > module). This means when you intend to sort on a field, you should index > > that field using doc values, which is much faster and less heap consuming > > than FieldCache. > > > > * Tokenizers and Analyzers no longer require Reader on init. > > > > * NormsFormat now gets its own dedicated NormsConsumer/Producer > > > > * SortedSetSortField, used to sort on a multi-valued field, is promoted > from > > sandbox to Lucene's core. > > > > * PostingsFormat now uses a "pull" API when writing postings, just like > doc > > values. This is powerful because you can do things in your postings > format > > that require making more than one pass through the postings such as > > iterating over all postings for each term to decide which compression > format > > it should use. > > > > * New DateRangeField type enables Indexing and searching of date ranges, > > particularly multi-valued ones. > > > > * A new ExitableDirectoryReader extends FilterDirectoryReader and > enables > > exiting requests that take too long to enumerate over terms. > > > > * Suggesters from multi-valued field can now be built as > > DocumentDictionary now enumerates each value separately in a multi- > > valued field. > > > > * ConcurrentMergeScheduler detects whether the index is on SSD or not > > and does a better job defaulting its settings. This only works on Linux > for > > now; other OS's will continue to use the previous defaults (tuned for > > spinning disks). > > > > * Auto-IO-throttling has been added to ConcurrentMergeScheduler, to rate > > limit IO writes for each merge depending on incoming merge rate. > > > > * CustomAnalyzer has been added that allows to configure analyzers like > > you do in Solr's index schema. This class has a builder API to configure > > Tokenizers, TokenFilters, and CharFilters based on their SPI names and > > parameters as documented by the corresponding factories. > > > > * Memory index now supports payloads. > > > > * Added a filter cache with a usage tracking policy that caches filters > based > > on frequency of use. > > > > * The default codec has an option to control BEST_SPEED or > > BEST_COMPRESSION for stored fields. > > > > * Stored fields are merged more efficiently, especially when upgrading > from > > previous versions or using SortingMergePolicy > > > > NOTE: Lucene 5 no longer supports the Lucene 3.x index format. Opening > > indexes will result in IndexFormatTooOldException. It is recommended to > > either reindex all your data, or upgrade the old indexes with the > > IndexUpgrader tool of latest Lucene 4 version (4.10.x). Those indexes can > > then be read (see next section) with Lucene 5. > > > > To read more about the changes, also see: > > http://blog.mikemccandless.com/2014/11/apache-lucene-500-is- > > coming.html > > > > Please read CHANGES.txt ( > > https://lucene.apache.org/core/5_0_0/changes/Changes.html) and > > MIGRATE.txt for a full list of new features and notes on upgrading. > > > > Please report any feedback to the mailing lists ( > > http://lucene.apache.org/core/discussion.html) > > > > -- > > Anshum Gupta > > http://about.me/anshumgupta > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > -- Anshum Gupta http://about.me/anshumgupta