[
https://issues.apache.org/jira/browse/LUCENE-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Steven Rowe updated LUCENE-2181:
--------------------------------
Attachment: top.100k.words.de.en.fr.uk.wikipedia.2009-11.tar.bz2
LUCENE-2181.patch
Hi Robert,
In the new version of the patch, {{ant benchmark}} from the {{contrib/icu/}}
directory attempts to download the attached {{tar.bz2}} file from
{{http://people.apache.org/~rmuir/wikipedia}} (*please change this to the
location where you end up putting the file*), then unpacks the archive to the
{{contrib/icu/src/benchmark/work/}} directory, then compiles and runs the
benchmark.
In addition to the top 100K word lists, the {{tar.bz2}} file contains
{{LICENSE.txt}}, which contains links to the Wikipedia dumps from which the
lists were extracted, along with a link to the license that Wikipedia uses.
> benchmark for collation
> -----------------------
>
> Key: LUCENE-2181
> URL: https://issues.apache.org/jira/browse/LUCENE-2181
> Project: Lucene - Java
> Issue Type: New Feature
> Components: contrib/benchmark
> Reporter: Robert Muir
> Assignee: Robert Muir
> Attachments: LUCENE-2181.patch,
> top.100k.words.de.en.fr.uk.wikipedia.2009-11.tar.bz2
>
>
> Steven Rowe attached a contrib/benchmark-based benchmark for collation (both
> jdk and icu) under LUCENE-2084, along with some instructions to run it...
> I think it would be a nice if we could turn this into a committable patch and
> add it to benchmark.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]