ShingleFilter benchmark
-----------------------
Key: LUCENE-2223
URL: https://issues.apache.org/jira/browse/LUCENE-2223
Project: Lucene - Java
Issue Type: New Feature
Components: contrib/benchmark
Affects Versions: 3.0
Reporter: Steven Rowe
Priority: Minor
Spawned from LUCENE-2218: a benchmark for ShingleFilter, along with a new task
to instantiate (non-default-constructor) ShingleAnalyzerWrapper:
NewShingleAnalyzerTask.
The included shingle.alg runs ShingleAnalyzerWrapper, wrapping the default
StandardAnalyzer, with 4 different configurations over 10,000 Reuters documents
each. To allow ShingleFilter timings to be isolated from the rest of the
pipeline, StandardAnalyzer is also run over the same set of Reuters documents.
This set of 5 runs is then run 5 times.
The patch includes two perl scripts, the first to output JIRA table formatted
timing information, with the minimum elapsed time for each of the 4
ShingleAnalyzerWrapper runs and the StandardAnalyzer run, and the second to
compare two runs' JIRA output, producing another JIRA table showing %
improvement.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]