ShingleFilter benchmark
-----------------------

                 Key: LUCENE-2223
                 URL: https://issues.apache.org/jira/browse/LUCENE-2223
             Project: Lucene - Java
          Issue Type: New Feature
          Components: contrib/benchmark
    Affects Versions: 3.0
            Reporter: Steven Rowe
            Priority: Minor


Spawned from LUCENE-2218: a benchmark for ShingleFilter, along with a new task 
to instantiate (non-default-constructor) ShingleAnalyzerWrapper: 
NewShingleAnalyzerTask.

The included shingle.alg runs ShingleAnalyzerWrapper, wrapping the default 
StandardAnalyzer, with 4 different configurations over 10,000 Reuters documents 
each.  To allow ShingleFilter timings to be isolated from the rest of the 
pipeline, StandardAnalyzer is also run over the same set of Reuters documents.  
This set of 5 runs is then run 5 times.

The patch includes two perl scripts, the first to output JIRA table formatted 
timing information, with the minimum elapsed time for each of the 4 
ShingleAnalyzerWrapper runs and the StandardAnalyzer run, and the second to 
compare two runs' JIRA output, producing another JIRA table showing % 
improvement.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to