[jira] [Commented] (LUCENE-3242) Rename Lucene common-build.xml project name

2011-06-25 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054820#comment-13054820
 ] 

Uwe Schindler commented on LUCENE-3242:
---

Do it, no complaints :-)

 Rename Lucene common-build.xml project name
 ---

 Key: LUCENE-3242
 URL: https://issues.apache.org/jira/browse/LUCENE-3242
 Project: Lucene - Java
  Issue Type: Improvement
  Components: general/build
Reporter: Chris Male
Priority: Minor

 While adding the new common module, I ran into a name collision with Lucene's 
 common-build.xml project name.  I've since renamed the common module's 
 project name to be clearer, but I think we should rename common-build's one 
 as well.  Solr's common-build.xml uses common-solr, so lets rename Lucene's 
 to common-lucene.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3239) drop java 5 support

2011-06-25 Thread Simon Willnauer (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054827#comment-13054827
 ] 

Simon Willnauer commented on LUCENE-3239:
-

bq. So let me make sure we are on the same page:

+1

 drop java 5 support
 -

 Key: LUCENE-3239
 URL: https://issues.apache.org/jira/browse/LUCENE-3239
 Project: Lucene - Java
  Issue Type: Task
Reporter: Robert Muir

 its been discussed here and there, but I think we need to drop java 5 
 support, for these reasons:
 * its totally untested by any continual build process. Testing java5 only 
 when there is a release candidate ready is not enough. If we are to claim 
 support then we need a hudson actually running the tests with java 5.
 * its now unmaintained, so bugs have to either be hacked around, tests 
 disabled, warnings placed, but some things simply cannot be fixed... we 
 cannot actually support something that is no longer maintained: we do find 
 JRE bugs (http://wiki.apache.org/lucene-java/SunJavaBugs) and its important 
 that bugs actually get fixed: cannot do everything with hacks.
 * because of its limitations, we do things like allow 20% slower grouping 
 speed. I find it hard to believe we are sacrificing performance for this.
 So, in summary: because we don't test it at all, because its buggy and 
 unmaintained, and because we are sacrificing performance, I think we need to 
 cutover the build system for the next release to require java 6.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Issue Comment Edited] (LUCENE-3239) drop java 5 support

2011-06-25 Thread Simon Willnauer (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054827#comment-13054827
 ] 

Simon Willnauer edited comment on LUCENE-3239 at 6/25/11 7:07 AM:
--

bq. So let me make sure we are on the same page:

+1

we should call out an official vote on the dev list though

  was (Author: simonw):
bq. So let me make sure we are on the same page:

+1
  
 drop java 5 support
 -

 Key: LUCENE-3239
 URL: https://issues.apache.org/jira/browse/LUCENE-3239
 Project: Lucene - Java
  Issue Type: Task
Reporter: Robert Muir

 its been discussed here and there, but I think we need to drop java 5 
 support, for these reasons:
 * its totally untested by any continual build process. Testing java5 only 
 when there is a release candidate ready is not enough. If we are to claim 
 support then we need a hudson actually running the tests with java 5.
 * its now unmaintained, so bugs have to either be hacked around, tests 
 disabled, warnings placed, but some things simply cannot be fixed... we 
 cannot actually support something that is no longer maintained: we do find 
 JRE bugs (http://wiki.apache.org/lucene-java/SunJavaBugs) and its important 
 that bugs actually get fixed: cannot do everything with hacks.
 * because of its limitations, we do things like allow 20% slower grouping 
 speed. I find it hard to believe we are sacrificing performance for this.
 So, in summary: because we don't test it at all, because its buggy and 
 unmaintained, and because we are sacrificing performance, I think we need to 
 cutover the build system for the next release to require java 6.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3229) Overlaped SpanNearQuery

2011-06-25 Thread ludovic Boutros (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ludovic Boutros updated LUCENE-3229:


Priority: Major  (was: Minor)

 Overlaped SpanNearQuery
 ---

 Key: LUCENE-3229
 URL: https://issues.apache.org/jira/browse/LUCENE-3229
 Project: Lucene - Java
  Issue Type: Bug
  Components: core/search
Affects Versions: 3.1
 Environment: Windows XP, Java 1.6
Reporter: ludovic Boutros
 Attachments: LUCENE-3229.patch, LUCENE-3229.patch, SpanOverlap.diff, 
 SpanOverlap2.diff, SpanOverlapTestUnit.diff


 While using Span queries I think I've found a little bug.
 With a document like this (from the TestNearSpansOrdered unit test) :
 w1 w2 w3 w4 w5
 If I try to search for this span query :
 spanNear([spanNear([field:w3, field:w5], 1, true), field:w4], 0, true)
 the above document is returned and I think it should not because 'w4' is not 
 after 'w5'.
 The 2 spans are not ordered, because there is an overlap.
 I will add a test patch in the TestNearSpansOrdered unit test.
 I will add a patch to solve this issue too.
 Basicaly it modifies the two docSpansOrdered functions to make sure that the 
 spans does not overlap.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3229) SpanNearQuery: ordered spans should not overlap

2011-06-25 Thread Paul Elschot (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Paul Elschot updated LUCENE-3229:
-

Fix Version/s: 3.4
  Summary: SpanNearQuery: ordered spans should not overlap  (was: 
Overlaped SpanNearQuery)

 SpanNearQuery: ordered spans should not overlap
 ---

 Key: LUCENE-3229
 URL: https://issues.apache.org/jira/browse/LUCENE-3229
 Project: Lucene - Java
  Issue Type: Bug
  Components: core/search
Affects Versions: 3.1
 Environment: Windows XP, Java 1.6
Reporter: ludovic Boutros
 Fix For: 3.4

 Attachments: LUCENE-3229.patch, LUCENE-3229.patch, SpanOverlap.diff, 
 SpanOverlap2.diff, SpanOverlapTestUnit.diff


 While using Span queries I think I've found a little bug.
 With a document like this (from the TestNearSpansOrdered unit test) :
 w1 w2 w3 w4 w5
 If I try to search for this span query :
 spanNear([spanNear([field:w3, field:w5], 1, true), field:w4], 0, true)
 the above document is returned and I think it should not because 'w4' is not 
 after 'w5'.
 The 2 spans are not ordered, because there is an overlap.
 I will add a test patch in the TestNearSpansOrdered unit test.
 I will add a patch to solve this issue too.
 Basicaly it modifies the two docSpansOrdered functions to make sure that the 
 spans does not overlap.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2619) two sfields in geospatial search

2011-06-25 Thread jose rodriguez (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054849#comment-13054849
 ] 

jose rodriguez commented on SOLR-2619:
--

Hi David ,

q={!geofilt sfield=location_1}fq={!geofilt sfield=location_2}

it works for me . Maybe theres another workaround . tried houndreds 
posibilities with no success.

Thanks for reply. 



 two sfields in geospatial search
 

 Key: SOLR-2619
 URL: https://issues.apache.org/jira/browse/SOLR-2619
 Project: Solr
  Issue Type: Wish
  Components: clients - php
Affects Versions: 3.2
 Environment: Using with drupal
Reporter: jose rodriguez
 Fix For: 3.2


 Is it possible to create a query with two sfield (geospatial search)? .Want 
 to mean two diferents pt and d for each field.
 If i need from - to then i need fields around the from coordinate and around 
 the to coordinates.
 Thanks.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[JENKINS-MAVEN] Lucene-Solr-Maven-3.x #162: POMs out of sync

2011-06-25 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-Maven-3.x/162/

1 tests failed.
FAILED:  org.apache.lucene.index.TestCheckIndex.testLuceneConstantVersion

Error Message:
Invalid version: 3.3-SNAPSHOT

Stack Trace:
java.lang.AssertionError: Invalid version: 3.3-SNAPSHOT
at org.junit.Assert.fail(Assert.java:91)
at org.junit.Assert.assertTrue(Assert.java:43)
at 
org.apache.lucene.index.TestCheckIndex.testLuceneConstantVersion(TestCheckIndex.java:98)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:616)
at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:44)
at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:15)
at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:41)
at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:20)
at org.junit.rules.TestWatchman$1.evaluate(TestWatchman.java:48)
at 
org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:28)
at 
org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31)
at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:76)
at 
org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1277)
at 
org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1195)
at org.junit.runners.ParentRunner$3.run(ParentRunner.java:193)
at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:52)
at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:191)
at org.junit.runners.ParentRunner.access$000(ParentRunner.java:42)
at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:184)
at 
org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:28)
at 
org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31)
at org.junit.runners.ParentRunner.run(ParentRunner.java:236)
at 
org.apache.maven.surefire.junit4.JUnit4TestSet.execute(JUnit4TestSet.java:35)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:146)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:97)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:616)
at 
org.apache.maven.surefire.booter.ProviderFactory$ClassLoaderProxy.invoke(ProviderFactory.java:103)
at $Proxy0.invoke(Unknown Source)
at 
org.apache.maven.surefire.booter.SurefireStarter.invokeProvider(SurefireStarter.java:145)
at 
org.apache.maven.surefire.booter.SurefireStarter.runSuitesInProcess(SurefireStarter.java:87)
at 
org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:69)




Build Log (for compile errors):
[...truncated 15489 lines...]



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-1742) Wrap SegmentInfos in public class

2011-06-25 Thread Tyheem Backer (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-1742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054868#comment-13054868
 ] 

Tyheem Backer commented on LUCENE-1742:
---

[Thanks|http://rullymisar.com/]

 Wrap SegmentInfos in public class 
 --

 Key: LUCENE-1742
 URL: https://issues.apache.org/jira/browse/LUCENE-1742
 Project: Lucene - Java
  Issue Type: Improvement
  Components: core/index
Affects Versions: 2.4.1
Reporter: Jason Rutherglen
Priority: Trivial
 Fix For: 2.9

 Attachments: LUCENE-1742.patch, LUCENE-1742.patch, LUCENE-1742.patch, 
 LUCENE-1742.patch, LUCENE-1742.patch

   Original Estimate: 48h
  Remaining Estimate: 48h

 Wrap SegmentInfos in a public class so that subclasses of MergePolicy do not 
 need to be in the org.apache.lucene.index package.  

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3239) drop java 5 support

2011-06-25 Thread Dawid Weiss (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054896#comment-13054896
 ] 

Dawid Weiss commented on LUCENE-3239:
-

+1 from me.

 drop java 5 support
 -

 Key: LUCENE-3239
 URL: https://issues.apache.org/jira/browse/LUCENE-3239
 Project: Lucene - Java
  Issue Type: Task
Reporter: Robert Muir

 its been discussed here and there, but I think we need to drop java 5 
 support, for these reasons:
 * its totally untested by any continual build process. Testing java5 only 
 when there is a release candidate ready is not enough. If we are to claim 
 support then we need a hudson actually running the tests with java 5.
 * its now unmaintained, so bugs have to either be hacked around, tests 
 disabled, warnings placed, but some things simply cannot be fixed... we 
 cannot actually support something that is no longer maintained: we do find 
 JRE bugs (http://wiki.apache.org/lucene-java/SunJavaBugs) and its important 
 that bugs actually get fixed: cannot do everything with hacks.
 * because of its limitations, we do things like allow 20% slower grouping 
 speed. I find it hard to believe we are sacrificing performance for this.
 So, in summary: because we don't test it at all, because its buggy and 
 unmaintained, and because we are sacrificing performance, I think we need to 
 cutover the build system for the next release to require java 6.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[JENKINS-MAVEN] Lucene-Solr-Maven-trunk #159: POMs out of sync

2011-06-25 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-Maven-trunk/159/

No tests ran.

Build Log (for compile errors):
[...truncated 9467 lines...]



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3234) Provide limit on phrase analysis in FastVectorHighlighter

2011-06-25 Thread Mike Sokolov (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mike Sokolov updated LUCENE-3234:
-

Attachment: LUCENE-3234.patch

Sure - the test is fragile.  It was just meant to illustrate the use case; not 
really a good unit test for regression.  The last patch has it commented.

 Provide limit on phrase analysis in FastVectorHighlighter
 -

 Key: LUCENE-3234
 URL: https://issues.apache.org/jira/browse/LUCENE-3234
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 2.9.4, 3.0.3, 3.1, 3.2, 3.3
Reporter: Mike Sokolov
Assignee: Koji Sekiguchi
 Fix For: 3.4, 4.0

 Attachments: LUCENE-3234.patch, LUCENE-3234.patch, LUCENE-3234.patch


 With larger documents, FVH can spend a lot of time trying to find the 
 best-scoring snippet as it examines every possible phrase formed from 
 matching terms in the document.  If one is willing to accept
 less-than-perfect scoring by limiting the number of phrases that are 
 examined, substantial speedups are possible.  This is analogous to the 
 Highlighter limit on the number of characters to analyze.
 The patch includes an artifical test case that shows  1000x speedup.  In a 
 more normal test environment, with English documents and random queries, I am 
 seeing speedups of around 3-10x when setting phraseLimit=1, which has the 
 effect of selecting the first possible snippet in the document.  Most of our 
 sites operate in this way (just show the first snippet), so this would be a 
 big win for us.
 With phraseLimit = -1, you get the existing FVH behavior. At larger values of 
 phraseLimit, you may not get substantial speedup in the normal case, but you 
 do get the benefit of protection against blow-up in pathological cases.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-1889) FastVectorHighlighter: support for additional queries

2011-06-25 Thread Mike Sokolov (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054923#comment-13054923
 ] 

Mike Sokolov commented on LUCENE-1889:
--

I made an incremental change to FVH to support WildcardQuery, PrefixQuery and 
RegexpQuery.  Just uses Java regexes to match.  It is faster than 
HighlightQuery (4-5x in enwiki benchmark) although not as much faster as the 
comparison w/TermQuery.  

A possible issue is that regex support will differ from RegexpQuery, but I 
think? that Java's is a superset, so should be ok, but I'm not sure about this 
one.

This doesn't take you to nirvana, but does add support for a common case. If 
there's interest I'll post.


 FastVectorHighlighter: support for additional queries
 -

 Key: LUCENE-1889
 URL: https://issues.apache.org/jira/browse/LUCENE-1889
 Project: Lucene - Java
  Issue Type: Wish
  Components: modules/highlighter
Reporter: Robert Muir
Priority: Minor

 I am using fastvectorhighlighter for some strange languages and it is working 
 well! 
 One thing i noticed immediately is that many query types are not highlighted 
 (multitermquery, multiphrasequery, etc)
 Here is one thing Michael M posted in the original ticket:
 {quote}
 I think a nice [eventual] model would be if we could simply re-run the
 scorer on the single document (using InstantiatedIndex maybe, or
 simply some sort of wrapper on the term vectors which are already a
 mini-inverted-index for a single doc), but extend the scorer API to
 tell us the exact term occurrences that participated in a match (which
 I don't think is exposed today).
 {quote}
 Due to strange requirements I am using something similar to this (but 
 specialized to our case).
 I am doing strange things like forcing multitermqueries to rewrite into 
 boolean queries so they will be highlighted,
 and flattening multiphrasequeries into boolean or'ed phrasequeries.
 I do not think these things would be 'fast', but i had a few ideas that might 
 help:
 * looking at contrib/highlighter, you can support FilteredQuery in flatten() 
 by calling getQuery() right?
 * maybe as a last resort, try Query.extractTerms() ?

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2242) Get distinct count of names for a facet field

2011-06-25 Thread Bill Bell (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054931#comment-13054931
 ] 

Bill Bell commented on SOLR-2242:
-

re: whitespace

What are the settings supposed to be for tabs? Because on my editor it looks 
perfect. 4 space, tabs, 2 space per tab? ??

I will add some tests.

I think switching from if to switch and the movement to termList != null is 
mostly just style and does not really improve anything. I actually think it 
confuses things and makes the overall patch larger and more risky that we miss 
something or mess it up.

I will also look at the Integer generic... Thanks.

Bill


 Get distinct count of names for a facet field
 -

 Key: SOLR-2242
 URL: https://issues.apache.org/jira/browse/SOLR-2242
 Project: Solr
  Issue Type: New Feature
  Components: Response Writers
Affects Versions: 4.0
Reporter: Bill Bell
Assignee: Simon Willnauer
Priority: Minor
 Fix For: 4.0

 Attachments: SOLR-2242.patch, SOLR-2242.shard.patch, 
 SOLR-2242.shard.patch, SOLR-2242.solr3.1.patch, SOLR.2242.solr3.1.patch, 
 SOLR.2242.v2.patch


 When returning facet.field=name of field you will get a list of matches for 
 distinct values. This is normal behavior. This patch tells you how many 
 distinct values you have (# of rows). Use with limit=-1 and mincount=1.
 The feature is called namedistinct. Here is an example:
 http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solrindent=trueq=*:*facet=truefacet.mincount=1facet.numFacetTerms=2facet.limit=-1facet.field=price
 http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solrindent=trueq=*:*facet=truefacet.mincount=1facet.numFacetTerms=0facet.limit=-1facet.field=price
 http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solrindent=trueq=*:*facet=truefacet.mincount=1facet.numFacetTerms=1facet.limit=-1facet.field=price
 This currently only works on facet.field.
 {code}
 lst name=facet_fields
   lst name=price
 int name=numFacetTerms14/int
 int name=0.03/intint name=11.51/intint 
 name=19.951/intint name=74.991/intint name=92.01/intint 
 name=179.991/intint name=185.01/intint name=279.951/intint 
 name=329.951/intint name=350.01/intint name=399.01/intint 
 name=479.951/intint name=649.991/intint name=2199.01/int
   /lst
 /lst
 {code} 
 Several people use this to get the group.field count (the # of groups).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2242) Get distinct count of names for a facet field

2011-06-25 Thread Bill Bell (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054932#comment-13054932
 ] 

Bill Bell commented on SOLR-2242:
-

Just so you know I have been using the original patch in production for over 5 
months. I would say that the original one is tested.

But now that we are changing it, I agree that we need more coverage.

That will be my #1 priority.

 Get distinct count of names for a facet field
 -

 Key: SOLR-2242
 URL: https://issues.apache.org/jira/browse/SOLR-2242
 Project: Solr
  Issue Type: New Feature
  Components: Response Writers
Affects Versions: 4.0
Reporter: Bill Bell
Assignee: Simon Willnauer
Priority: Minor
 Fix For: 4.0

 Attachments: SOLR-2242.patch, SOLR-2242.shard.patch, 
 SOLR-2242.shard.patch, SOLR-2242.solr3.1.patch, SOLR.2242.solr3.1.patch, 
 SOLR.2242.v2.patch


 When returning facet.field=name of field you will get a list of matches for 
 distinct values. This is normal behavior. This patch tells you how many 
 distinct values you have (# of rows). Use with limit=-1 and mincount=1.
 The feature is called namedistinct. Here is an example:
 http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solrindent=trueq=*:*facet=truefacet.mincount=1facet.numFacetTerms=2facet.limit=-1facet.field=price
 http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solrindent=trueq=*:*facet=truefacet.mincount=1facet.numFacetTerms=0facet.limit=-1facet.field=price
 http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solrindent=trueq=*:*facet=truefacet.mincount=1facet.numFacetTerms=1facet.limit=-1facet.field=price
 This currently only works on facet.field.
 {code}
 lst name=facet_fields
   lst name=price
 int name=numFacetTerms14/int
 int name=0.03/intint name=11.51/intint 
 name=19.951/intint name=74.991/intint name=92.01/intint 
 name=179.991/intint name=185.01/intint name=279.951/intint 
 name=329.951/intint name=350.01/intint name=399.01/intint 
 name=479.951/intint name=649.991/intint name=2199.01/int
   /lst
 /lst
 {code} 
 Several people use this to get the group.field count (the # of groups).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-2622) ZkSolrResourceLoader does not support getConfigDir()

2011-06-25 Thread Stefan Matheis (steffkes) (JIRA)
ZkSolrResourceLoader does not support getConfigDir()


 Key: SOLR-2622
 URL: https://issues.apache.org/jira/browse/SOLR-2622
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud, web gui
Affects Versions: 4.0
 Environment: SVN-Revision: {{1139570}}
Startup-Command: {{cd solr/example  java -Dbootstrap_confdir=./solr/conf 
-Dcollection.configName=myconf -DzkRun -jar start.jar}}
Reporter: Stefan Matheis (steffkes)


Requesting 
{{/solr/admin/file/?contentType=text/xml;charset=utf-8file=schema.xml}} 
generates an HTTP 500:

{code}org.apache.solr.common.cloud.ZooKeeperException: ZkSolrResourceLoader 
does not support getConfigDir() - likely, what you are trying to do is not 
supported in ZooKeeper mode
at 
org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:99)
at 
org.apache.solr.handler.admin.ShowFileRequestHandler.handleRequestBody(ShowFileRequestHandler.java:126)
at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1316)
at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:353)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:248)
at 
org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
at 
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
at 
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
at 
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
at 
org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
at 
org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230)
at 
org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
at 
org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
at org.mortbay.jetty.Server.handle(Server.java:326)
at 
org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542)
at 
org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConnection.java:928)
at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:549)
at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212)
at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404)
at 
org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:228)
at 
org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582){code}

It's not related to a specific file, requesting {{/solr/admin/file}} is enough.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



RE: [JENKINS-MAVEN] Lucene-Solr-Maven-3.x #162: POMs out of sync

2011-06-25 Thread Uwe Schindler
Jenkins configs updated to set correct version number

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de


 -Original Message-
 From: Apache Jenkins Server [mailto:jenk...@builds.apache.org]
 Sent: Saturday, June 25, 2011 1:42 PM
 To: dev@lucene.apache.org
 Subject: [JENKINS-MAVEN] Lucene-Solr-Maven-3.x #162: POMs out of sync
 
 Build: https://builds.apache.org/job/Lucene-Solr-Maven-3.x/162/
 
 1 tests failed.
 FAILED:
 org.apache.lucene.index.TestCheckIndex.testLuceneConstantVersion
 
 Error Message:
 Invalid version: 3.3-SNAPSHOT
 
 Stack Trace:
 java.lang.AssertionError: Invalid version: 3.3-SNAPSHOT
   at org.junit.Assert.fail(Assert.java:91)
   at org.junit.Assert.assertTrue(Assert.java:43)
   at
 org.apache.lucene.index.TestCheckIndex.testLuceneConstantVersion(TestC
 heckIndex.java:98)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.j
 ava:57)
   at
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
 sorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:616)
   at
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(Framework
 Method.java:44)
   at
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.jav
 a:15)
   at
 org.junit.runners.model.FrameworkMethod.invokeExplosively(Framework
 Method.java:41)
   at
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMeth
 od.java:20)
   at org.junit.rules.TestWatchman$1.evaluate(TestWatchman.java:48)
   at
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java
 :28)
   at
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31
 )
   at
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.j
 ava:76)
   at
 org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(Luc
 eneTestCase.java:1277)
   at
 org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(Luc
 eneTestCase.java:1195)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:193)
   at
 org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:52)
   at
 org.junit.runners.ParentRunner.runChildren(ParentRunner.java:191)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:42)
   at
 org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:184)
   at
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java
 :28)
   at
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:31
 )
   at org.junit.runners.ParentRunner.run(ParentRunner.java:236)
   at
 org.apache.maven.surefire.junit4.JUnit4TestSet.execute(JUnit4TestSet.java:
 35)
   at
 org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Prov
 ider.java:146)
   at
 org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java
 :97)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.j
 ava:57)
   at
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
 sorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:616)
   at
 org.apache.maven.surefire.booter.ProviderFactory$ClassLoaderProxy.invok
 e(ProviderFactory.java:103)
   at $Proxy0.invoke(Unknown Source)
   at
 org.apache.maven.surefire.booter.SurefireStarter.invokeProvider(SurefireSt
 arter.java:145)
   at
 org.apache.maven.surefire.booter.SurefireStarter.runSuitesInProcess(Surefi
 reStarter.java:87)
   at
 org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:6
 9)
 
 
 
 
 Build Log (for compile errors):
 [...truncated 15489 lines...]
 
 
 
 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional
 commands, e-mail: dev-h...@lucene.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3234) Provide limit on phrase analysis in FastVectorHighlighter

2011-06-25 Thread Digy (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054935#comment-13054935
 ] 

Digy commented on LUCENE-3234:
--

I am not sure how much it is related to this issue but there was
a similar issue in Lucene.Net.
https://issues.apache.org/jira/browse/LUCENENET-350



 Provide limit on phrase analysis in FastVectorHighlighter
 -

 Key: LUCENE-3234
 URL: https://issues.apache.org/jira/browse/LUCENE-3234
 Project: Lucene - Java
  Issue Type: Improvement
Affects Versions: 2.9.4, 3.0.3, 3.1, 3.2, 3.3
Reporter: Mike Sokolov
Assignee: Koji Sekiguchi
 Fix For: 3.4, 4.0

 Attachments: LUCENE-3234.patch, LUCENE-3234.patch, LUCENE-3234.patch


 With larger documents, FVH can spend a lot of time trying to find the 
 best-scoring snippet as it examines every possible phrase formed from 
 matching terms in the document.  If one is willing to accept
 less-than-perfect scoring by limiting the number of phrases that are 
 examined, substantial speedups are possible.  This is analogous to the 
 Highlighter limit on the number of characters to analyze.
 The patch includes an artifical test case that shows  1000x speedup.  In a 
 more normal test environment, with English documents and random queries, I am 
 seeing speedups of around 3-10x when setting phraseLimit=1, which has the 
 effect of selecting the first possible snippet in the document.  Most of our 
 sites operate in this way (just show the first snippet), so this would be a 
 big win for us.
 With phraseLimit = -1, you get the existing FVH behavior. At larger values of 
 phraseLimit, you may not get substantial speedup in the normal case, but you 
 do get the benefit of protection against blow-up in pathological cases.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-3220) Implement various ranking models as Similarities

2011-06-25 Thread David Mark Nemeskey (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-3220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Mark Nemeskey updated LUCENE-3220:


Attachment: LUCENE-3220.patch

Implementation of the DFR framework added. Lots of nocommits, though. I things 
to think about:
 * lots of (float) conversions. Maybe the inner API (BasicModel, etc.) could 
use doubles? According to my experience, double is faster anyway, at least on 
64bit architectures
 * I am not overly happy about the naming scheme, i.e. BasicModelBE, etc. Maybe 
we should do it the same way as in Terrier, with a basicmodel package and class 
names like BE?
 * A regular SimilarityProvider implementation won't play well with 
DFRSimilarity, in case the user wants to use several different setups. 
Actually, this is a problem for all similarities that have parameters (e.g. 
BM25 with b and k).

Also, I think we need that NormConverter we talked earlier on irc, so that the 
Similarities can run on any index.

 Implement various ranking models as Similarities
 

 Key: LUCENE-3220
 URL: https://issues.apache.org/jira/browse/LUCENE-3220
 Project: Lucene - Java
  Issue Type: Sub-task
  Components: core/search
Affects Versions: flexscoring branch
Reporter: David Mark Nemeskey
Assignee: David Mark Nemeskey
  Labels: gsoc
 Attachments: LUCENE-3220.patch, LUCENE-3220.patch, LUCENE-3220.patch, 
 LUCENE-3220.patch, LUCENE-3220.patch, LUCENE-3220.patch

   Original Estimate: 336h
  Remaining Estimate: 336h

 With [LUCENE-3174|https://issues.apache.org/jira/browse/LUCENE-3174] done, we 
 can finally work on implementing the standard ranking models. Currently DFR, 
 BM25 and LM are on the menu.
 TODO:
  * {{EasyStats}}: contains all statistics that might be relevant for a 
 ranking algorithm
  * {{EasySimilarity}}: the ancestor of all the other similarities. Hides the 
 DocScorers and as much implementation detail as possible
  * _BM25_: the current mock implementation might be OK
  * _LM_
  * _DFR_
 Done:

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



RE: [JENKINS-MAVEN] Lucene-Solr-Maven-trunk #159: POMs out of sync

2011-06-25 Thread Steven A Rowe
The new common module's POM had artifactId lucene-common instead of 
lucene-common-module, so generate-maven-artifacts failed to find the built jar 
when deploying.

I've committed a fix to the POM, generate-maven-artifacts succeeds for me 
locally.

Steve

 -Original Message-
 From: Apache Jenkins Server [mailto:jenk...@builds.apache.org]
 Sent: Saturday, June 25, 2011 10:25 AM
 To: dev@lucene.apache.org
 Subject: [JENKINS-MAVEN] Lucene-Solr-Maven-trunk #159: POMs out of sync
 
 Build: https://builds.apache.org/job/Lucene-Solr-Maven-trunk/159/
 
 No tests ran.
 
 Build Log (for compile errors):
 [...truncated 9467 lines...]
 
 
 
 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org



[JENKINS] Lucene-Solr-tests-only-trunk - Build # 9074 - Failure

2011-06-25 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/9074/

1 tests failed.
REGRESSION:  org.apache.solr.cloud.CloudStateUpdateTest.testCoreRegistration

Error Message:
expected:2 but was:3

Stack Trace:
junit.framework.AssertionFailedError: expected:2 but was:3
at 
org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1430)
at 
org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1348)
at 
org.apache.solr.cloud.CloudStateUpdateTest.testCoreRegistration(CloudStateUpdateTest.java:208)




Build Log (for compile errors):
[...truncated 8893 lines...]



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-1889) FastVectorHighlighter: support for additional queries

2011-06-25 Thread Koji Sekiguchi (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054984#comment-13054984
 ] 

Koji Sekiguchi commented on LUCENE-1889:


Sounds great!

bq. This doesn't take you to nirvana, but does add support for a common case. 
If there's interest I'll post.

Sure, please post. Does it cover range queries?

 FastVectorHighlighter: support for additional queries
 -

 Key: LUCENE-1889
 URL: https://issues.apache.org/jira/browse/LUCENE-1889
 Project: Lucene - Java
  Issue Type: Wish
  Components: modules/highlighter
Reporter: Robert Muir
Priority: Minor

 I am using fastvectorhighlighter for some strange languages and it is working 
 well! 
 One thing i noticed immediately is that many query types are not highlighted 
 (multitermquery, multiphrasequery, etc)
 Here is one thing Michael M posted in the original ticket:
 {quote}
 I think a nice [eventual] model would be if we could simply re-run the
 scorer on the single document (using InstantiatedIndex maybe, or
 simply some sort of wrapper on the term vectors which are already a
 mini-inverted-index for a single doc), but extend the scorer API to
 tell us the exact term occurrences that participated in a match (which
 I don't think is exposed today).
 {quote}
 Due to strange requirements I am using something similar to this (but 
 specialized to our case).
 I am doing strange things like forcing multitermqueries to rewrite into 
 boolean queries so they will be highlighted,
 and flattening multiphrasequeries into boolean or'ed phrasequeries.
 I do not think these things would be 'fast', but i had a few ideas that might 
 help:
 * looking at contrib/highlighter, you can support FilteredQuery in flatten() 
 by calling getQuery() right?
 * maybe as a last resort, try Query.extractTerms() ?

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-1889) FastVectorHighlighter: support for additional queries

2011-06-25 Thread Mike Sokolov (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054985#comment-13054985
 ] 

Mike Sokolov commented on LUCENE-1889:
--

No, no range queries, sorry.  I don't think that's easily expressible as a 
regex? So it would add probably require yet another data structure in 
FieldQuery - right now we have MapString,QueryPhraseMap for TermQuery; I've 
added a ListQueryPhraseMap and QPM.Pattern for matching wildcards and 
regexes.  To handle RangeQuery, you'd need to add another such data structure: 
it would probably be best to introduce some new abstraction to represent all of 
these query-proxies.

It seemed a less useful case to me anyway since we don't usually use range 
queries in the context of full text; more often they come up in structured 
metadata?  Curious if  you have requests for that?

Anyway I will clean up a bit and post.

 FastVectorHighlighter: support for additional queries
 -

 Key: LUCENE-1889
 URL: https://issues.apache.org/jira/browse/LUCENE-1889
 Project: Lucene - Java
  Issue Type: Wish
  Components: modules/highlighter
Reporter: Robert Muir
Priority: Minor

 I am using fastvectorhighlighter for some strange languages and it is working 
 well! 
 One thing i noticed immediately is that many query types are not highlighted 
 (multitermquery, multiphrasequery, etc)
 Here is one thing Michael M posted in the original ticket:
 {quote}
 I think a nice [eventual] model would be if we could simply re-run the
 scorer on the single document (using InstantiatedIndex maybe, or
 simply some sort of wrapper on the term vectors which are already a
 mini-inverted-index for a single doc), but extend the scorer API to
 tell us the exact term occurrences that participated in a match (which
 I don't think is exposed today).
 {quote}
 Due to strange requirements I am using something similar to this (but 
 specialized to our case).
 I am doing strange things like forcing multitermqueries to rewrite into 
 boolean queries so they will be highlighted,
 and flattening multiphrasequeries into boolean or'ed phrasequeries.
 I do not think these things would be 'fast', but i had a few ideas that might 
 help:
 * looking at contrib/highlighter, you can support FilteredQuery in flatten() 
 by calling getQuery() right?
 * maybe as a last resort, try Query.extractTerms() ?

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-1889) FastVectorHighlighter: support for additional queries

2011-06-25 Thread Mike Sokolov (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mike Sokolov updated LUCENE-1889:
-

Attachment: LUCENE-1889.patch

Patch includes FVH support for Wildcard-, Regexp- and PrefixQuery.  Change to 
Enwiki benchmark (to generate wildcard queries) should maybe not be committed; 
just providing this as a validation of this approach.

 FastVectorHighlighter: support for additional queries
 -

 Key: LUCENE-1889
 URL: https://issues.apache.org/jira/browse/LUCENE-1889
 Project: Lucene - Java
  Issue Type: Wish
  Components: modules/highlighter
Reporter: Robert Muir
Priority: Minor
 Attachments: LUCENE-1889.patch


 I am using fastvectorhighlighter for some strange languages and it is working 
 well! 
 One thing i noticed immediately is that many query types are not highlighted 
 (multitermquery, multiphrasequery, etc)
 Here is one thing Michael M posted in the original ticket:
 {quote}
 I think a nice [eventual] model would be if we could simply re-run the
 scorer on the single document (using InstantiatedIndex maybe, or
 simply some sort of wrapper on the term vectors which are already a
 mini-inverted-index for a single doc), but extend the scorer API to
 tell us the exact term occurrences that participated in a match (which
 I don't think is exposed today).
 {quote}
 Due to strange requirements I am using something similar to this (but 
 specialized to our case).
 I am doing strange things like forcing multitermqueries to rewrite into 
 boolean queries so they will be highlighted,
 and flattening multiphrasequeries into boolean or'ed phrasequeries.
 I do not think these things would be 'fast', but i had a few ideas that might 
 help:
 * looking at contrib/highlighter, you can support FilteredQuery in flatten() 
 by calling getQuery() right?
 * maybe as a last resort, try Query.extractTerms() ?

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: [JENKINS-MAVEN] Lucene-Solr-Maven-trunk #159: POMs out of sync

2011-06-25 Thread Chris Male
Cheers Steve

On Sun, Jun 26, 2011 at 7:44 AM, Steven A Rowe sar...@syr.edu wrote:

 The new common module's POM had artifactId lucene-common instead of
 lucene-common-module, so generate-maven-artifacts failed to find the built
 jar when deploying.

 I've committed a fix to the POM, generate-maven-artifacts succeeds for me
 locally.

 Steve

  -Original Message-
  From: Apache Jenkins Server [mailto:jenk...@builds.apache.org]
  Sent: Saturday, June 25, 2011 10:25 AM
  To: dev@lucene.apache.org
  Subject: [JENKINS-MAVEN] Lucene-Solr-Maven-trunk #159: POMs out of sync
 
  Build: https://builds.apache.org/job/Lucene-Solr-Maven-trunk/159/
 
  No tests ran.
 
  Build Log (for compile errors):
  [...truncated 9467 lines...]
 
 
 
  -
  To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
  For additional commands, e-mail: dev-h...@lucene.apache.org




-- 
Chris Male | Software Developer | JTeam BV.| www.jteam.nl


[JENKINS] Lucene-3.x - Build # 419 - Failure

2011-06-25 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-3.x/419/

1 tests failed.
REGRESSION:  org.apache.lucene.index.TestNRTThreads.testNRTThreads

Error Message:
Some threads threw uncaught exceptions!

Stack Trace:
junit.framework.AssertionFailedError: Some threads threw uncaught exceptions!
at 
org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1277)
at 
org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1195)
at 
org.apache.lucene.util.LuceneTestCase.tearDown(LuceneTestCase.java:469)




Build Log (for compile errors):
[...truncated 12603 lines...]



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org