Re: Looking for a code pattern to pass stop words as an attribute

2012-08-21 Thread Dawid Weiss
Thanks for replies Steve, Uwe. > if you dont want to create your own "marker filter", you can use > KeywordMarkerFilter (http://goo.gl/OOgf4) instead This is pretty much what I had come up with, although I used a custom filter class (with a similar attribute). The thing I have trouble with is, h

[jira] [Commented] (LUCENE-4321) java.io.FilterReader considered harmful

2012-08-21 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13439309#comment-13439309 ] Dawid Weiss commented on LUCENE-4321: - Yeah, this looks good to me! I'd even make "in

Re: Diversifying Search Results - Custom Collector

2012-08-21 Thread Mikhail Khludnev
one more idea: first search is grouped by brand with limit 1, it gives you a most relevant products for this particular search. than second search boost top products from the first search result by ie. q=original:query ID:(44,56,78,99,22)^1000 On Tue, Aug 21, 2012 at 8:04 PM, Karthick Duraisamy S

Re: Avoid losing data on ZK connection-loss/session-timeout

2012-08-21 Thread Mark Miller
> Mark Miller: > Ad 3) Well, we can do some practical things right? I don't think we need to > support a node coming back from the dead a year later and it had some > updates the cluster doesn't have. A node coming up 2 minutes later is > something we want to worry about though. > > A year, no, but

[jira] [Commented] (LUCENE-2810) Explore Alternate Stored Field approaches for highly redundant data

2012-08-21 Thread Sabbir Kumar Manandhar (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13439266#comment-13439266 ] Sabbir Kumar Manandhar commented on LUCENE-2810: @Grant, I see this post

[JENKINS] Lucene-Solr-Tests-4.x-java7 - Build # 328 - Failure

2012-08-21 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-Tests-4.x-java7/328/ No tests ran. Build Log: [...truncated 7347 lines...] FATAL: hudson.remoting.RequestAbortedException: java.io.IOException: Unexpected termination of the channel hudson.remoting.RequestAbortedException: hudson.remoting.Request

[jira] [Commented] (SOLR-3721) Multiple concurrent recoveries of same shard?

2012-08-21 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13439176#comment-13439176 ] Mark Miller commented on SOLR-3721: --- connection-loss should not be involved here - the ge

[jira] [Created] (LUCENE-4321) java.io.FilterReader considered harmful

2012-08-21 Thread Robert Muir (JIRA)
Robert Muir created LUCENE-4321: --- Summary: java.io.FilterReader considered harmful Key: LUCENE-4321 URL: https://issues.apache.org/jira/browse/LUCENE-4321 Project: Lucene - Core Issue Type: Bug

[jira] [Updated] (LUCENE-4321) java.io.FilterReader considered harmful

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-4321: Attachment: LUCENE-4321.patch Here's a patch. > java.io.FilterReader considered h

Re: Caveat when upgrading from 4.0.0-ALPHA to 4.0.0-BETA (CharStream overrides).

2012-08-21 Thread Robert Muir
I think this is crazy, lets fix it so people don't trip up on this: https://issues.apache.org/jira/browse/LUCENE-4321 On Fri, Aug 17, 2012 at 4:15 PM, Dawid Weiss wrote: > I've just hit this and I thought I'd share -- don't know if this was > discussed. I had to update some code from 4.0.0-ALPHA

RE: Looking for a code pattern to pass stop words as an attribute

2012-08-21 Thread Uwe Schindler
Hi Dawid, if you dont want to create your own "marker filter", you can use KeywordMarkerFilter (http://goo.gl/OOgf4) instead StopFilter. This will work perfectly and don’t affect other filters, if you don’t have stemming in your analysis chain. The trick is to pass the stop-set to KeywordMarker

RE: Looking for a code pattern to pass stop words as an attribute

2012-08-21 Thread Steven A Rowe
Hi Dawid, Maybe you could use KeywordMarkerFilter, either directly or as a recipe for a StopwordMarkerFilter? Note that KeywordAttribute is used by most (all?) Lucene stemmers, so I wouldn't use KeywordMarkerFilter if your analysis chain already includes a stemmer. Steve -Original Mess

Looking for a code pattern to pass stop words as an attribute

2012-08-21 Thread Dawid Weiss
Seeking advice. I have an application where I need to know which tokens are stop words. Most analyzers construct the token stream in a way that those tokens are filtered out -- this isn't what I need, I want them in, but marked somehow. The question is how to do it nicely and in a simple way, poss

[jira] [Moved] (SOLR-3748) testDistribSearch test timeout (progress stalled)

2012-08-21 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dawid Weiss moved LUCENE-4320 to SOLR-3748: --- Lucene Fields: (was: New) Key: SOLR-3748 (was: LUCENE-4320)

Re: [JENKINS] Lucene-Solr-Tests-4.x-java7 - Build # 326 - Failure

2012-08-21 Thread Dawid Weiss
Fcuk, sorry -- filed the issue under Lucene, moved: https://issues.apache.org/jira/browse/SOLR-3748 Dawid On Tue, Aug 21, 2012 at 10:01 PM, Dawid Weiss wrote: > I filed this issue for this build failure: > https://issues.apache.org/jira/browse/LUCENE-4320 > > Uwe took a snapshot of log files and

Re: [JENKINS] Lucene-Solr-Tests-4.x-java7 - Build # 326 - Failure

2012-08-21 Thread Dawid Weiss
I filed this issue for this build failure: https://issues.apache.org/jira/browse/LUCENE-4320 Uwe took a snapshot of log files and I was able to analyze them. Conclusions: 1) Solr's BasicDistributedZkTest stalled after ~40 seconds and showed no sign of progress until timed out (grep for 'jstack' i

[jira] [Created] (LUCENE-4320) testDistribSearch test timeout (progress stalled)

2012-08-21 Thread Dawid Weiss (JIRA)
Dawid Weiss created LUCENE-4320: --- Summary: testDistribSearch test timeout (progress stalled) Key: LUCENE-4320 URL: https://issues.apache.org/jira/browse/LUCENE-4320 Project: Lucene - Core Issue

[jira] [Updated] (LUCENE-4320) testDistribSearch test timeout (progress stalled)

2012-08-21 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dawid Weiss updated LUCENE-4320: Attachment: lucene.log Output logs recovered from the events file of the crashed JVM.

[jira] [Updated] (SOLR-3745) SolrCore should fail to init in SolrCloud mode if schema does not contain _version_ field

2012-08-21 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-3745: --- Attachment: SOLR-3745.patch patch building on SOLR-3746 with the basic error checking in SolrCore ... works f

[jira] [Resolved] (SOLR-3432) deleteByQuery silently ignored if updateLog is enabled, but {{_version_}} field does not exist in schema

2012-08-21 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man resolved SOLR-3432. Resolution: Fixed Fix Version/s: 5.0 Assignee: Hoss Man (was: Yonik Seeley) > deleteByQ

RE: Making a contribution to Lucene - please help

2012-08-21 Thread Steven A Rowe
Hi Catherine, Welcome! I assume you are the same person as "jellyman" who posted earlier in this mailing list about exactly this same topic? If so, FYI, someone posting under multiple identities can be a predictor for various forms of mailing list nastiness, so you might want to pick one iden

[jira] [Commented] (LUCENE-4319) Rethink Fields.size() returning -1

2012-08-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438917#comment-13438917 ] Michael McCandless commented on LUCENE-4319: +1 to require Fields.size() is a

[jira] [Resolved] (SOLR-3746) updateLog should fail to init if there is no _version_ field in schema.xml

2012-08-21 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man resolved SOLR-3746. Resolution: Fixed Fix Version/s: 5.0 Committed revision 1375674. Committed revision 1375690.

[JENKINS] Lucene-Solr-Tests-4.x-java7 - Build # 326 - Failure

2012-08-21 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-Tests-4.x-java7/326/ All tests passed Build Log: [...truncated 8052 lines...] [junit4:junit4] ERROR: JVM J0 ended with an exception, command line: /usr/local/openjdk7/jre/bin/java -XX:+UseG1GC -Dtests.prefix=tests -Dtests.seed=606B185F45F0CD8B -X

[jira] [Commented] (SOLR-3747) Solr Score threshold 'reasonably', independent of results returned

2012-08-21 Thread Ramzi Alqrainy (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438833#comment-13438833 ] Ramzi Alqrainy commented on SOLR-3747: -- Many thanks for your effort, and I am already

Re: Diversifying Search Results - Custom Collector

2012-08-21 Thread Karthick Duraisamy Soundararaj
On Tue, Aug 21, 2012 at 11:32 AM, Tanguy Moal wrote: > Sorry then, my approach really disables pagination jumps. You're left with > the 'next' button only, or an "infinite-scroll" type of pagination, which > may not be what you wanted to do... You are right. > Did you try disabling tf/idf and

[jira] [Commented] (SOLR-3747) Solr Score threshold 'reasonably', independent of results returned

2012-08-21 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438814#comment-13438814 ] Erick Erickson commented on SOLR-3747: -- Your point isn't invalid, it's just not approp

Re: Diversifying Search Results - Custom Collector

2012-08-21 Thread Tanguy Moal
Sorry then, my approach really disables pagination jumps. You're left with the 'next' button only, or an "infinite-scroll" type of pagination, which may not be what you wanted to do... Did you try disabling tf/idf and using random field as a secondary sort ? I'm pretty sure it will give you the be

RE: [JENKINS] Lucene-Solr-trunk-Windows ([[ Exception while replacing ENV. Please report this as a bug. ]]

2012-08-21 Thread Uwe Schindler
Sorry, Java Update (1.6.0_34 and 1.7.0_06) + VirtualBox Update (4.1.20) - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Policeman Jenkins Server [mailto:jenk...@sd-datasolutions.de] > Sent: Tuesday, Aug

[JENKINS] Lucene-Solr-4.x-Windows ([[ Exception while replacing ENV. Please report this as a bug. ]]

2012-08-21 Thread Policeman Jenkins Server
{{ java.lang.NullPointerException }}) - Build # 372 - Failure! MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="=_Part_0_636101343.1345561855228" Precedence: bulk --=_Part_0_636101343.1345561855228 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit

Re: Diversifying Search Results - Custom Collector

2012-08-21 Thread Karthick Duraisamy Soundararaj
Hello Tanguy, I need pagination. The problem with your approach is that, to achieve pagination, you need to do a sort at application level for sorting rather than at the solr level which I think would become messy. Do you see a way around this? Thanks, Karthick On Tue, Aug

[jira] [Commented] (LUCENE-4319) Rethink Fields.size() returning -1

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438781#comment-13438781 ] Robert Muir commented on LUCENE-4319: - I guess my concern here is vectors, someone co

[jira] [Commented] (SOLR-3747) Solr Score threshold 'reasonably', independent of results returned

2012-08-21 Thread Ramzi Alqrainy (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438778#comment-13438778 ] Ramzi Alqrainy commented on SOLR-3747: -- Thank you for a wonderfully detailed answer.

[JENKINS] Lucene-Solr-trunk-Windows ([[ Exception while replacing ENV. Please report this as a bug. ]]

2012-08-21 Thread Policeman Jenkins Server
{{ java.lang.NullPointerException }}) - Build # 375 - Failure! MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="=_Part_20_1792413589.1345561018017" Precedence: bulk --=_Part_20_1792413589.1345561018017 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7

[jira] [Resolved] (LUCENE-4316) Deprecate Fields.getUniqueTermCount, remove AtomicReader.getUniqueTermCount

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-4316. - Resolution: Fixed Fix Version/s: 4.0 5.0 > Deprecate Fields.getUni

Re: Diversifying Search Results - Custom Collector

2012-08-21 Thread Tanguy Moal
Hello Karthick, 2012/8/21 Karthick Duraisamy Soundararaj > *"Find all the highest scoring document for each manufacuturer in the > current result set and place them ahead of the rest. Here as you can see, > the idea is to display one product from each unique manufacturer first"*. > Now to decid

Re: Avoid losing data on ZK connection-loss/session-timeout

2012-08-21 Thread Per Steffensen
Per Steffensen wrote: *Mark Miller:* *Ad 3)* Well, we can do some practical things right? I don't think we need to support a node coming back from the dead a year later and it had some updates the cluster doesn't have. A node coming up 2 minutes later is something we want to worry about though

[jira] [Commented] (LUCENE-3892) Add a useful intblock postings format (eg, FOR, PFOR, PFORDelta, Simple9/16/64, etc.)

2012-08-21 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438731#comment-13438731 ] Adrien Grand commented on LUCENE-3892: -- bq. Thanks Billy for all the hard work and e

[jira] [Commented] (LUCENE-3892) Add a useful intblock postings format (eg, FOR, PFOR, PFORDelta, Simple9/16/64, etc.)

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438728#comment-13438728 ] Robert Muir commented on LUCENE-3892: - Thanks Billy for all the hard work and endless

Re: Diversifying Search Results - Custom Collector

2012-08-21 Thread Karthick Duraisamy Soundararaj
Hi Lance, Thanks for your response. Wouldnt randomizing affect relevancy? Maybe I should explain my problem better: Lets say there are 1000 matches for a search of "Sofas". For the sake of simplcity, lets assume all of these 1000 matches(1000 sofas) have same Merchant

[jira] [Created] (LUCENE-4319) Rethink Fields.size() returning -1

2012-08-21 Thread Uwe Schindler (JIRA)
Uwe Schindler created LUCENE-4319: - Summary: Rethink Fields.size() returning -1 Key: LUCENE-4319 URL: https://issues.apache.org/jira/browse/LUCENE-4319 Project: Lucene - Core Issue Type: Task

[jira] [Resolved] (LUCENE-4315) Minor fixes for Fields abstract class, TermVectorsWriter

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler resolved LUCENE-4315. --- Resolution: Fixed Committed trunk revision: 1375522 Committed 4.x revision: 1375530 I'll op

[jira] [Commented] (LUCENE-4315) Minor fixes for Fields abstract class, TermVectorsWriter

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438680#comment-13438680 ] Uwe Schindler commented on LUCENE-4315: --- I will commit this soon. I think the discu

[jira] [Resolved] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler resolved LUCENE-4317. --- Resolution: Fixed Committed trunk revision: 1375507 Committed 4.x revision: 1375508 If you

[jira] [Updated] (LUCENE-4224) Simplify MultiValuedCase in TermsIncludingScoreQuery

2012-08-21 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martijn van Groningen updated LUCENE-4224: -- Attachment: LUCENE-4224.patch Updated patch. Tests pass now. Will commit short

RE: svn commit: r1375499 - /lucene/dev/trunk/lucene/core/src/java/org/apache/lucene/codecs/block/BlockPostingsReader.java

2012-08-21 Thread Uwe Schindler
I also disabled the Jenkins Job and will delete it at some time. I keep the history for a while as reference, maybe GSoC wants to look at it. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Uwe Schindle

Re: svn commit: r1375499 - /lucene/dev/trunk/lucene/core/src/java/org/apache/lucene/codecs/block/BlockPostingsReader.java

2012-08-21 Thread Michael McCandless
No need to apologize! Yes I used "svn merge --reintegrate" (got help from Robert on stuck merge props). Thanks for reminding me to remove the branch ... I just did. I just missed those compilation warnings before committing the first time. Mike McCandless http://blog.mikemccandless.com On Tue

[jira] [Commented] (LUCENE-3892) Add a useful intblock postings format (eg, FOR, PFOR, PFORDelta, Simple9/16/64, etc.)

2012-08-21 Thread Han Jiang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438653#comment-13438653 ] Han Jiang commented on LUCENE-3892: --- Thank you Mike! And thanks to all of you! I learnt

RE: svn commit: r1375499 - /lucene/dev/trunk/lucene/core/src/java/org/apache/lucene/codecs/block/BlockPostingsReader.java

2012-08-21 Thread Uwe Schindler
Hi, Sorry for interfering with you. I was expecting you will merge before reintegrating. Did you do not do merge --reintegrate? Don't forget to nuke branch, it cannot be used anymore once reintegrated! Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u.

[jira] [Closed] (LUCENE-4283) Support more frequent skip with Block Postings Format

2012-08-21 Thread Han Jiang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Han Jiang closed LUCENE-4283. - Resolution: Later We didn't get overall improvement with partial decode, and some patches here are more

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438645#comment-13438645 ] Uwe Schindler commented on LUCENE-4317: --- I wanted to add: The StringTokenStream in

[jira] [Resolved] (LUCENE-3892) Add a useful intblock postings format (eg, FOR, PFOR, PFORDelta, Simple9/16/64, etc.)

2012-08-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-3892. Resolution: Fixed Fix Version/s: (was: 4.1) 4.0

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438644#comment-13438644 ] Uwe Schindler commented on LUCENE-4317: --- bq. Mike: Or (crazy idea): maybe we could

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438639#comment-13438639 ] Michael McCandless commented on LUCENE-4317: +1 to the current patch and sepa

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438636#comment-13438636 ] Robert Muir commented on LUCENE-4317: - {quote} To come back to Robert: We can of cour

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438637#comment-13438637 ] Robert Muir commented on LUCENE-4317: - by the way, of course I am +1 to commit the pa

[jira] [Resolved] (LUCENE-4239) Provide access to PackedInts' low-level blocks <-> values conversion methods

2012-08-21 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrien Grand resolved LUCENE-4239. -- Resolution: Fixed Done has part of LUCENE-3892 which was just merged to trunk (r1375486).

[jira] [Comment Edited] (LUCENE-4239) Provide access to PackedInts' low-level blocks <-> values conversion methods

2012-08-21 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438635#comment-13438635 ] Adrien Grand edited comment on LUCENE-4239 at 8/21/12 11:08 PM: ---

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438633#comment-13438633 ] Uwe Schindler commented on LUCENE-4317: --- bq. But there is, at query time. it could

[jira] [Resolved] (SOLR-3747) Solr Score threshold 'reasonably', independent of results returned

2012-08-21 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erick Erickson resolved SOLR-3747. -- Resolution: Invalid This is really a usage question more appropriate to the user's list.

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438630#comment-13438630 ] Uwe Schindler commented on LUCENE-4317: --- To come back to Robert: We can of course r

Re: Custom Directory and IndexInput

2012-08-21 Thread Danil ŢORIN
Unfortunately test framework is not flexible enough. My constructor needs another directory, LuceneTestCase expect either a File, either a zero-arg constructor. On Tue, Aug 21, 2012 at 1:19 PM, Uwe Schindler wrote: > Hi, > > ** ** > > OK. If you test against only Lucene Core’s directories y

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438628#comment-13438628 ] Robert Muir commented on LUCENE-4317: - {quote} The problem with numeric is that they

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438627#comment-13438627 ] Chris Male commented on LUCENE-4317: I'm +1 to committing this as a first step improv

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438626#comment-13438626 ] Uwe Schindler commented on LUCENE-4317: --- The problem with numeric is that they are

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438625#comment-13438625 ] Robert Muir commented on LUCENE-4317: - {quote} In addition, the "Robert solution" sti

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438618#comment-13438618 ] Uwe Schindler commented on LUCENE-4317: --- Do we agree to fix the current problem and

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438604#comment-13438604 ] Chris Male commented on LUCENE-4317: Yeah I agree with Robert, using KeywordAnalyzer

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438589#comment-13438589 ] Robert Muir commented on LUCENE-4317: - {quote} We should maybe think about a solution

RE: Custom Directory and IndexInput

2012-08-21 Thread Uwe Schindler
Hi, OK. If you test against only Lucene Core’s directories you are fine. Wrapping the special wrappers from test-framework is not needed and wrong, they are just for testing Lucene itsself. In fact when running all Lucene tests against you own directory (-Dtests.directory=…), the test framew

Re: Custom Directory and IndexInput

2012-08-21 Thread Danil ŢORIN
I was trying to run the whole suite of randomized tests, wrapped inside my own implementation. I wasn't sure how my stuff will behave against MMap, NIO, RAM, NRT implementations, so I thought running fully randomized tests would work to find edge cases in my implementation. By randomizing everyth

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438555#comment-13438555 ] Uwe Schindler commented on LUCENE-4317: --- bq. Do you mean also for NumericTokenStrea

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438554#comment-13438554 ] Chris Male commented on LUCENE-4317: Thanks for tackling this Uwe. bq. We should m

RE: Custom Directory and IndexInput

2012-08-21 Thread Uwe Schindler
I don’t understand what you are doing! Do you wrap MockIndexInputWrapper again with your wrapper? This can of course not work, as MockIndexInputWrapper is an internal test-only class, so the big question is: What are you trying to do? - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Br

Re: Custom Directory and IndexInput

2012-08-21 Thread Danil ŢORIN
Strange thing: If I wrap all "reasonable" directories into my directory, my tests pass. However if I wrap MockIndexInputWrapper, it calls "checkIndex()" after close on delegated DataInput, and obviously fails to read proper data. CheckIndex failed ERROR: could not read any segments file in directo

[jira] [Commented] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438548#comment-13438548 ] Uwe Schindler commented on LUCENE-4317: --- This patch also fixes StringTokenStream to

[jira] [Updated] (LUCENE-4317) Field.java does not reuse its inlined Keyword-TokenStream

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-4317: -- Attachment: LUCENE-4317.patch Patch: - Streamlines handling of NumericTokenStreanm and a new i

[jira] [Commented] (SOLR-3747) Solr Score threshold 'reasonably', independent of results returned

2012-08-21 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438544#comment-13438544 ] Adrien Grand commented on SOLR-3747: Hi Ramzi, bq. When the results are returned with

[jira] [Commented] (LUCENE-2000) Use covariant clone() return types

2012-08-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438532#comment-13438532 ] Uwe Schindler commented on LUCENE-2000: --- Hi, in revision 1375428 (trunk) and 137542

RE: Custom Directory and IndexInput

2012-08-21 Thread Uwe Schindler
In my commit I also solved the casting problem with covariant override. But DataInput.clone() was always public. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de From: Danil ŢORIN [mailto:torin...@gmail.c

Re: Custom Directory and IndexInput

2012-08-21 Thread Adrien Grand
Hi Danil, On Tue, Aug 21, 2012 at 10:15 AM, Danil ŢORIN wrote: > Is there a way to force run full lucene & solr test suite against my new > Directory implementation? > > Just to make sure everything is covered :) Running `ant test -Dtests.directory=my.Directory` will force your directory impl t

Re: Custom Directory and IndexInput

2012-08-21 Thread Danil ŢORIN
Is there a way to force run full lucene & solr test suite against my new Directory implementation? Just to make sure everything is covered :) On Tue, Aug 21, 2012 at 10:55 AM, Danil ŢORIN wrote: > My bad, few casts solved the issue. > > > On Tue, Aug 21, 2012 at 10:46 AM, Danil ŢORIN wrote: >

Re: Custom Directory and IndexInput

2012-08-21 Thread Danil ŢORIN
My bad, few casts solved the issue. On Tue, Aug 21, 2012 at 10:46 AM, Danil ŢORIN wrote: > One tricky thing, I'm doing a lot of delegation, I can't call "clone()" on > the DataInput I'm delegating to. > (unfortunately in my case it's delegation, not inheritance) > > The fact that the delegate im

Re: Custom Directory and IndexInput

2012-08-21 Thread Danil ŢORIN
One tricky thing, I'm doing a lot of delegation, I can't call "clone()" on the DataInput I'm delegating to. (unfortunately in my case it's delegation, not inheritance) The fact that the delegate implements Clonable, doesn't help, the clone() method is still protected :(. Any idea how to circumvent

AW: [VOTE] Release PyLucene 3.6.1-2

2012-08-21 Thread Thomas Koch
Thanks, JCC 2.14 now builds just fine with MS-VC9 on Windows. Could also build pylucene 3.6.1 and run the tests (except of the usual suspects on windows that fail due to the file lock issue). So +1 for release. kind regards Thomas -- OrbiTeam Software GmbH & Co. KG http://www.orbiteam.de >

RE: Custom Directory and IndexInput

2012-08-21 Thread Uwe Schindler
Thanks for „reporting“ lack of documentation, I will shortly commit some additional specs to IndexInput’s javadocs! Thanks, Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de From: Danil ŢORIN [mai

[jira] [Commented] (SOLR-3721) Multiple concurrent recoveries of same shard?

2012-08-21 Thread Per Steffensen (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438518#comment-13438518 ] Per Steffensen commented on SOLR-3721: -- Took the comments (parts) on this issue that i

Avoid losing data on ZK connection-loss/session-timeout

2012-08-21 Thread Per Steffensen
Hi Accidently started a discussion around SUBJECT on issue SOLR-3721. Not to mix things up too much I will encourage that we continue the discussion here. It is an important issue (at least for my organization), and I believe the current solution in Solr 4.x is not solid enough (have seen it