Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Stanislaw Osinski
Hi guys, thanks for the warm welcome! It's an honor. Like Dawid, I live in Poznan, we graduated in computer science from the same local university. My computer science experience started from electronics, Timex 2048 and Amiga 500/1200/PPC; I bought my first PC when I went to the university. My pre

[jira] Updated: (LUCENE-2881) Track FieldInfo per segment instead of per-IW-session

2011-02-08 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2881: -- Attachment: lucene-2881.patch New patch that adds a new junit for testing that field numbering

[jira] Commented: (LUCENE-2881) Track FieldInfo per segment instead of per-IW-session

2011-02-08 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992340#comment-12992340 ] Michael Busch commented on LUCENE-2881: --- bq. Maybe we can simply implement Iterable

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Mark Miller
Welcome guys! Thanks Dawid - your turn Stanislaw Osinksi ;) - Mark On Feb 8, 2011, at 5:05 PM, Dawid Weiss wrote: > Thank you very much, everyone! This is a great privilege and honor for me. > > In the spirit of previous posters, I would like to quickly introduce > myself. I'm 32, I was born an

Re: Potential contrib module

2011-02-08 Thread Ryan McKinley
sounds awesome, but... the dependency on software that is not installed/testable in the Apache infrastructure is kind of a show stopper for getting into the lucene code base. In general, everyone needs to be able to run "ant test" and make sure they have not broken something. However, check: ht

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Koji Sekiguchi
(11/02/09 3:13), Robert Muir wrote: I'm pleased to announce that the PMC has voted in Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers! Welcome! Welcome! Koji -- http://www.rondhuit.com/en/ - To unsubscribe, e-mai

Potential contrib module

2011-02-08 Thread Edward Drapkin
Hello all, Pending approval (which is almost certain) from management, I have a potential module that I'd like to contribute if possible. Before I go to management, I'd like to be able to make a case that I'm certain this will be approved by Lucene, although I am all but completely sure that

[jira] Commented: (SOLR-2342) Lock starvation can cause commit to never run when many clients are adding docs

2011-02-08 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992245#comment-12992245 ] Yonik Seeley commented on SOLR-2342: bq. Passing true does make the read lock acq more

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Yonik Seeley
On Tue, Feb 8, 2011 at 1:13 PM, Robert Muir wrote: > I'm pleased to announce that the PMC has voted in Dawid Weiss and > Stanislaw Osinski as Lucene/Solr committers! Welcome aboard guys! -Yonik http://lucidimagination.com - To

[jira] Commented: (LUCENE-2903) Improvement of PForDelta Codec

2011-02-08 Thread hao yan (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992237#comment-12992237 ] hao yan commented on LUCENE-2903: - I tried to move memory allocation out of readBlock() t

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Erick Erickson
Glad to see more committers, welcome aboard! Erick On Tue, Feb 8, 2011 at 5:05 PM, Dawid Weiss wrote: > Thank you very much, everyone! This is a great privilege and honor for me. > > In the spirit of previous posters, I would like to quickly introduce > myself. I'm 32, I was born and I still liv

[jira] Commented: (SOLR-1711) Race condition in org/apache/solr/client/solrj/impl/StreamingUpdateSolrServer.java

2011-02-08 Thread Aakarsh Nair (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992224#comment-12992224 ] Aakarsh Nair commented on SOLR-1711: We are still seeing this issue even after using Jo

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Dawid Weiss
Thank you very much, everyone! This is a great privilege and honor for me. In the spirit of previous posters, I would like to quickly introduce myself. I'm 32, I was born and I still live in Poznan, Poland, happily married and with two kids on board. My computer science experience is somewhat stra

[jira] Commented: (LUCENENET-392) Arabic Analyzer

2011-02-08 Thread Digy (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992189#comment-12992189 ] Digy commented on LUCENENET-392: If no objections, I am going to commit it in a few day

[jira] Updated: (LUCENENET-392) Arabic Analyzer

2011-02-08 Thread Digy (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Digy updated LUCENENET-392: --- Attachment: Analyzers.zip I merged Arabic analyzer and existing Brazilian analyzer in contrib. Since chan

[jira] Commented: (LUCENE-2881) Track FieldInfo per segment instead of per-IW-session

2011-02-08 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992174#comment-12992174 ] Simon Willnauer commented on LUCENE-2881: - I gave the patch another glance - here

[jira] Commented: (LUCENE-2881) Track FieldInfo per segment instead of per-IW-session

2011-02-08 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992175#comment-12992175 ] Michael Busch commented on LUCENE-2881: --- Thanks for reviewing! bq. I think you sho

[jira] Commented: (LUCENE-2881) Track FieldInfo per segment instead of per-IW-session

2011-02-08 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992156#comment-12992156 ] Simon Willnauer commented on LUCENE-2881: - {quote} New patch that removes the tra

[jira] Updated: (LUCENE-2881) Track FieldInfo per segment instead of per-IW-session

2011-02-08 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2881: -- Attachment: lucene-2881.patch New patch that removes the tracking of 'hasVectors' and 'hasProx

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Uwe Schindler
Welcome! -- Uwe Schindler H.-H.-Meier-Allee 63, 28213 Bremen http://www.thetaphi.de Simon Willnauer schrieb: Welcome! ;) Simon On Tue, Feb 8, 2011 at 8:06 PM, Steven A Rowe wrote: > Welcome Stanisław and Dawid! > >> -Original Message- >> From: Robert Muir [mailto:rcm...@gmail.com] >

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Simon Willnauer
Welcome! ;) Simon On Tue, Feb 8, 2011 at 8:06 PM, Steven A Rowe wrote: > Welcome Stanisław and Dawid! > >> -Original Message- >> From: Robert Muir [mailto:rcm...@gmail.com] >> Sent: Tuesday, February 08, 2011 1:13 PM >> To: gene...@lucene.apache.org; dev@lucene.apache.org >> Subject: Wel

[jira] Resolved: (LUCENE-2908) clean up serialization in the codebase

2011-02-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-2908. - Resolution: Fixed Assignee: Robert Muir Committed revision 1068526. > clean up serializat

[jira] Created: (SOLR-2352) HTTP 400 Undefined Filed: * with TV component enabled.

2011-02-08 Thread Jed Glazner (JIRA)
HTTP 400 Undefined Filed: * with TV component enabled. -- Key: SOLR-2352 URL: https://issues.apache.org/jira/browse/SOLR-2352 Project: Solr Issue Type: Bug Components: SearchCompo

RE: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Steven A Rowe
Welcome Stanisław and Dawid! > -Original Message- > From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Tuesday, February 08, 2011 1:13 PM > To: gene...@lucene.apache.org; dev@lucene.apache.org > Subject: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr > committers > > I'm please

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Michael McCandless
Welcome Dawid and Stanislaw! Mike On Tue, Feb 8, 2011 at 1:13 PM, Robert Muir wrote: > I'm pleased to announce that the PMC has voted in Dawid Weiss and > Stanislaw Osinski as Lucene/Solr committers! > > Welcome! > > - > To unsu

Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Robert Muir
I'm pleased to announce that the PMC has voted in Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers! Welcome! - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.

[jira] Commented: (LUCENE-2911) synchronize grammar/token types across StandardTokenizer, UAX29EmailURLTokenizer, ICUTokenizer, add CJK types.

2011-02-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992066#comment-12992066 ] Robert Muir commented on LUCENE-2911: - {quote} The generated top-level domain macro f

[jira] Commented: (LUCENE-2881) Track FieldInfo per segment instead of per-IW-session

2011-02-08 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992038#comment-12992038 ] Simon Willnauer commented on LUCENE-2881: - Michael, this looks very good! {quote

Re: Should ASCIIFoldingFilter be deprecated?

2011-02-08 Thread Robert Zotter
unsubscribe On 2/8/11 7:05 AM, David Smiley (@MITRE.org) wrote: Robert Muir wrote: On Tue, Feb 8, 2011 at 9:12 AM, David Smiley (@MITRE.org) wrote: I'm skeptical that whatever the difference is is relevant in the scheme of things. The cost to keeping it is introducing confusion on users, a

[jira] Commented: (LUCENE-2911) synchronize grammar/token types across StandardTokenizer, UAX29EmailURLTokenizer, ICUTokenizer, add CJK types.

2011-02-08 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992014#comment-12992014 ] Steven Rowe commented on LUCENE-2911: - The generated top-level domain macro file has

[jira] Commented: (SOLR-2155) Geospatial search using geohash prefixes

2011-02-08 Thread David Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12991999#comment-12991999 ] David Smiley commented on SOLR-2155: So Bill's talking about sorting, and Lance is talk

Re: Should ASCIIFoldingFilter be deprecated?

2011-02-08 Thread Robert Muir
On Tue, Feb 8, 2011 at 10:05 AM, David Smiley (@MITRE.org) wrote: > > Well then I see a path forward to speed up MappingCharFilter substantially. > There's your LUCENE-2788, and then you could easily add the same no-op > optimization for the smallest char value in the HashMap. only for the smalle

Re: Should ASCIIFoldingFilter be deprecated?

2011-02-08 Thread David Smiley (@MITRE.org)
Robert Muir wrote: > > On Tue, Feb 8, 2011 at 9:12 AM, David Smiley (@MITRE.org) > wrote: > >> I'm skeptical that whatever the difference is is relevant in the scheme >> of >> things. The cost to keeping it is introducing confusion on users, and >> more >> code to maintain. >> > > its pretty

Re: Should ASCIIFoldingFilter be deprecated?

2011-02-08 Thread Robert Muir
On Tue, Feb 8, 2011 at 9:12 AM, David Smiley (@MITRE.org) wrote: > I'm skeptical that whatever the difference is is relevant in the scheme of > things. The cost to keeping it is introducing confusion on users, and more > code to maintain. > its pretty significant. charfilters are not reusable, a

Re: Umlauts as Char

2011-02-08 Thread Stefan Bodewig
On 2011-02-08, Prescott Nasser wrote: > So I can take the source codes word that 'ü' is the u with dots over > it (becuase it says replace umlauts in the source notes). But, I > guess, is that really true? Is that perhaps u with a carrot over it > instead? I think the case has been settled by no

RE: Should ASCIIFoldingFilter be deprecated?

2011-02-08 Thread David Smiley (@MITRE.org)
Chris Hostetter-3 wrote: > > CharFilters and TokenFilters have different purposes though... > > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#When_To_use_a_CharFilter_vs_a_TokenFilter > > (ie: If you use MappingCharFilter, you can't then tokenize on some of the > characters you

[jira] Commented: (SOLR-2342) Lock starvation can cause commit to never run when many clients are adding docs

2011-02-08 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12991965#comment-12991965 ] Michael McCandless commented on SOLR-2342: -- OK, passing true to the ReentrantReadW

[jira] Updated: (LUCENE-2911) synchronize grammar/token types across StandardTokenizer, UAX29EmailURLTokenizer, ICUTokenizer, add CJK types.

2011-02-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2911: Attachment: LUCENE-2911.patch after applying the patch, you have to run 'ant jflex' from modules/

[jira] Created: (LUCENE-2911) synchronize grammar/token types across StandardTokenizer, UAX29EmailURLTokenizer, ICUTokenizer, add CJK types.

2011-02-08 Thread Robert Muir (JIRA)
synchronize grammar/token types across StandardTokenizer, UAX29EmailURLTokenizer, ICUTokenizer, add CJK types. -- Key: LUCENE-2911 URL: https://issues.apac

Re: Should ASCIIFoldingFilter be deprecated?

2011-02-08 Thread Robert Muir
On Mon, Feb 7, 2011 at 10:51 PM, Steven A Rowe wrote: > I haven't done any benchmarking, but I'm pretty sure that ASCIIFoldingFilter > can achieve a significantly higher throughput rate than MappingCharFilter, > and given that, it probably makes sense to keep both, to allow people to make > the

Re: CustomScoreQueryWithSubqueries

2011-02-08 Thread Simon Willnauer
Hi Fernando, I didn't follow this really but in general we fix stuff in trunk and then backport to older versions. Usually if something is useful for 2.9 its also useful for 4.0 & 3.x if the issue still applies. simon On Tue, Feb 8, 2011 at 12:34 PM, Fernando Wasylyszyn wrote: > Hi Doron. Thank

Re: CustomScoreQueryWithSubqueries

2011-02-08 Thread Fernando Wasylyszyn
Hi Doron. Thanks for your answer. Maybe the question seems simple, but I want to be sure about the procedure. By the way, there is a chance, if the patch is really useful, that it could be "adapted" for other versions (in this case, lucene 3.0). Thanks. Regards. Fernando. ___

[jira] Updated: (LUCENE-2881) Track FieldInfo per segment instead of per-IW-session

2011-02-08 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2881: -- Attachment: lucene-2881.patch * Creates for every segment a new FieldInfos * Changes Field

[jira] Commented: (SOLR-1191) NullPointerException in delta import

2011-02-08 Thread Gunnlaugur Thor Briem (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12991884#comment-12991884 ] Gunnlaugur Thor Briem commented on SOLR-1191: - bq. There seems to be TestSqlEnt