[jira] Updated: (SOLR-1967) New Native PHP Response Writer Class

2010-06-21 Thread Israel Ekpo (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Israel Ekpo updated SOLR-1967: -- Attachment: phpnative.tar.gz phpnativeresponsewriter.jar Attaching the source code and t

Korean analyzer in PyLucene

2010-06-21 Thread Boris D
Hi, I would like to make an additional analyzer for Korean available to pylucene (http://sourceforge.net/projects/lucenekorean/). Can someone describe the procedure to do so? Thanks in advance. -- Boris

[jira] Updated: (SOLR-1959) SolrJ GET operation does not send correct encoding

2010-06-21 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lance Norskog updated SOLR-1959: Attachment: SOLR-1959.patch This patch applies against tags/release-1.4.0 and trunk, so this bit of

Re: can lucene search more than one word

2010-06-21 Thread Otis Gospodnetic
Hi, Words have and on are probably in your list of stopwords and getting removed from the query/documents. Please use u...@lucene list for questions about Lucene usage. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ -

[jira] Created: (SOLR-1967) New Native PHP Response Writer Class

2010-06-21 Thread Israel Ekpo (JIRA)
New Native PHP Response Writer Class Key: SOLR-1967 URL: https://issues.apache.org/jira/browse/SOLR-1967 Project: Solr Issue Type: New Feature Components: clients - php, Response Writers Aff

[jira] Updated: (LUCENE-2507) automaton spellchecker

2010-06-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2507: Attachment: LUCENE-2507.patch prototype patch that adds 'DirectSpellChecker', with some tests show

[jira] Created: (LUCENE-2507) automaton spellchecker

2010-06-21 Thread Robert Muir (JIRA)
automaton spellchecker -- Key: LUCENE-2507 URL: https://issues.apache.org/jira/browse/LUCENE-2507 Project: Lucene - Java Issue Type: New Feature Components: contrib/spellchecker Reporter: Robert Muir

[jira] Updated: (LUCENE-2348) DuplicateFilter incorrectly handles multiple calls to getDocIdSet for segment readers

2010-06-21 Thread Karthick Sankarachary (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karthick Sankarachary updated LUCENE-2348: -- Attachment: LUCENE-2348.patch > DuplicateFilter incorrectly handles multiple c

[jira] Commented: (LUCENE-2348) DuplicateFilter incorrectly handles multiple calls to getDocIdSet for segment readers

2010-06-21 Thread Karthick Sankarachary (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12881010#action_12881010 ] Karthick Sankarachary commented on LUCENE-2348: --- Hi, All, Having run into t

[jira] Updated: (LUCENE-2506) A Stateful Filter That Works Across Index Segments

2010-06-21 Thread Karthick Sankarachary (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karthick Sankarachary updated LUCENE-2506: -- Attachment: LUCENE-2506.patch > A Stateful Filter That Works Across Index Segm

[jira] Updated: (LUCENE-2506) A Stateful Filter That Works Across Index Segments

2010-06-21 Thread Karthick Sankarachary (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karthick Sankarachary updated LUCENE-2506: -- Fix Version/s: 3.0.2 Affects Version/s: 3.0.2 Component/s: In

[jira] Created: (LUCENE-2506) A Stateful Filter That Works Across Index Segments

2010-06-21 Thread Karthick Sankarachary (JIRA)
A Stateful Filter That Works Across Index Segments -- Key: LUCENE-2506 URL: https://issues.apache.org/jira/browse/LUCENE-2506 Project: Lucene - Java Issue Type: Improvement Reporter

Re: Distributed search forces standard request handler?

2010-06-21 Thread Mark Miller
On 6/21/10 4:58 PM, Chris Hostetter wrote: : I registered a request handler named after my application. Aside from : various defaults that I configured, I also registered some components : I've written. When I tried to do a distributed search, I noticed that : the secondary Solr requests intern

Re: [VOTE] RC2 Release Solr 1.4.1

2010-06-21 Thread Ryan McKinley
dropped into existing 1.4 product -- all dependent tests pass and things behave good +1 On Thu, Jun 17, 2010 at 6:35 PM, Mark Miller wrote: > Let's try this again: > > Please vote on releasing the Solr 1.4.1 artifacts located at > http://people.apache.org/~markrmiller/staging-area/rc2 > > CHANG

Hudson build is back to normal : Lucene-3.x #48

2010-06-21 Thread Apache Hudson Server
See - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (LUCENE-2426) change sort order to binary order

2010-06-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12880974#action_12880974 ] Robert Muir commented on LUCENE-2426: - How to deal with Term? I really don't like th

[jira] Issue Comment Edited: (SOLR-1782) stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields

2010-06-21 Thread Wojtek Piaseczny (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12880964#action_12880964 ] Wojtek Piaseczny edited comment on SOLR-1782 at 6/21/10 8:15 PM: -

[jira] Updated: (SOLR-1782) stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields

2010-06-21 Thread Wojtek Piaseczny (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wojtek Piaseczny updated SOLR-1782: --- Attachment: SOLR-1782.2.patch First batch was unusably slow with ~1M documents. New patch uses

Re: Brics Automaton version

2010-06-21 Thread Robert Muir
On Mon, Jun 21, 2010 at 3:16 PM, eks dev wrote: > i would even argue it makes sense to keep some (all?) of these methods, > especially if intended use of the Automaton code gets expanded to Analyzer > chains. This particular method has usage in our code for optimizing matching > based on minimum

Re: Brics Automaton version

2010-06-21 Thread Robert Muir
On Mon, Jun 21, 2010 at 3:16 PM, eks dev wrote: > ok, that explains it, but I didn't expect it, considering small size of the > library. > well, its not that small. for example, the original brics jar is 170KB. Our minimal use takes up significantly less space (i dont remember i think 30-40KBish

Re: Brics Automaton version

2010-06-21 Thread eks dev
ok, that explains it, but I didn't expect it, considering small size of the library. i would even argue it makes sense to keep some (all?) of these methods, especially if intended use of the Automaton code gets expanded to Analyzer chains. This particular method has usage in our code for optimizin

Re: Distributed search forces standard request handler?

2010-06-21 Thread Chris Hostetter
: I registered a request handler named after my application. Aside from : various defaults that I configured, I also registered some components : I've written. When I tried to do a distributed search, I noticed that : the secondary Solr requests internal to the workings of the distributed : s

Re: Distributed Search Components

2010-06-21 Thread Chris Hostetter
: I mean the implementation of the distributed search in Solr. Those classes : that are responsible for the search-logic. I mean, from somewhere the : searcher (or whatever) must get the knowledge about which shards exists, : which of them to query and what their adresses are. : I want to learn m

Re: Brics Automaton version

2010-06-21 Thread Chris Hostetter
: Subject: Brics Automaton version : In-Reply-To: <28834904.106271277107103144.javamail.j...@thor> http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fres

Re: [VOTE] RC2 Release Solr 1.4.1

2010-06-21 Thread Mark Miller
On 6/21/10 1:39 PM, Yonik Seeley wrote: On Thu, Jun 17, 2010 at 6:35 PM, Mark Miller wrote: Please vote on releasing the Solr 1.4.1 artifacts located at http://people.apache.org/~markrmiller/staging-area/rc2 +1 -Yonik http://www.lucidimagination.com -

Re: [VOTE] RC2 Release Solr 1.4.1

2010-06-21 Thread Yonik Seeley
On Thu, Jun 17, 2010 at 6:35 PM, Mark Miller wrote: > Please vote on releasing the Solr 1.4.1 artifacts located at > http://people.apache.org/~markrmiller/staging-area/rc2 +1 -Yonik http://www.lucidimagination.com - To unsubscr

Re: [VOTE] RC2 Release Solr 1.4.1

2010-06-21 Thread Bill Au
+1 I tested it with my app which has custom analysis filer, handler, parser, and update processor. Everything works without any problem. Bill On Fri, Jun 18, 2010 at 5:06 PM, Mark Miller wrote: > On 6/18/10 4:52 PM, Chris Hostetter wrote: > > version info still indicates local modifications

Re: Brics Automaton version

2010-06-21 Thread Robert Muir
we are based on the latest version (1.11.2) getShortestExample (among other methods) are not available because we don't have anything using them in lucene... we only have the stuff we need. On Mon, Jun 21, 2010 at 11:22 AM, eks dev wrote: > I have been trying to use automaton library from Lucen

I'm looking for a consultant/expert developer Lucene/Solr

2010-06-21 Thread keepu.pedro
keepU comunicació interactiva is looking for a consultant/expert developer Lucene/Solr profile to optimize the indexing engine and search for a web application. The main task is the analysis and optimization of the processes of indexing and search technologies based on lucene / SOLR and identify

[jira] Resolved: (LUCENE-2381) Use packed ints for sort ords (in FieldCache.getStringIndex/.getTermBytesIndex)

2010-06-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-2381. Resolution: Duplicate Dup of LUCENE-2380. > Use packed ints for sort ords (in >

Brics Automaton version

2010-06-21 Thread eks dev
I have been trying to use automaton library from Lucene, (instead of direct import of the brics lib), and noticed some methods I need are not there (e.g. getShortestExample) Looking at the change log of the brics automaton (http://www.brics.dk/automaton/ChangeLog): 1.3-1 -> 1.3-2 =

[jira] Created: (SOLR-1966) QueryElevationComponent: Add option to return only the specified results

2010-06-21 Thread Grant Ingersoll (JIRA)
QueryElevationComponent: Add option to return only the specified results Key: SOLR-1966 URL: https://issues.apache.org/jira/browse/SOLR-1966 Project: Solr Issue Type: I

Re: Solr relational data

2010-06-21 Thread Jan Høydahl / Cominvent
Hi, Please re-post this question to the solr-user list, and we'll answer over there. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Training in Europe - www.solrtraining.com On 21. juni 2010, at 12.04, Chris Finch wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: S

Hudson build is back to normal : Solr-trunk #1184

2010-06-21 Thread Apache Hudson Server
See - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (SOLR-1954) Highlighter component should expose snippet character offsets and the score.

2010-06-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12880805#action_12880805 ] Robert Muir commented on SOLR-1954: --- bq. And without either, there's no feature here to di

[jira] Commented: (SOLR-1954) Highlighter component should expose snippet character offsets and the score.

2010-06-21 Thread David Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12880798#action_12880798 ] David Smiley commented on SOLR-1954: Character offsets may not be perfect, but bytes are

Re: Build failed in Hudson: Lucene-3.x #46

2010-06-21 Thread Robert Muir
for a while, hudson has been failing about 50% of the time due to some problems on the machine. At this point its useless, I propose we disable it and seek an alternative. On Mon, Jun 21, 2010 at 6:04 AM, Apache Hudson Server < hud...@hudson.zones.apache.org> wrote: > See

RE: can lucene search more than one word

2010-06-21 Thread Itamar Syn-Hershko
This happens because those two words are "stop words", and are being filtered by StandardAnalyzer, which is what you probably used. You may want to use java-u...@lucene.apache.org for this type of questions. Itamar. > -Original Message- > From: danielkimo [mailto:danielk...@gmail.com]

RE: Doppleganger threads after ingestion completed

2010-06-21 Thread karl.wright
Some answers below. (1) netstat -an shows no sockets at all. Remember, the client process is gone, dead, shut down. (2) This is Solr 1.5 from approximately mid-March. (3) Autocommit was on, using the standard configuration present in the example. This could well be a jetty bug and, no, I have n

Re: Distributed Search Components

2010-06-21 Thread MitchK
Hoss, > I honestly don't know what you mean by "those implementations" and "both > implementations" ... impls of what? > I mean the implementation of the distributed search in Solr. Those classes that are responsible for the search-logic. I mean, from somewhere the searcher (or whatever) mus

Build failed in Hudson: Lucene-3.x #46

2010-06-21 Thread Apache Hudson Server
See -- [...truncated 20377 lines...] [javadoc] Building index for all classes... [javadoc] Generating

Solr relational data

2010-06-21 Thread Chris Finch
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 I want to be able to store property information in Solr, including descriptions, tags, keywords etc. This is really easy to do. But also I need to be able to store a range of dates that the property is available along with costings. Currently we're us

Re: can lucene search more than one word

2010-06-21 Thread Li Li
BooleanQuery bQuery=new BooleanQuery(); TermQuery tQuery=new TermQuery(new Term("title","cat")); bQuery.add(tQuery,BooleanClause.Occur.SHOULD); tQuery=new TermQuery(new Term("title","dog")); bQuery.add(tQuery,BooleanCla

can lucene search more than one word

2010-06-21 Thread danielkimo
Dear all, I used lucene to index BNC Corpus(British News Corpus). However, when I search "have on", the result is always zero. The result is 82 from BNC website. I think the problem is lucene cannot search two words at the same time. Does anyone have the same experience? Thanks -- View this me

[jira] Commented: (LUCENE-2505) The system cannot find the file specified - _0.fdt

2010-06-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12880760#action_12880760 ] Michael McCandless commented on LUCENE-2505: Can you post a complete testing (

[jira] Commented: (SOLR-1316) Create autosuggest component

2010-06-21 Thread Ankul Garg (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12880751#action_12880751 ] Ankul Garg commented on SOLR-1316: -- Stefan, yes the tree needs to be re-built at each commi

[jira] Created: (LUCENE-2505) The system cannot find the file specified - _0.fdt

2010-06-21 Thread Tej Kiran Sharma (JIRA)
The system cannot find the file specified - _0.fdt -- Key: LUCENE-2505 URL: https://issues.apache.org/jira/browse/LUCENE-2505 Project: Lucene - Java Issue Type: Bug Components: Index