Re: solr result problem

2011-03-28 Thread Grijesh
Try LucidImagination's KStemmer - Thanx: Grijesh www.gettinhahead.co.in -- View this message in context: http://lucene.472066.n3.nabble.com/solr-result-problem-tp2746849p2747106.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Suggest component

2011-03-28 Thread Grijesh
have you checked with q=*:*? You mentioned in config buildOnCommit=true So have you checked that your indexing process ends with commit? - Thanx: Grijesh www.gettinhahead.co.in -- View this message in context: http://lucene.472066.n3.nabble.com/Suggest-component-tp2725438p2747100.html Sent

Re: copyField at search time / multi-language support

2011-03-28 Thread Markus Jelsma
I haven't tried this as an UpdateProcessor but it relies on Tika and that LanguageIdentifier works well, except for short texts. > Thanks Markus. > > Do you know if this patch is good enough for production use? Thanks. > > Andy > > --- On Tue, 3/29/11, Markus Jelsma wrote: > > From: Markus Je

Re: copyField at search time / multi-language support

2011-03-28 Thread Andy
Thanks Markus. Do you know if this patch is good enough for production use? Thanks. Andy --- On Tue, 3/29/11, Markus Jelsma wrote: > From: Markus Jelsma > Subject: Re: copyField at search time / multi-language support > To: solr-user@lucene.apache.org > Cc: "Andy" > Date: Tuesday, March 29,

Re: copyField at search time / multi-language support

2011-03-28 Thread Markus Jelsma
https://issues.apache.org/jira/browse/SOLR-1979 > Tom, > > Could you share the method you use to perform language detection? Any open > source tools that do that? > > Thanks. > > --- On Mon, 3/28/11, Tom Mortimer wrote: > > From: Tom Mortimer > > Subject: copyField at search time / multi-lang

Re: solr result problem

2011-03-28 Thread anurag.walia
is there any other filter which can solved my singular plural problem? Anurag -- View this message in context: http://lucene.472066.n3.nabble.com/solr-result-problem-tp2746849p2746956.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solr result problem

2011-03-28 Thread Gora Mohanty
On Tue, Mar 29, 2011 at 10:41 AM, anurag.walia wrote: > it will be polymers but result will come different in case of polymer and > polymers (singular/plural). > or there can be more words like polymer. [...] Your only alternative then is to implement a filter that works the way you want it to.

Re: solr result problem

2011-03-28 Thread anurag.walia
it will be polymers but result will come different in case of polymer and polymers (singular/plural). or there can be more words like polymer. Regards Anurag Walia -- View this message in context: http://lucene.472066.n3.nabble.com/solr-result-problem-tp2746849p2746947.html Sent from the Solr -

Re: [WKT] Spatial Searching

2011-03-28 Thread Mattmann, Chris A (388J)
LGPL licenses and Apache aren't exactly compatible, see: http://www.apache.org/legal/3party.html#transition-examples-lgpl http://www.apache.org/legal/resolved.html#category-x In practice, this was the reason we started the SIS project. Cheers, Chris On Mar 28, 2011, at 11:16 AM, Smiley, David W

Re: solr result problem

2011-03-28 Thread Gora Mohanty
On Tue, Mar 29, 2011 at 10:12 AM, anurag.walia wrote: > Hi  Gora, > > Thanks for relied. > > i applied this snowballporterfilterfactory for remove difference of result > in case of plural or singular. > if i entered polymer then it working fine but again polymers giving me > "polym". > while bag o

Re: solr result problem

2011-03-28 Thread anurag.walia
Hi Gora, Thanks for relied. i applied this snowballporterfilterfactory for remove difference of result in case of plural or singular. if i entered polymer then it working fine but again polymers giving me "polym". while bag or bags giving me bag after snowballporterfilterfactory . Please find t

Re: solr result problem

2011-03-28 Thread Gora Mohanty
On Tue, Mar 29, 2011 at 9:46 AM, anurag.walia wrote: [...] > I have a problem with number of character in Term Text  . I entered > "Polymer" but after snowballporterfilterfactory it become "Polym" while it > was not exist in "protwords.txt" file . I want if any word does not exist in > "protwords.

Re: copyField at search time / multi-language support

2011-03-28 Thread Andy
Tom, Could you share the method you use to perform language detection? Any open source tools that do that? Thanks. --- On Mon, 3/28/11, Tom Mortimer wrote: > From: Tom Mortimer > Subject: copyField at search time / multi-language support > To: solr-user@lucene.apache.org > Date: Monday, Marc

solr result problem

2011-03-28 Thread anurag.walia
Thanks in advance. Find the screen shot of analyzer for this http://lucene.472066.n3.nabble.com/file/n2746849/solr.jpg solr.jpg problem . I have a problem with number of character in Term Text . I entered "Polymer" but after snowballporterfilterfactory it become "Polym" while it was not exist

Re: copyField at search time / multi-language support

2011-03-28 Thread Gora Mohanty
On Mon, Mar 28, 2011 at 2:15 PM, Tom Mortimer wrote: > Hi, > > Here's my problem: I'm indexing a corpus with text in a variety of > languages. I'm planning to detect these at index time and send the > text to one of a suitably-configured field (e.g. "mytext_de" for > German, "mytext_cjk" for Chine

Re: Question about the message Indexing failed. Rolled back all changes

2011-03-28 Thread Gora Mohanty
On Mon, Mar 28, 2011 at 3:59 PM, Firdous Ali wrote: > Hi, > I m unable to index data, looks like the datasource is not even read by > solr, even created an empty dataimport.properties file at /conf but the > problem > persists. [...] Look at the Solr log files, which will probably have an except

Fields not being indexed?

2011-03-28 Thread Charles Wardell
Sorry for inadvertently Hijacking the last thread. Can someone take a look at this and let me know what I am doing wrong. According to luke, only guid, tags, and aquiDate are available. Schema is below as well. http://twitter.com/AshleyxArsenic/statuses/52164920388763648 []

Re: Fields not being indexed?

2011-03-28 Thread Chris Hostetter
: Subject: Fields not being indexed? : In-Reply-To: : References: : : http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you c

Fields not being indexed?

2011-03-28 Thread Charles Wardell
Can someone take a look at this and let me know what I am doing wrong. According to luke, only guid, tags, and aquiDate are available. Schema is below as well. http://twitter.com/AshleyxArsenic/statuses/52164920388763648 []

Re: Broken stats.js

2011-03-28 Thread Mark Mandel
Ah cool, thanks for your help. I'll get digging, and see what I can do. Mark On Tue, Mar 29, 2011 at 11:36 AM, Chris Hostetter wrote: > > : I can't seem to find any references to this issue anywhere except : > : https://issues.apache.org/jira/browse/SOLR-1750 > : > : (Which has more of a workar

Re: DIH relating multiple DataSources

2011-03-28 Thread Chris Hostetter
: Subject: DIH relating multiple DataSources : In-Reply-To: <1301054278.18711.1433747...@webmail.messagingengine.com> : References: : : : <1301054278.18711.1433747...@webmail.messagingengine.com> http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When start

Re: Broken stats.js

2011-03-28 Thread Chris Hostetter
: I can't seem to find any references to this issue anywhere except : : https://issues.apache.org/jira/browse/SOLR-1750 : : (Which has more of a workaround), and it seems that the SolrInfoMBeanHandler : is not in the 1.4.1 build. correct, it will be in 3.1 however. it's not so much a workarroun

Javabin to JSON convertor

2011-03-28 Thread paulohess
Hi guys, I have an Javabin object ( it is actually a List data structure) and I need to convert that to a JSon object. I am using "Gson" and pass my List to it: gson.toJson(myList); , the rerturn is just the same with couple of (") added to the begining and and end could anybody help here, T

Re: Cant retrieve data

2011-03-28 Thread Walter Andreas Pucko
On Mon, 28 Mar 2011 13:12 +0100, "Upayavira" wrote: > What query are you doing? > /solr/select/?q=welpe%0D%0A&version=2.2&start=0&rows=10&indent=on > Try q=*:* > returns: − 0 5 − *:* > Also, what does /solr/admin/stats.jsp report for number of docs? That's a good question. Core s

Question about the message Indexing failed. Rolled back all changes

2011-03-28 Thread Firdous Ali
Hi, I m unable to index data, looks like the datasource is not even read by solr, even created an empty dataimport.properties file at /conf but the problem persists. Following is the response text: − 0 0 − − /home/username/data-config.xml full-import debug idle Configuration Re-loaded su

Highlighting Problem

2011-03-28 Thread pottwal1
dear solr specialists, my data looks like this: j]s(dh)fjk [hf]sjkadh asdj(kfh) [skdjfh aslkfjhalwe uigfrhj bsd bsdfga sjfg asdlfj. if I want to query for the first "word", the following queries must match: j]s(dh)fjk j]s(dhfjk j]sdhfjk jsdhfjk dhf So the matching should ignore some character

copyField at search time / multi-language support

2011-03-28 Thread Tom Mortimer
Hi, Here's my problem: I'm indexing a corpus with text in a variety of languages. I'm planning to detect these at index time and send the text to one of a suitably-configured field (e.g. "mytext_de" for German, "mytext_cjk" for Chinese/Japanese/Korean etc.) At search time I want to search all of

Re: SOLR - problems with non-english symbols when extracting HTML

2011-03-28 Thread kushti
Grijesh wrote: > > Try to send HTML data using format CDATA . > Doesn't work with > $content = ""; > And my goal is not to avoid extraction, but have no problems with non-english chars -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-problems-with-non-english-symb

Re: Solr 1.4.1 and Tika 0.9 - some tests not passing

2011-03-28 Thread Andreas Kemkes
I'm still interested on what steps I could take to get to the bottom of the failing tests. Is there additional information that I should provide? Some of the output below got mangled in the email - here are the (hopefully) complete lines: This has a http://www.apache.org";>link. (Tika 0.9)

Re: [WKT] Spatial Searching

2011-03-28 Thread Estrada Groups
Outstanding! Thanks David...I can't wait to take a look at it. Adam Sent from my iPhone On Mar 28, 2011, at 2:16 PM, "Smiley, David W." wrote: > (This is one of those messages that I would have responded to at the time if > I only noticed it.) > > There is not yet indexing of arbitrary shape

Re: Solrcore.properties

2011-03-28 Thread Jayendra Patil
Can you please attach the other files. It doesn't seem to find the enable.master property, so you may want to check the properties file exists on the box having issues We have the following configuration in the core :- Core - - solrconfig.xml - Master & Slave

Re: Cant retrieve data

2011-03-28 Thread Gora Mohanty
On Mon, Mar 28, 2011 at 4:58 PM, Merlin Morgenstern wrote: [...] You should probably hide passwords when posting to public lists. >         >         >             >             >             [...] Your select does not seem to have the ID field. Regards, Gora

Re: problems indexing web content

2011-03-28 Thread Markus Jelsma
> I have about 1000 documents per xml file. I am not really doing anything > with the data other than putting the xml tags around it. So essentially > the data is okay with the exception of a few documents that are causing > the errors. > > Let's say document # 47 in the xml file has a problem, i

Re: [WKT] Spatial Searching

2011-03-28 Thread Smiley, David W.
(This is one of those messages that I would have responded to at the time if I only noticed it.) There is not yet indexing of arbitrary shapes (i.e. your data can only be points), but with SOLR-2155 you can query via WKT thanks to JTS. If you want to index shapes then you'll have to wait a mon

Re: problems indexing web content

2011-03-28 Thread Charles Wardell
I have about 1000 documents per xml file. I am not really doing anything with the data other than putting the xml tags around it. So essentially the data is okay with the exception of a few documents that are causing the errors. Let's say document # 47 in the xml file has a problem, is the whole

Role of the "name" in spellchecker declaration. Can there be multiple instances of it?

2011-03-28 Thread Teruhiko Kurosaka
In the spellchecker search component declaration: http://wiki.apache.org/solr/SpellCheckComponent#Configuration What role does the "name" play, which is "default" in this sample? Can this be any arbitrary name? Should this name match with something else in the configuration files? I came to thi

Re: problems indexing web content

2011-03-28 Thread Markus Jelsma
Also, don't forget to encode entities or wrap them in CDATA. > Jan, > > thank you for such a quick reply. I have a feed coming in that I convert to > an Here is the type for text including index > and query with the changes suggested. > > > positionIncrementGap="100"> >

Re: problems indexing web content

2011-03-28 Thread Markus Jelsma
The analyzer order doesn't really matter, char filters are regardless of position in the analyzer always executed first. Multiple filters of the same type, however, are affected by order. Also, your error is not caused by a faulty analyzer, there is something wrong in your XML. Anyway, accordi

Re: problems indexing web content

2011-03-28 Thread Charles Wardell
Jan, thank you for such a quick reply. I have a feed coming in that I convert to an Here is the type for text including index and query with the changes suggested.

Re: problems indexing web content

2011-03-28 Thread Jan Høydahl
Hi, I assume you try to post HTML files from post.jar, and use HTMLStripCharFilter to sanitize the HTML. But you refer to "my file" as if you have multiple docs in one file? XML or HTML? Multiple files? To what UpdateRequestHandler are you posting? /update/xml or /update/extract ? For us to und

problems indexing web content

2011-03-28 Thread Charles Wardell
Hi Everyone, I setup a server and began to index my data. I have two questions I am hoping someone can help me with. Many of my files seem to index without any problems. Others, I get a host of different errors. I am indexing primarily web based content and have identified my text field as foll

Re: Cant retrieve data

2011-03-28 Thread Jeffrey Chang
I'm also new but I was able to get DIH working. >From your response your have: ... Indexing completed. Added/Updated: 0 documents. Deleted 0 documents. ... 0 I believe your fetch (db source and query) is correct based on the response but perhaps your mapping isn't. I would check required fields o

Re: Default operator

2011-03-28 Thread Brian Lamb
Thank you both for your input. I ended up using Ahmet's way because it seems to fit better with the rest of the application. On Sat, Mar 26, 2011 at 6:02 AM, lboutros wrote: > The other way could be to extend the SolrQueryParser to read a per field > default operator in the solr config file. The

Re: copyField destination does not exist

2011-03-28 Thread Geert-Jan Brits
The error is saying you have a copyfield-directive in schema.xml that wants to copy the value of a field to the destination field 'text' that doesn't exist (which indeed is the case given your supplied fields) Search your schema.xml for 'copyField'. There's probably something configured related to

Re: delete by query

2011-03-28 Thread Gastone Penzo
i resolved: http://10.0.0.178:8983/solr/update?stream.body= (field1:value1)AND(field2:value2) Thanx 2011/3/28 Gastone Penzo > Hi, > i want to use delete by query method to delete indexes. > i try for example: > > http://10.0.0.178:8983/solr/update?stream.body= > field1:value > > and it works >

delete by query

2011-03-28 Thread Gastone Penzo
Hi, i want to use delete by query method to delete indexes. i try for example: http://10.0.0.178:8983/solr/update?stream.body= field1:value and it works but how can delete indexes by 2 filters? http://10.0.0.178:8983/solr/update?stream.body=field1:value1 AND field2:value2 it doesn't work. i ne

copyField destination does not exist

2011-03-28 Thread Merlin Morgenstern
Hi there, I am trying to get solr indexing mysql tables. Seems like I have misconfigured schema.xml: HTTP ERROR: 500 Severe errors in solr configuration. - org.apache.solr.common.SolrException: copyField destination :'text' does not ex

Re: RamBufferSize and AutoCommit

2011-03-28 Thread Erick Erickson
Also note that making RAMBufferSize too big isn't useful. Lucid recommends 128M as the point over which you hit diminishing returns. But unless you're having problems speed-wise with the default, why change it? And are you actually getting OOMs or is this a background question? Best Erick On Mon

Re: Cant retrieve data

2011-03-28 Thread Upayavira
What query are you doing? Try q=*:* Also, what does /solr/admin/stats.jsp report for number of docs? Upayavira On Mon, 28 Mar 2011 04:28 -0700, "Merlin Morgenstern" wrote: > Hi there, > > I am new to solr and have just installed it on a suse box with mysql > backend. > > Install and MySQL c

Cant retrieve data

2011-03-28 Thread Merlin Morgenstern
Hi there, I am new to solr and have just installed it on a suse box with mysql backend. Install and MySQL connector seem to be running. I can see the solr admin interface. Now I tried to index a table with about 0.5 Mio rows. That seemed to work as well. However, I do get 0 results doing a querie

Re: RamBufferSize and AutoCommit

2011-03-28 Thread Li Li
there are 3 conditions that will trigger an auto flushing in lucene 1. size of index in ram is larger than ram buffer size 2. documents in mamory is larger than the number set by setMaxBufferedDocs. 3. deleted term number is larger than the ratio set by setMaxBufferedDeleteTerms. auto flushing by

RamBufferSize and AutoCommit

2011-03-28 Thread Isan Fulia
Hi all , I would like to know is there any relation between autocommit and rambuffersize. My solr config does not contain rambuffersize which mean its deault(32mb).Autocommit setting are after 500 docs or 80 sec whichever is first. Solr starts with Xmx 2700M .Total Ram is 4 GB. Does the rambuffer

AW: stopwords not working in multicore setup

2011-03-28 Thread Martin Rödig
Hi, you must encode the umlaut in the URL. In your case it must be &q=title:f%FCr then it must be work. Von: Christopher Bottaro [mailto:cjbott...@onespot.com] Gesendet: Freitag, 25. März 2011 18:48 An: solr-user@lucene.apache.org Cc: Martin Rödig Betreff: Re:

Re: ComplexPhraseQueryParser and wildcards

2011-03-28 Thread jmr
Chandan Tamrakar-2 wrote: > > did you get any exceptions ? > usually wild card term you mentioned would be expanded before being > actually > searched . > No exception. Just no results returned. JMR -- View this message in context: http://lucene.472066.n3.nabble.com/ComplexPhraseQueryParser

Re: problem with snowballporterfilterfactory

2011-03-28 Thread anurag.walia
Thanks Erick for replied, I used "protwords.txt" for matching the result for singular and plural words like bag and bags. Regards Anurag Walia -- View this message in context: http://lucene.472066.n3.nabble.com/problem-with-snowballporterfilterfactory-tp2729589p2742365.html Sent from the Sol

Re: ComplexPhraseQueryParser and wildcards

2011-03-28 Thread Chandan Tamrakar
did you get any exceptions ? usually wild card term you mentioned would be expanded before being actually searched . thanks. On Mon, Mar 28, 2011 at 1:24 PM, jmr wrote: > Hi, > > I'm using ComplexPhraseQueryParser and I'm quite happy with it. > However, there are some queries using wildcards no

ComplexPhraseQueryParser and wildcards

2011-03-28 Thread jmr
Hi, I'm using ComplexPhraseQueryParser and I'm quite happy with it. However, there are some queries using wildcards nor working. Exemple: I want to do a proximity search between the word compiler and the expression 'cross linker' or 'cross linking' or 'cross linked' ... ("cross-linker compiler"~

Re: FW: DIH relating multiple DataSources

2011-03-28 Thread Jeffrey Chang
I'll reply the solution to this thread on my own (with a different email address). Did some debugging on 1.4.1 source code, my issue is in the data-config.xml file, the field column name when stored in Map object uses the DBs column casing (e.g. ID --> id): ... ... The config above does no