Re: Does Solr support 'Value Search'?

2012-08-07 Thread Mikhail Khludnev
Hello, Have you checked http://lucidworks.lucidimagination.com/display/lweug/Wildcard+Queries ? On Wed, Aug 8, 2012 at 12:56 AM, Bing Hua wrote: > Hi folks, > > Just wondering if there is a query handler that simply takes a query string > and search on all/part of fields for field values? > > e

Re: Recovery problem in solrcloud

2012-08-07 Thread Jam Luo
Aug 06, 2012 10:05:55 AM org.apache.solr.common.SolrException log SEVERE: null:java.lang.RuntimeException: java.lang.OutOfMemoryError: Java heap space at org.apache.solr.servlet.SolrDispatchFilter.sendError(SolrDispatchFilter.java:456) at org.apache.solr.servlet.SolrDispatchFilter.d

Re: Is this too much time for full Data Import?

2012-08-07 Thread Mikhail Khludnev
Hello, Does your indexer utilize CPU/IO? - check it by iostat/vmstat. If it doesn't, take several thread dumps by jvisualvm sampler or jstack, try to understand what blocks your threads from progress. It might happen you need to speedup your SQL data consumption, to do this, you can enable threads

Solr Ping Request Handler Response problem

2012-08-07 Thread vempap
Hello, I've a problem with SOLR 4.0 Alpha ping request handler. If there are many cores & if I do start all the solr instances and they are up & running successfully, when I do a create index it fails with logs saying that one of the instances is down. I really donno why it is happening as start

Re: replication from lucene to solr

2012-08-07 Thread Lance Norskog
Look at how the older rsync-based snapshooter works: it uses the Unix rsync program to very efficiently spot and copy updated files in the master index. It runs from each query slave, just like Java replication. Unlike Java replication, it just uses the SSH copy protocol, and does not talk to the m

Re: exclusions by query and many values

2012-08-07 Thread Jack Krupansky
The usual technique is via filter queries that constrain what documents a user query can "see", either by OR-ing the doc classes it can see or starting with "*:*" and NOT-ing the doc classes it can't see, or a combination of the two techniques. The filter queries could either be supplied as qu

Re: Solr 4.0 Alpha incompatible with Index created with 3 Months old trunk code

2012-08-07 Thread Jack Krupansky
The last index format change I recall seeing was on June 11, which was like two weeks before the 4.0 Alpha. So, yeah, any 4.0 index created before that June 11 commit would have to be reindexed. -- Jack Krupansky -Original Message- From: roz dev Sent: Tuesday, August 07, 2012 7:38

exclusions by query and many values

2012-08-07 Thread caddmngr
we have Solr docs for manufacturer parts, where each part is available from 100's of suppliers. those suppliers are stored within each Mfg part document in a multi-field. customers searching our parts by keyword against part titles and descriptions, each have a unique list of what suppliers they a

Re: Custom Search Logic

2012-08-07 Thread Lance Norskog
Does "closeby" mean other words nearby in the text? For that, you want Lucene or Solr. Lucene is a text search library which does this nearby search very very quickly, and Solr is an app that wraps Lucene. On Tue, Aug 7, 2012 at 1:14 PM, Michael Della Bitta wrote: > Hello Joshy, > > You might wan

RE: null:java.lang.RuntimeException: [was class java.net.SocketTimeoutException] null

2012-08-07 Thread Markus Jelsma
Jack, There are no peculiarities in the JVM graphs. Only increase in used threads and GC time. Heap space is collected quickly and doesn't suddenly increase. There's only 256MB available for the heap but it's fine. Yonik, I'll increase the time out to five seconds tomorrow and try to reproduc

Re: null:java.lang.RuntimeException: [was class java.net.SocketTimeoutException] null

2012-08-07 Thread Yonik Seeley
Could this be just a simple case of a socket timeout? Can you raise the timout on request threads in Tomcat? It's a lot easier to reproduce/diagnose stuff like this when people use the stock jetty server shipped with Solr. -Yonik http://lucidimagination.com On Tue, Aug 7, 2012 at 5:39 PM, Marku

Solr 3.5 vs 3.6

2012-08-07 Thread bbarani
Hi, I heard SOLR 3.5 performs better than SOLR 3.6 (I havent tested though, will do that very soon), just want to hear thoughts from this forum regarding that... Also is SOLR 3.6 a stable release? Thanks, BB -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-3-5-vs-3-6-t

Re: java.lang.RuntimeException: [was class java.net.SocketTimeoutException] null

2012-08-07 Thread Jack Krupansky
I'm wondering if the timeout occurs due to a JVM garbage collection due to a large number of Lucene segments to merge. What is the JVM heap usage like, compared to the total heap space available? In other words, maybe the JVM needs more heap memory. -- Jack Krupansky -Original Message

RE: null:java.lang.RuntimeException: [was class java.net.SocketTimeoutException] null

2012-08-07 Thread Markus Jelsma
A signicant detail is the batch size which we set to 64 documents due to earlier memory limitations. We index segments of roughly 300-500k records each time. Lowering the batch size to 32 lead to an early internal server error and the stack trace below. Increasing it to 128 allowed us to index s

Solr index storage strategy on FileSystem

2012-08-07 Thread Bing Hua
Hi folks, With StandardDirectoryFactory, index is stored under data/index in forms of frq, tim, tip and a few other files. While index grows larger, more files are generated and sometimes it merges a few of them. It's like there're some kinds of separation and merging strategies there. My questio

Re: Syntax for parameter substitution in function queries?

2012-08-07 Thread Yonik Seeley
On Tue, Aug 7, 2012 at 3:01 PM, Timothy Hill wrote: > Hello, all ... > > According to http://wiki.apache.org/solr/FunctionQuery/#What_is_a_Function.3F, > it is possible under Solr 4.0 to perform parameter substitutions > within function queries. > > However, I can't get the syntax provided in the

Re: Solr search – Tika extracted text from PDF not return highlighting snippet

2012-08-07 Thread Jack Krupansky
The out-of-the-box example for SolrCell/Tika redirects the Tika "content" to the "text" field, which is not stored/highlighted, so the Tika content is indexed but not retrievable/highligtable. What field are you highlighting for your database text? You should direct your Tika "content" to a st

Does Solr support 'Value Search'?

2012-08-07 Thread Bing Hua
Hi folks, Just wondering if there is a query handler that simply takes a query string and search on all/part of fields for field values? e.g. q=*admin* Response may look like author: [admin, system_admin, sub_admin] last_modifier: [admin, system_admin, sub_admin] doctitle: [AdminGuide, AdminMan

Solr search – Tika extracted text from PDF not return highlighting snippet

2012-08-07 Thread anarchos78
Greetings friends, I have successfully indexed Pdf –using Tika- and pure text –fetched from database- in one single collection. Now I am trying to implement highlighting. When I querying Solr i placing in the url the following: http://localhost:8090/solr/ktimatologio/select/?q=BlahBlah&; &start=0&

Re: Custom Search Logic

2012-08-07 Thread Michael Della Bitta
Hello Joshy, You might want to look at MoreLikeThis: http://wiki.apache.org/solr/MoreLikeThis Michael Della Bitta Appinions | 18 East 41st St., Suite 1806 | New York, NY 10017 www.appinions.com Where Influence Isn’t a Game On Tue, Aug 7, 2012 a

Re: Select where in select

2012-08-07 Thread in.abdul
Yes you use filter query ... check filter query . Syed Abdul kather send from Samsung S3 On Aug 7, 2012 2:28 PM, "JoniJnm [via Lucene]" < ml-node+s472066n3999545...@n3.nabble.com> wrote: > Hi! > > I'm trying to do a query with a select in another. > > I would like to do something like: > > selec

Custom Search Logic

2012-08-07 Thread joshy_m
I am a new user to Solr and I am still learning the techniques used here. I had a requirement to do a relative search based on a specific logic. Its something like, I have a text string which when searched, should return all the items that are matching that text string for an attribute and all th

Re: Connect to SOLR over socket file

2012-08-07 Thread Walter Underwood
Yes. You connect over a socket and talk HTTP. --wunder On Aug 7, 2012, at 12:43 PM, Jason Axelson wrote: > Hi, > > Is it possible to connect to SOLR over a socket file as is possible > with mysql? I've looked around and I get the feeling that I may be > mi-understanding part of SOLR's architectu

Connect to SOLR over socket file

2012-08-07 Thread Jason Axelson
Hi, Is it possible to connect to SOLR over a socket file as is possible with mysql? I've looked around and I get the feeling that I may be mi-understanding part of SOLR's architecture. Any pointers are welcome. Thanks, Jason

Syntax for parameter substitution in function queries?

2012-08-07 Thread Timothy Hill
Hello, all ... According to http://wiki.apache.org/solr/FunctionQuery/#What_is_a_Function.3F, it is possible under Solr 4.0 to perform parameter substitutions within function queries. However, I can't get the syntax provided in the documentation there to work *at all* with Solr 4.0 out of the box

Re: Synonym file for American-British words

2012-08-07 Thread SUJIT PAL
Hi Alex, I implemented something similar using the rules described in this page: http://en.wikipedia.org/wiki/American_and_British_English_spelling_differences The idea is to normalize the British spelling form to the American form during indexing and query using a tokenizer that takes in a wo

Re: Wildcard searches in phrases throws exception

2012-08-07 Thread Tomás Fernández Löbbe
Maybe you can take a look at this Jira: https://issues.apache.org/jira/browse/SOLR-1604 On Tue, Aug 7, 2012 at 2:54 PM, Alexander Cougarman wrote: > Hi, > > Is it possible to do wildcard searches on multiple words? Here's an > example: We need to search on the words "Dearly loved friends" using t

null:java.lang.RuntimeException: [was class java.net.SocketTimeoutException] null

2012-08-07 Thread Markus Jelsma
Hello, We sometimes see the error below in our `master` when indexing. Our master is currently the node we send documents to - we've not yet implemented CloudSolrServer in Apache Nutch. This causes the indexer to crash when using Nutch locally, the task is retried when running on Hadoop. We're

Wildcard searches in phrases throws exception

2012-08-07 Thread Alexander Cougarman
Hi, Is it possible to do wildcard searches on multiple words? Here's an example: We need to search on the words "Dearly loved friends" using this text:dearly * friends This blows up Solr with this exception. From my Googling, I see that the error has to do with too many tokens being creat

replication from lucene to solr

2012-08-07 Thread Robert Stewart
Hi, I have a client who uses Lucene in a home grown CMS system they developed in Java. They have a lot of code that uses the Lucene API directly and they can't change it now. But they also need to use SOLR for some other apps which must use the same Lucene index data. So I need to make a good w

Re: Recovery problem in solrcloud

2012-08-07 Thread Mark Miller
Still no idea on the OOM - please send the stacktrace if you can. As for doing a replication recovery when it should not be necessary, yonik just committed a fix for that a bit ago. On Aug 7, 2012, at 9:41 AM, Mark Miller wrote: > > On Aug 7, 2012, at 5:49 AM, Jam Luo wrote: > >> Hi >> I

Re: Multiple Embedded Servers Pointing to single solrhome/index

2012-08-07 Thread Bing Hua
Thanks Lance. The use case is to have a cluster of nodes which runs the same application with EmbeddedSolrServer on each of them, and they all point to the same index on NFS. Every application is designed equal, meaning that everyone may index and/or search. In such way, after every commit the wr

RE: Synonym file for American-British words

2012-08-07 Thread Alexander Cougarman
Sorry, the VarCon file is here: http://wordlist.sourceforge.net/ Sincerely, Alex -Original Message- From: Alexander Cougarman [mailto:acoug...@bwc.org] Sent: 7 August 2012 5:09 PM To: solr-user@lucene.apache.org Subject: Synonym file for American-British words Dear friends, Is there

Synonym file for American-British words

2012-08-07 Thread Alexander Cougarman
Dear friends, Is there a downloadable synonym file for American-British words? This page has some, for example the VarCon file, but it's not in the Solr synonym.txt file. We need something that can normalize words like "center" to "centre". The VarCon file has it, but it's in the wrong format.

Re: Stemming questions

2012-08-07 Thread Jack Krupansky
You could use a synonym filter to map "ran" to "run". ran => run (and apply same filter at query and index time) or ran, run (only apply filter at index time, synonym filtering not needed at query time) But you would have to manually add all such word forms. -- Jack Krupansky -Original

Re: Recovery problem in solrcloud

2012-08-07 Thread Mark Miller
On Aug 7, 2012, at 5:49 AM, Jam Luo wrote: > Hi >I have big index data files more then 200g, there are two solr > instance in a shard. leader startup and is ok, but the peer alway OOM > when it startup. Can you share the OOM msg and stacktrace please? > The peer alway download index

RE: Recovery problem in solrcloud

2012-08-07 Thread Markus Jelsma
Perhaps this describes your problem: https://issues.apache.org/jira/browse/SOLR-3685 -Original message- > From:Jam Luo > Sent: Tue 07-Aug-2012 11:52 > To: solr-user@lucene.apache.org > Subject: Recovery problem in solrcloud > > Hi > I have big index data files more then 200g, t

Re: termFrequncy off and still use fastvector highlighter?

2012-08-07 Thread Tanguy Moal
Hum sorry I think I didn't get your point right! Maybe what you want to do is more like providing a custom similarity for scoring of matches, see http://lucene.apache.org/core/old_versioned_docs/versions/3_5_0/api/core/org/apache/lucene/search/package-summary.html#changingSimilarity That way you

Select where in select

2012-08-07 Thread JoniJnm
Hi! I'm trying to do a query with a select in another. I would like to do something like: select?q=*:* AND id_user=5&fl=id_other select?q=test AND -id(the result of the other select) So: select?q=test AND -id(select?q=* AND id_user=5&fl=id_other) Is it possible? Or I have to do two separates

Re: termFrequncy off and still use fastvector highlighter?

2012-08-07 Thread Tanguy Moal
May be it wasn't clear in my response, sorry! You can use a different field for searching (qf parameter for dismax) than the one for highlighting (hl.fl) : q="a phrase query"&qf="text_without_termFreqs"&hl=on&hl.fl="text_with_termFreqs". Scoring will be based on fq's fields only (i.e. those withou

Re: Stemming questions

2012-08-07 Thread Tanguy Moal
Dear Alexander, A few questions on stemming support in Solr 3.6.1: > - Can you do non-English stemming? > With solr, many languages are supported, see http://wiki.apache.org/solr/LanguageAnalysis - We're using solr.PorterStemFilterFactory on the "text_en" field type. We > will index a ton of PD

Re: Updating document with the Solr Java API

2012-08-07 Thread Sami Siren
On Tue, Jul 31, 2012 at 5:16 PM, Jonatan Fournier wrote: > Hi, > > What is the Java syntax to create an update document? Currently to update a field in a document with solrj you need to do something like this: doc = new SolrInputDocument(); doc.addField("id", "unique"); doc.addField(

Stemming questions

2012-08-07 Thread Alexander Cougarman
Dear friends, A few questions on stemming support in Solr 3.6.1: - Can you do non-English stemming? - We're using solr.PorterStemFilterFactory on the "text_en" field type. We will index a ton of PDF, DOCX, etc. docs in multiple languages. Is this the best filter factory to use for stemming?