Re: solr version error

2013-06-14 Thread Jenny Huang
This is a duplicate of another topic "strange solr version error ( http://lucene.472066.n3.nabble.com/strange-solr-version-error-td4070636.html)". Please ignore or delete it. On Fri, Jun 14, 2013 at 5:54 PM, Jenny Huang < sunearthmoonwaterf...@gmail.com> wrote: > Hi, > > I need to use solrj to d

Re: strange solr version error

2013-06-14 Thread Jenny Huang
Thanks a lot, Shawn. It works now. Have a nice weekend, -Jenny On Fri, Jun 14, 2013 at 6:47 PM, Shawn Heisey wrote: > On 6/14/2013 4:26 PM, Jenny Huang wrote: > > Hi, > > > > I need to use solrj to do a full data import from a table in database, > and > > e

Re: strange solr version error

2013-06-14 Thread Shawn Heisey
On 6/14/2013 4:26 PM, Jenny Huang wrote: > Hi, > > I need to use solrj to do a full data import from a table in database, and > encountered the solr version error: "java.lang.RuntimeException: Invalid > version (expected 2, but 60) or the data in not in 'javabin' format". To > figure out what wen

solr version error

2013-06-14 Thread Jenny Huang
Hi, I need to use solrj to do a full data import from a table in database, and encountered the solr version error: "java.lang.RuntimeException: Invalid version (expected 2, but 60) or the data in not in 'javabin' format". To figure out what went wrong, I stripped the program to bare bone and let

Re: retrieve datefield value from document

2013-06-14 Thread Mingfeng Yang
Figured out the solution. The datefield in those documents were stored as binary, so what I should do is Fieldable df = doc.getFieldable(fname); byte[] ary = df.getBinaryValue(); ByteBuffer bb = ByteBuffer.wrap(ary); long num = bb.getLong(); ate dt = DateTools.stringToDate(DateTools.timeToString(

yet another optimize question

2013-06-14 Thread Petersen, Robert
Hi guys, We're on solr 3.6.1 and I've read the discussions about whether to optimize or not to optimize. I decided to try not optimizing our index as was recommended. We have a little over 15 million docs in our biggest index and a 32gb heap for our jvm. So without the optimizes the index fo

Re: retrieve datefield value from document

2013-06-14 Thread Michael Della Bitta
Use EmbeddedSolrServer rather than Lucene directly. On Jun 14, 2013 6:47 PM, "Mingfeng Yang" wrote: > How did you solve the problem then? > > MIng > > > On Fri, Jun 14, 2013 at 3:24 PM, Michael Della Bitta < > michael.della.bi...@appinions.com> wrote: > > > Yes, that should be what happens. But t

strange solr version error

2013-06-14 Thread Jenny Huang
Hi, I need to use solrj to do a full data import from a table in database, and encountered the solr version error: "java.lang.RuntimeException: Invalid version (expected 2, but 60) or the data in not in 'javabin' format". To figure out what went wrong, I stripped the program to bare bone and let

Re: retrieve datefield value from document

2013-06-14 Thread Mingfeng Yang
How did you solve the problem then? MIng On Fri, Jun 14, 2013 at 3:24 PM, Michael Della Bitta < michael.della.bi...@appinions.com> wrote: > Yes, that should be what happens. But then I'd guess you'd be able to > retrieve no dates. I've encountered this myself. > On Jun 14, 2013 6:05 PM, "Mingfe

Re: retrieve datefield value from document

2013-06-14 Thread Michael Della Bitta
Yes, that should be what happens. But then I'd guess you'd be able to retrieve no dates. I've encountered this myself. On Jun 14, 2013 6:05 PM, "Mingfeng Yang" wrote: > Michael, > > That's what I thought as well. I would assume an optimization of the index > would rewrite all documents in the ne

Solr cloud: zkHost in solr.xml gets wiped out

2013-06-14 Thread Al Wold
Hi, I'm working on setting up a solr cloud test environment, and the target environment I need to put it in has multiple webapps per tomcat instance. With that in mind, I wanted/had to avoid putting any configs in system properties. I tried putting the zkHost in solr.xml, like this: > > >

Re: retrieve datefield value from document

2013-06-14 Thread Mingfeng Yang
HI Dmitry, No, the docs are not deleted. Ming- On Fri, Jun 14, 2013 at 1:31 PM, Dmitry Kan wrote: > Maybe a document was marked as deleted? > > *isDeleted< > http://lucene.apache.org/core/3_6_0/api/all/org/apache/lucene/index/IndexReader.html#isDeleted(int) > > > * > > > On Fri, Jun 14, 2013

Re: data consistency in solrcloud cluster deployed in aws

2013-06-14 Thread Luis Carlos Guerrero Covo
Thank you for your reply otis. I found two open issues which may relate to this issue: https://issues.apache.org/jira/browse/SOLR-4924 https://issues.apache.org/jira/browse/SOLR-4260 We recently changed some settings to make commits happen on a more periodic nature (5 mins or 25000 docs). Before

Re: retrieve datefield value from document

2013-06-14 Thread Mingfeng Yang
Michael, That's what I thought as well. I would assume an optimization of the index would rewrite all documents in the newer format then? Ming- On Fri, Jun 14, 2013 at 1:25 PM, Michael Della Bitta < michael.della.bi...@appinions.com> wrote: > Shot in the dark: > > You're using Lucene to read

Re: data consistency in solrcloud cluster deployed in aws

2013-06-14 Thread Otis Gospodnetic
Yes, sounds like it's because of the second node being in a different AZ. In AWS, AZ really means a DC (Data Center), so the node that is in a different AZ/DC is naturally going to replicate more slowly. Otis -- Solr & ElasticSearch Support -- http://sematext.com/ On Fri, Jun 14, 2013 at 8:5

RE: Slow Highlighter Performance Even Using FastVectorHighlighter

2013-06-14 Thread Andy Brown
Bryan, For specifics, I'll refer you back to my original email where I specified all the fields/field types/handlers I use. Here's a general overview. I really only have 3 fields that I index and search against: "name", "description", and "content". All of which are just general text (string) f

Re: retrieve datefield value from document

2013-06-14 Thread Dmitry Kan
Maybe a document was marked as deleted? *isDeleted * On Fri, Jun 14, 2013 at 11:25 PM, Michael Della Bitta < michael.della.bi...@appinions.com> wrote: > Shot in the dark: > > You're using Lucene

Re: retrieve datefield value from document

2013-06-14 Thread Michael Della Bitta
Shot in the dark: You're using Lucene to read the index. That's sort of circumventing all the typing stuff that Solr does. Solr can deal with an index where some of the segments are in one format (say 1.4) and others are in another (3.6). Maybe they're being stored in a format in the newer (or old

retrieve datefield value from document

2013-06-14 Thread Mingfeng Yang
I have an index first built with solr1.4 and later upgraded to solr3.6, which has 150million documents, and all docs have a datefield which are not blank. (verified by solr query). I am using the following code snippet to retrieve import org.apache.lucene.index.IndexReader; import org.apache.luce

Re: SolrCloud excluding certain files in conf from zookeeper

2013-06-14 Thread Daniel Collins
We had something similar, we had backup copies of files that were getting uploaded to ZK and we didn't want them to. The morale I learned from that was that the files for ZK don't need to live anywhere under the Solr deployment area, they can be in a totally separate directory structure (in fa

Re: Debugging Solr XSL

2013-06-14 Thread O. Olson
Thank you Upayavira & Miguel. I decided to use Visual Studio – since I can at least set breakpoints and do interactive debugging in the UI. I hope the way Visual Studio treats XSL is the same as Solr - else I would have problems :-). Thanks again, O.O. -- View this message in context: http://l

SolrCloud excluding certain files in conf from zookeeper

2013-06-14 Thread Bill Au
When using SolrCloud, is it possible to exclude certain files in the conf directory from being loaded into Zookeeper? We are keeping our own solr related config files in the conf directory that is actually different for each node. Right now the copy in Zookeeper is overriding the local copy. Bil

Re: Solr Server Add causes java.net.SocketException: No buffer space available

2013-06-14 Thread Travis Low
If it's a windows box, then you may be experiencing a kernel sockets leak problem. http://support.microsoft.com/kb/2577795 On Fri, Jun 14, 2013 at 1:20 PM, Shawn Heisey wrote: > On 6/14/2013 8:57 AM, Snubbel wrote: > >> Hello, >> >> I am upgrading from Solr 4.0 to 4.3 and a Testcase that worke

Re: Solr Server Add causes java.net.SocketException: No buffer space available

2013-06-14 Thread Shawn Heisey
On 6/14/2013 8:57 AM, Snubbel wrote: Hello, I am upgrading from Solr 4.0 to 4.3 and a Testcase that worked fine is failing since. I do commit 1 Documents to Solr, then reload them and add a value to a multi-valued field with Atomic Update. I do commit every 50 Documents, so it's not so many

Re: The 'threads' parameter in DIH - SOLR 4.3.0

2013-06-14 Thread Java One
Hello,      I'm more than happy to contribute to this effort as well.    We are still on Solr 3.5 and never got solr 'threads' working properly. I've heard much of this was fixed in 3.6 but still a bit buggy and deprecated in later versions. Fully support in 4.X is a major wish-list item, th

Re: Suggest and Filtering

2013-06-14 Thread Brendan Grainger
Hi Otis, Sorry was a bit tired when I wrote that. I think what I'd like is to be able spellcheck the suggestions. For example. If a user types in brayk (as opposed to brake) I'd still get the following suggestions say: brake line brake condition Does that make sense? Thanks Brendan On Thu, J

Re: ngroups does not show correct number of groups when used in SolrCloud

2013-06-14 Thread Markus.Mirsberger
Hi Shreejay, Thanks for the info. I read about this too but as far as I understand this this feature is not really usefull in my case. This means I would have to reindex my documents just to get the grouping that I need now. It would be OK to do that one time but I would have to do that again

Replicas and soft commit

2013-06-14 Thread Giovanni Bricconi
I have recently upgraded our application from solr 3.6 to solr 4.2.1, and I have just started learning about soft commits and partial updates. Currently I have one indexing node and 3 replicas of the same core, and every modification goes through a dih delta index. This is usually ok but I have so

Re: Atomic Update Configurations how to?

2013-06-14 Thread Snubbel
Thanks, I started out with the original solrconfig and schema now, and it works. Just need to put in everything we are missing and figure out what the problem really was. Best regards Snubbel -- View this message in context: http://lucene.472066.n3.nabble.com/Atomic-Update-Configurations-ho

Solr Server Add causes java.net.SocketException: No buffer space available

2013-06-14 Thread Snubbel
Hello, I am upgrading from Solr 4.0 to 4.3 and a Testcase that worked fine is failing since. I do commit 1 Documents to Solr, then reload them and add a value to a multi-valued field with Atomic Update. I do commit every 50 Documents, so it's not so many at once, because the multi-valued fie

Re: Solr 3.5 Optimization takes index file size almost double

2013-06-14 Thread Pravin Bhutada
One thing that you can try is optimize incrementally. Instead of optimizing to 1 segment, optimize to 100, then 50 , 25, 10 ,5 ,2 ,1 After each step, the index size should go down. This way you dont have to wait 7 hours to get some results. Pravin On Fri, Jun 14, 2013 at 10:45 AM, Viresh Modi <

Re: Solr 3.5 Optimization takes index file size almost double

2013-06-14 Thread Viresh Modi
Hi pravin I have nearly 2 TB Disk space for optimization.And after optimization get response of Qtime nearly 7hours (Obvious which in milisecond).So i think not issue of disk space. Thanks& Regards, Viresh modi Mobile: 91 (0) 9714567430 On 14 June 2013 20:10, Pravin Bhutada wrote: > Hi Vi

Re: Solr 3.5 Optimization takes index file size almost double

2013-06-14 Thread Pravin Bhutada
Hi Viresh, How much free disc space do you have? if you have dont have enough space on disc, optimization process stops and rollsback to some intermediate state. Pravin On Fri, Jun 14, 2013 at 2:50 AM, Viresh Modi wrote: > Hi Rafal > > Here i attached solr index file snapshot as well .. >

Re: The 'threads' parameter in DIH - SOLR 4.3.0

2013-06-14 Thread Mikhail Khludnev
Hello, Most times users end-up with coding multithread SolrJ indexer that I consider as a sad thing. As 3.x fix contributor I want to share my vision to the problem. While I did that work I realized that join operation itself is too hard and even impossible to make concurrent. I propose to add con

Re: How spell checker used if indexed document is containing misspelled words

2013-06-14 Thread Shreejay
Hi, Have you tried this? http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.onlyMorePopular Of course this is assuming that your corpus has correct words occurring more frequently than incorrect ones! -- Shreejay On Friday, June 14, 2013 at 2:49, venkatesham.gu...@igate.com wrot

Re: Solr using a ridiculous amount of memory

2013-06-14 Thread Toke Eskildsen
On Fri, 2013-06-14 at 14:55 +0200, John Nielsen wrote: > Sorry for not getting back to the list sooner. Time not important, only feedback important (apologies to Fifth Element). > After some major refactoring, our 15 cores have now turned into ~500 cores > and our memory consumption has dropped d

Re: ngroups does not show correct number of groups when used in SolrCloud

2013-06-14 Thread Shreejay
Hi Markus, For ngroups to work in a cloud environment you have to make sure that all docs belonging to a group reside on the same shard. Custom hashing has been introduced in the recent versions of solr cloud. You might want to look into that https://issues.apache.org/jira/browse/SOLR-2592

Re: Solr using a ridiculous amount of memory

2013-06-14 Thread John Nielsen
Sorry for not getting back to the list sooner. It seems like I finally solved the memory problems by following Toke's instruction of splitting the cores up into smaller chunks. After some major refactoring, our 15 cores have now turned into ~500 cores and our memory consumption has dropped dramati

data consistency in solrcloud cluster deployed in aws

2013-06-14 Thread Luis Carlos Guerrero Covo
Hi, I currently have solrcloud setup with single shards and two nodes behind a load balancer in aws. I also have an additional node in the cluster which is outside the load balancer (not receiving any client requests) importing data into the cluster using data import handler. So that takes my clus

Re: Solr Hangs on startup

2013-06-14 Thread Jack Krupansky
What are the last few lines of the Solr log? No errors, exceptions, or warnings? -- Jack Krupansky -Original Message- From: Cool Techi Sent: Friday, June 14, 2013 4:49 AM To: solr-user@lucene.apache.org Subject: Solr Hangs on startup Hi, We are using solr4.3 cloud setup, but for som

ngroups does not show correct number of groups when used in SolrCloud

2013-06-14 Thread Markus.Mirsberger
Hi, I just noticed (after long time testing and finally looking into the docu :p) that the ngroups parameter does not show the correct number of groups when used in anything else than a single shard environment (in my case SolrCloud). Is there another way to get the amount of all groups with

Want to avoid setting the solr.xml in conf/Catalina/localhost

2013-06-14 Thread bsargurunathan
Hi All, I want to avoid to set the solr.xml in the conf/Catalina/localhost path for Tomcat server in Windows OS. So please suggest me to do that with the sample configuration. Thanks, Guru -- View this message in context: http://lucene.472066.n3.nabble.com/Want-to-avoid-setting-the-solr-xm

How spell checker used if indexed document is containing misspelled words

2013-06-14 Thread venkatesham.gu...@igate.com
My data is picked from social media sites and misspelled words are very frequent in social text because of the informal mode of communication.Spellchecker does not work here because misspelled words are present in the text corpus and not in the search query. Finding documents with all the different

Re: Debugging Solr XSL

2013-06-14 Thread Miguel
Hi You can use an online xsl validator, example: http://xslttest.appspot.com/ but I think it's better use XSLT editor. It's sure visual studio should have someone. regars. El 13/06/2013 23:45, O. Olson escribió: Hi, I am attempting to transform the XML output of Solr using the Xs

Solr Hangs on startup

2013-06-14 Thread Cool Techi
Hi, We are using solr4.3 cloud setup, but for some reason solr fails to startup, I see the following in the log file, post this there are no logs., org.apache.solr.search.SolrIndexSearcher ? Opening Searcher@17586ed7 main 15293 [coreLoadExecutor-3-thread-1] INFO org.apache.solr.core.SolrCore

RE: Solr 4.0 Optimize query very slow before the optimize end of a few minutes

2013-06-14 Thread Jeffery Wang
Yes, I used the same query url for each curl-call, it is very simple "http://...q=OS01W:sina*&fl=SecId,OS01W&rows=1&wt=xml&indent=true";. -Original Message- From: Toke Eskildsen [mailto:t...@statsbiblioteket.dk] Sent: 2013年6月14日 16:20 To: solr-user@lucene.apache.org Subject: RE: Solr 4.

RE: Solr 4.0 Optimize query very slow before the optimize end of a few minutes

2013-06-14 Thread Toke Eskildsen
On Fri, 2013-06-14 at 06:59 +0200, Jeffery Wang wrote: > Time queryTime(ms), CPU % r/s w/s rMB/s wMB/s IO % > ... > 7:30:52 16594 26 36 0 0.140 99.3 > 7:30:53 31 80 368 0 42.43 0 94.3 > 7:31:23 28575