Re: Search results after importing from Dih

2010-08-26 Thread hemant.verma
Check your index folder, does it contains files other than segment files? If yes then your data is in index, then you need to commit it. Try restart your solr. -- View this message in context: http://lucene.472066.n3.nabble.com/Search-results-after-importing-from-Dih-tp1365720p1366104.html Sent

Re: Search results after importing from Dih

2010-08-26 Thread Grijesh.singh
have u commited the data, use *:* query to see that data is commited yet or not -- View this message in context: http://lucene.472066.n3.nabble.com/Search-results-after-importing-from-Dih-tp1365720p1365927.html Sent from the Solr - User mailing list archive at Nabble.com.

Broken links in Solr FAQ's "Why don't International Characters Work?"

2010-08-26 Thread Teruhiko Kurosaka
In http://wiki.apache.org/solr/FAQ#Why_don.27t_International_Characters_Work.3F These three links are broken. http://www.nabble.com/International-Charsets-in-embedded-XML-tf1780147.html#a4897795 (International Charsets in embedded XML for Jetty 5.1) http://www.nabble.com/Problem-with-surrogate-

Search results after importing from Dih

2010-08-26 Thread Pavan Gupta
Hi, I was able to successfully index rows of a simple table of mysql db using DIH. However, when I tried searching for the indexed data using solr admin interface, no result based on data in table was displayed. Any idea why? Regards, Pavan

Re: Doing Shingle but also keep special single word

2010-08-26 Thread 朱炎詹
Thanks! It seems that I really go the wrong direction. - Original Message - From: "Ahmet Arslan" To: Sent: Tuesday, August 24, 2010 4:21 PM Subject: Re: Doing Shingle but also keep special single word The request is from our business team, they wish user of our product can type in

Re: solr working...

2010-08-26 Thread satya swaroop
Hi all, Thanks for ur response and information. I used slf4j log and i kept log.info method in every class of solr module to know which classes get invoke on particular requesthandler or on start of solr I was able to keep it only in solr Module but not in lucene module... i get error wh

Re: Search Results optimization

2010-08-26 Thread Chris Hostetter
: : if user searches for "swingline red stapler hammer hand rigid", then : documents that matches max number of words written in query should come : first : e.g a document with name field as "swingline stapler" should come later than : the document with "swingline red stapler" at a fundemental le

Re: Delete by query issue

2010-08-26 Thread Chris Hostetter
: Here's the problem: the standard Solr parser is a little weird about : negative queries. The way to make this work is to say : *:* AND -field:[* TO *] the default parser actually works ok ... it's a bug specific to deletion... https://issues.apache.org/jira/browse/SOLR-381 -Hoss

Re: Is there any strss test tool for testing Solr?

2010-08-26 Thread Chris Hostetter
: References: : : In-Reply-To: : Subject: Is there any strss test tool for testing Solr? http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh e

Re: XSL import/include relative to app server home directory...

2010-08-26 Thread Chris Hostetter
Brian: I think the problem you are encountering is similar to this issue... https://issues.apache.org/jira/browse/SOLR-1656 ...if you have any thoughts on wether the patch/ideas in that issue would also solve the problem you are looking at please post a comment -Hoss -- http://lucenerevol

Re: Field getting tokenized prior to charFilter on select query

2010-08-26 Thread Chris Hostetter
You are seeing the effects of the default QueryParser. whitespace (like '+','-','"','*', etc...) is a "special character" to the Lucene QueryParser. Un-Escaped/Quoted qhitespace tells the query parser to construct a BooleanQuery containing multiple clauses -- each clause is analyzed seperatel

Re: Query speed decreased dramatically, not sure why though?

2010-08-26 Thread Chris Hostetter
: ".../select?q=*&sort=evalDate+desc,score+desc&start=0&rows=10" : : This query takes around 5 seconds to complete. : : I changed the query to the following; : : ".../select?q=[* TO NOW]&sort=evalDate+desc,score+desc&start=0&rows=10" : : The query now returns in around 600 milliseconds. : : Ca

Re: Solr Admin Schema Browser and field named "keywords"

2010-08-26 Thread Chris Hostetter
: I have a field named "keywords" in my index. The schema browser page is not : able to deal with this, so I have trouble getting statistical information on : this field. When I click on the field, Firefox hangs for a minute and then : gives the "unresponsive script" warning. I assume (without

Re: Creating new Solr cores using relative paths

2010-08-26 Thread Chris Hostetter
: http://localhost:8080/solr/admin/cores : ?action=CREATE : &name=core1 : &instanceDir=core1 : &config=core0/conf/solrconfig.xml : &schema=core0/conf/schema.xml : (core1 is the name for the new core to be created, and I want to use the : config and schema from core0 to create the new core). : : b

Re: spellcheck index blown away during rebuild

2010-08-26 Thread Chris Hostetter
: What you're talking about is effectively promoting the spellcheck : index to a first-class Solr index, instead of an appendage bolted on : the side of an existing core. Given sharding and distributed search, : this may be a better design. even w/o promoting the spell index to be a "main" index,

Status of Solr in the cloud?

2010-08-26 Thread Charlie Jackson
There seem to be a few parallel efforts at putting Solr in a cloud configuration. See http://wiki.apache.org/solr/KattaIntegration, which is based off of https://issues.apache.org/jira/browse/SOLR-1395. Also http://wiki.apache.org/solr/SolrCloud which is https://issues.apache.org/jira/browse/SOLR-1

Re: Search Results optimization

2010-08-26 Thread Rob Casson
you might find these helpful...similar question came up last week: http://ln-s.net/7WpX http://robotlibrarian.billdueber.com/solr-forcing-items-with-all-query-terms-to-the-top-of-a-solr-search/ not exactly the same, as this case wanted to boost if *every* term matched, but a similar ta

Re: Search Results optimization

2010-08-26 Thread Hasnain
perhaps i wasnt clear in my earlier post if user searches for "swingline red stapler hammer hand rigid", then documents that matches max number of words written in query should come first e.g a document with name field as "swingline stapler" should come later than the document with "swingline red

Cutom filter implementation, advice needed

2010-08-26 Thread Ingo Renner
Hi *, I implemented a custom filter and am using it through a QParserPlugin. I'm wondering however, whether my implementation is that clever yet... Here's my QParser; I'm wondering whether I should apply the filter to all documents in the index (I already guess it's a bad idea) or whether I sho

RE: how to deal with virtual collection in solr?

2010-08-26 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks so much for your help! I will try it. -Original Message- From: Thomas Joiner [mailto:thomas.b.joi...@gmail.com] Sent: Thursday, August 26, 2010 2:36 PM To: solr-user@lucene.apache.org Subject: Re: how to deal with virtual collection in solr? I don't know about the shards, etc. H

Re: how to deal with virtual collection in solr?

2010-08-26 Thread Thomas Joiner
I don't know about the shards, etc. However I recently encountered that exception while indexing pdfs as well. The way that I resolved it was to upgrade to a nightly build of Solr. (You can find them https://hudson.apache.org/hudson/view/Solr/job/Solr-trunk/). The problem is that the version of

A few query issues with solr

2010-08-26 Thread David Yang
Hi, I'm new to using Solr, and I have started an index with it and it works great. I have encountered a few minor issues that I currently solve by modifying the query beforehand - however I feel like there is a much more configuration oriented and Solr-correct way of achieving. Current manu

RE: how to deal with virtual collection in solr?

2010-08-26 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks so much for your help, Jan Høydahl! I made multiple cores (aa public, aa private, bb public and bb private). I knew how to query them individually. Please tell me if I can do a combinations through shards parameter now. If yes, I tried to append &shards=aapub,bbpub after query string. Un

Re: Is there any strss test tool for testing Solr?

2010-08-26 Thread Gora Mohanty
On Wed, 25 Aug 2010 19:58:36 -0700 Amit Nithian wrote: > i recommend JMeter. We use that to do load testing on a search > server. [...] JMeter is certainly good, but we have also found Apache bench to also be of much use. Maybe it is just us, and what we are familiar with, but Apache bench seeme

Re: Matching exact words

2010-08-26 Thread Erick Erickson
See below: On Thu, Aug 26, 2010 at 10:24 AM, ahammad wrote: > > Hello Erick, > > Thanks for the reply. I am a little confused by this whole stemming thing. > What exactly does it refer to? > In your schema file, for the "text" field type, you'll see a line like: Which inserts the a stemmer in

Re: Matching exact words

2010-08-26 Thread ahammad
Hello Erick, Thanks for the reply. I am a little confused by this whole stemming thing. What exactly does it refer to? Basically, I already have a field which is essentially a collection of many other fields (done using copyField). This field is a text field. So what you're saying is to have a d

Re: Matching exact words

2010-08-26 Thread Erick Erickson
You'll have to change your index I'm afraid. The problem is that all the index sees is the stemmed version (assuming you're stemming at index time). There's no information in the index about what the original version was, so it's impossible to back this out. One solution is to use copyfield to mak

Re: How to do ? Articles and Its Associated Comments Indexing , One to Many relationship

2010-08-26 Thread Erick Erickson
See below... On Thu, Aug 26, 2010 at 4:31 AM, Sumit Arora wrote: > Thanks Ephraim for your response. > > If I use MultiValued for Comments Field then While Picking data from Solr, > Should I use following Logic : > > /* Sample PseudoCode */ > > Get Rows from Article and Article-Comments Table ;

Re: sort by field length

2010-08-26 Thread Shawn Heisey
On 5/24/2010 6:30 AM, Sascha Szott wrote: Hi folks, is it possible to sort by field length without having to (redundantly) save the length information in a seperate index field? At first, I thought to accomplish this using a function query, but I couldn't find an appropriate one. I have

Multiple passes with WordDelimiterFilterFactory

2010-08-26 Thread Shawn Heisey
Can I pass my data through WordDelimiterFilterFactory more than once? It occurs to me that I might get better results if I can do some of the filters separately and use preserveOriginal on some of them but not others. Currently I am using the following definition on both indexing and queryin

Re: Slow facet sorting - lex vs count

2010-08-26 Thread Eric Grobler
Hi Yonik, Thanks for your help. I will check the memory. It might also be related to patch SOLR-792 tree faceting I installed. I will remove it and try the same query tomorrow again. Regards Eric On Wed, Aug 25, 2010 at 10:25 PM, Yonik Seeley wrote: > On Wed, Aug 25, 2010 at 7:22 AM, Eric Gr

Matching exact words

2010-08-26 Thread ahammad
Hello, I have a case where if I search for the word "windows", I get results containing both "windows" and "window" (and probably other things like "windowing" etc.). Is there a way to find exact matches only? The field in which I am searching is a text field, which as I understand causes this b

Re: TurkishLowerCaseFilterFactory

2010-08-26 Thread Robert Muir
On Thu, Aug 26, 2010 at 7:28 AM, Yavuz Selim YILMAZ wrote: > I downloaded latest jars except snowball 3-1.jar. I can't find it any > place? > -- > > Yavuz Selim YILMAZ > > Hello, in 3.1 the contrib/snowball is now integrated with contrib/analyzers, so you just need the analyzers jar! This way,

Re: solr working...

2010-08-26 Thread Geert-Jan Brits
Check out Drew Farris' explantion for remote debugging Solr with Eclipse posted a couple of days ago: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-td1262050.html Geert-Jan 2010/8/26 Michael

RE: solr working...

2010-08-26 Thread Michael Griffiths
Take a look at the code? It _is_ open source. Open it up in Eclipse and debug it. -Original Message- From: satya swaroop [mailto:sswaro...@gmail.com] Sent: Thursday, August 26, 2010 8:24 AM To: solr-user@lucene.apache.org Subject: Re: solr working... Hi peter, I am already w

Re: solr working...

2010-08-26 Thread satya swaroop
Hi peter, I am already working on solr and it is working good. But i want to understand the code and know where the actual working is going on, and how indexing is done and how the requests are parsed and how it is responding and all others. TO understand the code i asked how to start?

Re: Candidate Profile Search which have multiple employers and Educations.

2010-08-26 Thread Sumit Arora
Thanks Ephraim for your response. Actually I am not using DIH to Sync the data from DB, I wrote on DB-SYNC by myself, and I am directly retrieving rows from MySQL-DB and Indexing to Solr. On my Earlier cases - I Picked Rows with Column Label from DB, and Similar Column Defined in my Sync Program,

Re: solr working...

2010-08-26 Thread Peter Karich
Hi! What do you mean? You want a quickstart? Then see http://lucene.apache.org/solr/tutorial.html (But I thought you already got solr working (from previous threads)!?) Or do you want to know if solr is running? Then try the admin view: http://localhost:8080/solr/admin/ Regards, Peter. > Hi al

Re: How to do ? Articles and Its Associated Comments Indexing , One to Many relationship

2010-08-26 Thread Sumit Arora
Thanks Ephraim for your response. If I use MultiValued for Comments Field then While Picking data from Solr, Should I use following Logic : /* Sample PseudoCode */ Get Rows from Article and Article-Comments Table ; *// It will retrieve - 1 Article and 20 Comments* Begin; Include 'Article Fie

RE: Candidate Profile Search which have multiple employers and Educations.

2010-08-26 Thread Ephraim Ofir
As far as I can tell you should use multiValued for these fields: In order to get the data from the DB you should either create a sub entity with its own query or (the better performance option) use something like: SELECT cp.name, GROUP_CONCAT(ce.CandidateEducation SEPARATOR '|') A

Re: TurkishLowerCaseFilterFactory

2010-08-26 Thread Yavuz Selim YILMAZ
I downloaded latest jars except snowball 3-1.jar. I can't find it any place? -- Yavuz Selim YILMAZ 2010/8/26 Ahmet Arslan > > Is there a version of solr which has > > TurkishLowerCaseFilterFactory.java > > I downloaded 1.4.1 version of solr , but it hasn't it. > > According to wiki that filter

RE: How to do ? Articles and Its Associated Comments Indexing , One to Many relationship

2010-08-26 Thread Ephraim Ofir
Why not define the comment field as multiValued? That way you only index each document once and you don't need to collapse anything... Ephraim Ofir -Original Message- From: Sumit Arora [mailto:sumit1...@gmail.com] Sent: Thursday, August 26, 2010 12:54 PM To: solr-user@lucene.apache.org

Candidate Profile Search which have multiple employers and Educations.

2010-08-26 Thread Sumit Arora
I have to search candidate's profile , on which I have following Tables : Candidate Profile Record : CandidateProfile_Table CandidateEducation : CandidateEducation_Table // EducationIn Different Institutes or Colleges : Employers : Employers_Table //More than One Employers : If I denormaliz

solr working...

2010-08-26 Thread satya swaroop
Hi all, I am intrested to see the working of solr. 1)Can anyone tell me how to start with to know its working Regards, satya

How to do ? Articles and Its Associated Comments Indexing , One to Many relationship

2010-08-26 Thread Sumit Arora
I have set of Articles and then Comments on it, so in database I have two major tables one for Articles and one for Comments, but each Article could have many comments (One to Many). If One Article will have 20 Comments, then on DB to SOLR - Index - Sync : Solr will index 20 Similar Documents wit

FieldCache.DEFAULT.getInts vs FieldCache.DEFAULT.getStringIndex. Memory usage

2010-08-26 Thread Marc Sturlese
I need to load a FieldCache for a field wich is a solr "integer" type and has as maximum 3 digits. Let's say my index has 10M docs. I am wandering what is more optimal and less memory consumig, to load a FieldCache.DEFAUL.getInts or a FieldCache.DEFAULT.getStringIndex. The second one will have a

Re: TurkishLowerCaseFilterFactory

2010-08-26 Thread Ahmet Arslan
> Is there a version of solr which has > TurkishLowerCaseFilterFactory.java > I downloaded 1.4.1 version of solr , but it hasn't it. According to wiki that filter will be available in solr 3.1 http://wiki.apache.org/solr/LanguageAnalysis#Turkish You can checkout branch 3.1 http://svn.apache.org

Re: JVM GC is very frequent.

2010-08-26 Thread Marc Sturlese
http://www.lucidimagination.com/blog/2009/09/19/java-garbage-collection-boot-camp-draft/ -- View this message in context: http://lucene.472066.n3.nabble.com/JVM-GC-is-very-frequent-tp1345760p1348065.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: How to delete documents from SOLR index using DIH

2010-08-26 Thread Ephraim Ofir
You have several options here: 1. Use the deletedPkQuery in delta import - you'll need to make a DB query which generates the IDs to be deleted (something like: SELECT id FROM your_table WHERE deleted = 1). 2. Add the $deleteDocById special command to your full/delta import. 3. Use preImportDeleteQ

TurkishLowerCaseFilterFactory

2010-08-26 Thread Yavuz Selim YILMAZ
Is there a version of solr which has TurkishLowerCaseFilterFactory.java I downloaded 1.4.1 version of solr , but it hasn't it. -- Yavuz Selim YILMAZ

Re: How to delete documents from SOLR index using DIH

2010-08-26 Thread Grijesh.singh
DIH is not basically for deletion it is for inserting data into index. Although it has a parameter "clean" which is by default true and it cleans the index every time when full-import command is issued, means it create index from scratch. If your requirement is to delete whole index you can also

Re: Regd WSTX EOFException

2010-08-26 Thread Pooja Verlani
Hi, The client being used is php curl. Could that be a problem? On Wed, Aug 25, 2010 at 7:10 PM, Yonik Seeley wrote: > On Wed, Aug 25, 2010 at 6:41 AM, Pooja Verlani > wrote: >> Hi, >> Sometimes while indexing to solr, I am getting  the following exception. >> "com.ctc.wstx.exc.WstxEOFException: