Solr 4.9.0 Premgen space error

2014-08-21 Thread Jawwad
Hi, I am new to Solr. I am trying to index some file via *SolrServer* . I deployed the war from the latest Solr release on my tomcat 6 server. As startup I tried indexing some fields using SolrDocument which worked fine. Then as a next step I started indexing file content using

Query regarding URL Analysers

2014-08-21 Thread Sathyam
Hi, I needed to generate tokens out of a URL such that I am able to get hierarchical units of the URL as well as each individual entity as tokens. For example: *Given a URL : * http://www.google.com/abcd/efgh/ijkl/mnop.php?a=10b=20c=30#xyz The tokens that I need are : *Hierarchical subsets of

Re: Query regarding URL Analysers

2014-08-21 Thread Aurélien MAZOYER
Hi, Maybe I am wrong but I am not that you can find such a tokenizer in solr out-of-the-box. I can suggest to have a look to PatternTokenizer and PathTokenizer. Note that you can also implement your own tokenizer and add it to Solr as a plugin. Regards, Aurélien MAZOYER Le 21/08/2014

Re: Help with StopFilterFactory

2014-08-21 Thread heaven
Any ideas? Doesn't that seems like a bug? -- View this message in context: http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4154202.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Help with StopFilterFactory

2014-08-21 Thread Shawn Heisey
On 8/21/2014 7:25 AM, heaven wrote: Any ideas? Doesn't that seems like a bug? I think it should have worked even with autoGeneratePhraseQueries enabled by the older schema version. The relative positions are the same -- it's 1,2,3 in the index and 2,3,4 in the query. Absolute positions don't

Re: Help with StopFilterFactory

2014-08-21 Thread Jack Krupansky
For the sake of completeness, please post the parsed query that you get when you add the debug=true parameter. IOW, how Solr/Lucene actually interprets the query itself. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Thursday, August 21, 2014 10:03 AM To:

Re: Help with StopFilterFactory

2014-08-21 Thread heaven
With 1.5 schema it work but not as it is expected. I am indexing twitter.com/testuser and only need to get exact matches, not those that match twitter or com. so my search results should contain just one record: * http://twitter.com/testuser but what I see with 1.5 schema is: *

Help Highlight Snippets Score

2014-08-21 Thread Tomer Levi
Hi, I have a document with a textual field, I would like to sort the highlighted snippets by the number of term occurrences. For instance, when I have the following snippets: Solr Solr Solr Solr Solr Solr Solr Solr Solr I would like to get them ordered as: Solr Solr Solr Solr Solr Solr

Re: Substring and Case In sensitive Search

2014-08-21 Thread Jack Krupansky
Yes, wildcards can be slow. That's why I suggested that the use cases be reviewed more carefully. But... using the reversed wildcard filter doesn't accomplish any good for the substring case where there is a wildcard on both ends. A prefix wildcard query should actually deliver decent

Re: Help with StopFilterFactory

2014-08-21 Thread Shawn Heisey
On 8/21/2014 8:40 AM, heaven wrote: With 1.5 schema it work but not as it is expected. I am indexing twitter.com/testuser and only need to get exact matches, not those that match twitter or com. so my search results should contain just one record: * http://twitter.com/testuser but what I

Re: Help with StopFilterFactory

2014-08-21 Thread Shawn Heisey
On 8/21/2014 9:52 AM, Shawn Heisey wrote: On 8/21/2014 8:40 AM, heaven wrote: With 1.5 schema it work but not as it is expected. I am indexing twitter.com/testuser and only need to get exact matches, not those that match twitter or com. so my search results should contain just one record: *

Re: Dynamically loaded core.properties file

2014-08-21 Thread Ryan Josal
Thanks Erick, I tested that does work, and provide a solution to my problem! So property expansion does work in core.properties, I did not know that, and I got the impression from Chris' comment that that would open up a can of worms when it comes to persisting core.properties. I guess while

Spell field type suggestion for business names

2014-08-21 Thread Corey Gerhardt
Solr 4.8.1 I'm using the DirectSolrSpellChecker class with str name=fieldspell/str. int name=minPrefix1/int float name=thresholdTokenFrequency1/float The field type definition for the field spell is fieldType class=solr.TextField name=textSpell2 positionIncrementGap=100 analyzer

RE: Spell field type suggestion for business names

2014-08-21 Thread Corey Gerhardt
I should also mention that I havestr name=maxCollationTries500/str just to see if it's possible to get a collation value. -Original Message- From: Corey Gerhardt [mailto:corey.gerha...@directwest.com] Sent: August-21-14 11:54 AM To: Solr User List Subject: Spell field type

Strange Behavior

2014-08-21 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
Hi , I have a field type text_general where query type for worddelimiter I am using the below type: where wddftype.txt contains - DIGIT When I do a query I am not getting the right results. E.g. Name:Wi-Fi Gets results but Name:Wi-Fi Devices Make not getting any results but if I change it to

Re: Dynamically loaded core.properties file

2014-08-21 Thread Erick Erickson
Ryan: A slight misunderstanding here: The idea of allowing different names for the core.properties file was the can of worms. Or at least a new can of worms. Using the existing stuff is an existing can of worms ;)... Under any circumstances the only time this would come in to play is if you are

Re: Query regarding URL Analysers

2014-08-21 Thread Steve Rowe
UAX29URLEmailTokenizer recognizes URLs (among other things) - you could start with its JFlex grammar and modify it to do what you want. Steve www.lucidworks.com On Aug 21, 2014, at 8:35 AM, Sathyam sathyam.dorasw...@gmail.com wrote: Hi, I needed to generate tokens out of a URL such that I

Incorrect group.ngroups value

2014-08-21 Thread Bryan Bende
Is there any known issue with using group.ngroups in a distributed Solr using version 4.8.1 ? I recently upgraded a cluster from 4.6.1 to 4.8.1, and I'm noticing several queries where ngroups will be more than the actual groups returned in the response. For example, ngroups will say 5, but then

Minimum Match with filters that add tokens

2014-08-21 Thread Schmidt, Matthew
Is there a good way of handling a minimum match value greater than 1 with token filters that add tokens to the stream? Say you have field with the DoubleMetaphone filter for phonetic matching: filter class=solr.DoubleMetaphoneFilterFactory inject=true maxCodeLength=6/ This would add two

RE: Spell field type suggestion for business names

2014-08-21 Thread Corey Gerhardt
It seems that my problem was the spell field type configuration. I switched it to a keyword tokenizer and now things appear to be working so far. fieldType class=solr.TextField name=textSpell positionIncrementGap=100 analyzer tokenizer class=solr.KeywordTokenizerFactory/

Re: Grouping based on multiple filters/criterias

2014-08-21 Thread deniz
umeshprasad wrote Grouping supports group by queries. https://cwiki.apache.org/confluence/display/solr/Result+Grouping However you will need to form the group queries before hand. Thanks Regards Umesh Prasad Search Lead@ in.linkedin.com/pub/umesh-prasad/6/5bb/580/ have seen

Re: Near Realtime get

2014-08-21 Thread Steve Davids
Did you happen to set the field ('fl') request parameter? By default the RealTimeGetHandler returns all stored fields, here is some more information on the parameter: http://wiki.apache.org/solr/CommonQueryParameters#fl -Steve On Tue, Aug 19, 2014 at 11:18 AM, Philippe Soares