Re: QueryAutoStopWordAnalyzer

2014-10-23 Thread Bernd Fehling
Am 23.10.2014 um 18:03 schrieb Alexandre Rafalovitch: > How is this different from using StopFilterFactory in Solr: > http://www.solr-start.com/javadoc/solr-lucene/org/apache/lucene/analysis/core/StopFilterFactory.html > ? With StopFilterFactory you have to set up a file with stopwords and maint

Re: Synonyms Search using solr

2014-10-23 Thread Walter Underwood
Use the SynonymFilterFactory in the indexer part of your analyzer chain. https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ On Oct 23, 2014, at 11:04 PM, Danesh Kuruppu wrote: >

Synonyms Search using solr

2014-10-23 Thread Danesh Kuruppu
Hi all, I need to get the synonyms search using solr. what the best possible way of doing this. is there are any documentation to follow. Thanks Danesh

Re: recip function error

2014-10-23 Thread Yonik Seeley
On Thu, Oct 23, 2014 at 7:47 PM, Michael Sokolov wrote: > 3.16e-11.0 looks fishy to me Indeed... looks like it should be "3.16e-11" Standard scientific notation shouldn't have decimal points in the exponent. Not sure if that causes Java problems or not though... -Yonik http://heliosearch.org -

Re: recip function error

2014-10-23 Thread Michael Sokolov
3.16e-11.0 looks fishy to me On 10/23/14 5:09 PM, eShard wrote: Good evening, I'm using solr 4.0 Final. I tried using this function boost=recip(ms(NOW/HOUR,startdatez,3.16e-11.0,0.08,0.05)) but it fails with this error: org.apache.lucene.queryparser.classic.ParseException: Expected ')' at posi

Re: SolrCloud 4.7 not doing distributed search when querying from a load balancer.

2014-10-23 Thread S.L
Shawn , Just wanted to follow up , I still face this issue of inconsistent search results on Solr Cloud 4.1.0.1 , upon further looking into logs , I found out a few exceptions , what was obvious was zkConnection time out issues and other exceptions , please take a look . *Logs* /opt/tomcat1/logs

Re: recip function error

2014-10-23 Thread Chris Hostetter
: I tried using this function : boost=recip(ms(NOW/HOUR,startdatez,3.16e-11.0,0.08,0.05)) : but it fails with this error: : org.apache.lucene.queryparser.classic.ParseException: Expected ')' at : position 29 in 'recip(ms(NOW/HOUR,startdatez,3.16e-11.0,0.08,0.05))' look very carefully at your inpu

Re: recip function error

2014-10-23 Thread eShard
Thanks we're planning on going to 4.10.1 in a few months. I discovered that recip only works with dismax; I use edismax by default. does anyone know why I can't use recip with edismax?? I hope this is fixed in 4.10.1... Thanks, -- View this message in context: http://lucene.472066.n3.nabble.

Re: Analytics component

2014-10-23 Thread nabil Kouici
Thank you for this replay. Yes but many analytics functions are not available like percentile, median, SD deviation... Regards,Nabil Le Jeudi 23 octobre 2014 16h34, Jorge Luis Betancourt González a écrit : I believe some of this statistics function that you're trying to use are pre

Re: recip function error

2014-10-23 Thread Shawn Heisey
On 10/23/2014 3:09 PM, eShard wrote: > Good evening, > I'm using solr 4.0 Final. > I tried using this function > boost=recip(ms(NOW/HOUR,startdatez,3.16e-11.0,0.08,0.05)) > but it fails with this error: > org.apache.lucene.queryparser.classic.ParseException: Expected ')' at > position 29 in 'recip(

RE: How to properly use Levenstein distance with ~ in Java

2014-10-23 Thread Will Martin
In terms of recent work with edit-distance (specifically Levenshtein) and your expressed interest might find this paper provocative. "We measure the keyword similarity between two strings by lemmatizing them, removing stopwords, and computing the cosine similarity. We then include the keyword sim

recip function error

2014-10-23 Thread eShard
Good evening, I'm using solr 4.0 Final. I tried using this function boost=recip(ms(NOW/HOUR,startdatez,3.16e-11.0,0.08,0.05)) but it fails with this error: org.apache.lucene.queryparser.classic.ParseException: Expected ')' at position 29 in 'recip(ms(NOW/HOUR,startdatez,3.16e-11.0,0.08,0.05))' I a

Re: update external file

2014-10-23 Thread Michael Sokolov
That's what I thought; thanks, Markus. On 10/23/14 2:19 PM, Markus Jelsma wrote: You either need to upload them and issue the reload command, or download them from the machine, and then issue the reload command. There is no REST support for it (yet) like the synonym filter, or was it stop filt

RE: update external file

2014-10-23 Thread Ramzi Alqrainy
Right, There is no REST support for it like the synonym filter, or was it stop filter. -- View this message in context: http://lucene.472066.n3.nabble.com/update-external-file-tp4165563p4165577.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Difference between unloading of cores with LotsOfCores and unloading a core with CoreAdmin

2014-10-23 Thread Erick Erickson
bq: ..allocated by the core close for unloaded cores? And how about the processing time for unloaded cores to get it loaded first if we issue a query to it? Well, all resources are supposed to be returned to the system. Even 500 cores open at one time is a lot though. My theory is this has nothin

RE: update external file

2014-10-23 Thread Markus Jelsma
You either need to upload them and issue the reload command, or download them from the machine, and then issue the reload command. There is no REST support for it (yet) like the synonym filter, or was it stop filter? MArkus -Original message- > From:Michael Sokolov > Sent: Thursday 2

Re: update external file

2014-10-23 Thread Ramzi Alqrainy
I hope I understand your question well. so I had the same problem. This is what I did: 1. Create a file: solr_home/PROJECT/multicore/core1/data/external_popularProducts.txt The file should contain values like this: uniqueID_in_core=count Example: 873728721=19 842728342=20 2. Update schema.xml,

Re: update external file

2014-10-23 Thread Michael Sokolov
Thanks for the links, Ramzi. I had already read the wiki page, which merely talks about how to reload the file into memory once it has been updated on disk. It doesn't mention any support for uploading that I can see. Did I miss it? -Mike On 10/23/14 1:36 PM, Ramzi Alqrainy wrote: Of cour

Re: update external file

2014-10-23 Thread Ramzi Alqrainy
Of course, there is a support for uploading the external file via Solr, you can find more details in below links https://cwiki.apache.org/confluence/display/solr/Working+with+External+Files+and+Processes http://lucene.apache.org/solr/4_10_0/solr-core/org/apache/solr/schema/ExternalFileField.html

update external file

2014-10-23 Thread Michael Sokolov
I've been looking at ExternalFileField to handle popularity boosting. Since Solr updatable docvalues (SOLR-5944) isn't quite there yet. My question is whether there is any support for uploading the external file via Solr, or if people do that some other (external, I guess) way? -Mike

Re: How to properly use Levenstein distance with ~ in Java

2014-10-23 Thread Alexandre Rafalovitch
The last real update on that is 2.5 years old. Is there more recent update? I am interested in this topic as well. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://w

Re: QueryAutoStopWordAnalyzer

2014-10-23 Thread Alexandre Rafalovitch
How is this different from using StopFilterFactory in Solr: http://www.solr-start.com/javadoc/solr-lucene/org/apache/lucene/analysis/core/StopFilterFactory.html ? Lucene "wraps" analyzers, Solr has a chain instead (though analyzers are supported as well). You just configure the chain. Writing a f

Re: Difference between unloading of cores with LotsOfCores and unloading a core with CoreAdmin

2014-10-23 Thread Xiaolu Zhao
Hi Erick, Actually we are adding more cores. In this case, we set "transientCacheSize=500", create 16,000 cores in total, each with 10k log entries. During the process, we could easily see JVM memory usage will increase as the total number of cores grows. It runs out of memory when the total

Re: Analytics component

2014-10-23 Thread Jorge Luis Betancourt González
I believe some of this statistics function that you're trying to use are precent in facets. - Original Message - From: "nabil Kouici" To: solr-user@lucene.apache.org Sent: Thursday, October 23, 2014 5:57:27 AM Subject: Analytics component Hi All, I'm trying to use Solr to do some ana

QueryAutoStopWordAnalyzer

2014-10-23 Thread Bernd Fehling
I just located the QueryAutoStopWordAnalyzer in lucene. Has anyone managed to use it for solr? Could imagine to have a language independent search "clean up" for the text_all field. Can it be used for solr right out of the box or do I have to write a wrapper or factory? Regards Bernd

Re: StatelessScriptUpdateProcessorFactory Access to Solr Core/schema/analyzer etc

2014-10-23 Thread Shawn Heisey
On 10/23/2014 2:47 AM, Erik Hatcher wrote: > Ummm… see slides 10 and 11 here: > http://www.slideshare.net/erikhatcher/solr-indexing-and-analysis-tricks > > So yes, you can do analysis tricks in an update script. And it’s incredibly > useful and powerful! :) That's pretty amazing. I would not

Re: unstable results on refresh

2014-10-23 Thread Shawn Heisey
On 10/23/2014 2:44 AM, Giovanni Bricconi wrote: > My user interface shows some boxes to describe results categories. After > half a day of small updates and delete I noticed with various queries that > the boxes started swapping while browsing. > For sure I relied too much in getting the same resul

Re: How to properly use Levenstein distance with ~ in Java

2014-10-23 Thread Walter Underwood
We’re reimplementing fuzzy support in edismax on Solr 4.x right now. See: https://issues.apache.org/jira/browse/SOLR-629 wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ On Oct 22, 2014, at 11:08 PM, karsten-s...@gmx.de wrote: > Hi Aleksander, > > The Fuzzy Search

Re: Difference between unloading of cores with LotsOfCores and unloading a core with CoreAdmin

2014-10-23 Thread Erick Erickson
Memory should eventually be returned when a core is unloaded. There's a very small amount of overhead for keeping a list of all the cores and their locations, but this shouldn't increase with time unless you're adding more cores. Do note that the transient cache size is fixed, but may be exceeded.

Analytics component

2014-10-23 Thread nabil Kouici
Hi All, I'm trying to use Solr to do some analytic function (percentile, median...). I got Trunck branch from Solr which contain the analytics component implementation. I've rebuild solr but unfortunately this component wasn't taken into consideration and no lib generated in /contrib/analytics

Re: StatelessScriptUpdateProcessorFactory Access to Solr Core/schema/analyzer etc

2014-10-23 Thread Erik Hatcher
On Oct 22, 2014, at 3:27 PM, Shawn Heisey wrote: > On 10/22/2014 11:50 AM, Tom LAMPERT wrote: >> I am attempting to create a script (java script) using the >> StatelessScriptUpdateProcessorFactory feature of solr but I am blocked on >> how to access the current core instance (ultimately to acc

Re: unstable results on refresh

2014-10-23 Thread Giovanni Bricconi
My user interface shows some boxes to describe results categories. After half a day of small updates and delete I noticed with various queries that the boxes started swapping while browsing. For sure I relied too much in getting the same results on each call, now I'm keeping the categories order in