Re: {soft}Commit and cache flusing

2013-10-01 Thread Bram Van Dam
if there are no modifications to an index and a softCommit or hardCommit issued, then solr flushes the cache. Indeed. The easiest way to work around this is by disabling auto commits and only commit when you have to.

Re: Problems with maxShardsPerNode in 4.5

2013-10-01 Thread Shalin Shekhar Mangar
Thanks for reporting this Brett. This is indeed a bug. A workaround is to specify replicationFactor=1 with the createShard command which will create only one replica even if maxShardsPerNode=1000 at collection level. I'll open an issue. On Wed, Oct 2, 2013 at 7:25 AM, Brett Hoerner wrote: > Rel

Re: Problems with maxShardsPerNode in 4.5

2013-10-01 Thread Brett Hoerner
Related, 1 more try: Created collection starting with 4 shards on 1 box. Had to set maxShardsPerNode to 4 to do this. Now I want to "roll over" my time window, so to attempt to deal with the problems noted above I delete the oldest shard first. That works fine. Now I try to add my new shard, whi

Problems with maxShardsPerNode in 4.5

2013-10-01 Thread Brett Hoerner
It seems that changes in 4.5 collection configuration now require users to set a maxShardsPerNode (or it defaults to 1). Maybe this was the case before, but with the new CREATESHARD API it seems a very restrictive. I've just created a very simple test collection on 3 machines where I set maxShards

Re: Newbie to Solr

2013-10-01 Thread Alexandre Rafalovitch
Mamta, You are trying to do multiple things at once. Slow down before you drown. Use the default Solr distribution. That runs embedded server. Do not switch to Tomcat. Do it on your personal machine if you need to (it's just unzip and run). Then, go through Solr tutorial. That will answer some o

Re: Profiling Solr Lucene for query

2013-10-01 Thread Shawn Heisey
On 10/1/2013 4:04 PM, Isaac Hebsh wrote: Hi Shawn, I know that every node operates as a frontend. This is the way our cluster currently run. If I seperate the frontend from the nodes which hold the shards, I can let him different amount of CPUs as RAM. (e.g. large amount of RAM to JVM, because t

Re: Profiling Solr Lucene for query

2013-10-01 Thread Isaac Hebsh
Hi Shawn, I know that every node operates as a frontend. This is the way our cluster currently run. If I seperate the frontend from the nodes which hold the shards, I can let him different amount of CPUs as RAM. (e.g. large amount of RAM to JVM, because this server won't need the OS cache for read

Accent insensitive multi-words suggester

2013-10-01 Thread Dominique Bejean
Hi, Up to now, the best solution I found in order to implement a multi-words suggester was to use "ShingleFilterFactory" filter at index time and the termsComponent. At index time the analyzer was : articles="lang/contractions_fr.txt"/> words="stopword

Re: Profiling Solr Lucene for query

2013-10-01 Thread Shawn Heisey
On 10/1/2013 2:35 PM, Isaac Hebsh wrote: Hi Dmitry, I'm trying to examine your suggestion to create a frontend node. It sounds pretty usefull. I saw that every node in solr cluster can serve request for any collection, even if it does not hold a core of that collection. because of that, I though

Re: Profiling Solr Lucene for query

2013-10-01 Thread Isaac Hebsh
Hi Dmitry, I'm trying to examine your suggestion to create a frontend node. It sounds pretty usefull. I saw that every node in solr cluster can serve request for any collection, even if it does not hold a core of that collection. because of that, I thought that adding a new node to the cluster (ak

Advice for using Solr 4.5 custom sharding to handle rolling time-oriented event data

2013-10-01 Thread Brett Hoerner
I'm interesting in using the new custom sharding features in the collections API to search a rolling window of event data. I'd appreciate a spot/sanity check of my plan/understanding. Say I only care about the last 7 days of events and I have thousands per second (billions per week). Am I correct

Re: Sorting dependent on user preferences with FunctionQuery

2013-10-01 Thread Chris Hostetter
: select?q=*%3A*&sort=query(qf=category v='Book')desc : : but Solr returns "Can't determine a Sort Order (asc or desc) in sort". the root cause of that error is that you don't have any whitespace between your query function and "desc" as for your broader goal: doing a straight sort on the users

Re: Doing time sensitive search in solr

2013-10-01 Thread Darniz
Thanks Eric When i did solr in 2010 i thought now they might have evolved and allow doing query by providing wildcard in field name, but looks like i have to provide a concrete dynamic field name to query. Anyway will look in the catch all fields. Do you have any examples on how a catch all fiel

Re: {soft}Commit and cache flusing

2013-10-01 Thread Dmitry Kan
Thanks a lot Shawn for an exhaustive reply! Regards, Dmitry On Tue, Oct 1, 2013 at 5:37 PM, Shawn Heisey wrote: > On 10/1/2013 2:48 AM, Dmitry Kan wrote: > > This is a minor thing, perhaps, but thought to ask / share: > > > > if there are no modifications to an index and a softCommit or hardCo

Re: Auto Suggest - Time decay

2013-10-01 Thread Ing. Jorge Luis Betancourt Gonzalez
Sorry, I forgot the link: [1] - http://wiki.apache.org/solr/SolrRelevancyFAQ - Mensaje original - De: "Ing. Jorge Luis Betancourt Gonzalez" Para: solr-user@lucene.apache.org Enviados: Martes, 1 de Octubre 2013 13:34:03 Asunto: Re: Auto Suggest - Time decay For that core just use a boost

Re: Auto Suggest - Time decay

2013-10-01 Thread Ing. Jorge Luis Betancourt Gonzalez
For that core just use a boost factor as explained on [1]: You could use a query like this to see (before make any change) how your suggestions will be retrieved, in this case a query for "goog" has been made, and recent documents will be boosted (an extra bonus will be given for the newer docu

Re: how to manually update a field in the index without re-crawling?

2013-10-01 Thread Shawn Heisey
On 10/1/2013 9:03 AM, eShard wrote: I'm currently using Solr 4.0 FINAL. I indexed a website and it took over 24 hours to crawl. I just realized I need to rename one of the fields (or add a new one). so I added the new field to the schema, But how do I copy the data over from the old field to the

Autosuggest - Custom sorting

2013-10-01 Thread SolrLover
Is there a way to sort the returned Autosuggest list based on a particular value (ex: score)? I am trying to sort the returned suggestions based on a field that has been calculated manually but not sure how to use that field for sorting suggestions. -- View this message in context: http://luc

Re: Auto Suggest - Time decay

2013-10-01 Thread SolrLover
I am using a totally separate core for storing the auto suggest keywords. Would you be able to send me some more details on your implementation? -- View this message in context: http://lucene.472066.n3.nabble.com/Auto-Suggest-Time-decay-tp4092965p4092969.html Sent from the Solr - User mailing

Re: XPathEntityProcessor nested in TikaEntityProcessor query null exception

2013-10-01 Thread Andreas Owen
i'm already using URLDataSource On 30. Sep 2013, at 5:41 PM, P Williams wrote: > Hi Andreas, > > When using > XPathEntityProcessoryour > DataSource > must be of type DataSource. You shouldn't be using > BinURLDataSource, it's

Re: Percolate feature?

2013-10-01 Thread Charlie Hull
On 01/10/2013 04:12, Otis Gospodnetic wrote: Just came across this "ancient" thread. Charlie, did this end up happening? I suspect Wolfgang may be interested, but that's just a wild guess. Hi Otis & all, Yes we're actually planning to talk about it at Lucene Revolution in November and open

Re: Auto Suggest - Time decay

2013-10-01 Thread Ing. Jorge Luis Betancourt Gonzalez
Are you using the suggester component? or a separated core? I've used a separated core to store suggestions and order this suggestions (queries performed on the frontend) using a time decay function, and it works great for me. Regards, - Mensaje original - De: "SolrLover" Para: solr-u

Auto Suggest - Time decay

2013-10-01 Thread SolrLover
I am trying to implement an auto suggest based on time decay function. I have a separate index just to store auto suggest keywords. I would be calculating the frequency over time rather than just calculating just based on frequency alone. I am thinking of using a database to perform the calculat

Re: Solr 4.0 is stripping XML format from RSS content field

2013-10-01 Thread eShard
If anyone is interested, I managed to resolve this a long time ago. I used a Data Import Handler instead and it worked beautifully. DIH are very forgiving and it takes what ever XML data is there and injects it into the Solr Index. It's a lot faster than crawling too. You use XPATH to map the field

Re: Doing time sensitive search in solr

2013-10-01 Thread Erick Erickson
Try it and see :). Dynamic fields are just like regular fields once you index a document that uses one. After that, they should behave just like regular. If you're asking if you can create a query like *_txt:text meaning search all the fields that end with _txt for the word "text", I don't think

how to manually update a field in the index without re-crawling?

2013-10-01 Thread eShard
Good morning, I'm currently using Solr 4.0 FINAL. I indexed a website and it took over 24 hours to crawl. I just realized I need to rename one of the fields (or add a new one). so I added the new field to the schema, But how do I copy the data over from the old field to the new field without recra

Re: {soft}Commit and cache flusing

2013-10-01 Thread Shawn Heisey
On 10/1/2013 2:48 AM, Dmitry Kan wrote: > This is a minor thing, perhaps, but thought to ask / share: > > if there are no modifications to an index and a softCommit or hardCommit > issued, then solr flushes the cache. Any time you do a commit that opens a new Searcher object (openSearcher=true, w

solr cpu usage

2013-10-01 Thread adfel70
hi We're building a spec for a machine to purchase. We're going to buy 10 machines. we aren't sure yet how many proccesses we will run per machine. the question is -should we buy faster cpu with less cores or slower cpu with more cores? in any case we will have 2 cpus in each machine. should we bu

SolrCloud. Scale-test by duplicating same index to the shards and make it behave each index is different (uniqueId).

2013-10-01 Thread Thomas Egense
Hello everyone, I have a small challenge performance testing a SolrCloud setup. I have 10 shards, and each shard is supposed to have index-size ~200GB. However I only have a single index of 200GB because it will take too long to build another index with different data, and I hope to somehow use th

Re: Newbie to Solr

2013-10-01 Thread Mamta Alshi
I can have only one schema.xml file right? Can i over-write the one which originally comes with solr set-up? the original schema.xml is @ C:\solr\solr\solr\conf along with post.sh et all..where should my other document be? i need to run post.jar on my doc file (xml) to index it right? I coul

Re: Sorting dependent on user preferences with FunctionQuery

2013-10-01 Thread Snubbel
Hello, thanks for your answers. I checked your suggestions, but I'm not quite there yet. With field collapsing, I only get the top result per category, which is not what I want, I want to have all results! And boosting is quite an interesting idea. With the following I get what I need, all resul

Re: Newbie to Solr

2013-10-01 Thread Kishan Parmar
you have to create only schema file dont change anything in solr config file,, and your xml file which you want to index from solr if you are new in solr then there is core named collection1 you have to add thee schema file in that collection conf folder C:\solr\example\solr\collection1\conf yo

Re: in Problem

2013-10-01 Thread Dmitry Kan
can you run both examples you provided through the query analysis of solr admin and see if there is any difference with term positions? On Tue, Oct 1, 2013 at 1:36 PM, PAVAN wrote: > Hi Dmitry, > > I already defined in the following way > > words="stopwords.txt" enablePositionIncrements="true"

RE: Newbie to Solr

2013-10-01 Thread Mamta S Kanade
Can you tell me what all docs I need to create...there needs to be a schema.xml and what else? A document having my data? Also, where these should be placed. There's already a schema.xml Thanks for the prompt response. Mamta. -Original Message- From: Kishan Parmar [mailto:kishan...

Re: Newbie to Solr

2013-10-01 Thread Kishan Parmar
yes you have to create your own schema but in schema file you have to add your xml files field name in it like wise you can add your field name in it or you can add your filed in the default schema file whiithout schema you can not add your xml file to solr my schema is like this -

Re: Not able to run sample solr examples

2013-10-01 Thread Mamta Alshi
Hi, My problem is i am not able to run the sample examples given in solr .i cannot run them through the solr admin consoleit doesn't give me the result.I have already indexed the documents. Appreciate your help! Thanks, Mamta On Tue, Oct 1, 2013 at 3:08 PM, Kishan Parmar wrote: > > h

Re: Not able to run sample solr examples

2013-10-01 Thread Kishan Parmar
http://www.coretechnologies.com/products/AlwaysUp/Apps/RunApacheSolrAsAService.html Regards, Kishan Parmar Software Developer +91 95 100 77394 Jay Shree Krishnaa !! On Tue, Oct 1, 2013 at 12:48 AM, mamta wrote: > Hi, > > I am running Solr on Tomcat server and am able to go to the solr link f

Re: Solr doesn't return TermVectors

2013-10-01 Thread Alessandro Benedetti
Nope, it's not the "last component" problem, but it's definetely the request handler problem, it was the same for me ... Switching to the /tvrh requesthandler solved my problem. We should update the wiki ! 2013/9/27 Shawn Heisey > On 9/27/2013 4:02 PM, Jack Krupansky wrote: > >> You are using

Re: in Problem

2013-10-01 Thread PAVAN
Hi Dmitry, I already defined in the following way -- View this message in context: http://lucene.472066.n3.nabble.com/in-Problem-tp4092866p4092899.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Problem regarding queries enclosed in double quotes in Solr 3.4

2013-10-01 Thread Upayavira
Which query parser are you using? It seems you are mixing them up. As far as I know, edismax doesnt support quoted phrases, it uses pf param to invoke phrase queries. Likewise, the lucene query parser doesn't support a phrase slop param, it uses a "phrase slop"~2 syntax. Upayavira On Tue, Oct 1

{soft}Commit and cache flusing

2013-10-01 Thread Dmitry Kan
Hello! This is a minor thing, perhaps, but thought to ask / share: if there are no modifications to an index and a softCommit or hardCommit issued, then solr flushes the cache. Is this designed on purpose? Regards, Dmitry

Re: OpenJDK or OracleJDK

2013-10-01 Thread Raheel Hasan
This sounds interesting... Thanks guyz for the replies.. :) On Tue, Oct 1, 2013 at 8:07 AM, Otis Gospodnetic wrote: > Hi, > > A while back I remember we notices some SPM users were having issues > with OpenJDK. Since then we've been recommending Oracle's > implementation to our Solr and to SPM

Newbie to Solr

2013-10-01 Thread mamta
Hi, I want to know that if i have to fire some query through the Solr admin, do i need to create a new schema.xml? Where do i place it incase iahve to create a new one. Incase i can edit the original schema.xml can there be two fields named id in my schema.xml? I desperately need help in running

Re: Problem regarding queries enclosed in double quotes in Solr 3.4

2013-10-01 Thread Dmitry Kan
Perhaps you can make a query parser to fix this? It would parse the incoming query and substitute "some_terms" with "some_terms" ~0 On Tue, Oct 1, 2013 at 7:43 AM, Kunal Mittal wrote: > We have a Solr 3.4 setup. When we try to do queries with double quotes > like : > "semantic web" , the query t

Re: in Problem

2013-10-01 Thread Dmitry Kan
Hi, See here, hope it helps. http://stackoverflow.com/questions/2681393/solr-is-there-a-way-to-include-stopwords-when-searching-exact-phrases On Tue, Oct 1, 2013 at 9:34 AM, PAVAN wrote: > Hi, > > When i type any query string without "in" it is giving proper results. But > when i try same que

Re: Issue in parallel Indexing using multiple csv files

2013-10-01 Thread zaheer.java
Ran more tests. It works. -- View this message in context: http://lucene.472066.n3.nabble.com/Issue-in-parallel-Indexing-using-multiple-csv-files-tp4092452p4092873.html Sent from the Solr - User mailing list archive at Nabble.com.

Not able to run sample solr examples

2013-10-01 Thread mamta
Hi, I am running Solr on Tomcat server and am able to go to the solr link from my Tomcat manager. I want to try running quieries through the solr admin page on the solr examples which come built-in when i install solr. How can i run queries on those examples? Thanks, Mamta -- View this messa