Re: How to get values of external file field(s) in Solr query?

2013-10-04 Thread Mikhail Khludnev
On Fri, Oct 4, 2013 at 12:24 AM, Peter Keegan peterlkee...@gmail.comwrote: But how to get the values for docs that match some arbitrary query? You can specify eff in fl param. it's implied by http://docs.lucidworks.com/display/solr/Function+Queries -- Sincerely yours Mikhail Khludnev

Re: Got error: Filter execution threw an exception

2013-10-04 Thread Shawn Heisey
On 10/3/2013 8:03 PM, Brian Robinson wrote: Hi Shawn, Thanks for responding. Just saw this email. I did go to the logging tab in the admin page, but it never finished loading. I'm using collection1, and I'm using the traditional config. Here is my complete code, with $options omitted for

Re: Regex Search at Solr

2013-10-04 Thread Furkan KAMACI
I will go with that. I've just wondered about could I do it with existence test data or not. 2013/10/4 Otis Gospodnetic otis.gospodne...@gmail.com Maybe storing the protocol info in a separate field would be cleaner. Otis Solr ElasticSearch Support http://sematext.com/ On Oct 3, 2013

Re: Indexing file system contents

2013-10-04 Thread Shawn Heisey
On 10/3/2013 11:29 PM, Sadler, Anthony wrote: Time: - On some servers we're dealing with something in the region of a million or more files. Indexing that many times takes upwards of 48 hours or more. While the script is now fairly stable and fault tolerant, that is still a pretty

Re: Indexing file system contents

2013-10-04 Thread Alexandre Rafalovitch
No direct help but a bunch of related random thoughts: 1) How are you running Tika? As a jar loading from scratch every time? Tika can also run in a server mode where it listens to a network socket. You send the file, it sends the extract back. Might be faster. 2) Deleting old stuff. You can

Re: Unable to deplay solr 4.3.0 on jboss EAP 6.1 in mode full JavaEE 6

2013-10-04 Thread Roland Everaert
Thanks for the tips. When I got time, I will have a look into it and I will try to use solr via the embedded jetty. Regards, Roland. On Thu, Oct 3, 2013 at 3:26 PM, Shawn Heisey s...@elyograg.org wrote: On 8/14/2013 5:16 AM, Roland Everaert wrote: For the past months I have deplaoyed and

Re: WikipediaTokenizer documentation

2013-10-04 Thread Furkan KAMACI
I suggest you to look at here: http://www.javadocexamples.com/java_source/org/apache/lucene/wikipedia/analysis/WikipediaTokenizerTest.java.html 2013/10/4 Ken Krugler kkrugler_li...@transpac.com Hi all, Where's the documentation on the WikipediaTokenizer? Specifically I'm wondering how

Re: FileNotFoundException

2013-10-04 Thread Furkan KAMACI
Did you check here at logs: *Caused by: java.io.FileNotFoundException: /opt/solr/myCore/data/index/_2he9.si (No such file or directory)* 2013/10/4 tamanjit.bin...@yahoo.co.in tamanjit.bin...@yahoo.co.in Hi, We migrated to Solr 4.3 from 3.5 yesterday. We use multicore Master Slave

Re: Solr client in CPP

2013-10-04 Thread Furkan KAMACI
There is an old question like that: http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201101.mbox/%3CAANLkTi=itRz7ni6HV-m=GTThzLb9G8XkWi92jBn=p...@mail.gmail.com%3E Also you check that page for general information: http://websolr.com/guides/solr-clients 2013/10/4 Neeraj Pandey

Zookeeper : What goes on the new node ?

2013-10-04 Thread maephisto
Hi! Imagin a collection collection1 with 3 shards, replicationFactor=2 and maxShardsPerNode=2 hosted on three machines. Then add a new collection, collection2 configured the same. Cool, so now we have three machines each with 4 cores, a shard leader and a replica for each of the two collection.

Size of ID field

2013-10-04 Thread Zahoor Mohamed
Hi Does the size of ID field matter .. in terms of memory usage...and query performance... i.e. will Solr use more memory if you use a URL string as ID field instead of a int value? ./zahoor

Re: ALIAS feature, can be used for what?

2013-10-04 Thread Jan Høydahl
Hi, I have been asked the same question. There are only DELETEALIAS and CREATEALIAS actions available, so is there a way to achieve uninterrupted switch of an alias from one index to another? Are we lacking a MOVEALIAS command? -- Jan Høydahl, search solution architect Cominvent AS -

Re: Size of ID field

2013-10-04 Thread Dmitry Kan
Using arbitrary strings affects at least on the traffic between the shard(s) and a querying client or shards and a frontend solr instance. We have actually hit such an issue, described here: https://issues.apache.org/jira/browse/SOLR-4903, which has triggered the suggestion for ids compaction:

Updating a indexed vs a non-indexed field

2013-10-04 Thread maephisto
In the last Solr versions Atomic Updates were introduced http://wiki.apache.org/solr/Atomic_Updates I'm wondering, updating a field that is stored=true indexed=true would be different as updating a field that is stored=true indexed=false Would Solr try to reindex the doc only if the field is

Re: FileNotFoundException

2013-10-04 Thread tamanjit.bin...@yahoo.co.in
I did see that. The file its looking for doesn't exist post slaves have been updated. I suspect there was segment info file of that name before the syncing happened and once the file was removed the searcher still looks for the file. Strange behavior, don't know why should this happen. -- View

Can I pass some Object as request parameter to solr server

2013-10-04 Thread Alok Bhandari
Hello , I am using solr 4.0 , I want to sent a list of objects to solr as a request parameter. Is it possible ? Please let me know. -- View this message in context: http://lucene.472066.n3.nabble.com/Can-I-pass-some-Object-as-request-parameter-to-solr-server-tp4093463.html Sent from the Solr

Re: Updating a indexed vs a non-indexed field

2013-10-04 Thread Upayavira
On Fri, Oct 4, 2013, at 11:19 AM, maephisto wrote: In the last Solr versions Atomic Updates were introduced http://wiki.apache.org/solr/Atomic_Updates I'm wondering, updating a field that is stored=true indexed=true would be different as updating a field that is stored=true

Re: ALIAS feature, can be used for what?

2013-10-04 Thread Upayavira
I've used this feature to great effect. I have logs coming in, and I create a core for each day. At the end of each day, I create a new core for tomorrow, unload any cores over 2 months old, then create a set of aliases (all, month, week, today) pointing to just the cores that are needed for that

Re: Can I pass some Object as request parameter to solr server

2013-10-04 Thread Jack Krupansky
The two techniques for sending lists of values/objects to Solr are comma or spaces delimited single parameter values, or multiple instances of the parameter name, one per value/object. Solr has no object-encoding per se, and has no parameters that are generalized objects, but you can use any

Activate Faceting based on DocValues

2013-10-04 Thread Alessandro Benedetti
Hi guys, I was thinking how to activate the DocValues approach for using faceting. Tell me if I am correct : 1) enable in schema.xml for the field of interest, the DocValues attribute set to true. 2) use one of these 2 faceting strategies : fc ( Field Cache) or fcs (Per segment Field Cache) 3)

Re: Can I pass some Object as request parameter to solr server

2013-10-04 Thread Furkan KAMACI
I've implemented a SearchRequest class at my application. It has some custom fields and filled via web services automatically from a JSON object (via jackson). Inside a core class I retrieve proper attributes from that class and send a query to Solr server. If you have an API class to reach your

Re: Size of ID field

2013-10-04 Thread Jack Krupansky
It all depends. I mean, if you have 20 million URLs averaging 40 characters each, that's 80 MB, not a big deal at all, but if you have 20 billion URLs that would take up 80 GB, which might be a big deal. But if you shard those 20 billion documents into 10 shards, 8 GB may or may not be a big

Re: Query among multiple cores

2013-10-04 Thread Snehalata Nalavade
Hi I want to display result as one Dataset thorough solr using Multicore.In one core Containg EnglishCollectionData and onther containg HindiCollectionData. When I am join two core result is displayed when I am giving English Parameter But does not work For Hindi Parameter.Could me give the

Re: Updating a indexed vs a non-indexed field

2013-10-04 Thread maephisto
Thank you! My question is actually what's the difference in updating an indexed field vs. updating a non-indexed field? Will updating an indexed field trigger a refresh in the solr indexes while in the other case wouldn't? -- View this message in context:

Removing BinaryUpdateRequestHandler which is deprecated

2013-10-04 Thread Erlend Garåsen
I'm trying to get rid of the following warnings: [2013-10-03 17:37:56.981] WARNING Using deprecated class: XmlUpdateRequestHandler -- replace with UpdateRequestHandler [2013-10-03 17:37:56.983] WARNING Using deprecated class: BinaryUpdateRequestHandler -- replace with UpdateRequestHandler

Re: Updating a indexed vs a non-indexed field

2013-10-04 Thread Shawn Heisey
On 10/4/2013 6:56 AM, maephisto wrote: My question is actually what's the difference in updating an indexed field vs. updating a non-indexed field? Will updating an indexed field trigger a refresh in the solr indexes while in the other case wouldn't? A change is a change. It doesn't matter

Re: Removing BinaryUpdateRequestHandler which is deprecated

2013-10-04 Thread Shawn Heisey
On 10/4/2013 7:16 AM, Erlend Garåsen wrote: [2013-10-03 17:37:56.981] WARNING Using deprecated class: XmlUpdateRequestHandler -- replace with UpdateRequestHandler [2013-10-03 17:37:56.983] WARNING Using deprecated class: BinaryUpdateRequestHandler -- replace with UpdateRequestHandler How do

Re: How to set a condition over stats result

2013-10-04 Thread Steven Bower
Check out: https://issues.apache.org/jira/browse/SOLR-5302 can do this using query facets On Fri, Jul 12, 2013 at 11:35 AM, Jack Krupansky j...@basetechnology.comwrote: sum(x, y, z) = x + y + z (sums those specific fields values for the current document) sum(x, y) = x + y (sum of those two

Re: StatsComponent with median

2013-10-04 Thread Steven Bower
Check out: https://issues.apache.org/jira/browse/SOLR-5302 it supports median value On Wed, Jul 3, 2013 at 12:11 PM, William Bell billnb...@gmail.com wrote: If you are a programmer, you can modify it and attach a patch in Jira... On Tue, Jun 4, 2013 at 4:25 AM, Marcin Rzewucki

Re: Removing BinaryUpdateRequestHandler which is deprecated

2013-10-04 Thread Erlend Garåsen
Many thanks for your reply. We're running Solr 4.4.0 in production, but SolrJ 3.6 is still used by our CMS connector. I will place your reply as a comment in our own Jira and do the necessary changes later when we are ready to upgrade SolrJ for our connector. Erlend On 10/4/13 4:05 PM,

Use function return value for range queries

2013-10-04 Thread SandroZbinden
Is there a way to use the function return value for a range query For example: I have two price fields pricea and priceb and now i want to get the values where the sum of the pricea and priceb is between [0 TO 5] Something like *select?q={!func}sum(pricea,priceb):[0 TO 5]* I can't calculate

Re: Use function return value for range queries

2013-10-04 Thread Jack Krupansky
I think the best you can do is compute sum(pricea,priceb) at index time as a third field, say priceSum, and then you can do a range query on that priceSum field. It would be nice to be able to have a query that evaluates arbitrary expressions combining field values, but there is no such

AW: Use function return value for range queries

2013-10-04 Thread Sandro Zbinden
Thanks for the quick answer. I thought that :-) Is there any plan add such a functionality in the future. Or is it completely against the concept. Bests Sandro -Ursprüngliche Nachricht- Von: Jack Krupansky [mailto:j...@basetechnology.com] Gesendet: Freitag, 4. Oktober 2013 16:41 An:

Re: WikipediaTokenizer documentation

2013-10-04 Thread Jack Krupansky
I have some info and examples for the WikipediaTokenizer in my book, but a tokenizer does not direct tokens to a field. Rather, you would use the tokenizer in the analyzer for whatever field you wish to store values in. You could use the same input for multiple fields and then filter the tokens

Re: AW: Use function return value for range queries

2013-10-04 Thread Jack Krupansky
No plan that I know of, but there is a new Lucene expression module, so maybe it is not so farfetched. Its performance might not be so great, but if you need the flexibility it might be worth it. -- Jack Krupansky -Original Message- From: Sandro Zbinden Sent: Friday, October 04,

Re: Got error: Filter execution threw an exception

2013-10-04 Thread Brian Robinson
Thanks for the guidance, Shawn. I am in fact using Tomcat instead of Jetty, but the logging was working OK at one point, so I'm not sure what changed to make it not work. I'll have to investigate that. I'll check out your other suggestions as well. Brian On 10/4/2013 1:43 AM, Shawn Heisey

Re: Adding attributes to Solr fields ?

2013-10-04 Thread jimmy nguyen
Hello, and thank you for your answer Shawn. I tried to simplify my problem but I realize I chose a bad example : I don't process phone numbers, and I do process unstructured documents. My GATE application might return several annotations for the same group of words (because I'm using an

SolrCloud distribute search question.

2013-10-04 Thread yriveiro
Hi, When a distributed search is done, the inital query is forwarded to all shards that are part of the specific collection that we are querying. My question here is, Which is the machine that does the aggregation for results from shards? Is the machine which receives the initial request? I

Re: Got error: Filter execution threw an exception

2013-10-04 Thread Brian Robinson
It ended up that I just needed to restart Tomcat. Once you mentioned the logging, it sounded like something somewhere just got stuck, so rebooting took care of it. I should have just done that in the first place. Brian On 10/4/2013 1:43 AM, Shawn Heisey wrote: On 10/3/2013 8:03 PM, Brian

Sort by score an int field value

2013-10-04 Thread user 01
I need to sort documents returned in order of (descending)score (descending)value of an int field within the document. How do I ensure proper sort order as well as good performance ? I don't need the sort-order defined by sort=score desc,intField desc. The sort order needs to be somewhat like

solr configuration templates when creating new cores on the fly?

2013-10-04 Thread helt
Hi, im playing around with solr 4.4. Just started it and tried to create new cores. First hurdle was the fact that no solrconfig.xml and no schema.xml were found in the classpath. So I copied and configured into example/resources, because that directory is in the classpath. Now I am able to

Re: solr configuration templates when creating new cores on the fly?

2013-10-04 Thread Shawn Heisey
On 10/4/2013 1:49 PM, helt wrote: Now I am able to create cores, but new cores don't have their own config file. Is this intended behavior? I haven't found much documentation for these features. I think I miss something basic here. Is it really normal, that no configuration files are being

Re: SolrCloud distribute search question.

2013-10-04 Thread Tomás Fernández Löbbe
Yes, the machine that gets the initial request is the one that distributes to the shards and the aggregates the results. On Fri, Oct 4, 2013 at 9:55 AM, yriveiro yago.rive...@gmail.com wrote: Hi, When a distributed search is done, the inital query is forwarded to all shards that are part