Re: Replication Error

2014-01-03 Thread anand chandak
Thanks Daniel. So, if I understand correctly the below exception is almost always caused because of merging segments ? Though I see different file names (for e.g download_av3.fdt in this case) in the exception messages [explicit-fetchindex-cmd] ERROR org.apache.solr.handler.ReplicationHand

Re: Tracking down the input that hits an analysis chain bug

2014-01-03 Thread Michael Sokolov
Have you considered using a custom UpdateProcessor to catch the exception and provide more context in the logs? -Mike On 01/03/2014 03:33 PM, Benson Margulies wrote: Robert, Yes, if the problem was not data-dependent, indeed I wouldn't need to index anything. However, I've run a small mountai

Re: SPLITSHARD failed to invoke SPLIT core admin command

2014-01-03 Thread sriram
Thanks Anshum. Will keep track of that. Also the shard is in construction state for long time. I see the size of the index folder inside the newly created cores : solr/collection1_shard1_0_replica1 solr/collection1_shard1_1_replica1 They are of almost similar size and more than half the size of

Re: SPLITSHARD failed to invoke SPLIT core admin command

2014-01-03 Thread Anshum Gupta
As far as I remember you can only delete slices which are either 1. Inactive or 2. A part of a collection that uses implicit routing. The delete shard API would not let you delete a slice directly. However, you can perhaps retry the command. As far as I remember, it's built to cleanup an earlier i

Re: SPLITSHARD failed to invoke SPLIT core admin command

2014-01-03 Thread sriram
Thanks a lot Anshum for your quick reply. I checked the clusterstate.json and there it says the two new shards as under construction. It shows : shard1 : active shard2 : active shard1_1 : construction shard1_0 : construction But the loggers don't indicate any other progress in that. Also when I

Re: how to include result ordinal in response

2014-01-03 Thread Jack Krupansky
Or just use the internal document ID: fl=*,[docid] Granted, the docID may change if a segment merge occurs and earlier documents have been deleted, but it may be sufficient for your purposes. -- Jack Krupansky -Original Message- From: Upayavira Sent: Friday, January 03, 2014 5:58 PM

Re: SPLITSHARD failed to invoke SPLIT core admin command

2014-01-03 Thread Anshum Gupta
Also, here's another issue that would help with the timeouts (SOLR-5477 ). On Fri, Jan 3, 2014 at 3:34 PM, Anshum Gupta wrote: > Hi Sriram, > > People hit timeout issues when trying to split larger collections/shards. > This is because of the tim

Re: SPLITSHARD failed to invoke SPLIT core admin command

2014-01-03 Thread Anshum Gupta
Hi Sriram, People hit timeout issues when trying to split larger collections/shards. This is because of the timing out of internal calls. About the UI showing everything as active, that's another open issue( SOLR-4777 ). You should instead check th

Re: Exact match on KeywordTokenizer

2014-01-03 Thread Chris Hostetter
Can you show us examples of the types of data you are indexing, and the types of queries you want to match? (as well as examples of queries you *don't* want to match) https://wiki.apache.org/solr/UsingMailingLists#Information_useful_for_searching_problems Best guess, based on your problem desc

Re: how to include result ordinal in response

2014-01-03 Thread Upayavira
On Fri, Jan 3, 2014, at 10:00 PM, Peter Keegan wrote: > Is there a simple way to output the result number (ordinal) with each > returned document using the 'fl' parameter? This would be useful when > visually comparing the results from 2 queries. I'm not aware of a simple way. If you're competent

SPLITSHARD failed to invoke SPLIT core admin command

2014-01-03 Thread sriram
Hi, I am using solr cloud 4.6 with 2 shards. I have loaded close to 13Million data in them. When I tried to give split shard, I am getting the following errors in the log, INFO - 2014-01-03 14:17:45.357; org.apache.solr.cloud.DistributedQueue$LatchChildWatcher; Watcher fired on path: /overseer/c

how to include result ordinal in response

2014-01-03 Thread Peter Keegan
Is there a simple way to output the result number (ordinal) with each returned document using the 'fl' parameter? This would be useful when visually comparing the results from 2 queries. Thanks, Peter

Re: SSLinitializationException after SOLR upgrade

2014-01-03 Thread Chris Hostetter
: We recently ungraded from solr 3.5 to solr4.4 . After the upgrade we are : facing the following issue. Could anyone please suggest on what could be : done. You are aparently running Solr code from a JVM configured to use an SSL keystore which has a password configured on it, and you either hav

Re: Tracking down the input that hits an analysis chain bug

2014-01-03 Thread Benson Margulies
Robert, Yes, if the problem was not data-dependent, indeed I wouldn't need to index anything. However, I've run a small mountain of data through our tokenizer on my machine, and never seen the error, but my customer gets these errors in the middle of a giant spew of data. As it happens, I _was_ mi

Re: Empty facets on Solr with MySQL

2014-01-03 Thread Andrea Gazzarini
Nice to hear you (not me) solved the problem. You're welcome Andrea On 3 Jan 2014 21:19, "PeterKerk" wrote: > No need, you solved it! > It was the id name, it had to be uppercase. > > btw the ; is still there in the query, but everything still works. > > Thanks! > > > > -- > View this message in

Re: Empty facets on Solr with MySQL

2014-01-03 Thread PeterKerk
No need, you solved it! It was the id name, it had to be uppercase. btw the ; is still there in the query, but everything still works. Thanks! -- View this message in context: http://lucene.472066.n3.nabble.com/Empty-facets-on-Solr-with-MySQL-tp4109170p4109425.html Sent from the Solr - User m

SSLinitializationException after SOLR upgrade

2014-01-03 Thread lakshmi guruparan
Hi All, We recently ungraded from solr 3.5 to solr4.4 . After the upgrade we are facing the following issue. Could anyone please suggest on what could be done. INFO [2013-12-13 07:54:50,149] [threadPoolTaskExecutor-2] [HttpClientUtil] [HttpClientUtil.java:103] - Creating new http client, config:m

Re: Empty facets on Solr with MySQL

2014-01-03 Thread Andrea Gazzarini
I don't remember your dih-config.xml (could you post it again?) - remove the trailing ; from the query. It is a valid delimiter only when you run queries in mysql worlbench; - I assume there's a parent entity named (name=) article. - are you sure the column of the article entity is "id" (NB this

RE: Indexing URLs for Binaries

2014-01-03 Thread Teague James
Thanks, Mark. I checked there, but pdf files are not listed. There are some file types in there that I might need in the future, so I appreciate the info. Any other ideas? -Original Message- From: Reyes, Mark Sent: Friday, January 03, 2014 1:39 PM To: solr-user@lucene.apache.org Subject:

Re: Tracking down the input that hits an analysis chain bug

2014-01-03 Thread Robert Muir
This exception comes from OffsetAttributeImpl (e.g. you dont need to index anything to reproduce it). Maybe you have a missing clearAttributes() call (your tokenizer 'returns true' without calling that first)? This could explain it, if something like a StopFilter is also present in the chain: basi

Tracking down the input that hits an analysis chain bug

2014-01-03 Thread Benson Margulies
Using Solr Cloud with 4.3.1. We've got a problem with a tokenizer that manifests as calling OffsetAtt.setOffsets() with invalid inputs. OK, so, we want to figure out what input provokes our code into getting into this pickle. The problem happens on SolrCloud nodes. The problem manifests as this

Re: Replication Error

2014-01-03 Thread Shawn Heisey
On 1/3/2014 10:34 AM, Daniel Collins wrote: We see this a lot as well, my understanding is that recovery asks the leader for a list of the files that it should download, then it downloads them. But if the leader has been merging segments whilst this is going on (recovery is taking a reasonable p

Re: Indexing URLs for Binaries

2014-01-03 Thread Reyes, Mark
Check suffix-urlfilter.txt in your conf directory for Nutch. You might be prohibiting those filetypes from the crawl. - Mark On 1/3/14, 10:29 AM, "Teague James" wrote: >I am using Nutch 1.7 with Solr 4.6.0 to index websites that have links to >binary files, such as Word, PDF, etc. The craw

Re: Empty facets on Solr with MySQL

2014-01-03 Thread PeterKerk
Hi Andrea, I think you helped me to get closer, but not quite there yet. When I replace wtr.object_id='${article.id}'; with wtr.object_id=18 the cat_name field holds a value, which I checked via the schema browser of Solr dashboard! I then checked my main query SELECT p.*, ( SELECT guid FROM wp

Indexing URLs for Binaries

2014-01-03 Thread Teague James
I am using Nutch 1.7 with Solr 4.6.0 to index websites that have links to binary files, such as Word, PDF, etc. The crawler crawls the site but I am not getting the URLs of the links for the binary files no matter how deep I set the settings for the site. I see the labels for the links in the conte

Re: Empty facets on Solr with MySQL

2014-01-03 Thread Andrea Gazzarini
Or, in other words, assuming that the example you reported (article.id=18) returns a record with cat_name, could you hardcode that id in your dih-config (replacing ${article.id} with 18) and try to run a full-import? Another important thing: in your example you are using the object_id as an int (o

Re: Empty facets on Solr with MySQL

2014-01-03 Thread Andrea Gazzarini
Yes, that definitely matters...I was talking about a simple java class. Peter, sincerely? I don't know. Are you sure you are doing then same queries with the same article.id as solr does? and sorry for the stupid question but are you sure that for those article ids the cat_name is not null? On 3

Re: Empty facets on Solr with MySQL

2014-01-03 Thread PeterKerk
But when I execute the query directly on MySQL I do get a cat_name column in there: select wt.name as cat_name,wt.slug,wtr.term_taxonomy_id,wtt.term_id,wtt.taxonomy from wp_term_relationships wtr INNER JOIN wp_term_taxonomy wtt ON wtt.term_taxonomy_id=wtr.term_taxonomy_id AND wtt.taxonomy='catego

Re: Replication Error

2014-01-03 Thread Daniel Collins
We see this a lot as well, my understanding is that recovery asks the leader for a list of the files that it should download, then it downloads them. But if the leader has been merging segments whilst this is going on (recovery is taking a reasonable period of time and you have an NRT system where

Re: Empty facets on Solr with MySQL

2014-01-03 Thread Andrea Gazzarini
Hi Peter, I can only guess that the result set doesn't contain a "cat_name" (case insensitive) column. Other option / question: do you have a transformer (like scriptTransformer) that manipulates the resultset? You can debug the resultset in a main class by doing rs.getString ("cat_name") Cheers,

Re: Very long running replication.

2014-01-03 Thread anand chandak
Folks, would really appreciate if somebody can help/throw some light on below issue . This issue is blocking our upgrade, we are doing a 3.x to 4.x upgrade and indexing around 100g of data. Any help would be highly appreciated. Thanks, Anand On 1/3/2014 11:46 AM, anand chandak wrote: Thank

Re: Invalid version (expected 2, but 60) or the data in not in 'javabin' format exception while deleting 30k records

2014-01-03 Thread Mark Miller
Just an FYI, newer version of Solr will deploy the proper error message rather than that cryptic one. - Mark On Jan 3, 2014, at 12:54 AM, Shawn Heisey wrote: > On 1/2/2014 10:22 PM, gpssolr2020 wrote: >> Caused by: java.lang.RuntimeException: Invalid version (expected 2, but 60) >> or the data

Re: Empty facets on Solr with MySQL

2014-01-03 Thread PeterKerk
Hi Andrea, Here you go: **data-config.xml** **schema.xml**

Re: Empty facets on Solr with MySQL

2014-01-03 Thread Andrea Gazzarini
Hi Peter, Umfprtunately I deleted your first email where you wrote a piece of your schema...the problem seems to be "cat_name" and not cat_name_raw...could you please post again your schema? On 3 Jan 2014 13:40, "PeterKerk" wrote: > Hi Andrea, > > You were right, I do see errors when setting the

Re: Empty facets on Solr with MySQL

2014-01-03 Thread PeterKerk
Hi Andrea, You were right, I do see errors when setting the required=true attribute...what can it be? Logging console homepage: 13:31:54 WARN SolrWriter Error creating document : SolrInputDocument[comment_status=open,​ post_content=algemeen kpn artikeltje ook over vodafone,​ guid=http://www.tel

Re: Empty facets on Solr with MySQL

2014-01-03 Thread Andrea Gazzarini
Hi Peter Sorry I didn't see your question about the log level. I'm here with my mobile so cannot guide you step by step (don't remember exactly) but is very very easy. Prior to solr 4 there was a "logging" hyperlink on the homepage of the console. Starting from 4 you should see the same link on the

Re: Empty facets on Solr with MySQL

2014-01-03 Thread Andrea Gazzarini
Sorry I didn't get the point...so that means - if you put the required clause you must have some kind of errors (because no documets are imported) - your query doesn't throw out that field I'm quite sure that if you put again the required attribute and set to finest the log level you must see

Re: Grouping results with group.limit return wrong numFound ?

2014-01-03 Thread tasmaniski
Hi and thanks all, There is no easy solution implemented in Solr for my problem. I will try "namedistinct" you said, might improve workaround solution. Perhaps, the main problem are the names :) because *group.limit* do same as basic rows

Re: Empty facets on Solr with MySQL

2014-01-03 Thread PeterKerk
Hi Andrea, I would say the JDBC driver must be working because when I leave out the required="true" from the cat_name field, 4 documents are imported. Since my entire DB currently holds only 4 records, there's no need for a LIMIT clause I guess? Andrea Gazzarini-4 wrote > In the solr console se