Re: Replication Problem from solr-3.6 to solr-4.0

2014-07-23 Thread Sree..
I did optimize the master and the slave started replicating the indices! -- View this message in context: http://lucene.472066.n3.nabble.com/Replication-Problem-from-solr-3-6-to-solr-4-0-tp4025028p4148953.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Performance of indexing using Solr

2014-07-23 Thread Joel Bernstein
It looks you're committing too frequently. If you're explicitly committing from the application you may want to switch to using autoCommits. If you're not committing from the application your autocommit settings are probably too low. Joel Bernstein Search Engineer at Heliosearch On Wed, Jul 23,

Re: Question about ReRankQuery

2014-07-23 Thread Joel Bernstein
I like the FieldSortedHitQueue idea. If you want to work up a patch for that, it would be great. Joel Bernstein Search Engineer at Heliosearch On Wed, Jul 23, 2014 at 5:17 PM, Peter Keegan wrote: > > The ReRankingQParserPlugin uses the Lucene QueryRescorer, which only uses > the score from th

Re: Any Solr consultants available??

2014-07-23 Thread Alexandre Rafalovitch
On Thu, Jul 24, 2014 at 2:44 AM, Jack Krupansky wrote: > All the great Solr guys I know are quite busy. Sounds like an opportunity for somebody to put together a training hacker camp, similar to https://hackerbeach.org/ . Cross-train consultants in Solr, immediately increase their value. Do it s

Re: Any Solr consultants available??

2014-07-23 Thread Walter Underwood
When I see job postings like this, I have to assume they were written by people who really don’t understand the problem and have never met people with the various skills they are asking for. They are not going to find one person who does all this. This is an opening for zebra unicorn that walks

commons-configuration NoClassDefFoundError: Predicate

2014-07-23 Thread Peyman Faratin
Hi I've tried all permutations with no results so I thought I write to the group for help. I am running commons config (http://commons.apache.org/proper/commons-configuration/) just fine via maven and ant but when I try to run the class calling the method PropertiesConfiguration via a SOLR s

Re: SOLR 4.4 - Slave always replicates full index

2014-07-23 Thread Robin Woods
Thanks Shawn. that makes sense. -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-4-4-Slave-always-replicates-full-index-tp4113089p4148909.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to migrate content of a collection to a new collection

2014-07-23 Thread Chris Hostetter
: billions of documents (not enough memory). Please note that we are on 4.4, : which does not contain the new CURSOR-feature. Please also note that speed is : an important factor for us. for situations where you know you will be processing every doc and order doesn't matter you can use a "poor m

Issue with solr admin collection API 4.8.1

2014-07-23 Thread Hutchins, Jonathan
Solr 4.8.1 Zookeeper 3.4.5 Centos 6.5 We are running into an issue where one of our environments is unable to successfully execute commands via the collection API. We found that we were unable to add new collections and after doing some digging found that even /solr/admin/collections?action=LI

Re: Question about ReRankQuery

2014-07-23 Thread Peter Keegan
> The ReRankingQParserPlugin uses the Lucene QueryRescorer, which only uses the score from the re-rank query when re-ranking the top N documents. Understood, but if the re-rank scores produce new ties, wouldn't you want to resort them with the FieldSortedHitQueue? Anyway, I was looking to reimple

Re: integrating Accumulo with solr

2014-07-23 Thread Joe Gresock
We store data in both Solr and Accumulo -- do you have more details about what kind of data and indexing you want? Is there a reason you're thinking of using both databases in particular? On Wed, Jul 23, 2014 at 5:17 AM, Ali Nazemian wrote: > Dear All, > Hi, > I was wondering is there anybody

Performance of indexing using Solr

2014-07-23 Thread Ameya Aware
Hi, I am kind of in trouble regarding indexing documents using Solr. After every 15-20 documents, Solr gives below log: INFO - 2014-07-23 15:38:50.715; org.apache.solr.core.SolrDeletionPolicy; newest commit generation = 994 INFO - 2014-07-23 15:38:50.718; org.apache.solr.search.SolrIndexSearch

Re: Any Solr consultants available??

2014-07-23 Thread Steve McKay
Perhaps the requirement means a total of 10 years of experience spread across Solr, HTML, XML, Java, Tomcat, JBoss, and MySQL. This doesn't seem likely, but it is satisfiable, so if we proceed on the assumption that a job posting doesn't contain unsatisfiable requirements then it's more reasonab

Re: Any Solr consultants available??

2014-07-23 Thread Jack Krupansky
Yeah, I saw that, which is why I suggested not being too picky about specific requirements. If you have at least two or three years of solid Solr experience, that would make you at least worth looking at. -- Jack Krupansky From: Tri Cao Sent: Wednesday, July 23, 2014 3:57 PM To: solr-user@luce

Re: Any Solr consultants available??

2014-07-23 Thread Tri Cao
Well, it's kind of hard to find a person if the requirement is "10 years' experience with Solr" given that Solr was created in 2004. On Jul 23, 2014, at 12:45 PM, Jack Krupansky wrote: I occasionally get pinged by recruiters looking for Solr application developers... here’s the latest. If you

Any Solr consultants available??

2014-07-23 Thread Jack Krupansky
I occasionally get pinged by recruiters looking for Solr application developers... here’s the latest. If you are interested, either contact Jessica directly or reply to me and I’ll forward your reply. Even if you don’t strictly meet all the requirements... they are having trouble finding... any

Re: Question about ReRankQuery

2014-07-23 Thread Joel Bernstein
Blog on the RankQuery API http://heliosearch.org/solrs-new-rankquery-feature/ Joel Bernstein Search Engineer at Heliosearch On Wed, Jul 23, 2014 at 3:27 PM, Joel Bernstein wrote: > The ReRankingQParserPlugin uses the Lucene QueryRescorer, which only uses > the score from the re-rank query when

Re: Question about ReRankQuery

2014-07-23 Thread Joel Bernstein
The ReRankingQParserPlugin uses the Lucene QueryRescorer, which only uses the score from the re-rank query when re-ranking the top N documents. The ReRanklingQParserPlugin is built as a RankQuery plugin so you can swap in your own implementation. Patches are also welcome for the existing implement

Re: How do I get faceting to work with Solr JOINs

2014-07-23 Thread Vinay B,
Thank You, Umesh ! That's a neat approach. Reading through your post, we decided to tweak our indexing strategy a bit, basically an inversion. We moved all our facetable (and frequently updated) fields to the main doc and the text and other static content fields to the sub doc (co-related via a pa

Re: NoClassDefFoundError while indexing in Solr

2014-07-23 Thread Steve McKay
BTW, Ameya, jhighlight-1.0.jar is in the Solr binary distribution, in contrib/extraction/lib. There are a bunch of different libraries that Tika uses for content extraction, so this seems like a good time to make sure that Tika has all the jars available that it might need to process the files you'

Re: [ANN] SIREn, a Lucene/Solr plugin for rich JSON data search

2014-07-23 Thread Walter Underwood
Querying nested data is very easy in MarkLogic, it was built for that. I used to work there. The founder is a former search engine guy from Infoseek and Ultraseek, so it has a lot of familiar behavior, like merging segments automatically. wunder Walter Underwood wun...@wunderwood.org http://obs

Re: Question about ReRankQuery

2014-07-23 Thread Peter Keegan
See http://heliosearch.org/solrs-new-re-ranking-feature/ On Wed, Jul 23, 2014 at 11:27 AM, Erick Erickson wrote: > I'm having a little trouble understanding the use-case here. Why use > re-ranking? > Isn't this just combining the original query with the second query with an > AND > and using th

Re: how to achieve static boost in solr

2014-07-23 Thread Erick Erickson
Take a look at Query Elevation Component perhaps? Best, Erick On Wed, Jul 23, 2014 at 8:05 AM, rahulmodi wrote: > Hi, > > I am struggling how to achieve static boost in solr, i have visited many > web > sites but not getting solid answer. > > The requirement is as below: > Suppose i have 100 k

Re: Question about ReRankQuery

2014-07-23 Thread Erick Erickson
I'm having a little trouble understanding the use-case here. Why use re-ranking? Isn't this just combining the original query with the second query with an AND and using the original sort? At the end, you have your original list in it's original order, with (potentially) some documents removed tha

Re: solr 3.6 to 4.7 upgrade has changed the query string

2014-07-23 Thread Jack Krupansky
Did you blindly switch to the new solrconfig.xml? If so, the default query request handler sets the "df" parameter to "text", which would give you different results compared to having the "" set to some other field, like your "ID" field. Read the comments in the new schema.xml about being de

Re: Passivate core in Solr Cloud

2014-07-23 Thread Erick Erickson
Do note that the lots of cores stuff does NOT play nice with in distributed mode (yet). Best, Erick On Wed, Jul 23, 2014 at 6:00 AM, Alexandre Rafalovitch wrote: > Solr has some support for large number of cores, including transient > cores: http://wiki.apache.org/solr/LotsOfCores > > Regards,

Re: solr 3.6 to 4.7 upgrade has changed the query string

2014-07-23 Thread Erick Erickson
Try adding &debug=all to both records. But these are very different queries. My guess is that something _else_ changed, probably in solrconfig.xml that's the cause, most probably your default field in your 3.6 case is the ID field. If that's the case you should be able to change it in the 4.7 soln

Re: Are stored fields compressed by default?

2014-07-23 Thread Erick Erickson
Yes, they have been since 4.1. And there's no handy option for turning this off at this point.. Best, Erick On Wed, Jul 23, 2014 at 2:31 AM, Gili Nachum wrote: > Hi! I'm planning to use atomic-updates > which means having all > fields > stored. >

Re: How to migrate content of a collection to a new collection

2014-07-23 Thread Erick Erickson
Per: Given that you said that the field redefinition also includes routing info I don't see any other way than re-indexing each collection. That said, could you use the collection aliasing and do one collection at a time? Best, Erick On Tue, Jul 22, 2014 at 11:45 PM, Per Steffensen wrote:

how to achieve static boost in solr

2014-07-23 Thread rahulmodi
Hi, I am struggling how to achieve static boost in solr, i have visited many web sites but not getting solid answer. The requirement is as below: Suppose i have 100 keywords to search for and for each keyword i want particular URL to be appear on top. Say.. for keyword *"car"* the URL *"http://c

Re: How to get Lacuma to match Lucuma

2014-07-23 Thread Warren Bell
Is there a way to make solr do fuzzy searches automatically without having to add the tilda character ? And are there disadvantages of doing a fuzzy searches ? Warren On Jul 22, 2014, at 1:54 PM, Anshum Gupta wrote: > Hi Warren, > > Check out the section about fuzzy search here > https://cwi

Re: [ANN] SIREn, a Lucene/Solr plugin for rich JSON data search

2014-07-23 Thread Jay Vyas
Querying nested data is very difficult in any modern db that I have seen. If It works as you suggest then It would be cool if the feature was it going to be eventually maintained inside solr. > On Jul 23, 2014, at 7:13 AM, Renaud Delbru wrote: > > One of the coolest features of Lucene/Solr is

Re: NoClassDefFoundError while indexing in Solr

2014-07-23 Thread Ameya Aware
Thanks a lot for your suggestions. On Wed, Jul 23, 2014 at 9:53 AM, Pablo Queixalos < pqueixa...@customermatrix.com> wrote: > There is a source code "parser" in tika that in fact just renders the > source using an external source higlighter. > > Seen in you stack trace : > com.apache.tika.parser

Re: Solr 4.7.2 auto suggestion

2014-07-23 Thread benjelloun
Hello, the suggester "solr.SuggestComponent" with FuzzyLookupFactory DocumentDictionaryFactory Dont work with type of fields which are not string and multivalued. any idea ? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-4-7-2-auto-suggestion-tp4147677p4148

RE: NoClassDefFoundError while indexing in Solr

2014-07-23 Thread Pablo Queixalos
There is a source code "parser" in tika that in fact just renders the source using an external source higlighter. Seen in you stack trace : com.apache.tika.parser.code.SourceCodeParser.getRenderer(SourceCodeParser.java:121) You are indexing code (java, c or groovy). Solr seems to be missing a

Question about ReRankQuery

2014-07-23 Thread Peter Keegan
I'm looking at how 'ReRankQuery' works. If the main query has a Sort criteria, it is only used to sort the first pass results. The QueryScorer used in the second pass only reorders the ScoreDocs based on score and docid, but doesn't use the original Sort fields. If the Sort criteria is 'score desc,

Re: text search problem

2014-07-23 Thread Josh Lincoln
Ravi, for the hyphen issue, try setting autoGeneratePhraseQueries=true for that fieldType (no re-index needed). As of 1.4, this defaults to false. One word of caution, autoGeneratePhraseQueries may not work as expected for langauges that aren't whitespace delimited. As Erick mentioned, the Analysis

how to fully test a response writer

2014-07-23 Thread Matteo Grolla
Hi, I developed a new SolResponseWriter but I'm not happy with how I wrote tests. My problem is that I need to test it either with local request and with distributed request since the solr response object (input to the response writer) are different. a) I tested the local request case

Re: Passivate core in Solr Cloud

2014-07-23 Thread Alexandre Rafalovitch
Solr has some support for large number of cores, including transient cores: http://wiki.apache.org/solr/LotsOfCores Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.co

Passivate core in Solr Cloud

2014-07-23 Thread Aurélien MAZOYER
Hello, We want to setup a Solr Cloud cluster in order to handle a high volume of documents with a multi-tenant architecture. The problem is that an application-level isolation for a tenant (using a mutual index with a field "customer") is not enough to fit our requirements. As a result, we ne

[ANN] SIREn, a Lucene/Solr plugin for rich JSON data search

2014-07-23 Thread Renaud Delbru
One of the coolest features of Lucene/Solr is its ability to index nested documents using a Blockjoin approach. While this works well for small documents and document collections, it becomes unsustainable for larger ones: Blockjoin works by splitting the original document in many documents, on

Re: Query using doc Id

2014-07-23 Thread Mukundaraman Valakumaresan
Exactly Alexandre, Thanks Regards Mukund On Wed, Jul 23, 2014 at 3:37 PM, Alexandre Rafalovitch wrote: > Perhaps you are looking for cursorMark: > http://solr.pl/en/2014/03/10/solr-4-7-efficient-deep-paging/ ? > > Regards, >Alex. > Personal: http://www.outerthoughts.com/ and @arafalov > So

Re: Query using doc Id

2014-07-23 Thread Alexandre Rafalovitch
Perhaps you are looking for cursorMark: http://solr.pl/en/2014/03/10/solr-4-7-efficient-deep-paging/ ? Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=

RE: SolrCloud replica dies under high throughput

2014-07-23 Thread Darren Lee
Thanks that helped. I no longer see the constant replica recovery. It also increased my throughput to 1.6/1.7 million per hour reliably. I actually then tried using SSDs instead and it flew up to 6.5 million updates per hour. Setup: 4 node cluster using m3.2xl AWS servers using general purpose S

Re: Query using doc Id

2014-07-23 Thread Mukundaraman Valakumaresan
@Alexandre No, I mean the same what you mean docId:[100 TO 200] @Santosh My intention is to query all the docs from Solr. If I give rows=100&start=100, for which I need to apply my query as *:* , which is not a good idea. Hence looking for an option to filter based on docId. Thanks & Regards Muku

solr 3.6 to 4.7 upgrade has changed the query string

2014-07-23 Thread shashi.rsb
Hi, Our backend application queries solr to retrieve certain records. We were initially on 3.6 version and now upgrade to 4.7 solr version. something has changed in terms of query string which needs a parentheses for the below query both the queries are from 4.7 solr. returns 1 record http://

Re: Java heap Space error

2014-07-23 Thread Harald Kirsch
You may want to change your solr startup script such that it creates a heap dump on OOM. Add -XX:+HeapDumpOnOutOfMemoryError as an option. The heap dump can be nicely analyzed with http://www.eclipse.org/mat/. Just increasing -Xmx is a workaround that may help to get around for a while. With m

Are stored fields compressed by default?

2014-07-23 Thread Gili Nachum
Hi! I'm planning to use atomic-updates which means having all fields stored. Some docs might have text fields of up to 200K, I will feel better knowing that Solr automatically compresses stored fields (I know Lucene 4.x default codec does). *Are stored

integrating Accumulo with solr

2014-07-23 Thread Ali Nazemian
Dear All, Hi, I was wondering is there anybody out there that tried to integrate Solr with Accumulo? I was thinking about using Accumulo on top of HDFS and using Solr to index data inside Accumulo? Do you have any idea how can I do such integration? Best regards. -- A.Nazemian