Fwd: Join in SOLR

2014-12-17 Thread Mikhail Khludnev
Hello Rajesh, Please find the brief below Thanks Mikhail. As per what you have mentioned can I get a list of sub entities with this new Zipper join. Because now in existing DIH I'm getting a list for individual fields of the sub entities. 1) If I get a list of sub entities how can I get it

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread lboutros
Andrei, Pivot faceting is the Solr implementation for Hierarchical Facets. I don't think this is what you need. Could you please describe the original use case ? Just to eliminate XY problem. I don't know if this is acceptable for you in term of performance but you could try to solve your

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread andreic9203
Hello, First, thank you for the quick answer. Second, This is exactly my current problem. I'm doing this with 2 queries, but with the current performance issues, I want to change the implementation. Now I am using grouping and then I take the count and set that count to the all grouped

Large scale Update of solr indexed documents

2014-12-17 Thread atawfik
Hi all, I have a scenario where I need to generate summaries of indexed documents. So, I initially thought I should do that at Nutch because I am using Nutch to push documents to Solr. However, I will need some statistics about terms and documents. Hence, I will have to duplicate analysis at

Re: questions about BlockJoinParentQParser

2014-12-17 Thread Kydryavtsev Andrey
For support scoreMode parameter in BlockJoinParentQParser we have this jira with attached patch https://issues.apache.org/jira/browse/SOLR-5882 17.12.2014, 06:54, Michael Sokolov msoko...@safaribooksonline.com: I'm trying to use BJPQP and ran into a few little gotchas that I'd like to share

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread lboutros
I don't know if this is possible for you but: could you pre-process the group and create nested documents with pre-computed document counts ?

Keepwords DataImportHandler

2014-12-17 Thread leostro
Hi all, This is my first question in this forum :D I'm trying to import documents using a DataImportHandler. document entity name=entry query=select top 100 id, title from entry order by id desc /entity /document The first test is to import some document having only

Re: Join in SOLR

2014-12-17 Thread Mikhail Khludnev
On Wed, Dec 17, 2014 at 11:51 AM, Rajesh Panneerselvam rajesh.panneersel...@aspiresys.com wrote: Yes Mikhail. This is what I want exactly. My sub-entities should be added as child document to the root entity. But will this feature be available in upcoming release anytime soon? Rajesh, it's

Block join subqueries

2014-12-17 Thread marotosg
Hi, Is is possible to do a query joining three levels. For isntance with three cores Person, Person Job and Company. I know is possible to join from Person to Person Job and from Person to Company. For instance {!type=join from=PersonIdsS to=PersonID fromIndex=personjob}type_level:parent AND

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread andreic9203
Ok, if you have a bit of time, you can explain your last comment on this example, please? { name: Table, user: admin, updateDate: 2012-12-31T15:31:16.012Z, id: svsKXRqbVhqZa-SznsU8FUII, type: 1, groupSignature: 5FJHzWke16auaB2hdqcIP)3a, } {

Re: Block join subqueries

2014-12-17 Thread Mikhail Khludnev
Sergio, did you check http://blog.griddynamics.com/2013/12/grandchildren-and-siblings-with-block.html ? On Wed, Dec 17, 2014 at 1:07 PM, marotosg marot...@gmail.com wrote: Hi, Is is possible to do a query joining three levels. For isntance with three cores Person, Person Job and Company. I

Re: Block join subqueries

2014-12-17 Thread marotosg
Hi Mikhail, Thanks for that. That's exactly what I was looking for but this is for the same core. This allows you to search in a document nested two levels. I was expecting to do the same for cross core joins. That's basically doing a join from Core1 to Core2 to Core3. I couldn't find anything

Re: questions about BlockJoinParentQParser

2014-12-17 Thread Michael Sokolov
Thanks Andrey! I voted for your patch -Mike On 12/17/2014 4:01 AM, Kydryavtsev Andrey wrote: For support scoreMode parameter in BlockJoinParentQParser we have this jira with attached patch https://issues.apache.org/jira/browse/SOLR-5882 17.12.2014, 06:54, Michael Sokolov

Re: Keepwords DataImportHandler

2014-12-17 Thread Ahmet Arslan
Hi Leo, You are doing OK. DIH and analysis are separate issues. Please note that analysis changes indexed values. Like you see analysis section in solr panel. When you retrieve stored values using fl= parameter, original values are displayed. So natural question is, how are you doing to

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread lboutros
You could try to denormalize even more : Create two collections: - one with user groups in mind - the second collection with user and groupSignature groups in mind. For instance, with user groups in mind : { id: svsKQSFfzhu-SznsU8FUII, user: admin, furniture_count:2,

SolrJ insert PDF for different language

2014-12-17 Thread Gabi
Following the sample in http://wiki.apache.org/solr/ContentStreamUpdateRequestExample http://wiki.apache.org/solr/ContentStreamUpdateRequestExample I'm able to insert a PDF and search words, etc. ContentStreamUpdateRequest up = new ContentStreamUpdateRequest(/update/extract); up.addFile(new

Re: Keepwords DataImportHandler

2014-12-17 Thread Doug Turnbull
Leo, everything you describe sounds correct. Are you having any problems? are keep words not working for DIH for you? Or are you just looking for general pointers? If so, your approach to this sounds a lot like a blog post I recently wrote, which you might find useful:

Re: Block join subqueries

2014-12-17 Thread Mikhail Khludnev
Hello, BlockJoin doesn't work cross core, it's only possible with {!join On Wed, Dec 17, 2014 at 3:12 PM, marotosg marot...@gmail.com wrote: Hi Mikhail, Thanks for that. That's exactly what I was looking for but this is for the same core. This allows you to search in a document nested two

Re: Get groups which has the number of elements greater than...

2014-12-17 Thread andreic9203
Hello again, So, what you are saying is to create the collections and documents with this in mind, and what I want couldn't be done from querying? Yes, but my index is already big, and this kind of operation is hard to be done... Thank you for the idea. If I didn't understand as it should be,

Re: CLUSTERSTATUS timeout

2014-12-17 Thread adfel70
Hi Jonathan, We are having the exact same problem with Solr 4.8.0. Did you manage to resolve this one? Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/CLUSTERSTATUS-timeout-tp4173224p4174741.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Block join subqueries

2014-12-17 Thread marotosg
Yes, that's true. I mean join then. is it possible to join three cores A B C. I know it is possible to join A - B and A - C Is it possible to join them A - B - C Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Block-join-subqueries-tp4174709p4174749.html Sent from

LIRE Solr plugin updated to 4.10.2 and new demo ...

2014-12-17 Thread Mathias Lux
Hi all! After the initial release I finally came around to update the content based image retrieval plugin LIRE Solr to the current version and it has been extended to support more CBIR features. https://bitbucket.org/dermotte/liresolr I also took the freedom to update the web client and the

Spellchecker delivers far too few suggestions

2014-12-17 Thread Martin Dietze
I recently upgraded to SOLR 4.10.1 and after that set up the spell checker which I use for returning suggestions after searches with few or no results. When the spellchecker is active, this request handler is used (most of which is taken from examples I found in the net): requestHandler

Re: Block join subqueries

2014-12-17 Thread Mikhail Khludnev
I don't see any problem with it. Could you try? On Wed, Dec 17, 2014 at 5:18 PM, marotosg marot...@gmail.com wrote: Yes, that's true. I mean join then. is it possible to join three cores A B C. I know it is possible to join A - B and A - C Is it possible to join them A - B - C Thanks

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Erick Erickson
First, I'd look in your corpus for bnak. The problem with index-based suggestions is that if your index contains garbage, they're correctly spelled since they're in the index. TermsComponent is very useful for this. You can also loosen up the match criteria, and as I remember the collations

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Dan Davis
What about the frequency comparison - I haven't used the spellchecker heavily, but it seems that if bnak is in the database, but bank is much more frequent, then bank should be a suggestion anyway... On Wed, Dec 17, 2014 at 10:41 AM, Erick Erickson erickerick...@gmail.com wrote: First, I'd look

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Martin Dietze
On 17 December 2014 at 16:41, Erick Erickson erickerick...@gmail.com wrote: First, I'd look in your corpus for bnak. The problem with index-based suggestions is that if your index contains garbage, they're correctly spelled since they're in the index. TermsComponent is very useful for this.

Re: Spellchecker delivers far too few suggestions

2014-12-17 Thread Erick Erickson
That seems fine. What happens if your prefix is just b? Just to verify that you're getting something back Although I usually just enable the terms component and specify the field and all that on the URL, but what you're doing should work fine This is seeming like a puzzler... Erick On

/export - Why need sort criteria (4.10.2)?

2014-12-17 Thread Damien Dykman
The /export request handler mandates a sort order. Is there a particular reason? It'd be nice to have the option to tell Solr: just export in the order you want, to limit any kind of overhead added by sorting. Or am I missing something? If exports were distributed, I can see the need for some

Re: Partial match autosuggest (match a word occurring anywhere in a field)

2014-12-17 Thread bbarani
Thanks for your response. I fixed this issue by using the filter class=solr.PositionFilterFactory / fieldType name=edgytext class=solr.TextField positionIncrementGap=100 omitNorms=true analyzer type=index filter class=solr.LowerCaseFilterFactory/ tokenizer

converting to parent/child block indexing

2014-12-17 Thread Michael Sokolov
Have other people tried migrating an index that was created without block (parent/child) indexing to one that *does* have it? Did you find that you got duplicate documents - ie multiple documents with the same uniqueField value? That's what I found, and I don't see how that's possible.

Re: converting to parent/child block indexing

2014-12-17 Thread Mikhail Khludnev
Hm.. really sorry about that. The current implementation is not really ideal, you know. When handles update it tries to recognize whether it block or not and in fact it uses _root_ field to enforce uniqueness. There are few consequences: - _root_ field spans whole block, not the parent one -

Search Query with editdistance - throws Null Pointer

2014-12-17 Thread Trilok Prithvi
When I run the following query (Solr 4.10.2) with edit-distance, I'm getting a null pointer exception: *host/solr/select?q=fld:(Event ID)fl=strdist(eventid,fld_alphaonly,edit)* responselst name=errorstr name=tracejava.lang.NullPointerException /strint name=code500/int/lst/response It works if I

Exception writing document to the index; possible analysis error.

2014-12-17 Thread bengates
Hello everyone, I'm losing my hair trying to add a simple document on a freshly installed Solr core. I'm running Solr 4.10.2 on jetty with the start.jar. I have the following (managed) schema and my core has been reloaded : Consider my core have 0 document. Now I'm trying to add a document via

Re: converting to parent/child block indexing

2014-12-17 Thread Michael Sokolov
Thanks, Mikhail! That explains the situation pretty well. -Mike On 12/17/14 4:49 PM, Mikhail Khludnev wrote: Hm.. really sorry about that. The current implementation is not really ideal, you know. When handles update it tries to recognize whether it block or not and in fact it uses _root_

A field-wide remove duplicate tokens filter

2014-12-17 Thread Varun Rajput
The org.apache.solr.analysis.RemoveDuplicatesTokenFilter, as per its description, Filters out any tokens which are at the same logical position in the tokenstream as a previous token with the same text. A very useful filter would be one which filters out duplicate tokens throughout the field,

Re: A field-wide remove duplicate tokens filter

2014-12-17 Thread Walter Underwood
Why is that useful? It breaks phrase search. If you want to ignore term frequency in ranking, change the Similarity class. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ On Dec 17, 2014, at 2:40 PM, Varun Rajput varun...@hotmail.com wrote: The

Re: Exception writing document to the index; possible analysis error.

2014-12-17 Thread Shawn Heisey
On 12/17/2014 3:00 PM, bengates wrote: I'm losing my hair trying to add a simple document on a freshly installed Solr core. I'm running Solr 4.10.2 on jetty with the start.jar. I have the following (managed) schema and my core has been reloaded : Consider my core have 0 document. Now I'm

can't make sense of spellchecker results when using techproducts example

2014-12-17 Thread Chris Hostetter
Ok, so i've been working on updating hte ref guide to account for hte new way to run the examples in 5.0. The spell checking page... https://cwiki.apache.org/confluence/display/solr/Spell+Checking ...has some examples that loosely corroloate to the techproducts example, but even if

Fwd: Issue with SOLR Distributed Search

2014-12-17 Thread rashi gandhi
Hi, This is regarding the issue that we are facing with SOLR distributed search. In our application, we are managing multiple shards at SOLR server to manage the load. But there is a problem with the order of results that we going to return to client during the search. For Example: Currently

Trying to understand error

2014-12-17 Thread Joel Stobart
When I try to upload a solr item by the index: site/solr/#/collection1/documents When I upload a simple text file via the interface: Response: { error: { msg: java.lang.NoSuchFieldError: LFH_SIG, trace: java.lang.RuntimeException: java.lang.NoSuchFieldError: LFH_SIG\n\tat

Re: Trying to understand error

2014-12-17 Thread Gora Mohanty
On 18 December 2014 at 13:10, Joel Stobart j...@stobart.co.uk wrote: When I try to upload a solr item by the index: site/solr/#/collection1/documents When I upload a simple text file via the interface: Response: { error: { msg: java.lang.NoSuchFieldError: LFH_SIG, trace: