Re: Query behavior.

2016-03-14 Thread Modassar Ather
Thanks Jack for your response. The following jira bug for this issue is already present so I have not created a new one. https://issues.apache.org/jira/browse/SOLR-8812 Kindly help me understand that whether it is possible to achieve search on ORed terms as it was done in earlier Solr version. Is

accessing data in hdfs by solr in standalone mode

2016-03-14 Thread vidya
Hi can solr access the data from HDFS in standalone mode? If so, can u brief how it is done. Thnaks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/accessing-data-in-hdfs-by-solr-in-standalone-mode-tp4263805.html Sent from the Solr - User mailing list

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Anil
Thanks Guys. i will try two level document routing in case of file_collection. i really don't understand why index size is high for file_collection as same file is available in main_collection. (each file indexed as one document with all commands in main collection and same file is indexed as

Re: solr & docker in production

2016-03-14 Thread Jay Potharaju
Upayavira, Thanks for the feedback. I plan to deploy solr on its own instance rather than on instance running multiple applications. Jay On Mon, Mar 14, 2016 at 3:19 PM, Upayavira wrote: > There is a default Docker image for Solr on the Docker Registry. I've > used it to

Re: Using group.ngroups during query search

2016-03-14 Thread Zheng Lin Edwin Yeo
Hi Toke, I have tried using the HTTP URL command, and it works! The searching speed for using JSON Facet is much faster than using the group.ngroups. The time taken for returning my 6 million groups has been reduced from 2 minutes to less than 2 seconds. I used the following URL to get the JSON

Re: New to Solr 5.5

2016-03-14 Thread Erick Erickson
OK, take Cassandra out of it for the time being and spend some time familiarizing yourself with Solr would be my advice ;) Yeah, the Solr documentation is a bit scattered, but your most complete and up to date reference is the Solr reference guide here: In particular, see the

Re: solr & docker in production

2016-03-14 Thread Upayavira
There is a default Docker image for Solr on the Docker Registry. I've used it to great effect in creating a custom Solr install. The main thing I'd say is that Docker generally encourages you to run many apps on the same host, whereas Solr benefits hugely from a host of its own - so don't be

Re: Solr Indexation

2016-03-14 Thread Erick Erickson
Please review: http://wiki.apache.org/solr/UsingMailingLists On Mon, Mar 14, 2016 at 3:06 PM, fabigol wrote: > Hi, > i have a problem with the indexation. > My indexation works with 3 million of record but it don't work with 10 > millions of records. > Someone may

Solr Indexation

2016-03-14 Thread fabigol
Hi, i have a problem with the indexation. My indexation works with 3 million of record but it don't work with 10 millions of records. Someone may help me. Does it exist configuration files where i may fix thatN -- View this message in context:

solr & docker in production

2016-03-14 Thread Jay Potharaju
Hi, I was wondering is running solr inside a docker container. Are there any recommendations for this? -- Thanks Jay

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Susheel Kumar
If you can find/know which fields (or combination) in your document divides / groups the data together would be the fields for custom routing. Solr supports up to two level. E.g. if you have field with say documentType or country or etc. would help. See the document routing at

Re: Load pre-built index to Solr

2016-03-14 Thread praneethvarma
As you guessed, I'm trying to build a non-HDFS collection from the index files in HDFS (constructed by MRIT). To give you the overall picture, below is my workflow (Sorry if it is too long)- I have collection-A that is serving an index and I'm replacing the collection with another one -

Re: New to Solr 5.5

2016-03-14 Thread Bhanu Prasad
Hi Erick, I am very new to this, I haven't uploaded any configsets. I need help to get existing cassandra keyspace into solr to do analysis. I am completely new to this technology so having trouble with finding right documentation on how to do it. Regards, Bhanu On Mon, Mar 14, 2016 at 3:11 PM,

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Erick Erickson
Usually I just let the compositeId do its thing and only go for custom routing when the default proves inadequate. Note: your 480M documents may very well be too many for three shards! You really have to test Erick On Mon, Mar 14, 2016 at 10:04 AM, Anil wrote: > Hi

RE: Solr sort preferences number vs space vs character

2016-03-14 Thread vkrishna
Yes Andy, I am using field configured with KeywordTokenizerFactory. Thanks, Vamsi. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-sort-preferences-number-vs-space-vs-character-tp4263527p4263728.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: New to Solr 5.5

2016-03-14 Thread Erick Erickson
What configsets have you uploaded to Solr? The canned example does this for you. The configurations must reside in Zookeeper, NOT on the local disk. I think that's probably what you're seeing... Best, Erick On Mon, Mar 14, 2016 at 11:33 AM, Bhanu Prasad wrote: > I was

Re: New to Solr 5.5

2016-03-14 Thread Bhanu Prasad
I was able to create a core using create -c option, But this time with user as solr. It worked. How can I make sure that the solr user is running the webapplication command requests as well ? Any help ? [solr@solr bin]$ ./solr create -c cassie Copying configuration to new core instance

New to Solr 5.5

2016-03-14 Thread Bhanu Prasad
Hello, I installed a new solr instance in lab on Cent OS 7 # java -version java version "1.8.0_72" Java(TM) SE Runtime Environment (build 1.8.0_72-b15) Java HotSpot(TM) 64-Bit Server VM (build 25.72-b15, mixed mode) #wget http://apache.mirror.gtcomm.net/lucene/solr/5.5.0/solr-5.5.0.tgz #tar

Re: return and highlight the most relevant child with BlockJoinQuery

2016-03-14 Thread Mikhail Khludnev
Michae, Regarding the former, it's not a feature of [child] result transformer, it might be separately requested, but I prefer to provide via generic SOLR-8202. Regarding highlighting, I can't comment, I only saw that there is some highlighting case for {!parent} queries. Sorry. On Mon, Mar 14,

RE: Solr sort preferences number vs space vs character

2016-03-14 Thread Andrew Chillrud
Are you sorting against an untokenized field (either defined using the 'string' fieldType or a fieldType that is configured with KeywordTokenizerFactory)? Solr will let you sort against a tokenized field. Not sure what happens internally when you do this, but the results will not be what you

Re: Solr sort preferences number vs space vs character

2016-03-14 Thread vkrishna
Shawn, I think you did saw my required result order in previous update(which is different from what I asked first )space > number > character, sorry for confusion. Thanks, Krishna. On Mon, 3/14/16, Shawn Heisey-2 [via Lucene]

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Anil
Hi Erick, In b/w, Do you recommend any effective shard distribution method ? Regards, Anil On 14 March 2016 at 22:30, Erick Erickson wrote: > Try shards.info=true, but pinging the shard directly is the most certain. > > > Best, > Erick > > On Mon, Mar 14, 2016 at 9:48

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Anil
thanks Eric. i will try that. Some how i am not able to run a query on the shard directly because of kerberos. i even tried curl --negotiate. Regards, Anil On 14 March 2016 at 22:30, Erick Erickson wrote: > Try shards.info=true, but pinging the shard directly is the

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Erick Erickson
Try shards.info=true, but pinging the shard directly is the most certain. Best, Erick On Mon, Mar 14, 2016 at 9:48 AM, Anil wrote: > HI Erik, > > we have used document routing to balance the shards load and for > expand/collapse. it is mainly used for main_collection which

Re: Solr sort preferences number vs space vs character

2016-03-14 Thread Shawn Heisey
On 3/14/2016 10:28 AM, vkrishna wrote: > I completely forgot to mention that this kind of sorting is working fine in > 1.4 version now we are upgrading to 5.4. I know solr made many changes > between, because it's been years. Do you know when and in which version they > made changes for

Re: Not able to do DataImport from MSSQL server to solr server

2016-03-14 Thread Shawn Heisey
On 3/14/2016 2:31 AM, Adi@GTN wrote: > Am using the below config details in order to connect the database, and pull > the query but in the connection details it is mentioned as > authenticateMethod=ntlm, am bit confused here am not sure what is this > authentication method, now am not able to

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Anil
HI Erik, we have used document routing to balance the shards load and for expand/collapse. it is mainly used for main_collection which holds one to many relationship records. In file_collection, it is only for load distribution. 25GB for entire solr service. each machine will act as shard for

ant generate-maven-artifacts

2016-03-14 Thread Keith L
Hi all, Hoping someone else uses the maven capabilities and can help out here. Solr: 4.10.4 Ant-Task: ant generate-maven-artifacts Problem: When trying to publish to an internal artifactory using our SNAPSHOTs, where our user has update/delete permissions, everything builds ok. When trying to

RE: Solr sort preferences number vs space vs character

2016-03-14 Thread vkrishna
Hey Shawn, I completely forgot to mention that this kind of sorting is working fine in 1.4 version now we are upgrading to 5.4. I know solr made many changes between, because it's been years. Do you know when and in which version they made changes for sorting. Thanks, Krishna.

Re: Avoid Duplication of record in searching

2016-03-14 Thread Jack Krupansky
Are you using DSE Search or some custom integration of Solr and Cassandra? Generally, changes in Solr are only visible after a commit operation is performed, either an explicit commit or a time-based auto-commit. Recent DSE Search also has a real-time search feature that does not require commit -

Re: Avoid Duplication of record in searching

2016-03-14 Thread Binoy Dalal
Have you tried using de-duplication? https://cwiki.apache.org/confluence/display/solr/De-Duplication On Mon, Mar 14, 2016 at 9:48 PM wrote: > HI, > I am having SOLR Search on Cassandra Table, when I do some updation in > the Cassandra Table to which the SOLR is

Re: query against two fields

2016-03-14 Thread Erick Erickson
Ah, that's _Lucene_, support for Solr wasn't added until Solr 4.8.. sorry about that. On Mon, Mar 14, 2016 at 9:06 AM, Vis Sw wrote: > Thanks Erick... > > Strange... I am getting Unknown query parser 'complexphrase'... > > SOLR ver is 4.7.0 and I can see >

Avoid Duplication of record in searching

2016-03-14 Thread rajeshkumar . s
HI, I am having SOLR Search on Cassandra Table, when I do some updation in the Cassandra Table to which the SOLR is being configured he Updated record gets Duplicated in SOLR Search.But when we do RE-Index of the SOLR there we are getting unique records. We can do re-index every time via

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Anil
Hi Shusheel, we have enabled kerberos. so solr is accessed using Hue only. i will check if I can get the similar information using Hue. Thanks. Regards, Anil On 14 March 2016 at 19:34, Susheel Kumar wrote: > Hello Anil, > > Can you go to Solr Admin Panel -> Dashboard

Re: query against two fields

2016-03-14 Thread Vis Sw
Thanks Erick... Strange... I am getting Unknown query parser 'complexphrase'... SOLR ver is 4.7.0 and I can see org.apache.lucene.queryparser.complexPhrase.ComplexPhraseQueryParser in lucene-queryparser-4.7.0.jar http://localhost:8081/solr/collection1/select?q={!complexphrase

RE: Solr sort preferences number vs space vs character

2016-03-14 Thread Andrew Chillrud
No experience with this personally, but it seems like you are describing https://cwiki.apache.org/confluence/display/solr/Language+Analysis#LanguageAnalysis-UnicodeCollation - Andy - -Original Message- From: Shawn Heisey [mailto:apa...@elyograg.org] Sent: Monday, March 14, 2016 10:51

Single zookeeper ensemble for multiple solr instances/clusters

2016-03-14 Thread solr2020
Hi , I have multiple Solr clusters running(5 clusters), i would like to make use of single zookeeper ensemble contains (3 servers) . can anyone tell me what is the pros and cons of this approach? Thanks, Gomathi -- View this message in context:

Solr in Linux Platform

2016-03-14 Thread Adel Mohamed Khalifa
Hello everybody, I build a website (Java EE ) and want to search in some json files so I installed the solr server in an Ubuntu server and create a new core then indexing json files and the web searched correctly when I moved my code from windows to the server it stopped and cannot connect to

return and highlight the most relevant child with BlockJoinQuery

2016-03-14 Thread michael solomon
Hi, how can I *highlight* and *return* the most relevant child with BlockJoinQuery. for this: > {!parent which="is_parent:*" score=max}(title:(terms) I expect to get: . . . docs:[ { doc parent _childDocuments_:{the most relevant child} } { doc parent2 _childDocuments_:{the most

Not able to do DataImport from MSSQL server to solr server

2016-03-14 Thread Adi@GTN
Hi All, Am using the below config details in order to connect the database, and pull the query but in the connection details it is mentioned as authenticateMethod=ntlm, am bit confused here am not sure what is this authentication method, now am not able to connect the database so am not able to

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Erick Erickson
bq: The slowness is happening for file_collection. though it has 3 shards, documents are available in 2 shards. shard1 - 150M docs and shard2 has 330M docs , shard3 is empty. Well, this collection terribly balanced. Putting 330M docs on a single shard is pushing the limits, the only time I've

Re: Solr sort preferences number vs space vs character

2016-03-14 Thread Shawn Heisey
On 3/14/2016 12:05 AM, vkrishna wrote: > Hey Shawn, > > Is there any way to use ASCII? so I can get the result I want. I do not know whether Solr has any config facility to incorporate a custom Lucene sorting class. I tried to look at the Lucene code to see if I could figure out how/where the

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Susheel Kumar
For each of the solr machines/shards you have. Thanks. On Mon, Mar 14, 2016 at 10:04 AM, Susheel Kumar wrote: > Hello Anil, > > Can you go to Solr Admin Panel -> Dashboard and share all 4 memory > parameters under System / share the snapshot. ? > > Thanks, > Susheel > >

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Susheel Kumar
Hello Anil, Can you go to Solr Admin Panel -> Dashboard and share all 4 memory parameters under System / share the snapshot. ? Thanks, Susheel On Mon, Mar 14, 2016 at 5:36 AM, Anil wrote: > HI Toke and Jack, > > Please find the details below. > > * How large are your 3

Re: There is no jetty thread pool stats in solr JMX

2016-03-14 Thread forest_soup
I have read the articles below, but does not find the jetty.home/start.ini in solr/server folder and there is no etc/jetty-jmx.xml config file. http://www.eclipse.org/jetty/documentation/current/jmx-chapter.html http://wiki.apache.org/solr/SolrJmx -- View this message in context:

Need a group custom function(fieldcollapsing)

2016-03-14 Thread Abhishek Mishra
Hi all We are running on solr5.2.1 . Now the requirement come that we need the data on basis on some algo. The algorithm part we need to put on result obtained from query. So best we can do is using group.field,group.main,group.func. In group.func we need to use custom function which will run the

There is no jetty thread pool stats in solr JMX

2016-03-14 Thread forest_soup
I'm using solr v8.5.1 in SolrCloud mode and enabled in solrconfig.xml, and added those variables in solr.in.sh to enable jmx. -Dcom.sun.management.jmxremote -Dcom.sun.management.jmxremote.local.only=false -Dcom.sun.management.jmxremote.ssl=false

Re: [Solr Suggester Component] Unique suggestions

2016-03-14 Thread Alessandro Benedetti
Hi Roland, I agree with you, I updated the Jira issue and I will spend some more time on that. I think the best solution should be Solr core side, I will sort out some proposal, that we can discuss ( probably I will use the related spin off Jira issue, linked in :

Re: Solr Queries are very slow - Suggestions needed

2016-03-14 Thread Anil
HI Toke and Jack, Please find the details below. * How large are your 3 shards in bytes? (total index across replicas) -- *146G. i am using CDH (cloudera), not sure how to check the index size of each collection on each shard* * What storage system do you use (local SSD, local

Re: using data from external service in Solr: value source or auxiliary core?

2016-03-14 Thread Charlie Hull
On 11/03/2016 17:36, Jitka wrote: Hello. Our company uses Solr-4.10 in a distributed environment. We are considering how best to customize results based on user preferences, information about which is obtained from an external service. At present the preferences can be expressed as filters,

Re: Stopping Solr JVM on OOM

2016-03-14 Thread Binoy Dalal
I set the heap to 16 mb and tried to index about 350k records using a DIH. This did throw an OOM for that particular thread in the console, but the oom script wasn't called and solr was running properly. Moreover, solr also managed to index all 350k records. Is this the correct way to o about

Re: Re: Solr sort preferences number vs space vs character

2016-03-14 Thread Toke Eskildsen
On Sun, 2016-03-13 at 23:05 -0700, vkrishna wrote: > Is there any way to use ASCII? so I can get the result I want. If you use a plain StrField and stick to ASCII in your input, the sorting should be ASCII: https://en.wikipedia.org/wiki/ASCII#Unicode - Toke Eskildsen, State and University

Re: How to sort docs basing on nested docs' fields

2016-03-14 Thread Jhon Smith
Thanks, the following qyery worked /bjqfacet?q={!parent which=doc_type_s:parentDocument score=min}+doc_type_s:middleDocument^=0 +{!parent which=doc_type_s:middleDocument score=min v=$priceRef}=doc_type_s:childDocument^=0 AND {!func}price_d=*,[docid],score=score

Re: Re: Solr sort preferences number vs space vs character

2016-03-14 Thread vkrishna
Hey Shawn, Is there any way to use ASCII? so I can get the result I want. Thanks, Vamsi. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-sort-preferences-number-vs-space-vs-character-tp4263527p4263551.html Sent from the Solr - User mailing list archive at