Re: Need help on Solr authorization
My guess is that you're using a self-signed cert and the certificate path can't be verified. Either that or your cert was signed by a CA that your JVM doesn't recognize. There's a good article about diagnosing SSL problems here: https://confluence.atlassian.com/kb/unable-to-connect-to-ssl-services-due-to-pkix-path-building-failed-779355358.html Good luck! -Scott On Fri, Jan 18, 2019 at 6:03 AM sathish kumar wrote: > Hi, > Anyone got a chance to have a look at the issue i had posted? > Please throw some inputs. > > -Sathish > > On Fri, 11 Jan 2019, 8:10 pm sathish kumar, > wrote: > > > Hi, > > > > We have a two node Solr setup(version is 7.2.1) with embedded zookeeper > > running in Solr Server 1. > > > > We have recently enabled SSL and also enabled basic authentication and > > RuleBasedAuthorizationPlugin. > > > > As part of testing, created new user with admin role and assigned the > > permissions "collection-admin-read" & “read” to this role. > > > > When I try to query a data for any collection name, the system is unable > > to talk with shards of other server. > > > > I am getting the following error in both command line and Solr admin > > browser. > > > > Can someone help me to identify what configurations I am missing? Let me > > know if you need any more info. > > > > > > > > Followed this url for SSL setup: > > https://lucene.apache.org/solr/guide/7_2/enabling-ssl.html > > > > Command used: curl --cacert solr-ssl.cacert.pem --user solr:SolrRocks > > https://solr-node-1:8080/solr//select?q=*:* > > > > > > Error: > > > > { > > > > "error":{ > > > > "metadata":[ > > > > "error-class","org.apache.solr.common.SolrException", > > > > > > > "root-error-class","sun.security.provider.certpath.SunCertPathBuilderException"], > > > > "msg":"Error trying to proxy request for url: > > https://solr-node-2:8080/solr/ba_test/select";, > > > > "trace":"org.apache.solr.common.SolrException: Error trying to proxy > > request for url: https://solr-node-2:8080/solr/ba_test/select\n\tat > > > org.apache.solr.servlet.HttpSolrCall.remoteQuery(HttpSolrCall.java:646)\n\tat > > org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:500)\n\tat > > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:382)\n\tat > > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:326)\n\tat > > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1751)\n\tat > > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:582)\n\tat > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)\n\tat > > > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:548)\n\tat > > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:226)\n\tat > > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1180)\n\tat > > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:512)\n\tat > > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)\n\tat > > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1112)\n\tat > > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)\n\tat > > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:213)\n\tat > > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:119)\n\tat > > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:134)\n\tat > > > org.eclipse.jetty.rewrite.handler.RewriteHandler.handle(RewriteHandler.java:335)\n\tat > > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:134)\n\tat > > org.eclipse.jetty.server.Server.handle(Server.java:534)\n\tat > > org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:320)\n\tat > > > org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:251)\n\tat > > org.eclipse.jetty.io > .AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:283)\n\tat > > org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:108)\n\tat > > > org.eclipse.jetty.io.ssl.SslConnection.onFillable(SslConnection.java:251)\n\tat > > org.eclipse.jetty.io > .AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:283)\n\tat > > org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:108)\n\tat > > org.eclipse.jetty.io > .SelectChannelEndPoint$2.run(SelectChannelEndPoint.java:93)\n\tat > > > org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.executeProduceConsume(ExecuteProduceConsume.java:303)\n\tat > > > org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.produceConsume(ExecuteProduceConsume.java:148)\n\tat > > > org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.run(ExecuteProduceConsume.java:136)\n\tat > > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:671)\n
Re: Need help on Solr authorization
Hi, Anyone got a chance to have a look at the issue i had posted? Please throw some inputs. -Sathish On Fri, 11 Jan 2019, 8:10 pm sathish kumar, wrote: > Hi, > > We have a two node Solr setup(version is 7.2.1) with embedded zookeeper > running in Solr Server 1. > > We have recently enabled SSL and also enabled basic authentication and > RuleBasedAuthorizationPlugin. > > As part of testing, created new user with admin role and assigned the > permissions "collection-admin-read" & “read” to this role. > > When I try to query a data for any collection name, the system is unable > to talk with shards of other server. > > I am getting the following error in both command line and Solr admin > browser. > > Can someone help me to identify what configurations I am missing? Let me > know if you need any more info. > > > > Followed this url for SSL setup: > https://lucene.apache.org/solr/guide/7_2/enabling-ssl.html > > Command used: curl --cacert solr-ssl.cacert.pem --user solr:SolrRocks > https://solr-node-1:8080/solr//select?q=*:* > > > Error: > > { > > "error":{ > > "metadata":[ > > "error-class","org.apache.solr.common.SolrException", > > > > "root-error-class","sun.security.provider.certpath.SunCertPathBuilderException"], > > "msg":"Error trying to proxy request for url: > https://solr-node-2:8080/solr/ba_test/select";, > > "trace":"org.apache.solr.common.SolrException: Error trying to proxy > request for url: https://solr-node-2:8080/solr/ba_test/select\n\tat > org.apache.solr.servlet.HttpSolrCall.remoteQuery(HttpSolrCall.java:646)\n\tat > org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:500)\n\tat > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:382)\n\tat > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:326)\n\tat > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1751)\n\tat > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:582)\n\tat > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)\n\tat > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:548)\n\tat > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:226)\n\tat > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1180)\n\tat > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:512)\n\tat > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)\n\tat > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1112)\n\tat > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)\n\tat > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:213)\n\tat > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:119)\n\tat > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:134)\n\tat > org.eclipse.jetty.rewrite.handler.RewriteHandler.handle(RewriteHandler.java:335)\n\tat > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:134)\n\tat > org.eclipse.jetty.server.Server.handle(Server.java:534)\n\tat > org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:320)\n\tat > org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:251)\n\tat > org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:283)\n\tat > org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:108)\n\tat > org.eclipse.jetty.io.ssl.SslConnection.onFillable(SslConnection.java:251)\n\tat > org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:283)\n\tat > org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:108)\n\tat > org.eclipse.jetty.io.SelectChannelEndPoint$2.run(SelectChannelEndPoint.java:93)\n\tat > org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.executeProduceConsume(ExecuteProduceConsume.java:303)\n\tat > org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.produceConsume(ExecuteProduceConsume.java:148)\n\tat > org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.run(ExecuteProduceConsume.java:136)\n\tat > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:671)\n\tat > org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(QueuedThreadPool.java:589)\n\tat > java.lang.Thread.run(Thread.java:748)\nCaused by: > javax.net.ssl.SSLHandshakeException: > sun.security.validator.ValidatorException: PKIX path building failed: > sun.security.provider.certpath.SunCertPathBuilderException: unable to find > valid certification path to requested target\n\tat > sun.security.ssl.Alerts.getSSLException(Alerts.java:192)\n\tat > sun.security.ssl.SSLSocketImpl.fatal(SSLSocketImpl.java:1959)\n\tat > sun.security.ssl.Handshaker.fatalSE(Handshaker.java:302)\n\tat > sun.security.ssl.Handshaker.fatalSE(Handshaker.java:296)\n\tat
Need help on Solr authorization
Hi, We have a two node Solr setup(version is 7.2.1) with embedded zookeeper running in Solr Server 1. We have recently enabled SSL and also enabled basic authentication and RuleBasedAuthorizationPlugin. As part of testing, created new user with admin role and assigned the permissions "collection-admin-read" & “read” to this role. When I try to query a data for any collection name, the system is unable to talk with shards of other server. I am getting the following error in both command line and Solr admin browser. Can someone help me to identify what configurations I am missing? Let me know if you need any more info. Followed this url for SSL setup: https://lucene.apache.org/solr/guide/7_2/enabling-ssl.html Command used: curl --cacert solr-ssl.cacert.pem --user solr:SolrRocks https://solr-node-1:8080/solr//select?q=*:* Error: { "error":{ "metadata":[ "error-class","org.apache.solr.common.SolrException", "root-error-class","sun.security.provider.certpath.SunCertPathBuilderException"], "msg":"Error trying to proxy request for url: https://solr-node-2:8080/solr/ba_test/select";, "trace":"org.apache.solr.common.SolrException: Error trying to proxy request for url: https://solr-node-2:8080/solr/ba_test/select\n\tat org.apache.solr.servlet.HttpSolrCall.remoteQuery(HttpSolrCall.java:646)\n\tat org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:500)\n\tat org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:382)\n\tat org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:326)\n\tat org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1751)\n\tat org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:582)\n\tat org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)\n\tat org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:548)\n\tat org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:226)\n\tat org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1180)\n\tat org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:512)\n\tat org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)\n\tat org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1112)\n\tat org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)\n\tat org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:213)\n\tat org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:119)\n\tat org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:134)\n\tat org.eclipse.jetty.rewrite.handler.RewriteHandler.handle(RewriteHandler.java:335)\n\tat org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:134)\n\tat org.eclipse.jetty.server.Server.handle(Server.java:534)\n\tat org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:320)\n\tat org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:251)\n\tat org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:283)\n\tat org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:108)\n\tat org.eclipse.jetty.io.ssl.SslConnection.onFillable(SslConnection.java:251)\n\tat org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:283)\n\tat org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:108)\n\tat org.eclipse.jetty.io.SelectChannelEndPoint$2.run(SelectChannelEndPoint.java:93)\n\tat org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.executeProduceConsume(ExecuteProduceConsume.java:303)\n\tat org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.produceConsume(ExecuteProduceConsume.java:148)\n\tat org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.run(ExecuteProduceConsume.java:136)\n\tat org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:671)\n\tat org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(QueuedThreadPool.java:589)\n\tat java.lang.Thread.run(Thread.java:748)\nCaused by: javax.net.ssl.SSLHandshakeException: sun.security.validator.ValidatorException: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target\n\tat sun.security.ssl.Alerts.getSSLException(Alerts.java:192)\n\tat sun.security.ssl.SSLSocketImpl.fatal(SSLSocketImpl.java:1959)\n\tat sun.security.ssl.Handshaker.fatalSE(Handshaker.java:302)\n\tat sun.security.ssl.Handshaker.fatalSE(Handshaker.java:296)\n\tat sun.security.ssl.ClientHandshaker.serverCertificate(ClientHandshaker.java:1514)\n\tat sun.security.ssl.ClientHandshaker.processMessage(ClientHandshaker.java:216)\n\tat sun.security.ssl.Handshaker.processLoop(Handshaker.java:1026)\n\tat sun.security.ssl.Handshaker.process_record(Handshaker.java:961)\n\tat sun.securi
Re: Need Help on Solr Client connection Pooling
On 8/30/2018 2:13 AM, Gembali Satish kumar wrote: *SolrClient client = new HttpSolrClient.Builder(* * SolrUtil.getSolrURL(tsConfigUtil.getClusterAdvertisedAddress(), aInCollectionName)).build();* after my job search done, I am closing my client. *client.close();* but from UI getting more requests to search the data I think to create the *client object *on every request is costly is there any way to pool the *SolrClient objects?*? If there kindly share me the reference Yes, creating the client on every request is costly. Supplementing what Shalin told you: Exactly which version of SolrJ you're running can affect how many threads can use the object at the same time, unless you explicitly build it to handle more. Newer versions set it up with lots of thread capability, but older versions just create the internal HttpClient object with defaults. By default, HttpClient only allows two threads. What version of SolrJ are you using? Thanks, Shawn
Re: Need Help on Solr Client connection Pooling
You should create a single HttpSolrClient and re-use for all requests. It is thread safe and creates an Http connection pool internally (well Apache HttpClient does). On Thu, Aug 30, 2018 at 2:28 PM Gembali Satish kumar < gembalisatishku...@gmail.com> wrote: > Hi Team, > > Need some help on Client connection object pooling > I am using SolrJ API to connect the Solr. > > This below snippet I used to create the client object. > > *SolrClient client = new HttpSolrClient.Builder(* > * SolrUtil.getSolrURL(tsConfigUtil.getClusterAdvertisedAddress(), > aInCollectionName)).build();* > > after my job search done, I am closing my client. > *client.close();* > > but from UI getting more requests to search the data > I think to create the *client object *on every request is costly is there > any way to pool the *SolrClient objects?*? > If there kindly share me the reference > > Thanks and Regards, > Satish > -- Regards, Shalin Shekhar Mangar.
Need Help on Solr Client connection Pooling
Hi Team, Need some help on Client connection object pooling I am using SolrJ API to connect the Solr. This below snippet I used to create the client object. *SolrClient client = new HttpSolrClient.Builder(* * SolrUtil.getSolrURL(tsConfigUtil.getClusterAdvertisedAddress(), aInCollectionName)).build();* after my job search done, I am closing my client. *client.close();* but from UI getting more requests to search the data I think to create the *client object *on every request is costly is there any way to pool the *SolrClient objects?*? If there kindly share me the reference Thanks and Regards, Satish
Need Help on solr for Email Search
HI Team, We are using solr for Quick retrieval of search result. Recently we are encountered with a problem while searching for Email in solr. search is performing well when i enter simple text ,but When i enter any special characters (Like @ ,(comma)) it is not returning any results. i have attached the schema file once please verify and let us know how to perform search on solr for email address . please get back to me as soon as possible. -- *Thanks & Regards,* *Santosh Palivela.*
need help on solr schema design
HI, i am building small search application with solr. i need help in designing the solr schema. data structure is like one to many. when user search text matches with parent document fields, system should not display any child fields information. when user search text matches with child document fields, system should display parent and child fields information. Any help would be appreciated. Thanks. Regards, Anil
Re: Need help on Solr
On Jun 20, 2013, at 18:26 , Abhishek Bansal wrote: > Yeah I know, out of the box there is one id field. I removed it from > schema.xml > > I have also added below code to automatically generate an ID. > > multiValued="false"/> > > Is that a valid configuration for an id field (assuming that the field "id" is also defined as "uniqueKey")?∆
Re: Need help on Solr
As I am running Solr on windows + tomcat I am using below command to index pdf. I hope this command is not faulty. Please check java -jar -Durl=" http://localhost:8080/solr-4.3.0/update/extract?literal.id=1&commit=true"; post.jar sample.pdf with regards, Abhishek Bansal On 20 June 2013 21:56, Abhishek Bansal wrote: > Yeah I know, out of the box there is one id field. I removed it from > schema.xml > > I have also added below code to automatically generate an ID. > > multiValued="false"/> > > > > with regards, > Abhishek Bansal > > > On 20 June 2013 21:49, Shreejay wrote: > >> org.apache.solr.common.SolrException: [schema.xml] Duplicate field >> definition for 'id' >> >> You might have defined an id field in the schema file. The out of box >> schema file already contains an id field . >> >> -- >> Shreejay >> >> >> On Thursday, June 20, 2013 at 9:16, Abhishek Bansal wrote: >> >> > Hello, >> > >> > I am trying to index a pdf file on Solr. I am running icurrently Solr on >> > Apache Tomcat 6. >> > >> > When I try to index it I get below error. Please help. I was not able to >> > rectify this error with help of internet. >> > >> > >> > >> > >> > ERROR - 2013-06-20 20:43:41.549; org.apache.solr.core.CoreContainer; >> Unable >> > to create core: collection1 >> > org.apache.solr.common.SolrException: [schema.xml] Duplicate field >> > definition for 'id' >> > >> [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, >> > required=true}]]] and >> > >> [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, >> > required=true}]]] >> > at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:502) >> > at org.apache.solr.schema.IndexSchema.(IndexSchema.java:176) >> > at >> > >> org.apache.solr.schema.ClassicIndexSchemaFactory.create(ClassicIndexSchemaFactory.java:62) >> > at >> > >> org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:36) >> > at >> > >> org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:946) >> > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:984) >> > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:597) >> > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:592) >> > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) >> > at java.util.concurrent.FutureTask.run(FutureTask.java:138) >> > at >> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439) >> > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) >> > at java.util.concurrent.FutureTask.run(FutureTask.java:138) >> > at >> > >> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895) >> > at >> > >> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918) >> > at java.lang.Thread.run(Thread.java:662) >> > ERROR - 2013-06-20 20:43:41.551; org.apache.solr.common.SolrException; >> > null:org.apache.solr.common.SolrException: Unable to create core: >> > collection1 >> > at >> > >> org.apache.solr.core.CoreContainer.recordAndThrow(CoreContainer.java:1450) >> > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:993) >> > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:597) >> > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:592) >> > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) >> > at java.util.concurrent.FutureTask.run(FutureTask.java:138) >> > at >> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439) >> > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) >> > at java.util.concurrent.FutureTask.run(FutureTask.java:138) >> > at >> > >> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895) >> > at >> > >> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918) >> > at java.lang.Thread.run(Thread.java:662) >> > Caused by: org.apache.solr.common.SolrException: [schema.xml] Duplicate >> > field definition for 'id' >> > >> [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, >> > required=true}]]] and >> > >> [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, >> > required=true}]]] >> > at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:502) >> > at org.apache.solr.schema.IndexSchema.(IndexSchema.java:176) >> > at >> > >> org.apache.solr.schema.ClassicIndexSchemaFactory.create(ClassicIndexSchemaFactory.java:62) >> > at >> > >> org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:36) >> > at >> > >> org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:946) >> > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:984) >> > ... 10 more >> > >> > INFO - 2013-06-20 20:43:41.553; >> > org.apache.solr.servle
Re: Need help on Solr
Yeah I know, out of the box there is one id field. I removed it from schema.xml I have also added below code to automatically generate an ID. with regards, Abhishek Bansal On 20 June 2013 21:49, Shreejay wrote: > org.apache.solr.common.SolrException: [schema.xml] Duplicate field > definition for 'id' > > You might have defined an id field in the schema file. The out of box > schema file already contains an id field . > > -- > Shreejay > > > On Thursday, June 20, 2013 at 9:16, Abhishek Bansal wrote: > > > Hello, > > > > I am trying to index a pdf file on Solr. I am running icurrently Solr on > > Apache Tomcat 6. > > > > When I try to index it I get below error. Please help. I was not able to > > rectify this error with help of internet. > > > > > > > > > > ERROR - 2013-06-20 20:43:41.549; org.apache.solr.core.CoreContainer; > Unable > > to create core: collection1 > > org.apache.solr.common.SolrException: [schema.xml] Duplicate field > > definition for 'id' > > > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > > required=true}]]] and > > > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > > required=true}]]] > > at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:502) > > at org.apache.solr.schema.IndexSchema.(IndexSchema.java:176) > > at > > > org.apache.solr.schema.ClassicIndexSchemaFactory.create(ClassicIndexSchemaFactory.java:62) > > at > > > org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:36) > > at > > > org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:946) > > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:984) > > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:597) > > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:592) > > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439) > > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > > at > > > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895) > > at > > > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918) > > at java.lang.Thread.run(Thread.java:662) > > ERROR - 2013-06-20 20:43:41.551; org.apache.solr.common.SolrException; > > null:org.apache.solr.common.SolrException: Unable to create core: > > collection1 > > at > > > org.apache.solr.core.CoreContainer.recordAndThrow(CoreContainer.java:1450) > > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:993) > > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:597) > > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:592) > > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439) > > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > > at > > > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895) > > at > > > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918) > > at java.lang.Thread.run(Thread.java:662) > > Caused by: org.apache.solr.common.SolrException: [schema.xml] Duplicate > > field definition for 'id' > > > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > > required=true}]]] and > > > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > > required=true}]]] > > at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:502) > > at org.apache.solr.schema.IndexSchema.(IndexSchema.java:176) > > at > > > org.apache.solr.schema.ClassicIndexSchemaFactory.create(ClassicIndexSchemaFactory.java:62) > > at > > > org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:36) > > at > > > org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:946) > > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:984) > > ... 10 more > > > > INFO - 2013-06-20 20:43:41.553; > > org.apache.solr.servlet.SolrDispatchFilter; user.dir=C:\Program > > Files\Apache Software Foundation\Tomcat 6.0 > > INFO - 2013-06-20 20:43:41.553; > > org.apache.solr.servlet.SolrDispatchFilter; SolrDispatchFilter.init() > done > > ERROR - 2013-06-20 20:43:41.820; org.apache.solr.common.SolrException; > > null:org.apache.solr.common.SolrException: SolrCore 'collection1' is not > > available due to init failure: [schema.xml] Duplicate field definition > for > > 'id' > > > [[[id{type=string,pro
Re: Need help on Solr
org.apache.solr.common.SolrException: [schema.xml] Duplicate field definition for 'id' You might have defined an id field in the schema file. The out of box schema file already contains an id field . -- Shreejay On Thursday, June 20, 2013 at 9:16, Abhishek Bansal wrote: > Hello, > > I am trying to index a pdf file on Solr. I am running icurrently Solr on > Apache Tomcat 6. > > When I try to index it I get below error. Please help. I was not able to > rectify this error with help of internet. > > > > > ERROR - 2013-06-20 20:43:41.549; org.apache.solr.core.CoreContainer; Unable > to create core: collection1 > org.apache.solr.common.SolrException: [schema.xml] Duplicate field > definition for 'id' > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > required=true}]]] and > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > required=true}]]] > at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:502) > at org.apache.solr.schema.IndexSchema.(IndexSchema.java:176) > at > org.apache.solr.schema.ClassicIndexSchemaFactory.create(ClassicIndexSchemaFactory.java:62) > at > org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:36) > at > org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:946) > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:984) > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:597) > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:592) > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439) > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > at > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918) > at java.lang.Thread.run(Thread.java:662) > ERROR - 2013-06-20 20:43:41.551; org.apache.solr.common.SolrException; > null:org.apache.solr.common.SolrException: Unable to create core: > collection1 > at > org.apache.solr.core.CoreContainer.recordAndThrow(CoreContainer.java:1450) > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:993) > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:597) > at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:592) > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439) > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > at > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918) > at java.lang.Thread.run(Thread.java:662) > Caused by: org.apache.solr.common.SolrException: [schema.xml] Duplicate > field definition for 'id' > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > required=true}]]] and > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > required=true}]]] > at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:502) > at org.apache.solr.schema.IndexSchema.(IndexSchema.java:176) > at > org.apache.solr.schema.ClassicIndexSchemaFactory.create(ClassicIndexSchemaFactory.java:62) > at > org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:36) > at > org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:946) > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:984) > ... 10 more > > INFO - 2013-06-20 20:43:41.553; > org.apache.solr.servlet.SolrDispatchFilter; user.dir=C:\Program > Files\Apache Software Foundation\Tomcat 6.0 > INFO - 2013-06-20 20:43:41.553; > org.apache.solr.servlet.SolrDispatchFilter; SolrDispatchFilter.init() done > ERROR - 2013-06-20 20:43:41.820; org.apache.solr.common.SolrException; > null:org.apache.solr.common.SolrException: SolrCore 'collection1' is not > available due to init failure: [schema.xml] Duplicate field definition for > 'id' > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > required=true}]]] and > [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, > required=true}]]] > at org.apache.solr.core.CoreContainer.getCore(CoreContainer.java:1212) > at > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:248) > at > org.apache.solr.servlet.SolrDispatchFilter.d
Need help on Solr
Hello, I am trying to index a pdf file on Solr. I am running icurrently Solr on Apache Tomcat 6. When I try to index it I get below error. Please help. I was not able to rectify this error with help of internet. ERROR - 2013-06-20 20:43:41.549; org.apache.solr.core.CoreContainer; Unable to create core: collection1 org.apache.solr.common.SolrException: [schema.xml] Duplicate field definition for 'id' [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, required=true}]]] and [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, required=true}]]] at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:502) at org.apache.solr.schema.IndexSchema.(IndexSchema.java:176) at org.apache.solr.schema.ClassicIndexSchemaFactory.create(ClassicIndexSchemaFactory.java:62) at org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:36) at org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:946) at org.apache.solr.core.CoreContainer.create(CoreContainer.java:984) at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:597) at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:592) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918) at java.lang.Thread.run(Thread.java:662) ERROR - 2013-06-20 20:43:41.551; org.apache.solr.common.SolrException; null:org.apache.solr.common.SolrException: Unable to create core: collection1 at org.apache.solr.core.CoreContainer.recordAndThrow(CoreContainer.java:1450) at org.apache.solr.core.CoreContainer.create(CoreContainer.java:993) at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:597) at org.apache.solr.core.CoreContainer$2.call(CoreContainer.java:592) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918) at java.lang.Thread.run(Thread.java:662) Caused by: org.apache.solr.common.SolrException: [schema.xml] Duplicate field definition for 'id' [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, required=true}]]] and [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, required=true}]]] at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:502) at org.apache.solr.schema.IndexSchema.(IndexSchema.java:176) at org.apache.solr.schema.ClassicIndexSchemaFactory.create(ClassicIndexSchemaFactory.java:62) at org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:36) at org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:946) at org.apache.solr.core.CoreContainer.create(CoreContainer.java:984) ... 10 more INFO - 2013-06-20 20:43:41.553; org.apache.solr.servlet.SolrDispatchFilter; user.dir=C:\Program Files\Apache Software Foundation\Tomcat 6.0 INFO - 2013-06-20 20:43:41.553; org.apache.solr.servlet.SolrDispatchFilter; SolrDispatchFilter.init() done ERROR - 2013-06-20 20:43:41.820; org.apache.solr.common.SolrException; null:org.apache.solr.common.SolrException: SolrCore 'collection1' is not available due to init failure: [schema.xml] Duplicate field definition for 'id' [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, required=true}]]] and [[[id{type=string,properties=indexed,stored,omitNorms,omitTermFreqAndPositions,sortMissingLast,required, required=true}]]] at org.apache.solr.core.CoreContainer.getCore(CoreContainer.java:1212) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:248) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:155) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233) at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191) at org.apache.catalina.core.StandardH
Re: need help on solr search
Used mm parameter and it works! Right now preparing perf test. Please share if anybody has method to optimize dismax queries Thanks! Jeremy Otis Gospodnetic-5 wrote > Hi, > > Have a look at your solrconfig.xml and look for your default operator. > Also > look at the docs for the mm parameter on the Wiki. Let us know if that > does > it for you. -- View this message in context: http://lucene.472066.n3.nabble.com/need-help-on-solr-search-tp4017191p4018397.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: need help on solr search
Hi, Have a look at your solrconfig.xml and look for your default operator. Also look at the docs for the mm parameter on the Wiki. Let us know if that does it for you. Otis -- Performance Monitoring - http://sematext.com/spm On Nov 1, 2012 5:11 PM, "jchen2000" wrote: > Otis Gospodnetic-5 wrote > > You want "ordered term matching" (like in a phrase), but you cannot use > > AND > > because you do not want all query terms to be required. Correct? > > That's exactly right! actually none of the query term is required, but we > need to base similarity score on how many terms are matched. In addtion, > since we have unique prefixes like "a", "b", "c", we guarantee a1 would > never match anything in group "b" or "c", etc. > > > Otis Gospodnetic-5 wrote > > If so, would "a1 a2* a3 a4* b1 b2* c2 d1* d2"~someBigSlop work? > > This does not work, because a2* (just any term different from a2, not > wildcard), a4* etc do not appear in the doc. quoted proximity match seems > still to require all query terms to appear. > > Thanks, > Jeremy > > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/need-help-on-solr-search-tp4017191p4017686.html > Sent from the Solr - User mailing list archive at Nabble.com. >
Re: need help on solr search
Otis Gospodnetic-5 wrote > You want "ordered term matching" (like in a phrase), but you cannot use > AND > because you do not want all query terms to be required. Correct? That's exactly right! actually none of the query term is required, but we need to base similarity score on how many terms are matched. In addtion, since we have unique prefixes like "a", "b", "c", we guarantee a1 would never match anything in group "b" or "c", etc. Otis Gospodnetic-5 wrote > If so, would "a1 a2* a3 a4* b1 b2* c2 d1* d2"~someBigSlop work? This does not work, because a2* (just any term different from a2, not wildcard), a4* etc do not appear in the doc. quoted proximity match seems still to require all query terms to appear. Thanks, Jeremy -- View this message in context: http://lucene.472066.n3.nabble.com/need-help-on-solr-search-tp4017191p4017686.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: need help on solr search
Hi, I don't follow. :( See inline. On Thu, Nov 1, 2012 at 2:16 PM, jchen2000 wrote: > Seems like phrase query is close, but not exactly what we needed. Here is > an > example assuming just one field: > the doc: a1 a2 a3 b1 b2 c1 c2 c3 c4 d1 d2 > the query: a1 a2* a3 a4* b1 b2* c2 d1* d2 > Are those *s wildcards? Are they relevant here? If not, remove them to keep your example as simple as possible. both doc and query terms are ordered. We know that a term should never go > match with b or c terms. Obviously, if we treat all query terms with OR, we > What does "should never go match with b or c terms" mean? I see b1, b2*, c2 in there. There are no b or c terms anywhere in example doc or querym just b1, b2*, c2. > could have the job done, but in a much slower way (also the returned list > would be too long). So the question is, do we have a way to speed this > query > up? or customization code is needed (and how)? > Is that query supposed to match that doc? Ah, I think I see what you are trying to do/say. You want "ordered term matching" (like in a phrase), but you cannot use AND because you do not want all query terms to be required. Correct? If so, would "a1 a2* a3 a4* b1 b2* c2 d1* d2"~someBigSlop work? I'm not sure if that would be faster than the OR query, but see if that even does what you are after. Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html
Re: need help on solr search
Seems like phrase query is close, but not exactly what we needed. Here is an example assuming just one field: the doc: a1 a2 a3 b1 b2 c1 c2 c3 c4 d1 d2 the query: a1 a2* a3 a4* b1 b2* c2 d1* d2 both doc and query terms are ordered. We know that a term should never go match with b or c terms. Obviously, if we treat all query terms with OR, we could have the job done, but in a much slower way (also the returned list would be too long). So the question is, do we have a way to speed this query up? or customization code is needed (and how)? Thanks, Jeremy -- View this message in context: http://lucene.472066.n3.nabble.com/need-help-on-solr-search-tp4017191p4017630.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: need help on solr search
Hi, Not sure if I follow your requirements correctly, but it sounds like you may be looking for phrase queries (as opposed to term/keyword queries). Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html On Wed, Oct 31, 2012 at 1:33 AM, jchen2000 wrote: > Hi Solr experts, > > Our documents as well as queries consist of 10 properties in a particular > order. Because of stringent requirements on search latency, we grouped them > into only 2 fields with 5 properties each (we may use just 1 field, field > number over 3 seems too slow), and each property value is split into > fixed-length terms (like n-gram, hopefully to save search time) and prefixed > with property name. What we want is to find out how similar the query is to > the documents by comparing terms. We can't use the default OR operator since > it's slow, we wanted to take advantage of the prefix and the defined order. > > My questions are: > 1) Can we do this simply through solr configuration, and how if possible? > 2) If we need to customize solr request handler or anything else, where to > start? > > Thanks a lot! > > Jeremy > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/need-help-on-solr-search-tp4017191.html > Sent from the Solr - User mailing list archive at Nabble.com.
Re: need help on solr search
Sure. here are some more details: 1) we are having 30M ~ 60M documents per node (right now we have 4 nodes, but that will increase in the future). Documents are relatively small (around 3K), but 99% searches must be returned within 200ms and this is measured by test drivers sitting right in front of solr servers. 2) throughput requirement right now is about 300 qps. The machines we use are quite powerful with 16 cores, lots of memory and with ssd drives. We haven't really achieved this throughput, but search latency is more of an issue 3) one property value may overlap with value in another different property, but we don't want to match those so we prefixed terms with property name Thanks, Fang -- View this message in context: http://lucene.472066.n3.nabble.com/need-help-on-solr-search-tp4017191p4017341.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: need help on solr search
You need to provide significantly more information than you have. What are your perf requirements? How big is your data set? What kinds of searches are you talking about here? How are you measuring response? This really feels like an "XY" problem. Best Erick On Wed, Oct 31, 2012 at 1:33 AM, jchen2000 wrote: > Hi Solr experts, > > Our documents as well as queries consist of 10 properties in a particular > order. Because of stringent requirements on search latency, we grouped them > into only 2 fields with 5 properties each (we may use just 1 field, field > number over 3 seems too slow), and each property value is split into > fixed-length terms (like n-gram, hopefully to save search time) and prefixed > with property name. What we want is to find out how similar the query is to > the documents by comparing terms. We can't use the default OR operator since > it's slow, we wanted to take advantage of the prefix and the defined order. > > My questions are: > 1) Can we do this simply through solr configuration, and how if possible? > 2) If we need to customize solr request handler or anything else, where to > start? > > Thanks a lot! > > Jeremy > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/need-help-on-solr-search-tp4017191.html > Sent from the Solr - User mailing list archive at Nabble.com.
need help on solr search
Hi Solr experts, Our documents as well as queries consist of 10 properties in a particular order. Because of stringent requirements on search latency, we grouped them into only 2 fields with 5 properties each (we may use just 1 field, field number over 3 seems too slow), and each property value is split into fixed-length terms (like n-gram, hopefully to save search time) and prefixed with property name. What we want is to find out how similar the query is to the documents by comparing terms. We can't use the default OR operator since it's slow, we wanted to take advantage of the prefix and the defined order. My questions are: 1) Can we do this simply through solr configuration, and how if possible? 2) If we need to customize solr request handler or anything else, where to start? Thanks a lot! Jeremy -- View this message in context: http://lucene.472066.n3.nabble.com/need-help-on-solr-search-tp4017191.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Need help on Solr Cell usage with specific Tika parser
Thanks, moving it to direcxt child worked. Olivier 2010/6/14 Chris Hostetter > > : In solrconfig, in update/extract requesthandler I specified : name="tika.config">./tika-config.xml , where tika-config.xml is in > : conf directory (same as solrconfig). > > can you show us the full requestHandler decalration? ... tika.config needs > to be a direct child of the requestHandler (not in the defaults) > > I also don't know if using a "local" path like that will work -- depends > on how that file is loaded (if solr loads it, then you might want to > remove the "./"; if solr just gives the path to tika, then you probably > need an absolute path. > > > -Hoss > >
Re: Need help on Solr Cell usage with specific Tika parser
: In solrconfig, in update/extract requesthandler I specified ./tika-config.xml , where tika-config.xml is in : conf directory (same as solrconfig). can you show us the full requestHandler decalration? ... tika.config needs to be a direct child of the requestHandler (not in the defaults) I also don't know if using a "local" path like that will work -- depends on how that file is loaded (if solr loads it, then you might want to remove the "./"; if solr just gives the path to tika, then you probably need an absolute path. -Hoss
Re: Need help on Solr Cell usage with specific Tika parser
Yeap, I do. As magic is not set, this is the reason why it looks for this specific mime-type. Unfortunatly, It seems it either do not read my specific tika-config file or the mime-type file. But there is no error log concerning those files... (not trying to load them?) 2010/6/14 Ken Krugler > Hi Olivier, > > Are you setting the mime type explicitly via the stream.type parameter? > > -- Ken > > > On Jun 14, 2010, at 9:14am, olivier sallou wrote: > > Hi, >> I use Solr Cell to send specific content files. I developped a dedicated >> Parser for specific mime types. >> However I cannot get Solr accepting my new mime types. >> >> In solrconfig, in update/extract requesthandler I specified > name="tika.config">./tika-config.xml , where tika-config.xml is in >> conf directory (same as solrconfig). >> >> In tika-config I added my mimetypes: >> >> > class="org.irisa.genouest.tools.readseq.ReadSeqParser"> >> biosequence/document >> biosequence/embl >> biosequence/genbank >> >> >> I do not know for: >> >> >> whereas path to tika mimetypes should be absolute or relative... and even >> if >> this file needs to be redefined if "magic" is not used. >> >> >> When I run my update/extract, I have an error that "biosequence/document" >> does not match any known parser. >> >> Thanks >> >> Olivier >> > > > Ken Krugler > +1 530-210-6378 > http://bixolabs.com > e l a s t i c w e b m i n i n g > > > > >
Re: Need help on Solr Cell usage with specific Tika parser
Hi Olivier, Are you setting the mime type explicitly via the stream.type parameter? -- Ken On Jun 14, 2010, at 9:14am, olivier sallou wrote: Hi, I use Solr Cell to send specific content files. I developped a dedicated Parser for specific mime types. However I cannot get Solr accepting my new mime types. In solrconfig, in update/extract requesthandler I specified name="tika.config">./tika-config.xml , where tika-config.xml is in conf directory (same as solrconfig). In tika-config I added my mimetypes: biosequence/document biosequence/embl biosequence/genbank I do not know for: whereas path to tika mimetypes should be absolute or relative... and even if this file needs to be redefined if "magic" is not used. When I run my update/extract, I have an error that "biosequence/ document" does not match any known parser. Thanks Olivier Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g
Need help on Solr Cell usage with specific Tika parser
Hi, I use Solr Cell to send specific content files. I developped a dedicated Parser for specific mime types. However I cannot get Solr accepting my new mime types. In solrconfig, in update/extract requesthandler I specified ./tika-config.xml , where tika-config.xml is in conf directory (same as solrconfig). In tika-config I added my mimetypes: biosequence/document biosequence/embl biosequence/genbank I do not know for: whereas path to tika mimetypes should be absolute or relative... and even if this file needs to be redefined if "magic" is not used. When I run my update/extract, I have an error that "biosequence/document" does not match any known parser. Thanks Olivier