How to define facet.prefix as case-insensitive
I am using facet.prefix for auto complete. This is my definition requestHandler name=/ac class=solr.SearchHandler lst name=defaults str name=echoParamsexplicit/str ... str name=lowercaseOperatorstrue/str str name=faceton/str str name=facet.fieldSuggest/str /lst this is my field field name=Suggest type=text_auto indexed=true stored=true required=false multiValued=true/ and fieldType class=solr.TextField name=text_auto analyzer tokenizer class=solr.KeywordTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldType all works fine but when I search using caps lock it doesn't return answers. Even when the field contains capitals letters - it doesn't. I assume that the field in solr is lowered (from the field type filter definition) but the search term is not. How can I control the search term caps/no caps? Thanks.
Re: requested url solr/update/extract not available on this server
I did define the request handler. On Sun, Sep 22, 2013 at 12:51 AM, Erick Erickson [via Lucene] ml-node+s472066n4091391...@n3.nabble.com wrote: bq: And im not using the example config file It looks like you have not included the request handler in your solrconfig.xml, something like (from the stock distro): !-- Solr Cell Update Request Handler http://wiki.apache.org/solr/ExtractingRequestHandler -- requestHandler name=/update/extract startup=lazy class=solr.extraction.ExtractingRequestHandler lst name=defaults str name=lowernamestrue/str str name=uprefixignored_/str !-- capture link hrefs but ignore div attributes -- str name=captureAttrtrue/str str name=fmap.alinks/str str name=fmap.divignored_/str /lst /requestHandler I'd start with the stock config and try removing things one-by-one... Best, Erick On Sat, Sep 21, 2013 at 7:34 AM, Nutan [hidden email]http://user/SendEmail.jtp?type=nodenode=4091391i=0 wrote: Yes I do get the solr admin page.And im not using the example config file,I have create mine own for my project as required.I have also defined update/extract in solrconfig.xml. On Tue, Sep 17, 2013 at 4:45 AM, Chris Hostetter-3 [via Lucene] [hidden email] http://user/SendEmail.jtp?type=nodenode=4091391i=1 wrote: : Is /solr/update working? more importantly: does /solr/ work in your browser and return anything useful? (nothing you've told us yet gives us anyway of knowning if solr is even up and running) if 'http://localhost:8080/solr/' shows you the solr admin UI, and you are using the stock Solr 4.2 example configs, then http://localhost:8080/solr/update/extract should not give you a 404 error. if however you are using some other configs, it might not work unless those configs register a handler with the path /update/extract. Using the jetty setup provided with 4.2, and the example configs (from 4.2) I was able to index a sample PDF just fine using your curl command... hossman@frisbee:~/tmp$ curl http://localhost:8983/solr/update/extract?literal.id=1commit=true; -F myfile=@stump.winners.san.diego.2013.pdf ?xml version=1.0 encoding=UTF-8? response lst name=responseHeaderint name=status0/intint name=QTime1839/int/lst /response : : Check solrconfig to see that /update/extract is configured as in the standard : Solr example. : : Does /solr/update/extract work for you using the standard Solr example? : : -- Jack Krupansky : : -Original Message- From: Nutan : Sent: Sunday, September 15, 2013 2:37 AM : To: [hidden email] http://user/SendEmail.jtp?type=nodenode=4090459i=0 : Subject: requested url solr/update/extract not available on this server : : I am working on Solr 4.2 on Windows 7. I am trying to index pdf files.I : referred Solr Cookbook 4. Tomcat is using 8080 port number. I get this : error:requested url solr/update/extract not available on this server : When my curl is : : curl http://localhost:8080/solr/update/extract?literal.id=1commit=true; -F : myfile=@cookbook.pdf : There is no entry in log files. Please help. : : : : -- : View this message in context: : http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153.html : Sent from the Solr - User mailing list archive at Nabble.com. : -Hoss -- If you reply to this email, your message will be added to the discussion below: http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4090459.html To unsubscribe from requested url solr/update/extract not available on this server, click here . NAML http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewerid=instant_html%21nabble%3Aemail.namlbase=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespacebreadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml -- View this message in context: http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4091371.html Sent from the Solr - User mailing list archive at Nabble.com. -- If you reply to this email, your message will be added to the discussion below: http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4091391.html To unsubscribe from requested url solr/update/extract not available on this server, click herehttp://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_codenode=4090153code=bnV0YW5zaGluZGUxOTkyQGdtYWlsLmNvbXw0MDkwMTUzfC0xMzEzOTU5Mzcx .
Facet search on a docvalue field in a multi shard collection
Hi I have a problem doing facet search on a doc value field in a multi shard collection. Any ideas what I may be doing wrong? My Solr schema specifies fieldA as a docvalue type and I have created a two shard collection using Solr 4.4.0. When I do a facet search on fieldA with a large facet.limit then the query fails with the below exception A large facet.limit seems to be when (10 + (facet.limit * 1,5)) * number of shards rows matching my query The exception does not occur when I run with a single shard collection. It can easily be reproduced by indexing a single row and querying it, as the default facet.limit is 100. The facet query received by Solr looks as follows: 576793 [qtp170860084-18] INFO org.apache.solr.core.SolrCore ¦ [trym_shard2_replica1] webapp=/solr path=/select params={facet=truestart=0q=*:*distrib=truecollection=trymfacet.field=fieldAwt=javabinversion=2rows=0} status=500 QTime=20 One of the internal query send by Solr to its shard looks like 576783 [qtp170860084-19] INFO org.apache.solr.core.SolrCore ¦ [trym_shard1_replica1] webapp=/solr path=/select params={facet=truedistrib=falsecollection=trym wt=javabinversion=2rows=0NOW=1379855011787shard.url=192.168.56.1:8501/solr/trym_shard1_replica1/df=textfl=id,scoref.fieldA.facet.limit=160start=0q=*: *facet.field=fieldAisShard=truefsv=true} hits=1 status=500 QTime=2 576784 [qtp170860084-17] ERROR org.apache.solr.servlet.SolrDispatchFilter ¦ null:java.lang.IllegalStateException: Cannot use facet.mincount=0 on a field which is not indexed at org.apache.solr.request.NumericFacets.getCounts(NumericFacets.java:257) at org.apache.solr.request.SimpleFacets.getTermCounts(SimpleFacets.java:423) at org.apache.solr.request.SimpleFacets.getFacetFieldCounts(SimpleFacets.java:530) at org.apache.solr.request.SimpleFacets.getFacetCounts(SimpleFacets.java:259) at org.apache.solr.handler.component.FacetComponent.process(FacetComponent.java:78) at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:208) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1904) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:659) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:362) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:158) at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419) at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:455) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137) at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:557) at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231) at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1075) at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:384) at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193) at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1009) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135) at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255) at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154) at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116) at org.eclipse.jetty.server.Server.handle(Server.java:368) at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:489) at org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53) at org.eclipse.jetty.server.AbstractHttpConnection.content(AbstractHttpConnection.java:953) at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.content(AbstractHttpConnection.java:1014) at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:861) at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235) at org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpConnection.java:72) at org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(SocketConnector.java:264) at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608) at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543) at java.lang.Thread.run(Thread.java:724) My schema.xml contains the following lines (among others :-)) dynamicField name=*A type=dlong indexed=false stored=true
Equivalent Java client code
Hi, I use the below query to check if the both of the word are there in the document. http://localhost:18080/solr/select?defType=dismaxpf=tagged_skills ^100q=java+ejb+Springqf=tagged_skillsq.op=AND I am not sure how to execute the same thing using Solr Java client. SolrQuery query = new SolrQuery(); query.setQuery( tagged_skills:\java+and+ejb\ ); QueryResponse rsp = server.query( query ); SolrDocumentList docs = rsp.getResults(); Also not sure how to check if the given search is there in title and tageed_skills field. Could you please help me on this? Thanks, Baskar.S
Xml file is not inserting from code java -jar post.jar *.xml
hi i am new user of Solr i have done my schema file and when i write a code to insert xxl file to index from cmd .java -jar post.jar *.xml it give us error solr returned errer 404 not found what can i do??? Regards, Kishan Parmar Software Developer +91 95 100 77394 Jay Shree Krishnaa !!
Facet search on a docvalue field in a multi shard collection
Hi I have a problem doing facet search on a doc value field in a multi shard collection. Any ideas what I may be doing wrong? My Solr schema specifies fieldA as a docvalue type and I have created a two shard collection using Solr 4.4.0. When I do a facet search on fieldA with a large facet.limit then the query fails with the below exception A large facet.limit seems to be when (10 + (facet.limit * 1,5)) * number of shards rows matching my query The exception does not occur when I run with a single shard collection. It can easily be reproduced by indexing a single row and querying it, as the default facet.limit is 100. The facet query received by Solr looks as follows: 576793 [qtp170860084-18] INFO org.apache.solr.core.SolrCore ¦ [trym_shard2_replica1] webapp=/solr path=/select params={facet=truestart=0q=*:*distrib=truecollection=trymfacet.field=fieldAwt=javabinversion=2rows=0} status=500 QTime=20 One of the internal query send by Solr to its shard looks like 576783 [qtp170860084-19] INFO org.apache.solr.core.SolrCore ¦ [trym_shard1_replica1] webapp=/solr path=/select params={facet=truedistrib=falsecollection=trym wt=javabinversion=2rows=0NOW=1379855011787shard.url=192.168.56.1:8501/solr/trym_shard1_replica1/df=textfl=id,scoref.fieldA.facet.limit=160start=0q=*: *facet.field=fieldAisShard=truefsv=true} hits=1 status=500 QTime=2 576784 [qtp170860084-17] ERROR org.apache.solr.servlet.SolrDispatchFilter ¦ null:java.lang.IllegalStateException: Cannot use facet.mincount=0 on a field which is not indexed at org.apache.solr.request.NumericFacets.getCounts(NumericFacets.java:257) at org.apache.solr.request.SimpleFacets.getTermCounts(SimpleFacets.java:423) at org.apache.solr.request.SimpleFacets.getFacetFieldCounts(SimpleFacets.java:530) at org.apache.solr.request.SimpleFacets.getFacetCounts(SimpleFacets.java:259) at org.apache.solr.handler.component.FacetComponent.process(FacetComponent.java:78) at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:208) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1904) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:659) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:362) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:158) at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419) at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:455) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137) at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:557) at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231) at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1075) at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:384) at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193) at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1009) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135) at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255) at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154) at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116) at org.eclipse.jetty.server.Server.handle(Server.java:368) at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:489) at org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53) at org.eclipse.jetty.server.AbstractHttpConnection.content(AbstractHttpConnection.java:953) at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.content(AbstractHttpConnection.java:1014) at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:861) at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235) at org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpConnection.java:72) at org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(SocketConnector.java:264) at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608) at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543) at java.lang.Thread.run(Thread.java:724) My schema.xml contains the following lines (among others :-)) dynamicField name=*A type=dlong indexed=false stored=true
Re: Xml file is not inserting from code java -jar post.jar *.xml
Did you start Solr? How did you verify that Solr is running? Are you able to query Solr and access the Admin UI? Most importantly, did you successfully complete the standard Solr tutorial? (IOW, you know all the necessarily steps for basic operation of Solr.) Lastly, did you verify (by examining the log) whether Solr was able to successfully load your schema changes without errors? -- Jack Krupansky -Original Message- From: Kishan Parmar Sent: Sunday, September 22, 2013 9:56 AM To: solr-user@lucene.apache.org Subject: Xml file is not inserting from code java -jar post.jar *.xml hi i am new user of Solr i have done my schema file and when i write a code to insert xxl file to index from cmd .java -jar post.jar *.xml it give us error solr returned errer 404 not found what can i do??? Regards, Kishan Parmar Software Developer +91 95 100 77394 Jay Shree Krishnaa !!
Re: Facet search on a docvalue field in a multi shard collection
right, I think you're running into a bug I remember going by. I can't find it now, JIRA seems to be not responding. As I remember, where if a shard doesn't have a doc on it, you get an error. Although why facet.limit should figure in here is a mystery to me, maybe a coincidence? Significant work has been done about not requiring values for DocValues fields and stuff. Can you give a try on 4.x or the soon-to-be-released 4.5? Best, Erick On Sun, Sep 22, 2013 at 6:26 AM, Trym R. Møller t...@sigmat.dk wrote: Hi I have a problem doing facet search on a doc value field in a multi shard collection. Any ideas what I may be doing wrong? My Solr schema specifies fieldA as a docvalue type and I have created a two shard collection using Solr 4.4.0. When I do a facet search on fieldA with a large facet.limit then the query fails with the below exception A large facet.limit seems to be when (10 + (facet.limit * 1,5)) * number of shards rows matching my query The exception does not occur when I run with a single shard collection. It can easily be reproduced by indexing a single row and querying it, as the default facet.limit is 100. The facet query received by Solr looks as follows: 576793 [qtp170860084-18] INFO org.apache.solr.core.SolrCore ¦ [trym_shard2_replica1] webapp=/solr path=/select params={facet=truestart=0q=*:*distrib=truecollection=trymfacet.field=fieldAwt=javabinversion=2rows=0} status=500 QTime=20 One of the internal query send by Solr to its shard looks like 576783 [qtp170860084-19] INFO org.apache.solr.core.SolrCore ¦ [trym_shard1_replica1] webapp=/solr path=/select params={facet=truedistrib=falsecollection=trym wt=javabinversion=2rows=0NOW=1379855011787shard.url=192.168.56.1:8501/solr/trym_shard1_replica1/df=textfl=id,scoref.fieldA.facet.limit=160start=0q=*: *facet.field=fieldAisShard=truefsv=true} hits=1 status=500 QTime=2 576784 [qtp170860084-17] ERROR org.apache.solr.servlet.SolrDispatchFilter ¦ null:java.lang.IllegalStateException: Cannot use facet.mincount=0 on a field which is not indexed at org.apache.solr.request.NumericFacets.getCounts(NumericFacets.java:257) at org.apache.solr.request.SimpleFacets.getTermCounts(SimpleFacets.java:423) at org.apache.solr.request.SimpleFacets.getFacetFieldCounts(SimpleFacets.java:530) at org.apache.solr.request.SimpleFacets.getFacetCounts(SimpleFacets.java:259) at org.apache.solr.handler.component.FacetComponent.process(FacetComponent.java:78) at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:208) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1904) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:659) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:362) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:158) at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419) at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:455) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137) at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:557) at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231) at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1075) at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:384) at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193) at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1009) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135) at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255) at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154) at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116) at org.eclipse.jetty.server.Server.handle(Server.java:368) at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:489) at org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53) at org.eclipse.jetty.server.AbstractHttpConnection.content(AbstractHttpConnection.java:953) at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.content(AbstractHttpConnection.java:1014) at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:861) at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235) at
Re: requested url solr/update/extract not available on this server
Please review: http://wiki.apache.org/solr/UsingMailingLists Erick On Sun, Sep 22, 2013 at 5:52 AM, Nutan nutanshinde1...@gmail.com wrote: I did define the request handler. On Sun, Sep 22, 2013 at 12:51 AM, Erick Erickson [via Lucene] ml-node+s472066n4091391...@n3.nabble.com wrote: bq: And im not using the example config file It looks like you have not included the request handler in your solrconfig.xml, something like (from the stock distro): !-- Solr Cell Update Request Handler http://wiki.apache.org/solr/ExtractingRequestHandler -- requestHandler name=/update/extract startup=lazy class=solr.extraction.ExtractingRequestHandler lst name=defaults str name=lowernamestrue/str str name=uprefixignored_/str !-- capture link hrefs but ignore div attributes -- str name=captureAttrtrue/str str name=fmap.alinks/str str name=fmap.divignored_/str /lst /requestHandler I'd start with the stock config and try removing things one-by-one... Best, Erick On Sat, Sep 21, 2013 at 7:34 AM, Nutan [hidden email]http://user/SendEmail.jtp?type=nodenode=4091391i=0 wrote: Yes I do get the solr admin page.And im not using the example config file,I have create mine own for my project as required.I have also defined update/extract in solrconfig.xml. On Tue, Sep 17, 2013 at 4:45 AM, Chris Hostetter-3 [via Lucene] [hidden email] http://user/SendEmail.jtp?type=nodenode=4091391i=1 wrote: : Is /solr/update working? more importantly: does /solr/ work in your browser and return anything useful? (nothing you've told us yet gives us anyway of knowning if solr is even up and running) if 'http://localhost:8080/solr/' shows you the solr admin UI, and you are using the stock Solr 4.2 example configs, then http://localhost:8080/solr/update/extract should not give you a 404 error. if however you are using some other configs, it might not work unless those configs register a handler with the path /update/extract. Using the jetty setup provided with 4.2, and the example configs (from 4.2) I was able to index a sample PDF just fine using your curl command... hossman@frisbee:~/tmp$ curl http://localhost:8983/solr/update/extract?literal.id=1commit=true; -F myfile=@stump.winners.san.diego.2013.pdf ?xml version=1.0 encoding=UTF-8? response lst name=responseHeaderint name=status0/intint name=QTime1839/int/lst /response : : Check solrconfig to see that /update/extract is configured as in the standard : Solr example. : : Does /solr/update/extract work for you using the standard Solr example? : : -- Jack Krupansky : : -Original Message- From: Nutan : Sent: Sunday, September 15, 2013 2:37 AM : To: [hidden email] http://user/SendEmail.jtp?type=nodenode=4090459i=0 : Subject: requested url solr/update/extract not available on this server : : I am working on Solr 4.2 on Windows 7. I am trying to index pdf files.I : referred Solr Cookbook 4. Tomcat is using 8080 port number. I get this : error:requested url solr/update/extract not available on this server : When my curl is : : curl http://localhost:8080/solr/update/extract?literal.id=1commit=true; -F : myfile=@cookbook.pdf : There is no entry in log files. Please help. : : : : -- : View this message in context: : http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153.html : Sent from the Solr - User mailing list archive at Nabble.com. : -Hoss -- If you reply to this email, your message will be added to the discussion below: http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4090459.html To unsubscribe from requested url solr/update/extract not available on this server, click here . NAML http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewerid=instant_html%21nabble%3Aemail.namlbase=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespacebreadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml -- View this message in context: http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4091371.html Sent from the Solr - User mailing list archive at Nabble.com. -- If you reply to this email, your message will be added to the discussion below: http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4091391.html To unsubscribe from requested url solr/update/extract not available on this server, click
Re: Xml file is not inserting from code java -jar post.jar *.xml
Please review: http://wiki.apache.org/solr/UsingMailingLists Best, Erick On Sun, Sep 22, 2013 at 8:06 AM, Jack Krupansky j...@basetechnology.com wrote: Did you start Solr? How did you verify that Solr is running? Are you able to query Solr and access the Admin UI? Most importantly, did you successfully complete the standard Solr tutorial? (IOW, you know all the necessarily steps for basic operation of Solr.) Lastly, did you verify (by examining the log) whether Solr was able to successfully load your schema changes without errors? -- Jack Krupansky -Original Message- From: Kishan Parmar Sent: Sunday, September 22, 2013 9:56 AM To: solr-user@lucene.apache.org Subject: Xml file is not inserting from code java -jar post.jar *.xml hi i am new user of Solr i have done my schema file and when i write a code to insert xxl file to index from cmd .java -jar post.jar *.xml it give us error solr returned errer 404 not found what can i do??? Regards, Kishan Parmar Software Developer +91 95 100 77394 Jay Shree Krishnaa !!
Re: How to define facet.prefix as case-insensitive
You'll have to lowercase the term in your app and set terms.prefix to that value, there's no analysis done on the terms.prefix value. Best, Erick On Sun, Sep 22, 2013 at 4:07 AM, Mysurf Mail stammail...@gmail.com wrote: I am using facet.prefix for auto complete. This is my definition requestHandler name=/ac class=solr.SearchHandler lst name=defaults str name=echoParamsexplicit/str ... str name=lowercaseOperatorstrue/str str name=faceton/str str name=facet.fieldSuggest/str /lst this is my field field name=Suggest type=text_auto indexed=true stored=true required=false multiValued=true/ and fieldType class=solr.TextField name=text_auto analyzer tokenizer class=solr.KeywordTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldType all works fine but when I search using caps lock it doesn't return answers. Even when the field contains capitals letters - it doesn't. I assume that the field in solr is lowered (from the field type filter definition) but the search term is not. How can I control the search term caps/no caps? Thanks.
Re: Getting a query parameter in a TokenFilter
Hi Isaac, In the process of writing Solr in Action (http://solrinaction.com), I have built the solution to SOLR-5053 for the multilingual search chapter (I didn't realize this ticket existed at the time). The solution was something I called a MultiTextField. Essentially, the field let's you map a list of defined pre-fixes to field types and dynamically substitute in one or more field types based upon the incoming content. For example: #schema.xml# fieldType name=multiText class=sia.ch14.MultiTextField sortMissingLast=true defaultFieldType=text_general fieldMappings=en:text_english, es:text_spanish, fr:text_french/ fieldType name=text_english ... / fieldType name=text_spanish ... / fieldType name=text_french ... / field name=content type=multiText indexed=true ... / #document# adddoc field name=id1/field field name=contenten,es|the schools, la escuala/field /doc/add #Outputted Token Stream#: [Position 1] [Position 2] [Position 3] [Position 4] the school la escuela schools escuel #query on two languages# q=en,es|la OR en,es|escuela Essentially, this MultiText field type lets you dynamically combine one or more Analyzers (from a defined field type) and stack the tokens based upon term positions within each independent Analyzer. The use case here was multiple To answer your original question... at query time, this implementation requires that you pass the prefix before EACH term in the query, not just the first term (you can see this in the q= I demonstrated above). If you have a Token Filter you have developed, you could probably accomplish what you are trying to do the same way. You could write a custom QParserPlugin that would do this for you I think. Alternatively, it may be possible to create a similar implementation that makes use of a dynamic field name (i.e. content|en,fr as the field name), which would pull the prefix from the field name and apply it to all tokens instead of requiring/allowing each token to specify it's own prefix. I haven't done this in my implementation, but I could see where it might be more user-friendly for many Solr users. I'm just finishing up the multilingual search chapter and code now and will be happy to post it to SOLR-5053 once I finish in the next few days if this would be helpful to you. -Trey On Sat, Sep 21, 2013 at 4:15 PM, Isaac Hebsh isaac.he...@gmail.com wrote: Thought about that again, We can do this work as a search component, manipulating the query string. The cons are the double QParser work, and the double tokenization work. Another approach which might solve this issue easily is Dynamic query analyze chain: https://issues.apache.org/jira/browse/SOLR-5053 What would you do? On Tue, Sep 17, 2013 at 10:31 PM, Isaac Hebsh isaac.he...@gmail.com wrote: Hi everyone, We developed a TokenFilter. It should act differently, depends on a parameter supplied in the query (for query chain only, not the index one, of course). We found no way to pass that parameter into the TokenFilter flow. I guess that the root cause is because TokenFilter is a pure lucene object. As a last resort, we tried to pass the parameter as the first term in the query text (q=...), and save it as a member of the TokenFilter instance. Although it is ugly, it might work fine. But, the problem is that it is not guaranteed that all the terms of a particular query will be analyzed by the same instance of a TokenFilter. In this case, some terms will be analyzed without the required information of that parameter. We can produce such a race very easily. How should I overcome this issue? Do anyone have a better resolution?
Re: dih delete doc per $deleteDocById
sorry, it works like this, i had a typo in my conf :-( On 17. Sep 2013, at 2:44 PM, Andreas Owen wrote: i would like to know how to get it to work and delete documents per xml and dih. On 17. Sep 2013, at 1:47 PM, Shalin Shekhar Mangar wrote: What is your question? On Tue, Sep 17, 2013 at 12:17 AM, andreas owen a.o...@gmx.net wrote: i am using dih and want to delete indexed documents by xml-file with ids. i have seen $deleteDocById used in entity query=... data-config.xml: entity name=rec processor=XPathEntityProcessor url=file:///C:\ColdFusion10\cfusion\solr\solr\tkbintranet\docImportDelete.xml forEach=/docs/doc dataSource=main field column=$deleteDocById xpath=//id / /entity xml-file: docs doc id2345/id /doc /docs -- Regards, Shalin Shekhar Mangar.
Re: Equivalent Java client code
On 9/22/2013 7:43 AM, Baskar Sikkayan wrote: http://localhost:18080/solr/select?defType=dismaxpf=tagged_skills ^100q=java+ejb+Springqf=tagged_skillsq.op=AND I am not sure how to execute the same thing using Solr Java client. SolrQuery query = new SolrQuery(); query.setQuery( tagged_skills:\java+and+ejb\ ); QueryResponse rsp = server.query( query ); SolrDocumentList docs = rsp.getResults(); Also not sure how to check if the given search is there in title and tageed_skills field. The solrj code for a query that's completely identical to your HTTP query is this: SolrQuery qry = new SolrQuery(); qry.set(defType, dismax); qry.set(pf, tagged_skills^100); qry.set(qf, tagged_skills); qry.set(q.op, AND); qry.setQuery(java ejb Spring); If you change pf to tagged_skills^100 title^100 and qf to tagged_skills title that should extend the search to the title field as well. Note that with q.op set to AND, the results may not be what you expect. I'm not sure whether dismax is smart enough to only apply the AND within each of the field searches, I would expect it to apply it across the board so all the query data must exist in all query fields. Thanks, Shawn
Near Duplicate Document Detection at Solr
I want to detect near duplicate documents (for web documents). I know that there is an algorithm called Winnowing and there is another technique used by Google. However I also know that Solr has a component called MoreLikeThis. Google's page explains that *mirroring and plagiarism* is easy to detect but near duplicate detection is much more behind it. So I want to ask that what is the underlying algorithm Solr MoreLikeThis component uses and can I use it for such kind of purposes? Otherwise, I will implement an algorithm for near duplicate document detection within few days and I will be proud to contribute and adopt it into Solr. Thanks; Furkan KAMACI
Re: Near Duplicate Document Detection at Solr
I've also know that there is another mechanism at Solr: http://wiki.apache.org/solr/Deduplication I think that I should add a custom signature because that is the most usable one for me: http://wiki.apache.org/solr/TextProfileSignature On the other hand are there any limitation for deduplication at SolrCloud? What do you think? 2013/9/22 Furkan KAMACI furkankam...@gmail.com I want to detect near duplicate documents (for web documents). I know that there is an algorithm called Winnowing and there is another technique used by Google. However I also know that Solr has a component called MoreLikeThis. Google's page explains that *mirroring and plagiarism* is easy to detect but near duplicate detection is much more behind it. So I want to ask that what is the underlying algorithm Solr MoreLikeThis component uses and can I use it for such kind of purposes? Otherwise, I will implement an algorithm for near duplicate document detection within few days and I will be proud to contribute and adopt it into Solr. Thanks; Furkan KAMACI
RE: Near Duplicate Document Detection at Solr
-Original message- From:Furkan KAMACI furkankam...@gmail.com Sent: Sunday 22nd September 2013 21:15 To: solr-user@lucene.apache.org Subject: Re: Near Duplicate Document Detection at Solr I've also know that there is another mechanism at Solr: http://wiki.apache.org/solr/Deduplication I think that I should add a custom signature because that is the most usable one for me: http://wiki.apache.org/solr/TextProfileSignature Keep in mind, its results are really bad for short documents and does not work for languages not using whitespace. On the other hand are there any limitation for deduplication at SolrCloud? Yes, it does not work: https://issues.apache.org/jira/browse/SOLR-3473 What do you think? 2013/9/22 Furkan KAMACI furkankam...@gmail.com I want to detect near duplicate documents (for web documents). I know that there is an algorithm called Winnowing and there is another technique used by Google. However I also know that Solr has a component called MoreLikeThis. Google's page explains that *mirroring and plagiarism* is easy to detect but near duplicate detection is much more behind it. So I want to ask that what is the underlying algorithm Solr MoreLikeThis component uses and can I use it for such kind of purposes? Otherwise, I will implement an algorithm for near duplicate document detection within few days and I will be proud to contribute and adopt it into Solr. Thanks; Furkan KAMACI
import partition table from oracle
Hi I want to import dataset in a partition of a partition table with DIH. And I would like to explicitly define the partition when I do import job. To be specific. 1. I define the DIH configuration like these entity name=type query=select id as tid from movie_type partition(${ part})'' 2.I send the url: http://localhost:8983/solr/dataimport?command=full-importpart=p2 and then the DIHhandler will full import the p2 partition of the table. Any suggestion will be appreciated. Regards.
import partition table from oracle
Hi I want to import dataset in a partition of a partition table with DIH. And I would like to explicitly define the partition when I do import job. To be specific. 1. I define the DIH configuration like these entity name=type query=select id as tid from movie_type partition(${ part})'' 2.I send the url: http://localhost:8983/solr/dataimport?command=full-importpart=p2 and then the DIHhandler will full import the p2 partition of the table. Any suggestion will be appreciated. Regards.
Interesting edismax/qs bug in Solr 3.5
We have been seeing a strange bug in our prod Solr 3.5. I went to download a fresh copy of Solr3.5, with default schema and indexed (curl or post.jar) the following 2 docs [ { id:1, title:One Earth }, { id:2, title:One Love One Earth } ] I could browse and see the docs in solr. However, when I do: /solr/select?q=One Love One Earthqf=titleqs=2defType=edismaxpf=title I get nothing back. when I change qs=4 in the query, then I see the expected doc2. debugQuery=true does not reveal anything. - I have noticed that when I reverse the order of the documents in the input file i.e doc2 first, then doc1 , and do the indexing (using curl or post.jar), the the query above works and return doc2 as expected. - Same when I index only doc2 (doc1 not indexed). I tested solr3.6.2 and 4.4.0 and I can confirm they are not affected by this issue. I looked at the change logs for 3.6.2 and jira but could not find any trace of this problem. Any pointer to the ticket that addressed this issue will be appreciated. Thank you very much. Arcadius. .
RE: Interesting edismax/qs bug in Solr 3.5
Sounds like https://issues.apache.org/jira/browse/LUCENE-3821 (issue seems to be fixed but still shows as open). -Michael -Original Message- From: Arcadius Ahouansou [mailto:arcad...@menelic.com] Sent: Sunday, September 22, 2013 11:15 PM To: solr-user Subject: Interesting edismax/qs bug in Solr 3.5 We have been seeing a strange bug in our prod Solr 3.5. I went to download a fresh copy of Solr3.5, with default schema and indexed (curl or post.jar) the following 2 docs [ { id:1, title:One Earth }, { id:2, title:One Love One Earth } ] I could browse and see the docs in solr. However, when I do: /solr/select?q=One Love One Earthqf=titleqs=2defType=edismaxpf=title I get nothing back. when I change qs=4 in the query, then I see the expected doc2. debugQuery=true does not reveal anything. - I have noticed that when I reverse the order of the documents in the input file i.e doc2 first, then doc1 , and do the indexing (using curl or post.jar), the the query above works and return doc2 as expected. - Same when I index only doc2 (doc1 not indexed). I tested solr3.6.2 and 4.4.0 and I can confirm they are not affected by this issue. I looked at the change logs for 3.6.2 and jira but could not find any trace of this problem. Any pointer to the ticket that addressed this issue will be appreciated. Thank you very much. Arcadius. .