How to define facet.prefix as case-insensitive

2013-09-22 Thread Mysurf Mail
I am using facet.prefix for auto complete.
This is my definition

 requestHandler name=/ac class=solr.SearchHandler
 lst name=defaults
  str name=echoParamsexplicit/str
  ...
  str name=lowercaseOperatorstrue/str
  str name=faceton/str
  str name=facet.fieldSuggest/str
/lst

this is my field

field name=Suggest type=text_auto indexed=true stored=true
required=false multiValued=true/

and

 fieldType class=solr.TextField name=text_auto
  analyzer
tokenizer class=solr.KeywordTokenizerFactory/
filter class=solr.LowerCaseFilterFactory/
  /analyzer
/fieldType

all works fine but when I search using caps lock it doesn't return answers.
Even when the field contains capitals letters - it doesn't.

I assume that the field in solr is lowered (from the field type filter
definition) but the search term is not.
How can I control the search term caps/no caps?

Thanks.


Re: requested url solr/update/extract not available on this server

2013-09-22 Thread Nutan
I did define the request handler.


On Sun, Sep 22, 2013 at 12:51 AM, Erick Erickson [via Lucene] 
ml-node+s472066n4091391...@n3.nabble.com wrote:

 bq: And im not using the example config file

 It looks like you have not included the request handler in your
 solrconfig.xml,
 something like (from the stock distro):

   !-- Solr Cell Update Request Handler

http://wiki.apache.org/solr/ExtractingRequestHandler

 --
   requestHandler name=/update/extract
   startup=lazy
   class=solr.extraction.ExtractingRequestHandler 
 lst name=defaults
   str name=lowernamestrue/str
   str name=uprefixignored_/str

   !-- capture link hrefs but ignore div attributes --
   str name=captureAttrtrue/str
   str name=fmap.alinks/str
   str name=fmap.divignored_/str
 /lst
   /requestHandler

 I'd start with the stock config and try removing things one-by-one...

 Best,
 Erick

 On Sat, Sep 21, 2013 at 7:34 AM, Nutan [hidden 
 email]http://user/SendEmail.jtp?type=nodenode=4091391i=0
 wrote:

  Yes I do get the solr admin page.And im not using the example config
 file,I
  have create mine own for my project as required.I have also defined
  update/extract in solrconfig.xml.
 
 
  On Tue, Sep 17, 2013 at 4:45 AM, Chris Hostetter-3 [via Lucene] 
  [hidden email] http://user/SendEmail.jtp?type=nodenode=4091391i=1
 wrote:
 
 
  : Is /solr/update working?
 
  more importantly: does /solr/ work in your browser and return
 anything
  useful?  (nothing you've told us yet gives us anyway of knowning if
  solr is even up and running)
 
  if 'http://localhost:8080/solr/' shows you the solr admin UI, and you
 are
  using the stock Solr 4.2 example configs, then
  http://localhost:8080/solr/update/extract should not give you a 404
  error.
 
  if however you are using some other configs, it might not work unless
  those configs register a handler with the path /update/extract.
 
  Using the jetty setup provided with 4.2, and the example configs (from
  4.2) I was able to index a sample PDF just fine using your curl
 command...
 
  hossman@frisbee:~/tmp$ curl 
  http://localhost:8983/solr/update/extract?literal.id=1commit=true; -F
  myfile=@stump.winners.san.diego.2013.pdf
  ?xml version=1.0 encoding=UTF-8?
  response
  lst name=responseHeaderint name=status0/intint
  name=QTime1839/int/lst
  /response
 
 
 
 
 
  :
  : Check solrconfig to see that /update/extract is configured as in the
  standard
  : Solr example.
  :
  : Does /solr/update/extract work for you using the standard Solr
 example?
  :
  : -- Jack Krupansky
  :
  : -Original Message- From: Nutan
  : Sent: Sunday, September 15, 2013 2:37 AM
  : To: [hidden email]
 http://user/SendEmail.jtp?type=nodenode=4090459i=0
  : Subject: requested url solr/update/extract not available on this
 server
  :
  : I am working on Solr 4.2 on Windows 7. I am trying to index pdf
 files.I
  : referred Solr Cookbook 4. Tomcat is using 8080 port number. I get
 this
  : error:requested url solr/update/extract not available on this server
  : When my curl is :
  : curl 
 http://localhost:8080/solr/update/extract?literal.id=1commit=true;
  -F
  : myfile=@cookbook.pdf
  : There is no entry in log files. Please help.
  :
  :
  :
  : --
  : View this message in context:
  :
 
 http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153.html
  : Sent from the Solr - User mailing list archive at Nabble.com.
  :
 
  -Hoss
 
 
  --
   If you reply to this email, your message will be added to the
 discussion
  below:
 
 
 http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4090459.html
   To unsubscribe from requested url solr/update/extract not available on
  this server, click here

  .
  NAML
 http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewerid=instant_html%21nabble%3Aemail.namlbase=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespacebreadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml

 
 
 
 
 
  --
  View this message in context:
 http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4091371.html

  Sent from the Solr - User mailing list archive at Nabble.com.


 --
  If you reply to this email, your message will be added to the discussion
 below:

 http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4091391.html
  To unsubscribe from requested url solr/update/extract not available on
 this server, click 
 herehttp://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_codenode=4090153code=bnV0YW5zaGluZGUxOTkyQGdtYWlsLmNvbXw0MDkwMTUzfC0xMzEzOTU5Mzcx
 .
 

Facet search on a docvalue field in a multi shard collection

2013-09-22 Thread Trym R. Møller

Hi

I have a problem doing facet search on a doc value field in a multi 
shard collection. Any ideas what I may be doing wrong?


My Solr schema specifies fieldA as a docvalue type and I have created a 
two shard collection using Solr 4.4.0.
When I do a facet search on fieldA with a large facet.limit then the 
query fails with the below exception
A large facet.limit seems to be when (10 + (facet.limit * 1,5)) * 
number of shards  rows matching my query


The exception does not occur when I run with a single shard collection.
It can easily be reproduced by indexing a single row and querying it, as 
the default facet.limit is 100.


The facet query received by Solr looks as follows:
576793 [qtp170860084-18] INFO  org.apache.solr.core.SolrCore  ¦ 
[trym_shard2_replica1] webapp=/solr path=/select 
params={facet=truestart=0q=*:*distrib=truecollection=trymfacet.field=fieldAwt=javabinversion=2rows=0} 
status=500 QTime=20

One of the internal query send by Solr to its shard looks like
576783 [qtp170860084-19] INFO  org.apache.solr.core.SolrCore  ¦ 
[trym_shard1_replica1] webapp=/solr path=/select 
params={facet=truedistrib=falsecollection=trym

 
wt=javabinversion=2rows=0NOW=1379855011787shard.url=192.168.56.1:8501/solr/trym_shard1_replica1/df=textfl=id,scoref.fieldA.facet.limit=160start=0q=*:
*facet.field=fieldAisShard=truefsv=true} hits=1 status=500 QTime=2

576784 [qtp170860084-17] ERROR 
org.apache.solr.servlet.SolrDispatchFilter  ¦ 
null:java.lang.IllegalStateException: Cannot use facet.mincount=0 on a 
field which is not indexed
at 
org.apache.solr.request.NumericFacets.getCounts(NumericFacets.java:257)
at 
org.apache.solr.request.SimpleFacets.getTermCounts(SimpleFacets.java:423)
at 
org.apache.solr.request.SimpleFacets.getFacetFieldCounts(SimpleFacets.java:530)
at 
org.apache.solr.request.SimpleFacets.getFacetCounts(SimpleFacets.java:259)
at 
org.apache.solr.handler.component.FacetComponent.process(FacetComponent.java:78)
at 
org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:208)
at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)

at org.apache.solr.core.SolrCore.execute(SolrCore.java:1904)
at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:659)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:362)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:158)
at 
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419)
at 
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:455)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137)
at 
org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:557)
at 
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231)
at 
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1075)
at 
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:384)
at 
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193)
at 
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1009)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
at 
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
at 
org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)

at org.eclipse.jetty.server.Server.handle(Server.java:368)
at 
org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:489)
at 
org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53)
at 
org.eclipse.jetty.server.AbstractHttpConnection.content(AbstractHttpConnection.java:953)
at 
org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.content(AbstractHttpConnection.java:1014)

at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:861)
at 
org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235)
at 
org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpConnection.java:72)
at 
org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(SocketConnector.java:264)
at 
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)

at java.lang.Thread.run(Thread.java:724)


My schema.xml contains the following lines (among others :-))
dynamicField name=*A type=dlong indexed=false stored=true 

Equivalent Java client code

2013-09-22 Thread Baskar Sikkayan
Hi,
   I use the below query to check if the both of the word are there in the
document.

http://localhost:18080/solr/select?defType=dismaxpf=tagged_skills
^100q=java+ejb+Springqf=tagged_skillsq.op=AND

I am not sure how to execute the same thing using Solr Java client.

SolrQuery query = new SolrQuery();
query.setQuery( tagged_skills:\java+and+ejb\ );
QueryResponse rsp = server.query( query );
SolrDocumentList docs = rsp.getResults();


Also not sure how to check if the given search is there in title and
tageed_skills field.

Could you please help me on this?


Thanks,
Baskar.S


Xml file is not inserting from code java -jar post.jar *.xml

2013-09-22 Thread Kishan Parmar
hi

i am new user of Solr i have done my schema file and when i write a code to
insert xxl file to index from cmd .java -jar post.jar *.xml

it give us error solr returned errer 404 not found

what can i do???


Regards,

Kishan Parmar
Software Developer
+91 95 100 77394
Jay Shree Krishnaa !!


Facet search on a docvalue field in a multi shard collection

2013-09-22 Thread Trym R. Møller

Hi

I have a problem doing facet search on a doc value field in a multi 
shard collection. Any ideas what I may be doing wrong?


My Solr schema specifies fieldA as a docvalue type and I have created a 
two shard collection using Solr 4.4.0.
When I do a facet search on fieldA with a large facet.limit then the 
query fails with the below exception
A large facet.limit seems to be when (10 + (facet.limit * 1,5)) * 
number of shards  rows matching my query


The exception does not occur when I run with a single shard collection.
It can easily be reproduced by indexing a single row and querying it, as 
the default facet.limit is 100.


The facet query received by Solr looks as follows:
576793 [qtp170860084-18] INFO  org.apache.solr.core.SolrCore  ¦ 
[trym_shard2_replica1] webapp=/solr path=/select 
params={facet=truestart=0q=*:*distrib=truecollection=trymfacet.field=fieldAwt=javabinversion=2rows=0} 
status=500 QTime=20

One of the internal query send by Solr to its shard looks like
576783 [qtp170860084-19] INFO  org.apache.solr.core.SolrCore  ¦ 
[trym_shard1_replica1] webapp=/solr path=/select 
params={facet=truedistrib=falsecollection=trym

 
wt=javabinversion=2rows=0NOW=1379855011787shard.url=192.168.56.1:8501/solr/trym_shard1_replica1/df=textfl=id,scoref.fieldA.facet.limit=160start=0q=*:
*facet.field=fieldAisShard=truefsv=true} hits=1 status=500 QTime=2

576784 [qtp170860084-17] ERROR 
org.apache.solr.servlet.SolrDispatchFilter  ¦ 
null:java.lang.IllegalStateException: Cannot use facet.mincount=0 on a 
field which is not indexed
at 
org.apache.solr.request.NumericFacets.getCounts(NumericFacets.java:257)
at 
org.apache.solr.request.SimpleFacets.getTermCounts(SimpleFacets.java:423)
at 
org.apache.solr.request.SimpleFacets.getFacetFieldCounts(SimpleFacets.java:530)
at 
org.apache.solr.request.SimpleFacets.getFacetCounts(SimpleFacets.java:259)
at 
org.apache.solr.handler.component.FacetComponent.process(FacetComponent.java:78)
at 
org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:208)
at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)

at org.apache.solr.core.SolrCore.execute(SolrCore.java:1904)
at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:659)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:362)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:158)
at 
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419)
at 
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:455)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137)
at 
org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:557)
at 
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231)
at 
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1075)
at 
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:384)
at 
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193)
at 
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1009)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
at 
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
at 
org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)

at org.eclipse.jetty.server.Server.handle(Server.java:368)
at 
org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:489)
at 
org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53)
at 
org.eclipse.jetty.server.AbstractHttpConnection.content(AbstractHttpConnection.java:953)
at 
org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.content(AbstractHttpConnection.java:1014)

at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:861)
at 
org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235)
at 
org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpConnection.java:72)
at 
org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(SocketConnector.java:264)
at 
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)

at java.lang.Thread.run(Thread.java:724)


My schema.xml contains the following lines (among others :-))
dynamicField name=*A type=dlong indexed=false stored=true 

Re: Xml file is not inserting from code java -jar post.jar *.xml

2013-09-22 Thread Jack Krupansky
Did you start Solr? How did you verify that Solr is running? Are you able to 
query Solr and access the Admin UI?


Most importantly, did you successfully complete the standard Solr tutorial? 
(IOW, you know all the necessarily steps for basic operation of Solr.)


Lastly, did you verify (by examining the log) whether Solr was able to 
successfully load your schema changes without errors?


-- Jack Krupansky

-Original Message- 
From: Kishan Parmar

Sent: Sunday, September 22, 2013 9:56 AM
To: solr-user@lucene.apache.org
Subject: Xml file is not inserting from code java -jar post.jar *.xml

hi

i am new user of Solr i have done my schema file and when i write a code to
insert xxl file to index from cmd .java -jar post.jar *.xml

it give us error solr returned errer 404 not found

what can i do???


Regards,

Kishan Parmar
Software Developer
+91 95 100 77394
Jay Shree Krishnaa !! 



Re: Facet search on a docvalue field in a multi shard collection

2013-09-22 Thread Erick Erickson
right, I think you're running into a bug I remember going by. I can't
find it now, JIRA seems to be not responding. As I remember,
where if a shard doesn't have a doc on it, you get an error.

Although why facet.limit should figure in here is a mystery to me,
maybe a coincidence?

Significant work has been done about not requiring values for
DocValues fields and stuff. Can you give a try on 4.x or the
soon-to-be-released 4.5?

Best,
Erick

On Sun, Sep 22, 2013 at 6:26 AM, Trym R. Møller t...@sigmat.dk wrote:
 Hi

 I have a problem doing facet search on a doc value field in a multi shard
 collection. Any ideas what I may be doing wrong?

 My Solr schema specifies fieldA as a docvalue type and I have created a two
 shard collection using Solr 4.4.0.
 When I do a facet search on fieldA with a large facet.limit then the query
 fails with the below exception
 A large facet.limit seems to be when (10 + (facet.limit * 1,5)) * number
 of shards  rows matching my query

 The exception does not occur when I run with a single shard collection.
 It can easily be reproduced by indexing a single row and querying it, as the
 default facet.limit is 100.

 The facet query received by Solr looks as follows:
 576793 [qtp170860084-18] INFO  org.apache.solr.core.SolrCore  ¦
 [trym_shard2_replica1] webapp=/solr path=/select
 params={facet=truestart=0q=*:*distrib=truecollection=trymfacet.field=fieldAwt=javabinversion=2rows=0}
 status=500 QTime=20
 One of the internal query send by Solr to its shard looks like
 576783 [qtp170860084-19] INFO  org.apache.solr.core.SolrCore  ¦
 [trym_shard1_replica1] webapp=/solr path=/select
 params={facet=truedistrib=falsecollection=trym

 wt=javabinversion=2rows=0NOW=1379855011787shard.url=192.168.56.1:8501/solr/trym_shard1_replica1/df=textfl=id,scoref.fieldA.facet.limit=160start=0q=*:
 *facet.field=fieldAisShard=truefsv=true} hits=1 status=500 QTime=2

 576784 [qtp170860084-17] ERROR org.apache.solr.servlet.SolrDispatchFilter  ¦
 null:java.lang.IllegalStateException: Cannot use facet.mincount=0 on a field
 which is not indexed
 at
 org.apache.solr.request.NumericFacets.getCounts(NumericFacets.java:257)
 at
 org.apache.solr.request.SimpleFacets.getTermCounts(SimpleFacets.java:423)
 at
 org.apache.solr.request.SimpleFacets.getFacetFieldCounts(SimpleFacets.java:530)
 at
 org.apache.solr.request.SimpleFacets.getFacetCounts(SimpleFacets.java:259)
 at
 org.apache.solr.handler.component.FacetComponent.process(FacetComponent.java:78)
 at
 org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:208)
 at
 org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)
 at org.apache.solr.core.SolrCore.execute(SolrCore.java:1904)
 at
 org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:659)
 at
 org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:362)
 at
 org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:158)
 at
 org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419)
 at
 org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:455)
 at
 org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137)
 at
 org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:557)
 at
 org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231)
 at
 org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1075)
 at
 org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:384)
 at
 org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193)
 at
 org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1009)
 at
 org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
 at
 org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
 at
 org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
 at
 org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
 at org.eclipse.jetty.server.Server.handle(Server.java:368)
 at
 org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:489)
 at
 org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53)
 at
 org.eclipse.jetty.server.AbstractHttpConnection.content(AbstractHttpConnection.java:953)
 at
 org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.content(AbstractHttpConnection.java:1014)
 at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:861)
 at
 org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235)
 at
 

Re: requested url solr/update/extract not available on this server

2013-09-22 Thread Erick Erickson
Please review:

http://wiki.apache.org/solr/UsingMailingLists

Erick

On Sun, Sep 22, 2013 at 5:52 AM, Nutan nutanshinde1...@gmail.com wrote:
 I did define the request handler.


 On Sun, Sep 22, 2013 at 12:51 AM, Erick Erickson [via Lucene] 
 ml-node+s472066n4091391...@n3.nabble.com wrote:

 bq: And im not using the example config file

 It looks like you have not included the request handler in your
 solrconfig.xml,
 something like (from the stock distro):

   !-- Solr Cell Update Request Handler

http://wiki.apache.org/solr/ExtractingRequestHandler

 --
   requestHandler name=/update/extract
   startup=lazy
   class=solr.extraction.ExtractingRequestHandler 
 lst name=defaults
   str name=lowernamestrue/str
   str name=uprefixignored_/str

   !-- capture link hrefs but ignore div attributes --
   str name=captureAttrtrue/str
   str name=fmap.alinks/str
   str name=fmap.divignored_/str
 /lst
   /requestHandler

 I'd start with the stock config and try removing things one-by-one...

 Best,
 Erick

 On Sat, Sep 21, 2013 at 7:34 AM, Nutan [hidden 
 email]http://user/SendEmail.jtp?type=nodenode=4091391i=0
 wrote:

  Yes I do get the solr admin page.And im not using the example config
 file,I
  have create mine own for my project as required.I have also defined
  update/extract in solrconfig.xml.
 
 
  On Tue, Sep 17, 2013 at 4:45 AM, Chris Hostetter-3 [via Lucene] 
  [hidden email] http://user/SendEmail.jtp?type=nodenode=4091391i=1
 wrote:
 
 
  : Is /solr/update working?
 
  more importantly: does /solr/ work in your browser and return
 anything
  useful?  (nothing you've told us yet gives us anyway of knowning if
  solr is even up and running)
 
  if 'http://localhost:8080/solr/' shows you the solr admin UI, and you
 are
  using the stock Solr 4.2 example configs, then
  http://localhost:8080/solr/update/extract should not give you a 404
  error.
 
  if however you are using some other configs, it might not work unless
  those configs register a handler with the path /update/extract.
 
  Using the jetty setup provided with 4.2, and the example configs (from
  4.2) I was able to index a sample PDF just fine using your curl
 command...
 
  hossman@frisbee:~/tmp$ curl 
  http://localhost:8983/solr/update/extract?literal.id=1commit=true; -F
  myfile=@stump.winners.san.diego.2013.pdf
  ?xml version=1.0 encoding=UTF-8?
  response
  lst name=responseHeaderint name=status0/intint
  name=QTime1839/int/lst
  /response
 
 
 
 
 
  :
  : Check solrconfig to see that /update/extract is configured as in the
  standard
  : Solr example.
  :
  : Does /solr/update/extract work for you using the standard Solr
 example?
  :
  : -- Jack Krupansky
  :
  : -Original Message- From: Nutan
  : Sent: Sunday, September 15, 2013 2:37 AM
  : To: [hidden email]
 http://user/SendEmail.jtp?type=nodenode=4090459i=0
  : Subject: requested url solr/update/extract not available on this
 server
  :
  : I am working on Solr 4.2 on Windows 7. I am trying to index pdf
 files.I
  : referred Solr Cookbook 4. Tomcat is using 8080 port number. I get
 this
  : error:requested url solr/update/extract not available on this server
  : When my curl is :
  : curl 
 http://localhost:8080/solr/update/extract?literal.id=1commit=true;
  -F
  : myfile=@cookbook.pdf
  : There is no entry in log files. Please help.
  :
  :
  :
  : --
  : View this message in context:
  :
 
 http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153.html
  : Sent from the Solr - User mailing list archive at Nabble.com.
  :
 
  -Hoss
 
 
  --
   If you reply to this email, your message will be added to the
 discussion
  below:
 
 
 http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4090459.html
   To unsubscribe from requested url solr/update/extract not available on
  this server, click here

  .
  NAML
 http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewerid=instant_html%21nabble%3Aemail.namlbase=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespacebreadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml

 
 
 
 
 
  --
  View this message in context:
 http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4091371.html

  Sent from the Solr - User mailing list archive at Nabble.com.


 --
  If you reply to this email, your message will be added to the discussion
 below:

 http://lucene.472066.n3.nabble.com/requested-url-solr-update-extract-not-available-on-this-server-tp4090153p4091391.html
  To unsubscribe from requested url solr/update/extract not available on
 this server, click 
 

Re: Xml file is not inserting from code java -jar post.jar *.xml

2013-09-22 Thread Erick Erickson
Please review:

http://wiki.apache.org/solr/UsingMailingLists

Best,
Erick

On Sun, Sep 22, 2013 at 8:06 AM, Jack Krupansky j...@basetechnology.com wrote:
 Did you start Solr? How did you verify that Solr is running? Are you able to
 query Solr and access the Admin UI?

 Most importantly, did you successfully complete the standard Solr tutorial?
 (IOW, you know all the necessarily steps for basic operation of Solr.)

 Lastly, did you verify (by examining the log) whether Solr was able to
 successfully load your schema changes without errors?

 -- Jack Krupansky

 -Original Message- From: Kishan Parmar
 Sent: Sunday, September 22, 2013 9:56 AM
 To: solr-user@lucene.apache.org
 Subject: Xml file is not inserting from code java -jar post.jar *.xml


 hi

 i am new user of Solr i have done my schema file and when i write a code to
 insert xxl file to index from cmd .java -jar post.jar *.xml

 it give us error solr returned errer 404 not found

 what can i do???


 Regards,

 Kishan Parmar
 Software Developer
 +91 95 100 77394
 Jay Shree Krishnaa !!


Re: How to define facet.prefix as case-insensitive

2013-09-22 Thread Erick Erickson
You'll have to lowercase the term in your app and set
terms.prefix to that value, there's no analysis done
on the terms.prefix value.

Best,
Erick

On Sun, Sep 22, 2013 at 4:07 AM, Mysurf Mail stammail...@gmail.com wrote:
 I am using facet.prefix for auto complete.
 This is my definition

  requestHandler name=/ac class=solr.SearchHandler
  lst name=defaults
   str name=echoParamsexplicit/str
   ...
   str name=lowercaseOperatorstrue/str
   str name=faceton/str
   str name=facet.fieldSuggest/str
 /lst

 this is my field

 field name=Suggest type=text_auto indexed=true stored=true
 required=false multiValued=true/

 and

  fieldType class=solr.TextField name=text_auto
   analyzer
 tokenizer class=solr.KeywordTokenizerFactory/
 filter class=solr.LowerCaseFilterFactory/
   /analyzer
 /fieldType

 all works fine but when I search using caps lock it doesn't return answers.
 Even when the field contains capitals letters - it doesn't.

 I assume that the field in solr is lowered (from the field type filter
 definition) but the search term is not.
 How can I control the search term caps/no caps?

 Thanks.


Re: Getting a query parameter in a TokenFilter

2013-09-22 Thread Trey Grainger
Hi Isaac,

In the process of writing Solr in Action (http://solrinaction.com), I have
built the solution to SOLR-5053 for the multilingual search chapter (I
didn't realize this ticket existed at the time).  The solution was
something I called a MultiTextField.  Essentially, the field let's you
map a list of defined pre-fixes to field types and dynamically substitute
in one or more field types based upon the incoming content.

For example:

#schema.xml#
 fieldType name=multiText
class=sia.ch14.MultiTextField sortMissingLast=true
defaultFieldType=text_general
fieldMappings=en:text_english,
   es:text_spanish,
   fr:text_french/

fieldType name=text_english ... /
fieldType name=text_spanish ... /
fieldType name=text_french ... /

field name=content type=multiText indexed=true ... /
#document#
adddoc
  field name=id1/field
  field name=contenten,es|the schools, la escuala/field
/doc/add

#Outputted Token Stream#:
[Position 1]   [Position 2]   [Position 3] [Position 4]
 the   school   la
escuela
 schools
escuel

#query on two languages#
q=en,es|la OR en,es|escuela

 Essentially, this MultiText field type lets you dynamically combine one or
more Analyzers (from a defined field type) and stack the tokens based upon
term positions within each independent Analyzer.  The use case here was
multiple

To answer your original question... at query time, this implementation
requires that you pass the prefix before EACH term in the query, not just
the first term (you can see this in the q= I demonstrated above).  If you
have a Token Filter you have developed, you could probably accomplish
what you are trying to do the same way.

You could write a custom QParserPlugin that would do this for you I think.
 Alternatively, it may be possible to create a similar implementation that
makes use of a dynamic field name (i.e.  content|en,fr as the field
name), which would pull the prefix from the field name and apply it to all
tokens instead of requiring/allowing each token to specify it's own prefix.
 I haven't done this in my implementation, but I could see where it might
be more user-friendly for many Solr users.

I'm just finishing up the multilingual search chapter and code now and
will be happy to post it to SOLR-5053 once I finish in the next few days if
this would be helpful to you.

-Trey


On Sat, Sep 21, 2013 at 4:15 PM, Isaac Hebsh isaac.he...@gmail.com wrote:

 Thought about that again,
 We can do this work as a search component, manipulating the query string.
 The cons are the double QParser work, and the double tokenization work.

 Another approach which might solve this issue easily is Dynamic query
 analyze chain: https://issues.apache.org/jira/browse/SOLR-5053

 What would you do?


 On Tue, Sep 17, 2013 at 10:31 PM, Isaac Hebsh isaac.he...@gmail.com
 wrote:

  Hi everyone,
 
  We developed a TokenFilter.
  It should act differently, depends on a parameter supplied in the
  query (for query chain only, not the index one, of course).
  We found no way to pass that parameter into the TokenFilter flow. I guess
  that the root cause is because TokenFilter is a pure lucene object.
 
  As a last resort, we tried to pass the parameter as the first term in the
  query text (q=...), and save it as a member of the TokenFilter instance.
 
  Although it is ugly, it might work fine.
  But, the problem is that it is not guaranteed that all the terms of a
  particular query will be analyzed by the same instance of a TokenFilter.
 In
  this case, some terms will be analyzed without the required information
 of
  that parameter. We can produce such a race very easily.
 
  How should I overcome this issue?
  Do anyone have a better resolution?
 



Re: dih delete doc per $deleteDocById

2013-09-22 Thread Andreas Owen
sorry, it works like this, i had a typo in my conf :-(

On 17. Sep 2013, at 2:44 PM, Andreas Owen wrote:

 i would like to know how to get it to work and delete documents per xml and 
 dih.
 
 On 17. Sep 2013, at 1:47 PM, Shalin Shekhar Mangar wrote:
 
 What is your question?
 
 On Tue, Sep 17, 2013 at 12:17 AM, andreas owen a.o...@gmx.net wrote:
 i am using dih and want to delete indexed documents by xml-file with ids. i 
 have seen $deleteDocById used in entity query=...
 
 data-config.xml:
 entity name=rec processor=XPathEntityProcessor 
 url=file:///C:\ColdFusion10\cfusion\solr\solr\tkbintranet\docImportDelete.xml
  forEach=/docs/doc dataSource=main 
   field column=$deleteDocById xpath=//id /
 /entity
 
 xml-file:
 docs
   doc
   id2345/id
   /doc
 /docs
 
 
 
 -- 
 Regards,
 Shalin Shekhar Mangar.



Re: Equivalent Java client code

2013-09-22 Thread Shawn Heisey
On 9/22/2013 7:43 AM, Baskar Sikkayan wrote:
 http://localhost:18080/solr/select?defType=dismaxpf=tagged_skills
 ^100q=java+ejb+Springqf=tagged_skillsq.op=AND
 
 I am not sure how to execute the same thing using Solr Java client.
 
 SolrQuery query = new SolrQuery();
 query.setQuery( tagged_skills:\java+and+ejb\ );
 QueryResponse rsp = server.query( query );
 SolrDocumentList docs = rsp.getResults();
 
 
 Also not sure how to check if the given search is there in title and
 tageed_skills field.

The solrj code for a query that's completely identical to your HTTP
query is this:

SolrQuery qry = new SolrQuery();
qry.set(defType, dismax);
qry.set(pf, tagged_skills^100);
qry.set(qf, tagged_skills);
qry.set(q.op, AND);
qry.setQuery(java ejb Spring);

If you change pf to tagged_skills^100 title^100 and qf to
tagged_skills title that should extend the search to the title field
as well.  Note that with q.op set to AND, the results may not be what
you expect.  I'm not sure whether dismax is smart enough to only apply
the AND within each of the field searches, I would expect it to apply it
across the board so all the query data must exist in all query fields.

Thanks,
Shawn



Near Duplicate Document Detection at Solr

2013-09-22 Thread Furkan KAMACI
I want to detect near duplicate documents (for web documents). I know that
there is an algorithm called Winnowing and there is another technique used
by Google. However I also know that Solr has a component called
MoreLikeThis. Google's page explains that *mirroring and plagiarism* is
easy to detect but near duplicate detection is much more behind it.

So I want to ask that what is the underlying algorithm Solr MoreLikeThis
component uses and can I use it for such kind of purposes?

Otherwise, I will implement an algorithm for near duplicate document
detection within few days and I will be proud to contribute and adopt it
into Solr.

Thanks;
Furkan KAMACI


Re: Near Duplicate Document Detection at Solr

2013-09-22 Thread Furkan KAMACI
I've also know that there is another mechanism at Solr:
http://wiki.apache.org/solr/Deduplication I think that I should add a
custom signature because that is the most usable one for me:
http://wiki.apache.org/solr/TextProfileSignature On the other hand are
there any limitation for deduplication at SolrCloud?

What do you think?


2013/9/22 Furkan KAMACI furkankam...@gmail.com

 I want to detect near duplicate documents (for web documents). I know that
 there is an algorithm called Winnowing and there is another technique used
 by Google. However I also know that Solr has a component called
 MoreLikeThis. Google's page explains that *mirroring and plagiarism* is
 easy to detect but near duplicate detection is much more behind it.

 So I want to ask that what is the underlying algorithm Solr MoreLikeThis
 component uses and can I use it for such kind of purposes?

 Otherwise, I will implement an algorithm for near duplicate document
 detection within few days and I will be proud to contribute and adopt it
 into Solr.

 Thanks;
 Furkan KAMACI



RE: Near Duplicate Document Detection at Solr

2013-09-22 Thread Markus Jelsma

-Original message-
 From:Furkan KAMACI furkankam...@gmail.com
 Sent: Sunday 22nd September 2013 21:15
 To: solr-user@lucene.apache.org
 Subject: Re: Near Duplicate Document Detection at Solr
 
 I've also know that there is another mechanism at Solr:
 http://wiki.apache.org/solr/Deduplication I think that I should add a
 custom signature because that is the most usable one for me:
 http://wiki.apache.org/solr/TextProfileSignature

Keep in mind, its results are really bad for short documents and does not work 
for languages not using whitespace.

 On the other hand are
 there any limitation for deduplication at SolrCloud?

Yes, it does not work:
https://issues.apache.org/jira/browse/SOLR-3473

 
 What do you think?
 
 
 2013/9/22 Furkan KAMACI furkankam...@gmail.com
 
  I want to detect near duplicate documents (for web documents). I know that
  there is an algorithm called Winnowing and there is another technique used
  by Google. However I also know that Solr has a component called
  MoreLikeThis. Google's page explains that *mirroring and plagiarism* is
  easy to detect but near duplicate detection is much more behind it.
 
  So I want to ask that what is the underlying algorithm Solr MoreLikeThis
  component uses and can I use it for such kind of purposes?
 
  Otherwise, I will implement an algorithm for near duplicate document
  detection within few days and I will be proud to contribute and adopt it
  into Solr.
 
  Thanks;
  Furkan KAMACI
 
 


import partition table from oracle

2013-09-22 Thread YouPeng Yang
Hi

  I want to import dataset in a partition of a partition table with DIH.
And I would like to explicitly define the partition when I do import job.

 To be specific.
  1. I define the DIH configuration like these
entity name=type query=select id as tid from movie_type  partition(${
part})''

  2.I send the url:
  http://localhost:8983/solr/dataimport?command=full-importpart=p2

  and then the DIHhandler will full import the p2 partition of the table.

Any suggestion will be appreciated.


Regards.


import partition table from oracle

2013-09-22 Thread YouPeng Yang
Hi

  I want to import dataset in a partition of a partition table with DIH.
And I would like to explicitly define the partition when I do import job.

 To be specific.
  1. I define the DIH configuration like these
entity name=type query=select id as tid from movie_type  partition(${
part})''

  2.I send the url:
  http://localhost:8983/solr/dataimport?command=full-importpart=p2

  and then the DIHhandler will full import the p2 partition of the table.

Any suggestion will be appreciated.


Regards.


Interesting edismax/qs bug in Solr 3.5

2013-09-22 Thread Arcadius Ahouansou
We have been seeing a strange bug in our prod Solr 3.5.

I went to download a fresh copy of Solr3.5, with default schema  and
indexed (curl or post.jar) the following 2 docs

[
   {
  id:1,
  title:One Earth
   },
   {
  id:2,
  title:One Love One Earth
   }

]


I could browse and see the docs in solr.

However, when I do:
/solr/select?q=One Love One Earthqf=titleqs=2defType=edismaxpf=title

I get nothing back.
when I change qs=4 in the query, then I see the expected doc2.
debugQuery=true does not reveal anything.

- I have noticed that when I reverse the order of the documents in the
input file i.e doc2 first, then doc1 , and do the  indexing (using curl or
post.jar), the the query above works and return doc2 as expected.
- Same when I index only doc2 (doc1 not indexed).

I tested solr3.6.2  and 4.4.0 and I can confirm they are not affected by
this issue.

I looked at the change logs for 3.6.2 and jira but could not find any trace
of this problem.

Any pointer to the ticket that addressed this issue will be appreciated.


Thank you very much.


Arcadius.
.


RE: Interesting edismax/qs bug in Solr 3.5

2013-09-22 Thread Michael Ryan
Sounds like https://issues.apache.org/jira/browse/LUCENE-3821 (issue seems to 
be fixed but still shows as open).

-Michael

-Original Message-
From: Arcadius Ahouansou [mailto:arcad...@menelic.com] 
Sent: Sunday, September 22, 2013 11:15 PM
To: solr-user
Subject: Interesting edismax/qs bug in Solr 3.5

We have been seeing a strange bug in our prod Solr 3.5.

I went to download a fresh copy of Solr3.5, with default schema  and indexed 
(curl or post.jar) the following 2 docs

[
   {
  id:1,
  title:One Earth
   },
   {
  id:2,
  title:One Love One Earth
   }

]


I could browse and see the docs in solr.

However, when I do:
/solr/select?q=One Love One Earthqf=titleqs=2defType=edismaxpf=title

I get nothing back.
when I change qs=4 in the query, then I see the expected doc2.
debugQuery=true does not reveal anything.

- I have noticed that when I reverse the order of the documents in the input 
file i.e doc2 first, then doc1 , and do the  indexing (using curl or post.jar), 
the the query above works and return doc2 as expected.
- Same when I index only doc2 (doc1 not indexed).

I tested solr3.6.2  and 4.4.0 and I can confirm they are not affected by this 
issue.

I looked at the change logs for 3.6.2 and jira but could not find any trace of 
this problem.

Any pointer to the ticket that addressed this issue will be appreciated.


Thank you very much.


Arcadius.
.