Re: Schema Parsing Failed: unknown field 'id' [Zookeeper, SolrCloud]

2014-09-22 Thread paulparsons
Thanks for the suggestions. I actually had both problems. I couldn't figure
out how to remove the configs from zookeeper through the cloud scripts, so I
just manually removed the files in the zookeeper data directory.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Schema-Parsing-Failed-unknown-field-id-Zookeeper-SolrCloud-tp4160478p4160580.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Schema Parsing Failed: unknown field 'id' [Zookeeper, SolrCloud]

2014-09-22 Thread paulparsons
Thanks. There is definitely a  in each of the schemas.

I am using 4.7.2.

Here is one of the *schema.xml* (the others are similar):



   
   

   
   
   
   
   
   
   
   
  
  
   
   
   
   

 id

   



  
  
   
  


















  



  
  




  



  





  



  
  




  



  




  


  


 




Here is the corresponding *solrconfig.xml*: 





  4.7

  
  
  

  

  
  
  

  ${solr.medline-citations.data.dir:}

  

   

  

  

  
${solr.lock.type:native}
true
  

  

  
  


  ${solr.ulog.dir:}


  
   ${solr.autoCommit.maxTime:15000} 
   false 
 

  
   ${solr.autoSoftCommit.maxTime:-1} 
 

  

  

1024







 


true

   20

   200


  

  


  

  static firstSearcher warming in solrconfig.xml

  


false

2

  

  
 




  

  

 
   explicit
   100
   catchall
   
 



  
  
 
   explicit
   json
   true
   text
 
  

  
 
   true
   json
   true
 
  

  
 
   explicit

   
   velocity
   browse
   layout
   Solritas

   
   edismax
   
  catchall^0.5 medline_abstract_text^1.0 medline_journal_title^1.2
medline_article_title^1.2 id^10.0 
   
   catchall
   100%
   *:*
   10
   *,score

   
 catchall^0.5 medline_abstract_text^1.0 medline_journal_title^1.2
medline_article_title^1.2 id^10.0
   
   catchall,medline_article_title,medline_journal_title
   3

   on
   medline_author_lastname
   medline_journal_title

   on
   medline_abstract_text
   html
   
   
   0
   medline_abstract_text

   on
   false   
   5
   2
   5   
   true
   true  
   5
   3   
 

 
 
   spellcheck
 
  


  

 application/json
   
  
  

 application/csv
   
  


  

  true
  ignored_

  
  true
  links
  ignored_

  

  

  

  

  
  

  solrpingquery


  all


  

  
  

 explicit 
 true

  

   
  



  

  ./medline-citations_DIHconfig.xml
  uima

  

  

text_general


  default
  catchall
  solr.DirectSolrSpellChecker
  
  internal
  
  0.5
  
  2
  
  1
  
  5
  
  4
  
  0.01
  




  wordbreak
  solr.WordBreakSolrSpellChecker  
  name
  true
  true
  10


  

  

  catchall
  
  default
  wordbreak
  on
  true   
  10
  5
  5   
  true
  true  
  10
  5 


  spellcheck

  

  

  suggest
  org.apache.solr.spelling.suggest.Suggester
  org.apache.solr.spelling.suggest.tst.TSTLookupFactory
  
  suggestions  
  
  true


  

  

  true
  suggest
  
  10
  


  suggest

  


  
  

  

  text
  true


  tvComponent

  

  

  lingo

  org.carrot2.clustering.lingo.LingoClusteringAlgorithm

  clustering/carrot2




  stc
  org.carrot2.clustering.stc.STCClusteringAlgorithm




  kmeans
  org.carrot2.clustering.kmeans.BisectingKMeansClusteringAlgorithm

  
  
  
  

  
  
 
  true
  false
 

  terms

  


  
  

string
elevate.xml
  

  
  

  explicit
  text


  elevator

  

  
  

  
  
  

  100

  

  
  

  
  70
  
  0.5
  
  [-\w ,/\n\"']{20,200}

  

  
  

  
  

  

  
  

  
  
  
  
  
  
  
  
  
  
  

  

  
  

  
  

  
  
  

  10
  .,!? 	


  
  
  

  
  WORD
  
  
  en
  US

  

  

   



  


  
  /org/apache/uima/desc/AggregateGeneAE.xml
  
  
  true
  
  
false

   medline_abstract_text

Schema Parsing Failed: unknown field 'id' [Zookeeper, SolrCloud]

2014-09-22 Thread paulparsons
Hi,

I'm trying to set up a multicore SolrCloud on HDFS. I am getting the
following error for all my cores when trying to start the server:

ERROR org.apache.solr.core.CoreContainer  – Unable to create core: 
org.apache.solr.common.SolrException: Schema Parsing Failed: unknown field
'id'. Schema file is solr//schema.xml
at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:618)
at org.apache.solr.schema.IndexSchema.(IndexSchema.java:166)
at
org.apache.solr.schema.IndexSchemaFactory.create(IndexSchemaFactory.java:55)
at
org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:69)
at org.apache.solr.core.ZkContainer.createFromZk(ZkContainer.java:243)
at org.apache.solr.core.CoreContainer.create(CoreContainer.java:595)
at org.apache.solr.core.CoreContainer$1.call(CoreContainer.java:258)
at org.apache.solr.core.CoreContainer$1.call(CoreContainer.java:250)
at java.util.concurrent.FutureTask.run(FutureTask.java:273)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:482)
at java.util.concurrent.FutureTask.run(FutureTask.java:273)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1176)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:641)
at java.lang.Thread.run(Thread.java:853)
Caused by: java.lang.RuntimeException: unknown field 'id'
at 
org.apache.solr.schema.IndexSchema.getIndexedField(IndexSchema.java:340)
at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:536)
... 13 more


There is nothing wrong with the declaration of the 'id' field, and I have it
working fine when it's not using SolrCloud/HDFS. One odd thing is the part
that says "Schema file is solr//schema.xml", because there is no
schema file there. I have no idea where it is getting that path from. All of
the schema files are in solr//conf/schema.xml. I'm not sure if
this is the problem--it must be finding the schema, otherwise how does it
know about the 'id' field?

I am running it with the following command (with <> fields filled in
appropriately):

java -DnumShards=2 -Dbootstrap_conf=true -DzkHost=:2181 -Dhost=
-DSTOP.PORT=7983 -DSTOP.KEY=key -Dsolr.directoryFactory=HdfsDirectoryFactory
-Dsolr.hdfs.confdir=/hadoop-conf -Dsolr.lock.type=hdfs
-Dsolr.hdfs.home=hdfs:///user/pparsons/solrcloud -jar start.jar



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Schema-Parsing-Failed-unknown-field-id-Zookeeper-SolrCloud-tp4160478.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Integrate UIMA and DIH

2014-08-25 Thread paulparsons
I forgot to mention in the previous post that I changed the analysis engine
from

/org/apache/uima/desc/OverridingParamsExtServicesAE.xml

to

/org/apache/uima/desc/AggregateSentenceAE

In doing so, I forgot the '.xml' extension, which is what was causing the
error. It would be helpful if the error messages where a little more
descriptive!



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Integrate-UIMA-and-DIH-tp4154576p4155075.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Integrate UIMA and DIH

2014-08-25 Thread paulparsons
I added default="true" to my updateRequestProcessorChain:



Now I'm getting errors when running the DIH:


ERROR org.apache.solr.core.SolrCore  – org.apache.solr.common.SolrException:
org.apache.uima.resource.ResourceInitializationException
at
org.apache.solr.uima.processor.UIMAUpdateRequestProcessorFactory.getInstance(UIMAUpdateRequestProcessorFactory.java:64)
at
org.apache.solr.update.processor.UpdateRequestProcessorChain.createProcessor(UpdateRequestProcessorChain.java:204)
at
org.apache.solr.handler.dataimport.DataImportHandler.handleRequestBody(DataImportHandler.java:178)
at
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1962)
at
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:777)
at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:418)
at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:207)
at
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419)
at
org.eclipse.jetty.servlets.CrossOriginFilter.handle(CrossOriginFilter.java:247)
at
org.eclipse.jetty.servlets.CrossOriginFilter.doFilter(CrossOriginFilter.java:210)
at
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1419)
at
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:455)
at
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:137)
at
org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:557)
at
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231)
at
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1075)
at
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:384)
at
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:193)
at
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1009)
at
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
at
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
at
org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
at
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
at org.eclipse.jetty.server.Server.handle(Server.java:368)
at
org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:489)
at
org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53)
at
org.eclipse.jetty.server.AbstractHttpConnection.content(AbstractHttpConnection.java:953)
at
org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.content(AbstractHttpConnection.java:1014)
at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:861)
at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:240)
at
org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpConnection.java:72)
at
org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(SocketConnector.java:264)
at
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
at
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.uima.resource.ResourceInitializationException
at
org.apache.lucene.analysis.uima.ae.BasicAEProvider.getAE(BasicAEProvider.java:58)
at
org.apache.solr.uima.processor.UIMAUpdateRequestProcessorFactory.getInstance(UIMAUpdateRequestProcessorFactory.java:61)
... 35 more
Caused by: java.lang.NullPointerException
at org.apache.uima.util.XMLInputSource.(XMLInputSource.java:118)
at
org.apache.lucene.analysis.uima.ae.BasicAEProvider.getInputSource(BasicAEProvider.java:84)
at
org.apache.lucene.analysis.uima.ae.BasicAEProvider.getAE(BasicAEProvider.java:50)
... 36 more



I've looked at the source code that is pointed to, but can't determine what
the problem is. I've also noticed from other posts that people in the past
have had a similar problem with ResourceInitializationException, but there
doesn't seem to be any general solution.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Integrate-UIMA-and-DIH-tp4154576p4155039.html
Sent from the Solr - User mailing list archive at Nabble.com.


Integrate UIMA and DIH

2014-08-22 Thread paulparsons
Hi,

I am trying to integrate UIMA and Solr. I'm following the guide here:
https://cwiki.apache.org/confluence/display/solr/UIMA+Integration


I'm also already using DIH to import from XML files.

Here is what I've added to solrconfig.xml:

libraries:

  
  
  


UpdateRequestProcessorChain:


  

  /org/apache/uima/desc/OverridingParamsExtServicesAE.xml
  false
  
false

   text_en

  
  

  org.apache.uima.SentenceAnnotation
  
coveredText
sentence
  

  

  
  
  



and the DIH:

  

  ./medline_DIHconfig.xml
  uima

  


and I've added this field to the schema:




The DIH part of the indexing works fine and no errors are given. But nothing
happens with the sentence field. I'm not sure what I'm missing here. I've
been searching all over but can't seem to find any useful information.

Thanks.

 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Integrate-UIMA-and-DIH-tp4154576.html
Sent from the Solr - User mailing list archive at Nabble.com.


Use XSD or DTD to make Solr schema?

2014-05-06 Thread paulparsons
I'm new to Solr, so forgive me if this is a silly question. Although I can
find some related information (in this list and elsewhere), I can't seem to
find a clear answer to my specific question: 

If I have a DTD or XSD that describes the structure of a set of XML
documents that I have, is there some way to convert that into the schema.xml
that Solr requires? Or do I need to construct the schema from scratch
manually?

Thanks.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Use-XSD-or-DTD-to-make-Solr-schema-tp4135031.html
Sent from the Solr - User mailing list archive at Nabble.com.