i have now changed some things and the import runs without error. in schema.xml i haven't got the field "text" but "contentsExact". unfortunatly the text (from file) isn't indexed even though i mapped it to the proper field. what am i doing wrong?
data-config.xml: <dataConfig> <dataSource type="BinFileDataSource" name="data"/> <dataSource type="BinURLDataSource" name="dataUrl"/> <dataSource type="URLDataSource" baseUrl="http://127.0.0.1/tkb/internet/" name="main"/> <document> <entity name="rec" processor="XPathEntityProcessor" url="docImport.xml" forEach="/albums/album" dataSource="main"> <!--transformer="script:GenerateId"--> <field column="title" xpath="//title" /> <field column="id" xpath="//file" /> <field column="path" xpath="//path" /> <field column="Author" xpath="//author" /> <!-- <field column="tstamp">2013-07-05T14:59:46.889Z</field> --> <entity name="f" processor="FileListEntityProcessor" baseDir="C:\web\development\tkb\internet\public" fileName="${rec.id}" dataSource="data" onError="skip"> <entity name="tika" processor="TikaEntityProcessor" url="${f.fileAbsolutePath}"> <field column="text" name="contentsExact" /> </entity> </entity> </entity> </document> </dataConfig> i noticed, that when I move the field author into the tika-<entity> it isn't indexed. can this have something to do why the text from the file isn't indexed? Do I have to do something special about the <entity>-levels in <document> ps: how do i import tsstamp, it's a static value? On 14. Jul 2013, at 10:30 PM, Jack Krupansky wrote: > "Caused by: java.lang.NoSuchMethodError:" > > That means you have some out of date jars or some newer jars mixed in with > the old ones. > > -- Jack Krupansky > > -----Original Message----- From: Andreas Owen > Sent: Sunday, July 14, 2013 3:07 PM > To: solr-user@lucene.apache.org > Subject: Re: solr autodetectparser tikaconfig dataimporter error > > hi > > is there nowone with a idea what this error is or even give me a pointer > where to look? If not is there a alternitave way to import documents from a > xml-file with meta-data and the filename to parse? > > thanks for any help. > > > On 12. Jul 2013, at 10:38 PM, Andreas Owen wrote: > >> i am using solr 3.5, tika-app-1.4 and tagcloud 1.2.1. when i try to = >> import a >> file via xml i get this error, it doesn't matter what file format i try = >> to index txt, cfm, pdf all the same error: >> >> SEVERE: Exception while processing: rec document : >> SolrInputDocument[{id=3Did(1.0)=3D{myTest.txt}, >> title=3Dtitle(1.0)=3D{Beratungsseminar kundenbrief}, = >> contents=3Dcontents(1.0)=3D{wie >> kommuniziert man}, author=3Dauthor(1.0)=3D{Peter Z.}, >> = >> path=3Dpath(1.0)=3D{download/online}}]:org.apache.solr.handler.dataimport.= >> DataImportHandlerException: >> java.lang.NoSuchMethodError: >> = >> org.apache.tika.parser.AutoDetectParser.setConfig(Lorg/apache/tika/config/= >> TikaConfig;)V >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.jav= >> a:669) >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.jav= >> a:622) >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:2= >> 68) >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:187)= >> >> at >> = >> org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.= >> java:359) >> at >> = >> org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:4= >> 27) >> at >> = >> org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:40= >> 8) >> Caused by: java.lang.NoSuchMethodError: >> = >> org.apache.tika.parser.AutoDetectParser.setConfig(Lorg/apache/tika/config/= >> TikaConfig;)V >> at >> = >> org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityP= >> rocessor.java:122) >> at >> = >> org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityPr= >> ocessorWrapper.java:238) >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.jav= >> a:596) >> ... 6 more >> >> Jul 11, 2013 5:23:36 PM org.apache.solr.common.SolrException log >> SEVERE: Full Import >> failed:org.apache.solr.handler.dataimport.DataImportHandlerException: >> java.lang.NoSuchMethodError: >> = >> org.apache.tika.parser.AutoDetectParser.setConfig(Lorg/apache/tika/config/= >> TikaConfig;)V >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.jav= >> a:669) >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.jav= >> a:622) >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:2= >> 68) >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:187)= >> >> at >> = >> org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.= >> java:359) >> at >> = >> org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:4= >> 27) >> at >> = >> org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:40= >> 8) >> Caused by: java.lang.NoSuchMethodError: >> = >> org.apache.tika.parser.AutoDetectParser.setConfig(Lorg/apache/tika/config/= >> TikaConfig;)V >> at >> = >> org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityP= >> rocessor.java:122) >> at >> = >> org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityPr= >> ocessorWrapper.java:238) >> at >> = >> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.jav= >> a:596) >> ... 6 more >> >> Jul 11, 2013 5:23:36 PM org.apache.solr.update.DirectUpdateHandler2 = >> rollback >> >> data-config.xml: >> <dataConfig> >> <dataSource type=3D"BinURLDataSource" name=3D"data"/> >> <dataSource type=3D"URLDataSource" = >> baseUrl=3D"http://127.0.0.1/tkb/internet/" >> name=3D"main"/> >> <document> >> <entity name=3D"rec" processor=3D"XPathEntityProcessor" = >> url=3D"docImport.xml" >> forEach=3D"/albums/album" dataSource=3D"main">=20 >> <field column=3D"title" xpath=3D"//title" /> >> <field column=3D"id" xpath=3D"//file" /> >> <field column=3D"contents" xpath=3D"//description" /> >> <field column=3D"path" xpath=3D"//path" /> >> <field column=3D"Author" xpath=3D"//author" /> >> =09 >> =09 >> =09 >> <entity processor=3D"TikaEntityProcessor" >> = >> url=3D"file:///C:\web\development\tkb\internet\public\download\online\${re= >> c.id}" >> dataSource=3D"data" onerror=3D"skip"> >> <field column=3D"contents" name=3D"text" /> >> </entity> >> </entity> >> </document> >> </dataConfig> >> >> the lib are included and declared in the logs, i have also tried = >> tika-app >> 1.0 and tagsoup 1.2 with the same result. can someone please help, i = >> don't >> know where to start looking for the error.