path was set text wasn't, but it doesn't make a difference. my importer says 1 row fetched, 0 docs processed, 0 docs skipped. i don't understand how it can have 2 docs indexed with such a output.
On 20. Jul 2013, at 12:47 PM, Shalin Shekhar Mangar wrote: > Are the "path" and "text" fields set to "stored" in the schema.xml? > > > On Sat, Jul 20, 2013 at 3:37 PM, Andreas Owen <a...@conx.ch> wrote: > >> they are in my schema, path is typed correctly the others are default >> fields which already exist. all the other fields are populated and i can >> search for them, just path and text aren't. >> >> >> On 19. Jul 2013, at 6:16 PM, Alexandre Rafalovitch wrote: >> >>> Dumb question: they are in your schema? Spelled right, in the right >>> section, using types also defined? Can you populate them by hand with a >> CSV >>> file and post.jar? >>> >>> Regards, >>> Alex. >>> >>> Personal website: http://www.outerthoughts.com/ >>> LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch >>> - Time is the quality of nature that keeps events from happening all at >>> once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) >>> >>> >>> On Fri, Jul 19, 2013 at 12:09 PM, Andreas Owen <a...@conx.ch> wrote: >>> >>>> i'm using solr 4.3 which i just downloaded today and am using only jars >>>> that came with it. i have enabled the dataimporter and it runs without >>>> error. but the field "path" (included in schema.xml) and "text" (file >>>> content) aren't indexed. what am i doing wrong? >>>> >>>> solr-path: C:\ColdFusion10\cfusion\jetty-new >>>> collection-path: C:\ColdFusion10\cfusion\jetty-new\solr\collection1 >>>> pdf-doc-path: C:\web\development\tkb\internet\public >>>> >>>> >>>> data-config.xml: >>>> >>>> <dataConfig> >>>> <dataSource type="BinFileDataSource" name="data"/> >>>> <dataSource type="BinURLDataSource" name="dataUrl"/> >>>> <dataSource type="URLDataSource" baseUrl=" >>>> http://127.0.0.1/tkb/internet/" name="main"/> >>>> <document> >>>> <entity name="rec" processor="XPathEntityProcessor" >>>> url="docImportUrl.xml" forEach="/albums/album" dataSource="main"> <!-- >>>> >>>> transformer="script:GenerateId"--> >>>> <field column="title" xpath="//title" /> >>>> <field column="id" xpath="//file" /> >>>> <field column="path" xpath="//path" /> >>>> <field column="Author" xpath="//author" /> >>>> >>>> <!-- <field >>>> column="tstamp">2013-07-05T14:59:46.889Z</field> --> >>>> >>>> <entity name="tika" processor="TikaEntityProcessor" >>>> url="../../../../../web/development/tkb/internet/public/${rec.path}/${ >>>> rec.id}" >>>> >>>> dataSource="data" > >>>> <field column="text" /> >>>> >>>> </entity> >>>> </entity> >>>> </document> >>>> </dataConfig> >>>> >>>> >>>> docImportUrl.xml: >>>> >>>> <?xml version="1.0" encoding="utf-8"?> >>>> <albums> >>>> <album> >>>> <author>Peter Z.</author> >>>> <title>Beratungsseminar kundenbrief</title> >>>> <description>wie kommuniziert man</description> >>>> >>>> <file>0226520141_e-banking_Checkliste_CLX.Sentinel.pdf</file> >>>> <path>download/online</path> >>>> </album> >>>> <album> >>>> <author>Marcel X.</author> >>>> <title>kuchen backen</title> >>>> <description>torten, kuchen, geb‰ck ...</description> >>>> <file>Kundenbrief.pdf</file> >>>> <path>download/online</path> >>>> </album> >>>> </albums> >> >> > > > -- > Regards, > Shalin Shekhar Mangar.