i have tried post.jar and it works when i set the literal.id in solrconfig.xml. i can't pass the id with post.jar (-Dparams=literal.id=abc) because i get a error: "could not find or load main class .id=abc".
On 20. Jul 2013, at 7:05 PM, Andreas Owen wrote: > path was set text wasn't, but it doesn't make a difference. my importer says > 1 row fetched, 0 docs processed, 0 docs skipped. i don't understand how it > can have 2 docs indexed with such a output. > > > On 20. Jul 2013, at 12:47 PM, Shalin Shekhar Mangar wrote: > >> Are the "path" and "text" fields set to "stored" in the schema.xml? >> >> >> On Sat, Jul 20, 2013 at 3:37 PM, Andreas Owen <a...@conx.ch> wrote: >> >>> they are in my schema, path is typed correctly the others are default >>> fields which already exist. all the other fields are populated and i can >>> search for them, just path and text aren't. >>> >>> >>> On 19. Jul 2013, at 6:16 PM, Alexandre Rafalovitch wrote: >>> >>>> Dumb question: they are in your schema? Spelled right, in the right >>>> section, using types also defined? Can you populate them by hand with a >>> CSV >>>> file and post.jar? >>>> >>>> Regards, >>>> Alex. >>>> >>>> Personal website: http://www.outerthoughts.com/ >>>> LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch >>>> - Time is the quality of nature that keeps events from happening all at >>>> once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) >>>> >>>> >>>> On Fri, Jul 19, 2013 at 12:09 PM, Andreas Owen <a...@conx.ch> wrote: >>>> >>>>> i'm using solr 4.3 which i just downloaded today and am using only jars >>>>> that came with it. i have enabled the dataimporter and it runs without >>>>> error. but the field "path" (included in schema.xml) and "text" (file >>>>> content) aren't indexed. what am i doing wrong? >>>>> >>>>> solr-path: C:\ColdFusion10\cfusion\jetty-new >>>>> collection-path: C:\ColdFusion10\cfusion\jetty-new\solr\collection1 >>>>> pdf-doc-path: C:\web\development\tkb\internet\public >>>>> >>>>> >>>>> data-config.xml: >>>>> >>>>> <dataConfig> >>>>> <dataSource type="BinFileDataSource" name="data"/> >>>>> <dataSource type="BinURLDataSource" name="dataUrl"/> >>>>> <dataSource type="URLDataSource" baseUrl=" >>>>> http://127.0.0.1/tkb/internet/" name="main"/> >>>>> <document> >>>>> <entity name="rec" processor="XPathEntityProcessor" >>>>> url="docImportUrl.xml" forEach="/albums/album" dataSource="main"> <!-- >>>>> >>>>> transformer="script:GenerateId"--> >>>>> <field column="title" xpath="//title" /> >>>>> <field column="id" xpath="//file" /> >>>>> <field column="path" xpath="//path" /> >>>>> <field column="Author" xpath="//author" /> >>>>> >>>>> <!-- <field >>>>> column="tstamp">2013-07-05T14:59:46.889Z</field> --> >>>>> >>>>> <entity name="tika" processor="TikaEntityProcessor" >>>>> url="../../../../../web/development/tkb/internet/public/${rec.path}/${ >>>>> rec.id}" >>>>> >>>>> dataSource="data" > >>>>> <field column="text" /> >>>>> >>>>> </entity> >>>>> </entity> >>>>> </document> >>>>> </dataConfig> >>>>> >>>>> >>>>> docImportUrl.xml: >>>>> >>>>> <?xml version="1.0" encoding="utf-8"?> >>>>> <albums> >>>>> <album> >>>>> <author>Peter Z.</author> >>>>> <title>Beratungsseminar kundenbrief</title> >>>>> <description>wie kommuniziert man</description> >>>>> >>>>> <file>0226520141_e-banking_Checkliste_CLX.Sentinel.pdf</file> >>>>> <path>download/online</path> >>>>> </album> >>>>> <album> >>>>> <author>Marcel X.</author> >>>>> <title>kuchen backen</title> >>>>> <description>torten, kuchen, geb‰ck ...</description> >>>>> <file>Kundenbrief.pdf</file> >>>>> <path>download/online</path> >>>>> </album> >>>>> </albums> >>> >>> >> >> >> -- >> Regards, >> Shalin Shekhar Mangar.