Re: dataimporter, custom fields and parsing error

2013-07-23 Thread Andreas Owen
i have tried post.jar and it works when i set the literal.id in solrconfig.xml. 
i can't pass the id with post.jar (-Dparams=literal.id=abc) because i get a 
error: could not find or load main class .id=abc.


On 20. Jul 2013, at 7:05 PM, Andreas Owen wrote:

 path was set text wasn't, but it doesn't make a difference. my importer says 
 1 row fetched, 0 docs processed, 0 docs skipped. i don't understand how it 
 can have 2 docs indexed with such a output.
 
 
 On 20. Jul 2013, at 12:47 PM, Shalin Shekhar Mangar wrote:
 
 Are the path and text fields set to stored in the schema.xml?
 
 
 On Sat, Jul 20, 2013 at 3:37 PM, Andreas Owen a...@conx.ch wrote:
 
 they are in my schema, path is typed correctly the others are default
 fields which already exist. all the other fields are populated and i can
 search for them, just path and text aren't.
 
 
 On 19. Jul 2013, at 6:16 PM, Alexandre Rafalovitch wrote:
 
 Dumb question: they are in your schema? Spelled right, in the right
 section, using types also defined? Can you populate them by hand with a
 CSV
 file and post.jar?
 
 Regards,
 Alex.
 
 Personal website: http://www.outerthoughts.com/
 LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
 - Time is the quality of nature that keeps events from happening all at
 once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)
 
 
 On Fri, Jul 19, 2013 at 12:09 PM, Andreas Owen a...@conx.ch wrote:
 
 i'm using solr 4.3 which i just downloaded today and am using only jars
 that came with it. i have enabled the dataimporter and it runs without
 error. but the field path (included in schema.xml) and text (file
 content) aren't indexed. what am i doing wrong?
 
 solr-path: C:\ColdFusion10\cfusion\jetty-new
 collection-path: C:\ColdFusion10\cfusion\jetty-new\solr\collection1
 pdf-doc-path: C:\web\development\tkb\internet\public
 
 
 data-config.xml:
 
 dataConfig
  dataSource type=BinFileDataSource name=data/
  dataSource type=BinURLDataSource name=dataUrl/
  dataSource type=URLDataSource baseUrl=
 http://127.0.0.1/tkb/internet/; name=main/
 document
  entity name=rec processor=XPathEntityProcessor
 url=docImportUrl.xml forEach=/albums/album dataSource=main !--
 
 transformer=script:GenerateId--
  field column=title xpath=//title /
  field column=id xpath=//file /
  field column=path xpath=//path /
  field column=Author xpath=//author /
 
  !-- field
 column=tstamp2013-07-05T14:59:46.889Z/field --
 
  entity name=tika processor=TikaEntityProcessor
 url=../../../../../web/development/tkb/internet/public/${rec.path}/${
 rec.id}
 
 dataSource=data 
  field column=text /
 
  /entity
  /entity
 /document
 /dataConfig
 
 
 docImportUrl.xml:
 
 ?xml version=1.0 encoding=utf-8?
 albums
  album
  authorPeter Z./author
  titleBeratungsseminar kundenbrief/title
  descriptionwie kommuniziert man/description
 
 file0226520141_e-banking_Checkliste_CLX.Sentinel.pdf/file
  pathdownload/online/path
  /album
  album
  authorMarcel X./author
  titlekuchen backen/title
  descriptiontorten, kuchen, geb‰ck .../description
  fileKundenbrief.pdf/file
  pathdownload/online/path
  /album
 /albums
 
 
 
 
 -- 
 Regards,
 Shalin Shekhar Mangar.



Re: dataimporter, custom fields and parsing error

2013-07-20 Thread Andreas Owen
they are in my schema, path is typed correctly the others are default fields 
which already exist. all the other fields are populated and i can search for 
them, just path and text aren't.


On 19. Jul 2013, at 6:16 PM, Alexandre Rafalovitch wrote:

 Dumb question: they are in your schema? Spelled right, in the right
 section, using types also defined? Can you populate them by hand with a CSV
 file and post.jar?
 
 Regards,
   Alex.
 
 Personal website: http://www.outerthoughts.com/
 LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
 - Time is the quality of nature that keeps events from happening all at
 once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)
 
 
 On Fri, Jul 19, 2013 at 12:09 PM, Andreas Owen a...@conx.ch wrote:
 
 i'm using solr 4.3 which i just downloaded today and am using only jars
 that came with it. i have enabled the dataimporter and it runs without
 error. but the field path (included in schema.xml) and text (file
 content) aren't indexed. what am i doing wrong?
 
 solr-path: C:\ColdFusion10\cfusion\jetty-new
 collection-path: C:\ColdFusion10\cfusion\jetty-new\solr\collection1
 pdf-doc-path: C:\web\development\tkb\internet\public
 
 
 data-config.xml:
 
 dataConfig
dataSource type=BinFileDataSource name=data/
dataSource type=BinURLDataSource name=dataUrl/
dataSource type=URLDataSource baseUrl=
 http://127.0.0.1/tkb/internet/; name=main/
 document
entity name=rec processor=XPathEntityProcessor
 url=docImportUrl.xml forEach=/albums/album dataSource=main !--
 
 transformer=script:GenerateId--
field column=title xpath=//title /
field column=id xpath=//file /
field column=path xpath=//path /
field column=Author xpath=//author /
 
!-- field
 column=tstamp2013-07-05T14:59:46.889Z/field --
 
entity name=tika processor=TikaEntityProcessor
 url=../../../../../web/development/tkb/internet/public/${rec.path}/${
 rec.id}
 
 dataSource=data 
field column=text /
 
/entity
/entity
 /document
 /dataConfig
 
 
 docImportUrl.xml:
 
 ?xml version=1.0 encoding=utf-8?
 albums
album
authorPeter Z./author
titleBeratungsseminar kundenbrief/title
descriptionwie kommuniziert man/description
 
 file0226520141_e-banking_Checkliste_CLX.Sentinel.pdf/file
pathdownload/online/path
/album
album
authorMarcel X./author
titlekuchen backen/title
descriptiontorten, kuchen, geb‰ck .../description
fileKundenbrief.pdf/file
pathdownload/online/path
/album
 /albums



Re: dataimporter, custom fields and parsing error

2013-07-20 Thread Shalin Shekhar Mangar
Are the path and text fields set to stored in the schema.xml?


On Sat, Jul 20, 2013 at 3:37 PM, Andreas Owen a...@conx.ch wrote:

 they are in my schema, path is typed correctly the others are default
 fields which already exist. all the other fields are populated and i can
 search for them, just path and text aren't.


 On 19. Jul 2013, at 6:16 PM, Alexandre Rafalovitch wrote:

  Dumb question: they are in your schema? Spelled right, in the right
  section, using types also defined? Can you populate them by hand with a
 CSV
  file and post.jar?
 
  Regards,
Alex.
 
  Personal website: http://www.outerthoughts.com/
  LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
  - Time is the quality of nature that keeps events from happening all at
  once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)
 
 
  On Fri, Jul 19, 2013 at 12:09 PM, Andreas Owen a...@conx.ch wrote:
 
  i'm using solr 4.3 which i just downloaded today and am using only jars
  that came with it. i have enabled the dataimporter and it runs without
  error. but the field path (included in schema.xml) and text (file
  content) aren't indexed. what am i doing wrong?
 
  solr-path: C:\ColdFusion10\cfusion\jetty-new
  collection-path: C:\ColdFusion10\cfusion\jetty-new\solr\collection1
  pdf-doc-path: C:\web\development\tkb\internet\public
 
 
  data-config.xml:
 
  dataConfig
 dataSource type=BinFileDataSource name=data/
 dataSource type=BinURLDataSource name=dataUrl/
 dataSource type=URLDataSource baseUrl=
  http://127.0.0.1/tkb/internet/; name=main/
  document
 entity name=rec processor=XPathEntityProcessor
  url=docImportUrl.xml forEach=/albums/album dataSource=main !--
 
  transformer=script:GenerateId--
 field column=title xpath=//title /
 field column=id xpath=//file /
 field column=path xpath=//path /
 field column=Author xpath=//author /
 
 !-- field
  column=tstamp2013-07-05T14:59:46.889Z/field --
 
 entity name=tika processor=TikaEntityProcessor
  url=../../../../../web/development/tkb/internet/public/${rec.path}/${
  rec.id}
 
  dataSource=data 
 field column=text /
 
 /entity
 /entity
  /document
  /dataConfig
 
 
  docImportUrl.xml:
 
  ?xml version=1.0 encoding=utf-8?
  albums
 album
 authorPeter Z./author
 titleBeratungsseminar kundenbrief/title
 descriptionwie kommuniziert man/description
 
  file0226520141_e-banking_Checkliste_CLX.Sentinel.pdf/file
 pathdownload/online/path
 /album
 album
 authorMarcel X./author
 titlekuchen backen/title
 descriptiontorten, kuchen, geb‰ck .../description
 fileKundenbrief.pdf/file
 pathdownload/online/path
 /album
  /albums




-- 
Regards,
Shalin Shekhar Mangar.


Re: dataimporter, custom fields and parsing error

2013-07-20 Thread Andreas Owen
path was set text wasn't, but it doesn't make a difference. my importer says 1 
row fetched, 0 docs processed, 0 docs skipped. i don't understand how it can 
have 2 docs indexed with such a output.


On 20. Jul 2013, at 12:47 PM, Shalin Shekhar Mangar wrote:

 Are the path and text fields set to stored in the schema.xml?
 
 
 On Sat, Jul 20, 2013 at 3:37 PM, Andreas Owen a...@conx.ch wrote:
 
 they are in my schema, path is typed correctly the others are default
 fields which already exist. all the other fields are populated and i can
 search for them, just path and text aren't.
 
 
 On 19. Jul 2013, at 6:16 PM, Alexandre Rafalovitch wrote:
 
 Dumb question: they are in your schema? Spelled right, in the right
 section, using types also defined? Can you populate them by hand with a
 CSV
 file and post.jar?
 
 Regards,
  Alex.
 
 Personal website: http://www.outerthoughts.com/
 LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
 - Time is the quality of nature that keeps events from happening all at
 once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)
 
 
 On Fri, Jul 19, 2013 at 12:09 PM, Andreas Owen a...@conx.ch wrote:
 
 i'm using solr 4.3 which i just downloaded today and am using only jars
 that came with it. i have enabled the dataimporter and it runs without
 error. but the field path (included in schema.xml) and text (file
 content) aren't indexed. what am i doing wrong?
 
 solr-path: C:\ColdFusion10\cfusion\jetty-new
 collection-path: C:\ColdFusion10\cfusion\jetty-new\solr\collection1
 pdf-doc-path: C:\web\development\tkb\internet\public
 
 
 data-config.xml:
 
 dataConfig
   dataSource type=BinFileDataSource name=data/
   dataSource type=BinURLDataSource name=dataUrl/
   dataSource type=URLDataSource baseUrl=
 http://127.0.0.1/tkb/internet/; name=main/
 document
   entity name=rec processor=XPathEntityProcessor
 url=docImportUrl.xml forEach=/albums/album dataSource=main !--
 
 transformer=script:GenerateId--
   field column=title xpath=//title /
   field column=id xpath=//file /
   field column=path xpath=//path /
   field column=Author xpath=//author /
 
   !-- field
 column=tstamp2013-07-05T14:59:46.889Z/field --
 
   entity name=tika processor=TikaEntityProcessor
 url=../../../../../web/development/tkb/internet/public/${rec.path}/${
 rec.id}
 
 dataSource=data 
   field column=text /
 
   /entity
   /entity
 /document
 /dataConfig
 
 
 docImportUrl.xml:
 
 ?xml version=1.0 encoding=utf-8?
 albums
   album
   authorPeter Z./author
   titleBeratungsseminar kundenbrief/title
   descriptionwie kommuniziert man/description
 
 file0226520141_e-banking_Checkliste_CLX.Sentinel.pdf/file
   pathdownload/online/path
   /album
   album
   authorMarcel X./author
   titlekuchen backen/title
   descriptiontorten, kuchen, geb‰ck .../description
   fileKundenbrief.pdf/file
   pathdownload/online/path
   /album
 /albums
 
 
 
 
 -- 
 Regards,
 Shalin Shekhar Mangar.



Re: dataimporter, custom fields and parsing error

2013-07-19 Thread Alexandre Rafalovitch
Dumb question: they are in your schema? Spelled right, in the right
section, using types also defined? Can you populate them by hand with a CSV
file and post.jar?

Regards,
   Alex.

Personal website: http://www.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
- Time is the quality of nature that keeps events from happening all at
once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)


On Fri, Jul 19, 2013 at 12:09 PM, Andreas Owen a...@conx.ch wrote:

 i'm using solr 4.3 which i just downloaded today and am using only jars
 that came with it. i have enabled the dataimporter and it runs without
 error. but the field path (included in schema.xml) and text (file
 content) aren't indexed. what am i doing wrong?

 solr-path: C:\ColdFusion10\cfusion\jetty-new
 collection-path: C:\ColdFusion10\cfusion\jetty-new\solr\collection1
 pdf-doc-path: C:\web\development\tkb\internet\public


 data-config.xml:

 dataConfig
 dataSource type=BinFileDataSource name=data/
 dataSource type=BinURLDataSource name=dataUrl/
 dataSource type=URLDataSource baseUrl=
 http://127.0.0.1/tkb/internet/; name=main/
 document
 entity name=rec processor=XPathEntityProcessor
 url=docImportUrl.xml forEach=/albums/album dataSource=main !--

 transformer=script:GenerateId--
 field column=title xpath=//title /
 field column=id xpath=//file /
 field column=path xpath=//path /
 field column=Author xpath=//author /

 !-- field
 column=tstamp2013-07-05T14:59:46.889Z/field --

 entity name=tika processor=TikaEntityProcessor
 url=../../../../../web/development/tkb/internet/public/${rec.path}/${
 rec.id}

 dataSource=data 
 field column=text /

 /entity
 /entity
 /document
 /dataConfig


 docImportUrl.xml:

 ?xml version=1.0 encoding=utf-8?
 albums
 album
 authorPeter Z./author
 titleBeratungsseminar kundenbrief/title
 descriptionwie kommuniziert man/description

 file0226520141_e-banking_Checkliste_CLX.Sentinel.pdf/file
 pathdownload/online/path
 /album
 album
 authorMarcel X./author
 titlekuchen backen/title
 descriptiontorten, kuchen, geb‰ck .../description
 fileKundenbrief.pdf/file
 pathdownload/online/path
 /album
 /albums