Alex,

Same results on recursive=true / recursive=false.

I also tried importing plain text files instead of epub (still using TikeEntityProcessor though) and get exactly the same result - ie. all files fetched, but only one document indexed in Solr.

With verbose output, I get a row for each file in the directory, but only the first one has a non-empty documentImport entity. All subsequent documentImport entities just have an empty document#2 entry. eg:

 ....
  "verbose-output": [
    "entity:files",
    [
      null,
      "----------- row #1-------------",
      "fileSize",
      2609004,
      "fileLastModified",
      "2015-02-25T11:37:25.217Z",
      "fileAbsolutePath",
      "c:\\Users\\gt\\Documents\\epub\\issue018.epub",
      "fileDir",
      "c:\\Users\\gt\\Documents\\epub",
      "file",
      "issue018.epub",
      null,
      "---------------------------------------------",
      "entity:documentImport",
      [
        "document#1",
        [
          "query",
          "c:\\Users\\gt\\Documents\\epub\\issue018.epub",
          "time-taken",
          "0:0:0.0",
          null,
          "----------- row #1-------------",
          "text",
          "< ... parsed epub text - snip ... >"
          "title",
          "Issue 18 title",
          "Author",
          "Author text",
          null,
          "---------------------------------------------"
        ],
        "document#2",
        []
      ],
      null,
      "----------- row #2-------------",
      "fileSize",
      4428804,
      "fileLastModified",
      "2015-02-25T11:37:36.399Z",
      "fileAbsolutePath",
      "c:\\Users\\gt\\Documents\\epub\\issue019.epub",
      "fileDir",
      "c:\\Users\\gt\\Documents\\epub",
      "file",
      "issue019.epub",
      null,
      "---------------------------------------------",
      "entity:documentImport",
      [
        "document#2",
        []
      ],
      null,
      "----------- row #3-------------",
      "fileSize",
      2580266,
      "fileLastModified",
      "2015-02-25T11:37:41.188Z",
      "fileAbsolutePath",
      "c:\\Users\\gt\\Documents\\epub\\issue020.epub",
      "fileDir",
      "c:\\Users\\gt\\Documents\\epub",
      "file",
      "issue020.epub",
      null,
      "---------------------------------------------",
      "entity:documentImport",
      [
        "document#2",
        []
      ],
....
....


Reply via email to