Tilman,
Thank you for the toe-stubbing. I'm sorry that it wasn't easier...
I created a new user with collab permissions and ran through the process.
You are right about the privileges on the tmp directory... POI needs a tmp
directory to write xlsx. I created a tmp directory in /work/eval and added a
direction to set tmp dir via -Djava.io.tmpdir=tmp
I'm not sure what you mean by...sorry
>- "H" is missing, which is identical to "C"
I updated the permissions on appBatchExecutor.sh
I also added a recommendation to umask g+rw before starting.
Let me know if I need to fix anything else or if I missed something you've
already identified but I missed. ☹
Thank you, again.
Best,
Tim
-----Original Message-----
From: Tilman Hausherr [mailto:[email protected]]
Sent: Thursday, November 2, 2017 5:47 PM
To: [email protected]
Subject: Re: Running tika-eval on the Rackspace vm
I'm almost done... then I got this when doing the last step:
[tilman@cloud-server-02 eval]$ java -jar tika-eval-1.17-SNAPSHOT.jar Report -db
pdfboxAvsB
0 [main] INFO org.apache.tika.eval.reports.Report - Writing report:
All Mimes In A to mimes/all_mimes_A.xlsx Exception in thread "main"
java.io.IOException: Permission denied
at java.io.UnixFileSystem.createFileExclusively(Native Method)
at java.io.File.createTempFile(File.java:2024)
at
org.apache.poi.util.DefaultTempFileCreationStrategy.createTempFile(DefaultTempFileCreationStrategy.java:110)
at org.apache.poi.util.TempFile.createTempFile(TempFile.java:66)
at
org.apache.poi.xssf.streaming.SXSSFWorkbook.write(SXSSFWorkbook.java:924)
at org.apache.tika.eval.reports.Report.dumpXLSX(Report.java:85)
at org.apache.tika.eval.reports.Report.writeReport(Report.java:64)
at
org.apache.tika.eval.reports.ResultsReporter.execute(ResultsReporter.java:305)
at
org.apache.tika.eval.reports.ResultsReporter.main(ResultsReporter.java:266)
at
org.apache.tika.eval.TikaEvalCLI.handleReport(TikaEvalCLI.java:264)
at org.apache.tika.eval.TikaEvalCLI.execute(TikaEvalCLI.java:52)
at org.apache.tika.eval.TikaEvalCLI.main(TikaEvalCLI.java:273)
I changed the source, and now I got the path, it is
/work/eval/reports/mimes/all_mimes_A.xlsx . The file exists and it is empty.
I tried with a 1.16 version and the same happened.
Then I thought, maybe the file with the permission problem isn't the target at
all; could this be some temp file / temp directory where I don't have
permission?
smaller improvements for the documentation:
- appBatchExecutor.sh should have 775 permission or the documentation should
have "nohup sh ./appBatchExecutor.sh &"
- "H" is missing, which is identical to "C"
- mention that "pdfboxAvsB" db files are to be removed before starting?
I had accidentally aborted a run and couldn't restart.
Tilman
memo for me:
java -jar tika-eval-1.17-SNAPSHOT.jar Compare -extractsA
/data4/batch_runs/pdfbox_2_0_4 -extractsB
/data4/batch_runs/pdfbox_2_0_9-SNAPSHOT1 -db pdfboxAvsB
java -jar tika-eval-1.17-SNAPSHOT.jar Report -db pdfboxAvsB