Tilman,
  Thank you for the toe-stubbing.  I'm sorry that it wasn't easier...

I created a new user with collab permissions and ran through the process.

You are right about the privileges on the tmp directory... POI needs a tmp 
directory to write xlsx.  I created a tmp directory in /work/eval and added a 
direction to set tmp dir via -Djava.io.tmpdir=tmp

I'm not sure what you mean by...sorry
>- "H" is missing, which is identical to "C"

I updated the permissions on appBatchExecutor.sh

I also added a recommendation to umask g+rw before starting. 

Let me know if I need to fix anything else or if I missed something you've 
already identified but I missed. ☹

Thank you, again.

Best,

        Tim

-----Original Message-----
From: Tilman Hausherr [mailto:[email protected]] 
Sent: Thursday, November 2, 2017 5:47 PM
To: [email protected]
Subject: Re: Running tika-eval on the Rackspace vm

I'm almost done... then I got this when doing the last step:


[tilman@cloud-server-02 eval]$ java -jar tika-eval-1.17-SNAPSHOT.jar Report -db 
pdfboxAvsB
0    [main] INFO  org.apache.tika.eval.reports.Report  - Writing report: 
All Mimes In A to mimes/all_mimes_A.xlsx Exception in thread "main" 
java.io.IOException: Permission denied
         at java.io.UnixFileSystem.createFileExclusively(Native Method)
         at java.io.File.createTempFile(File.java:2024)
         at
org.apache.poi.util.DefaultTempFileCreationStrategy.createTempFile(DefaultTempFileCreationStrategy.java:110)
         at org.apache.poi.util.TempFile.createTempFile(TempFile.java:66)
         at
org.apache.poi.xssf.streaming.SXSSFWorkbook.write(SXSSFWorkbook.java:924)
         at org.apache.tika.eval.reports.Report.dumpXLSX(Report.java:85)
         at org.apache.tika.eval.reports.Report.writeReport(Report.java:64)
         at
org.apache.tika.eval.reports.ResultsReporter.execute(ResultsReporter.java:305)
         at
org.apache.tika.eval.reports.ResultsReporter.main(ResultsReporter.java:266)
         at
org.apache.tika.eval.TikaEvalCLI.handleReport(TikaEvalCLI.java:264)
         at org.apache.tika.eval.TikaEvalCLI.execute(TikaEvalCLI.java:52)
         at org.apache.tika.eval.TikaEvalCLI.main(TikaEvalCLI.java:273)


I changed the source, and now I got the path, it is 
/work/eval/reports/mimes/all_mimes_A.xlsx . The file exists and it is empty.

I tried with a 1.16 version and the same happened.

Then I thought, maybe the file with the permission problem isn't the target at 
all; could this be some temp file / temp directory where I don't have 
permission?

smaller improvements for the documentation:

- appBatchExecutor.sh should have 775 permission or the documentation should 
have "nohup sh ./appBatchExecutor.sh &"

- "H" is missing, which is identical to "C"

- mention that "pdfboxAvsB" db files are to be removed before starting? 
I had accidentally aborted a run and couldn't restart.


Tilman

memo for me:


java -jar tika-eval-1.17-SNAPSHOT.jar Compare -extractsA
/data4/batch_runs/pdfbox_2_0_4 -extractsB
/data4/batch_runs/pdfbox_2_0_9-SNAPSHOT1 -db pdfboxAvsB

java -jar tika-eval-1.17-SNAPSHOT.jar Report -db pdfboxAvsB

Reply via email to