> When I send a scanned pdf to extraction request
> handler, below icon appears in my Dock.
> 
> http://tinypic.com/r/2mpmo7o/6
> http://tinypic.com/r/28ukxhj/6

I found that text-extractable pdf files triggers above weird icon too.

curl 
"http://localhost:8983/solr/update/extract?literal.id=solr-word&commit=true"; -F 
"myfile=@solr-word.pdf"

I wrote a standalone java program using tika. When text-extracting from all 
kinds of pdf files, that weird icon pops-up :)

I will ask tika-ML about this.

 AutoDetectParser _autoParser = new AutoDetectParser();
 File file = new File("solr-word.pdf");
 BodyContentHandler textHandler = new BodyContentHandler();
 Metadata metadata = new Metadata();
 ParseContext context = new ParseContext();
 InputStream input = new FileInputStream(file);

 _autoParser.parse(input, textHandler, metadata, context);

 System.out.println("text : " + textHandler.toString());
 input.close();
 while (true) { }

Reply via email to