Re: scanned pdf with solr cell

2012-08-20 Thread Michael Della Bitta
It's pretty easy to accidentally run into the AWT stuff if you're doing anything that involves image processing, which I would expect a generic RTF parser might do. Michael Della Bitta Appinions | 18 East 41st St., Suite 1806 | New York, NY 10017 w

Re: scanned pdf with solr cell

2012-08-19 Thread Lance Norskog
The backstory here is that Tika uses a library that for some crazy reason is inside the Java AWG graphics toolkit. (I think the RTF parser?) On Wed, Aug 15, 2012 at 5:57 AM, Ahmet Arslan wrote: >> You can try passing >> -Djava.awt.headless=true as one of the arguments >> when you start Jetty to s

Re: scanned pdf with solr cell

2012-08-15 Thread Ahmet Arslan
> You can try passing > -Djava.awt.headless=true as one of the arguments > when you start Jetty to see if you can get this to go away > with no ill > effects. I started jetty using : 'java -Djava.awt.headless=true -jar start.jar' and successfully indexed two pdf files. That icon didn't appeared :

Re: scanned pdf with solr cell

2012-08-15 Thread Michael Della Bitta
You can try passing -Djava.awt.headless=true as one of the arguments when you start Jetty to see if you can get this to go away with no ill effects. Michael Della Bitta Appinions | 18 East 41st St., Suite 1806 | New York, NY 10017 www.appinions.com

Re: scanned pdf with solr cell

2012-08-15 Thread Paul Libbrecht
Le 15 août 2012 à 13:03, Ahmet Arslan a écrit : > Hi Paul, thanks for the explanation. So is it nothing to worry about? it is nothing to worry about except to remember that you can't run this step in a daemon-like process. (on Linux, I had to set-up a VNC-server for similar tasks) paul

Re: scanned pdf with solr cell

2012-08-15 Thread Ahmet Arslan
> the dock icon appears when AWT starts, e.g. when a font is > loaded. > You can prevent it using the headless mode but this is > likely to trigger an exception. > Same if your user is not UI-logged-in. Hi Paul, thanks for the explanation. So is it nothing to worry about?

Re: scanned pdf with solr cell

2012-08-15 Thread Paul Libbrecht
Ahmet, the dock icon appears when AWT starts, e.g. when a font is loaded. You can prevent it using the headless mode but this is likely to trigger an exception. Same if your user is not UI-logged-in. hope it helps. Paul Le 15 août 2012 à 01:30, Ahmet Arslan a écrit : > Hi All, > > I have set

Re: scanned pdf with solr cell

2012-08-15 Thread Ahmet Arslan
> When I send a scanned pdf to extraction request > handler, below icon appears in my Dock. > > http://tinypic.com/r/2mpmo7o/6 > http://tinypic.com/r/28ukxhj/6 I found that text-extractable pdf files triggers above weird icon too. curl "http://localhost:8983/solr/update/extract?literal.id=solr-

Re: scanned pdf with solr cell

2012-08-14 Thread Jack Krupansky
ostScript fonts. Try a "normal" PDF for comparison. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Tuesday, August 14, 2012 7:30 PM To: solr-user@lucene.apache.org Subject: scanned pdf with solr cell Hi All, I have set of rich documents. Some of them are scanned

scanned pdf with solr cell

2012-08-14 Thread Ahmet Arslan
Hi All, I have set of rich documents. Some of them are scanned pdf files. When I send a scanned pdf to extraction request handler, below icon appears in my Dock. http://tinypic.com/r/2mpmo7o/6 http://tinypic.com/r/28ukxhj/6 Does anyone know what this is? curl "http://localhost:8983/solr/docum