Hi Hayden, Thanks for your email! Have you tried the Tika JAXRS server, documented here:
https://issues.apache.org/jira/browse/TIKA-593 http://wiki.apache.org/tika/TikaJAXRS It first appeared in 1.2 and can also be run on a port (9988 by default) to handle cURL interactions. Cheers, Chris On Jul 20, 2012, at 8:17 AM, Mr Havercamp wrote: > Have been playing around with integrating Tika into my PHP app. > > I have had great success with Tika on the command line and also SolrCell. > > However, I was wondering if there is some way of running Tika in server mode > and extracting a document, say, via CURL. > > I have had varying degrees of success with: > > nc localhost 30000 < /opt/lampp/htdocs/joomla25/tmp/InformationRepository.pdf > > but I'm wondering how I pass other params such as for extracting just > metadata or content in html format. > > Any help would be much appreciated. > > Cheers > > > Hayden ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
