[
https://issues.apache.org/jira/browse/TIKA-96?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-96.
-------------------------------
Resolution: Fixed
Fix Version/s: 0.2-incubating
Assignee: Jukka Zitting (was: Chris A. Mattmann)
Added a simple Tika command line application and the related packaging settings
in revision 615395.
Initial features:
- four output formats (xml, html, text, metadata)
- three input sources (files, URLs, standard input)
- two logging levels (info and debug)
- usage message
- GUI mode
For the record, here's the initial usage message:
usage: tika [option] file
Options:
-? or --help Print this usage message
-v or --verbose Print debug level messages
-g or --gui Start the Apache Tika GUI
-x or --xml Output XHTML content (default)
-h or --html Output HTML content
-t or --text Output plain text content
-m or --metadata Output only metadata
Description:
Apache Tika will parse the file(s) specified on the
command line and output the extracted text content
or metadata to standard output.
Use "-" as the file name to parse the standard
input stream.
Using the "--gui" (or "-g") option will start
the Apache Tika GUI. You can drag and drop files
from a normal file explorer to the GUI window to
extract text content and metadata from the files.
> Tika CLI
> --------
>
> Key: TIKA-96
> URL: https://issues.apache.org/jira/browse/TIKA-96
> Project: Tika
> Issue Type: New Feature
> Reporter: Jukka Zitting
> Assignee: Jukka Zitting
> Priority: Minor
> Fix For: 0.2-incubating
>
>
> It would be nice to have a Tika-based command line application that takes in
> a document (either via standard input or as a filename or URL argument) and
> outputs the extracted metadata and text content (either as XHTML or plain
> text).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.