[ 
https://issues.apache.org/jira/browse/TIKA-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15205245#comment-15205245
 ] 

ASF GitHub Bot commented on TIKA-774:
-------------------------------------

GitHub user rgauss opened a pull request:

    https://github.com/apache/tika/pull/92

    TIKA-774: ExifTool Parser

    Contribution of tika-exiftool for review

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/Alfresco/tika tika-exiftool

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/tika/pull/92.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #92
    
----
commit 8eb474b06e1463ca172128b59b713782eb4bece8
Author: rgauss <rga...@rgauss.com>
Date:   2016-03-19T20:37:37Z

    Initial commit of tika-exiftool as is

commit 5ff139d68bebd39382d5ed9626bff42797ece01d
Author: rgauss <rga...@rgauss.com>
Date:   2016-03-19T22:44:00Z

    Added git ignore of properties override

commit c8f4fb062ce809661527c91df89b230da95f592c
Author: rgauss <rga...@rgauss.com>
Date:   2016-03-21T18:49:38Z

    Merge branch 'master' into tika-exiftool

commit e8a2fa30b16f8b947d118b61ca12476420e9bee0
Author: rgauss <rga...@rgauss.com>
Date:   2016-03-21T21:24:29Z

    TIKA-774: ExifTool Parser
      - Moved tika-exiftool from separate project to parsers
      - Updated license headers
      - Removed author Javadoc
      - Fixed a few forbiddenapi violations

commit 37aae337c5ca3b5a45c2e45804e3768e08a8bbb6
Author: rgauss <rga...@rgauss.com>
Date:   2016-03-21T21:31:31Z

    TIKA-774: ExifTool Parser
      - Removed more author Javadocs

commit 90f8550c03aa873a81975dfa10cfd77aa557fc6f
Author: rgauss <rga...@rgauss.com>
Date:   2016-03-21T22:00:00Z

    TIKA-774: ExifTool Parser
      - Renamed ExecutableUtils to ExiftoolExecutableUtils
      - Changed ExifToolImageParserTest to skip when exiftool is not
    available

----


> ExifTool Parser
> ---------------
>
>                 Key: TIKA-774
>                 URL: https://issues.apache.org/jira/browse/TIKA-774
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Requires be installed 
> (http://www.sno.phy.queensu.ca/~phil/exiftool/)
>            Reporter: Ray Gauss II
>              Labels: features, new-parser, newbie, patch
>             Fix For: 1.13
>
>         Attachments: testJPEG_IPTC_EXT.jpg, 
> tika-core-exiftool-parser-patch.txt, tika-parsers-exiftool-parser-patch.txt
>
>
> Adds an external parser that calls ExifTool to extract extended metadata 
> fields from images and other content types.
> In the core project:
> An ExifTool interface is added which contains Property objects that define 
> the metadata fields available.
> An additional Property constructor for internalTextBag type.
> In the parsers project:
> An ExiftoolMetadataExtractor is added which does the work of calling ExifTool 
> on the command line and mapping the response to tika metadata fields.  This 
> extractor could be called instead of or in addition to the existing 
> ImageMetadataExtractor and JempboxExtractor under TiffParser and/or 
> JpegParser but those have not been changed at this time.
> An ExiftoolParser is added which calls only the ExiftoolMetadataExtractor.
> An ExiftoolTikaMapper is added which is responsible for mapping the ExifTool 
> metadata fields to existing tika and Drew Noakes metadata fields if enabled.
> An ElementRdfBagMetadataHandler is added for extracting multi-valued RDF Bag 
> implementations in XML files.
> An ExifToolParserTest is added which tests several expected XMP and IPTC 
> metadata values in testJPEG_IPTC_EXT.jpg.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to