[
https://issues.apache.org/jira/browse/TIKA-4693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18071921#comment-18071921
]
Tilman Hausherr commented on TIKA-4693:
---------------------------------------
Please attach one and more files and explain what you did, what you expected
and what you got instead.
> TikaFileMetadata shows wrong data for "dc:subject"metadata properties for
> Doc and docx
> ----------------------------------------------------------------------------------------
>
> Key: TIKA-4693
> URL: https://issues.apache.org/jira/browse/TIKA-4693
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 3.2.3
> Reporter: Arpit
> Priority: Major
>
> Currently we are using <tika-core.version>3.2.3</tika-core. Version> , where
> we are seeing for subject attribute both subject and keywords are being
> returned instead of returning on subject for doc and docx files
> This is the metadata attribute (dc:subject) which we are using for fetching
> subject and it return both subject + keyword
> Able to see one more issue related to same which is in resolved state for PDF
> File https://issues.apache.org/jira/browse/TIKA-4444
--
This message was sent by Atlassian Jira
(v8.20.10#820010)