Tim Allison created TIKA-4133:
---------------------------------

             Summary: Add capture group metadataFilter
                 Key: TIKA-4133
                 URL: https://issues.apache.org/jira/browse/TIKA-4133
             Project: Tika
          Issue Type: Task
            Reporter: Tim Allison


There are some cases where it would be useful to run a regex to capture 
specific values in a metadata object.

For example, some users might not want the mime attributes (e.g. charset) as in 
"text/html; charset=UTF-8".

Let's start with a simple regex capture group filter.  If we need to capture 
multiple matches etc, we can add that on a later ticket.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to