[ https://issues.apache.org/jira/browse/TIKA-3454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369234#comment-17369234 ]
Lewis John McGibbney edited comment on TIKA-3454 at 6/25/21, 4:46 AM: ---------------------------------------------------------------------- There are a couple of confusing statements {quote} Configuring Language Identifiers At this time, there is no unified way to configure language identifiers. While the work on that is ongoing, for now you will need to review the Tika Javadocs to see how individual identifiers are configured. Configuring Translators At this time, there is no unified way to configure Translators. While the work on that is ongoing, for now you will need to review the Tika Javadocs to see how individual Translators are configured. {quote} The hyperlinks point to https://tika.apache.org/1.26/api/ which is not particularly useful. I think this is going to take some collective input to arrive at a decent solution. Is anyone else interested in this? was (Author: lewismc): There are a couple of confusing statements {quote} Configuring Language Identifiers At this time, there is no unified way to configure language identifiers. While the work on that is ongoing, for now you will need to review the Tika Javadocs to see how individual identifiers are configured. Configuring Translators At this time, there is no unified way to configure Translators. While the work on that is ongoing, for now you will need to review the Tika Javadocs to see how individual Translators are configured. {quote} I think this is going to take some collective input. Is anyone else interested in this? > Facilitate configuration of translation and transcription impls in > tika-server/tika-docker/tika-helm > ---------------------------------------------------------------------------------------------------- > > Key: TIKA-3454 > URL: https://issues.apache.org/jira/browse/TIKA-3454 > Project: Tika > Issue Type: Bug > Components: docker, helm, server > Reporter: Lewis John McGibbney > Assignee: Lewis John McGibbney > Priority: Major > Fix For: 2.0.0 > > > I need an easy way to configure, for example, the > [AmazonTranscribe|https://github.com/apache/tika/blob/main/tika-parsers/tika-parsers-ml/tika-transcribe-aws/src/main/java/org/apache/tika/parser/transcribe/aws/AmazonTranscribe.java] > implementation when I deploy tika-server (tika-docker) via the Helm chart > into Kubernetes. The same goes for TIka translation implementations. > We have [documentation for configuring tika-server to run via > Docker|https://github.com/apache/tika-docker#custom-config] however > currently, there is [no way to configure translators or > transcribers|https://tika.apache.org/1.26/configuring.html#Configuring_Translators] > > This task will determine a sensible means by which we can configure > translators and transcribers for tika-server such that it can be used further > downstream via Docker and Helm on K8s. -- This message was sent by Atlassian Jira (v8.3.4#803005)