[ 
https://issues.apache.org/jira/browse/TIKA-3454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369234#comment-17369234
 ] 

Lewis John McGibbney edited comment on TIKA-3454 at 6/25/21, 4:46 AM:
----------------------------------------------------------------------

There are a couple of confusing statements 

{quote}
Configuring Language Identifiers

At this time, there is no unified way to configure language identifiers. While 
the work on that is ongoing, for now you will need to review the Tika Javadocs 
to see how individual identifiers are configured.

Configuring Translators

At this time, there is no unified way to configure Translators. While the work 
on that is ongoing, for now you will need to review the Tika Javadocs to see 
how individual Translators are configured.
{quote}

The hyperlinks point to https://tika.apache.org/1.26/api/ which is not 
particularly useful.

I think this is going to take some collective input to arrive at a decent 
solution. Is anyone else interested in this?


was (Author: lewismc):
There are a couple of confusing statements 

{quote}
Configuring Language Identifiers

At this time, there is no unified way to configure language identifiers. While 
the work on that is ongoing, for now you will need to review the Tika Javadocs 
to see how individual identifiers are configured.

Configuring Translators

At this time, there is no unified way to configure Translators. While the work 
on that is ongoing, for now you will need to review the Tika Javadocs to see 
how individual Translators are configured.
{quote}

I think this is going to take some collective input. Is anyone else interested 
in this?

> Facilitate configuration of translation and transcription impls in 
> tika-server/tika-docker/tika-helm
> ----------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-3454
>                 URL: https://issues.apache.org/jira/browse/TIKA-3454
>             Project: Tika
>          Issue Type: Bug
>          Components: docker, helm, server
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Major
>             Fix For: 2.0.0
>
>
> I need an easy way to configure, for example, the 
> [AmazonTranscribe|https://github.com/apache/tika/blob/main/tika-parsers/tika-parsers-ml/tika-transcribe-aws/src/main/java/org/apache/tika/parser/transcribe/aws/AmazonTranscribe.java]
>  implementation when I deploy tika-server (tika-docker) via the Helm chart 
> into Kubernetes. The same goes for TIka translation implementations.
> We have [documentation for configuring tika-server to run via 
> Docker|https://github.com/apache/tika-docker#custom-config] however 
> currently, there is [no way to configure translators or 
> transcribers|https://tika.apache.org/1.26/configuring.html#Configuring_Translators]
>  
> This task will determine a sensible means by which we can configure 
> translators and transcribers for tika-server such that it can be used further 
> downstream via Docker and Helm on K8s.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to