Thanks!
I tried following https://beam
.apache.org/documentation/runtime/environments/ but get a "Custom images
are not yet supported" error message from DataFlow. Perhaps I did something
wrong?
"error": {
> "code": 400,
> "message": "(24f8c9b6e647d55d): The workflow could not be created. Causes:
> (24f8c9b6e647de48): Invalid worker harness container image: my_image.
> Custom images are not yet supported.",
> "status": "INVALID_ARGUMENT"
> }
>
On Wed, 27 Nov 2019 at 18:34, Kyle Weaver wrote:
> You can also configure your own Docker images if you like, instructions
> here: https://beam.apache.org/documentation/runtime/environments/
>
> On Wed, Nov 27, 2019 at 12:38 AM Carl Thomé wrote:
>
>> Hi,
>>
>> I have a Beam pipeline written in the Python SDK that decodes audio files
>> into TFRecord:s. I'd like to run it on DataFlow but I'm missing libsndfile1
>> in the workers.
>>
>> Is there any way of configuring the base image for the DataFlow workers
>> (e.g. Dockerfile + apt install) to get audio decoding working?
>>
>> On a similar note, when it comes to Python dependencies in the DataFlow
>> runtime (like librosa), is there a wish list somewhere on which we can
>> upvote missing Python libraries?
>>
>> Cheers,
>> Carl Thomé
>>
>