Udi Meiri created BEAM-3965:
-------------------------------

             Summary: HDFS read broken
                 Key: BEAM-3965
                 URL: https://issues.apache.org/jira/browse/BEAM-3965
             Project: Beam
          Issue Type: Bug
          Components: sdk-py-core
            Reporter: Udi Meiri
            Assignee: Udi Meiri


When running a command like:

{{python setup.py sdist > /dev/null && python -m apache_beam.examples.wordcount 
--output gs://.../py-wordcount-output --hdfs_host ... --hdfs_port 50070 
--hdfs_user ehudm --runner DataflowRunner --project ... --temp_location 
gs://.../temp-hdfs-int --staging_location gs://.../staging-hdfs-int 
--sdk_location dist/apache-beam-2.5.0.dev0.tar.gz --input hdfs://kinglear.txt}}

I get:

{{Traceback (most recent call last):}}
{{ File "/usr/lib/python2.7/runpy.py", line 174, in _run_module_as_main}}
{{ "__main__", fname, loader, pkg_name)}}
{{ File "/usr/lib/python2.7/runpy.py", line 72, in _run_code}}
{{ exec code in run_globals}}
{{ File ".../beam/sdks/python/apache_beam/examples/wordcount.py", line 136, in 
<module>}}
{{ run()}}
{{ File ".../beam/sdks/python/apache_beam/examples/wordcount.py", line 90, in 
run}}
{{ lines = p | 'read' >> ReadFromText(known_args.input)}}
{{ File "apache_beam/io/textio.py", line 522, in __init__}}
{{ skip_header_lines=skip_header_lines)}}
{{ File "apache_beam/io/textio.py", line 117, in __init__}}
{{ validate=validate)}}
{{ File "apache_beam/io/filebasedsource.py", line 119, in __init__}}
{{ self._validate()}}
{{ File "apache_beam/options/value_provider.py", line 124, in _f}}
{{ return fnc(self, *args, **kwargs)}}
{{ File "apache_beam/io/filebasedsource.py", line 176, in _validate}}
{{ match_result = FileSystems.match([pattern], limits=[1])[0]}}
{{ File "apache_beam/io/filesystems.py", line 159, in match}}
{{ return filesystem.match(patterns, limits)}}
{{ File "apache_beam/io/hadoopfilesystem.py", line 221, in match}}
{{ raise BeamIOError('Match operation failed', exceptions)}}
{{apache_beam.io.filesystem.BeamIOError: Match operation failed with exceptions 
\{'hdfs://kinglear.txt': KeyError('name',)}}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to