[ 
https://issues.apache.org/jira/browse/FLINK-11990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16798221#comment-16798221
 ] 

Aljoscha Krettek commented on FLINK-11990:
------------------------------------------

The reason for this is that truncate support was first introduced in Hadoop 
2.7.0 (HDFS-3107). For versions after 2.7.0 the {{BucketingSink}} will not 
write {{.valid-length}} files but directly truncate the file. The test 
simulates this behaviour by manually enumerating the {{.valid-length}} files by 
looking at the log entries.

For Hadoop 2.8.x {{LOG_LINES}} is empty. It seems bash is a bit strange here 
and will have one iteration of the loop with an empty string, that's why you 
see {{Truncating  to}}, i.e. its truncating nothing. That's also why you see 
the output from {{mv}} and {{rm}}.

> Streaming bucketing end-to-end test fail with hadoop 2.8
> --------------------------------------------------------
>
>                 Key: FLINK-11990
>                 URL: https://issues.apache.org/jira/browse/FLINK-11990
>             Project: Flink
>          Issue Type: Bug
>          Components: Connectors / Hadoop Compatibility
>            Reporter: Yu Li
>            Priority: Critical
>
> As titled, running the {{test_streaming_bucketing.sh}} case with hadoop 2.8 
> bundles always fail, while running with 2.6 bundles could pass.
> Command to run the case:
> {{FLINK_DIR=<flink dir> flink-end-to-end-tests/run-single-test.sh 
> test-scripts/test_streaming_bucketing.sh skip_check_exceptions}}
> The output with hadoop 2.8 
> [bundle|https://repository.apache.org/content/repositories/orgapacheflink-1213/org/apache/flink/flink-shaded-hadoop2-uber/2.8.3-1.8.0/flink-shaded-hadoop2-uber-2.8.3-1.8.0.jar]
>  or [dist|http://archive.apache.org/dist/hadoop/core/hadoop-2.8.5]:
> {noformat}
> Starting taskexecutor daemon on host z05f06378.sqa.zth.
> Waiting for job (905ae10bae4b99031e724b9c29f0ca7b) to reach terminal state 
> FINISHED ...
> Truncating buckets
> Truncating  to
> {noformat}
> The output of the success run with hadoop 2.6 
> [bundle|https://repository.apache.org/content/repositories/orgapacheflink-1213/org/apache/flink/flink-shaded-hadoop2-uber/2.6.5-1.8.0/flink-shaded-hadoop2-uber-2.6.5-1.8.0.jar]
>  or [dist|http://archive.apache.org/dist/hadoop/core/hadoop-2.6.5]:
> {noformat}
> Truncating 
> /home/jueding.ly/flink_rc_check/flink-1.8.0-src/flink-end-to-end-tests/test-scripts/temp-test-directory-06210353709/out/result3/part-3-0
>  to 51250
> 1+0 records in
> 1+0 records out
> 51250 bytes (51 kB) copied, 0.000377998 s, 136 MB/s
> Truncating 
> /home/jueding.ly/flink_rc_check/flink-1.8.0-src/flink-end-to-end-tests/test-scripts/temp-test-directory-06210353709/out/result7/part-3-0
>  to 51250
> 1+0 records in
> 1+0 records out
> 51250 bytes (51 kB) copied, 0.00033118 s, 155 MB/s
> pass Bucketing Sink
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to