[jira] [Updated] (SPARK-23863) Wholetext mode should not add line breaks

2018-04-05 Thread Hyukjin Kwon (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-23863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-23863:
-
Fix Version/s: (was: 2.3.1)

> Wholetext mode should not add line breaks
> -
>
> Key: SPARK-23863
> URL: https://issues.apache.org/jira/browse/SPARK-23863
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 2.3.0
>Reporter: chencheng
>Priority: Major
>
> We merged multiple files in the Text format. When using the select count(1) 
> to count the merged results, we found that the result of the merged wholetext 
> mode was inconsistent with that of the normal mode.
> E.g:
>  Combine 10 text files with a total of 100 lines. When the wholeTextMode 
> parameter of the buildReader method is set to true, the result of select 
> count(1) will be 110.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-23863) Wholetext mode should not add line breaks

2018-04-04 Thread chencheng (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-23863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

chencheng updated SPARK-23863:
--
Description: 
We merged multiple files in the Text format. When using the select count(1) to 
count the merged results, we found that the result of the merged wholetext mode 
was inconsistent with that of the normal mode.

E.g:
 Combine 10 text files with a total of 100 lines. When the wholeTextMode 
parameter of the buildReader method is set to true, the result of select 
count(1) will be 110.

  was:
We merged multiple files in the Text format. When using the select count(*) to 
count the merged results, we found that the result of the merged wholetext mode 
was inconsistent with that of the normal mode.

E.g:
Combine 10 text files with a total of 100 lines. When the wholeTextMode 
parameter of the buildReader method is set to true, the result of select 
count(*) will be 110.


> Wholetext mode should not add line breaks
> -
>
> Key: SPARK-23863
> URL: https://issues.apache.org/jira/browse/SPARK-23863
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 2.3.0
>Reporter: chencheng
>Priority: Major
> Fix For: 2.3.1
>
>
> We merged multiple files in the Text format. When using the select count(1) 
> to count the merged results, we found that the result of the merged wholetext 
> mode was inconsistent with that of the normal mode.
> E.g:
>  Combine 10 text files with a total of 100 lines. When the wholeTextMode 
> parameter of the buildReader method is set to true, the result of select 
> count(1) will be 110.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org