[jira] [Updated] (SPARK-23863) Wholetext mode should not add line breaks

2018-04-04 Thread chencheng (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-23863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

chencheng updated SPARK-23863:
--
Description: 
We merged multiple files in the Text format. When using the select count(1) to 
count the merged results, we found that the result of the merged wholetext mode 
was inconsistent with that of the normal mode.

E.g:
 Combine 10 text files with a total of 100 lines. When the wholeTextMode 
parameter of the buildReader method is set to true, the result of select 
count(1) will be 110.

  was:
We merged multiple files in the Text format. When using the select count(*) to 
count the merged results, we found that the result of the merged wholetext mode 
was inconsistent with that of the normal mode.

E.g:
Combine 10 text files with a total of 100 lines. When the wholeTextMode 
parameter of the buildReader method is set to true, the result of select 
count(*) will be 110.


> Wholetext mode should not add line breaks
> -
>
> Key: SPARK-23863
> URL: https://issues.apache.org/jira/browse/SPARK-23863
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 2.3.0
>Reporter: chencheng
>Priority: Major
> Fix For: 2.3.1
>
>
> We merged multiple files in the Text format. When using the select count(1) 
> to count the merged results, we found that the result of the merged wholetext 
> mode was inconsistent with that of the normal mode.
> E.g:
>  Combine 10 text files with a total of 100 lines. When the wholeTextMode 
> parameter of the buildReader method is set to true, the result of select 
> count(1) will be 110.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-23863) Wholetext mode should not add line breaks

2018-04-03 Thread chencheng (JIRA)
chencheng created SPARK-23863:
-

 Summary: Wholetext mode should not add line breaks
 Key: SPARK-23863
 URL: https://issues.apache.org/jira/browse/SPARK-23863
 Project: Spark
  Issue Type: Bug
  Components: SQL
Affects Versions: 2.3.0
Reporter: chencheng
 Fix For: 2.3.1


We merged multiple files in the Text format. When using the select count(*) to 
count the merged results, we found that the result of the merged wholetext mode 
was inconsistent with that of the normal mode.

E.g:
Combine 10 text files with a total of 100 lines. When the wholeTextMode 
parameter of the buildReader method is set to true, the result of select 
count(*) will be 110.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org