[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17572858#comment-17572858 ] Apache Spark commented on SPARK-38639: -- User 'caican00' has created a pull request for this issue: https://github.com/apache/spark/pull/37341 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17572857#comment-17572857 ] Apache Spark commented on SPARK-38639: -- User 'caican00' has created a pull request for this issue: https://github.com/apache/spark/pull/37341 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513318#comment-17513318 ] Apache Spark commented on SPARK-38639: -- User 'TonyDoen' has created a pull request for this issue: https://github.com/apache/spark/pull/35990 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513316#comment-17513316 ] Apache Spark commented on SPARK-38639: -- User 'TonyDoen' has created a pull request for this issue: https://github.com/apache/spark/pull/35990 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511874#comment-17511874 ] Apache Spark commented on SPARK-38639: -- User 'TonyDoen' has created a pull request for this issue: https://github.com/apache/spark/pull/35963 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511873#comment-17511873 ] Apache Spark commented on SPARK-38639: -- User 'TonyDoen' has created a pull request for this issue: https://github.com/apache/spark/pull/35963 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511855#comment-17511855 ] Apache Spark commented on SPARK-38639: -- User 'TonyDoen' has created a pull request for this issue: https://github.com/apache/spark/pull/35962 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511854#comment-17511854 ] Apache Spark commented on SPARK-38639: -- User 'TonyDoen' has created a pull request for this issue: https://github.com/apache/spark/pull/35962 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511484#comment-17511484 ] tonydoen commented on SPARK-38639: -- related pr : [https://github.com/apache/spark/pull/35954] > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-38639) Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly
[ https://issues.apache.org/jira/browse/SPARK-38639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511478#comment-17511478 ] Apache Spark commented on SPARK-38639: -- User 'TonyDoen' has created a pull request for this issue: https://github.com/apache/spark/pull/35954 > Support ignoreCorruptRecord flag to ensure querying broken sequence file > table smoothly > --- > > Key: SPARK-38639 > URL: https://issues.apache.org/jira/browse/SPARK-38639 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 3.1.2, 3.2.1 >Reporter: tonydoen >Priority: Minor > Fix For: 3.2.1 > > Original Estimate: 48h > Remaining Estimate: 48h > > There's an existing flag "spark.sql.files.ignoreCorruptFiles" and > "spark.sql.files.ignoreMissingFiles" that will quietly ignore attempted reads > from files that have been corrupted, but it still allows the query to fail on > sequence files. > > Being able to ignore corrupt record is useful in the scenarios that users > want to query successfully in dirty data(mixed schema in one table). > > We would like to add a "spark.sql.hive.ignoreCorruptRecord" to fill out the > functionality. -- This message was sent by Atlassian Jira (v8.20.1#820001) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org