[jira] [Commented] (SPARK-44810) XML: ArrayType and MapType support in from_xml

2024-05-17 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17847373#comment-17847373 ] Sandip Agarwala commented on SPARK-44810: - Yes, we don't plan to support root-level ArrayType or

[jira] [Resolved] (SPARK-44789) XML: Spark connect support

2024-05-12 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandip Agarwala resolved SPARK-44789. - Resolution: Fixed > XML: Spark connect support > -- > >

[jira] [Resolved] (SPARK-46108) XML: keepInnerXmlAsRaw option

2024-05-12 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandip Agarwala resolved SPARK-46108. - Resolution: Not A Problem As discussed in the PR (#44022) comments , this is already

[jira] [Resolved] (SPARK-44810) XML: ArrayType and MapType support in from_xml

2024-05-12 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandip Agarwala resolved SPARK-44810. - Resolution: Fixed > XML: ArrayType and MapType support in from_xml >

[jira] [Commented] (SPARK-47219) XML: Ignore commented row tags in XML tokenizer

2024-04-26 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17841353#comment-17841353 ] Sandip Agarwala commented on SPARK-47219: - Correct. Thanks for pointing it out. I closed it as

[jira] [Resolved] (SPARK-47219) XML: Ignore commented row tags in XML tokenizer

2024-04-26 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-47219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandip Agarwala resolved SPARK-47219. - Resolution: Duplicate > XML: Ignore commented row tags in XML tokenizer >

[jira] [Created] (SPARK-47219) XML: Ignore commented row tags in XML tokenizer

2024-02-28 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-47219: --- Summary: XML: Ignore commented row tags in XML tokenizer Key: SPARK-47219 URL: https://issues.apache.org/jira/browse/SPARK-47219 Project: Spark Issue

[jira] [Created] (SPARK-46954) XML: Perf optimizations

2024-02-01 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46954: --- Summary: XML: Perf optimizations Key: SPARK-46954 URL: https://issues.apache.org/jira/browse/SPARK-46954 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-46952) XML: Limit size of corrupt record

2024-02-01 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46952: --- Summary: XML: Limit size of corrupt record Key: SPARK-46952 URL: https://issues.apache.org/jira/browse/SPARK-46952 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-46667) XML: Throw error on multiple XML data source

2024-01-10 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46667: --- Summary: XML: Throw error on multiple XML data source Key: SPARK-46667 URL: https://issues.apache.org/jira/browse/SPARK-46667 Project: Spark Issue

[jira] [Created] (SPARK-46630) XML: Validate XML element name on write

2024-01-08 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46630: --- Summary: XML: Validate XML element name on write Key: SPARK-46630 URL: https://issues.apache.org/jira/browse/SPARK-46630 Project: Spark Issue Type:

[jira] [Created] (SPARK-46599) XML: Use TypeCoercion.findTightestCommonType for compatibility check

2024-01-04 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46599: --- Summary: XML: Use TypeCoercion.findTightestCommonType for compatibility check Key: SPARK-46599 URL: https://issues.apache.org/jira/browse/SPARK-46599 Project:

[jira] [Created] (SPARK-46587) XML: Fix XSD big integer conversion

2024-01-03 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46587: --- Summary: XML: Fix XSD big integer conversion Key: SPARK-46587 URL: https://issues.apache.org/jira/browse/SPARK-46587 Project: Spark Issue Type:

[jira] [Updated] (SPARK-46153) XML: Add TimestampNTZType support

2023-12-12 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandip Agarwala updated SPARK-46153: Summary: XML: Add TimestampNTZType support (was: XML: Add TimestampNTZType support in

[jira] [Created] (SPARK-46355) XML: Close InputStreamReader on read completion

2023-12-10 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46355: --- Summary: XML: Close InputStreamReader on read completion Key: SPARK-46355 URL: https://issues.apache.org/jira/browse/SPARK-46355 Project: Spark Issue

[jira] [Created] (SPARK-46153) XML: Add TimestampNTZType support in schema inference

2023-11-28 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46153: --- Summary: XML: Add TimestampNTZType support in schema inference Key: SPARK-46153 URL: https://issues.apache.org/jira/browse/SPARK-46153 Project: Spark

[jira] [Created] (SPARK-46152) XML: Add DecimalType support in schema inference

2023-11-28 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-46152: --- Summary: XML: Add DecimalType support in schema inference Key: SPARK-46152 URL: https://issues.apache.org/jira/browse/SPARK-46152 Project: Spark Issue

[jira] [Created] (SPARK-45562) XML: Make 'rowTag' a required option

2023-10-16 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-45562: --- Summary: XML: Make 'rowTag' a required option Key: SPARK-45562 URL: https://issues.apache.org/jira/browse/SPARK-45562 Project: Spark Issue Type:

[jira] [Created] (SPARK-45488) XML: Add support for value in 'rowTag' element

2023-10-10 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-45488: --- Summary: XML: Add support for value in 'rowTag' element Key: SPARK-45488 URL: https://issues.apache.org/jira/browse/SPARK-45488 Project: Spark Issue

[jira] [Created] (SPARK-45399) XML: Add XML Options using newOption

2023-10-03 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-45399: --- Summary: XML: Add XML Options using newOption Key: SPARK-45399 URL: https://issues.apache.org/jira/browse/SPARK-45399 Project: Spark Issue Type:

[jira] [Created] (SPARK-45225) XML: XSD file URL support

2023-09-19 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-45225: --- Summary: XML: XSD file URL support Key: SPARK-45225 URL: https://issues.apache.org/jira/browse/SPARK-45225 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45190) XML: StructType schema issue in pyspark connect

2023-09-17 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-45190: --- Summary: XML: StructType schema issue in pyspark connect Key: SPARK-45190 URL: https://issues.apache.org/jira/browse/SPARK-45190 Project: Spark Issue

[jira] [Created] (SPARK-45186) XML: Refine docstring of from_xml, schema_of_xml

2023-09-16 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-45186: --- Summary: XML: Refine docstring of from_xml, schema_of_xml Key: SPARK-45186 URL: https://issues.apache.org/jira/browse/SPARK-45186 Project: Spark Issue

[jira] [Created] (SPARK-44810) XML: ArrayType and MapType support in from_xml

2023-08-14 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44810: --- Summary: XML: ArrayType and MapType support in from_xml Key: SPARK-44810 URL: https://issues.apache.org/jira/browse/SPARK-44810 Project: Spark Issue

[jira] [Created] (SPARK-44790) XML: to_xml

2023-08-12 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44790: --- Summary: XML: to_xml Key: SPARK-44790 URL: https://issues.apache.org/jira/browse/SPARK-44790 Project: Spark Issue Type: Sub-task Components:

[jira] [Created] (SPARK-44789) XML: Spark connect support

2023-08-12 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44789: --- Summary: XML: Spark connect support Key: SPARK-44789 URL: https://issues.apache.org/jira/browse/SPARK-44789 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-44788) XML: Add pyspark.sql.functions

2023-08-12 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandip Agarwala updated SPARK-44788: Summary: XML: Add pyspark.sql.functions (was: XML Add pyspark.sql.functions) > XML: Add

[jira] [Created] (SPARK-44788) XML Add pyspark.sql.functions

2023-08-12 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44788: --- Summary: XML Add pyspark.sql.functions Key: SPARK-44788 URL: https://issues.apache.org/jira/browse/SPARK-44788 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-44787) XML: Add SQL Expressions

2023-08-12 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44787: --- Summary: XML: Add SQL Expressions Key: SPARK-44787 URL: https://issues.apache.org/jira/browse/SPARK-44787 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-44753) XML: Add Python and sparkR binding including Spark Connect

2023-08-09 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44753: --- Summary: XML: Add Python and sparkR binding including Spark Connect Key: SPARK-44753 URL: https://issues.apache.org/jira/browse/SPARK-44753 Project: Spark

[jira] [Created] (SPARK-44752) XML: Update Spark Docs

2023-08-09 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44752: --- Summary: XML: Update Spark Docs Key: SPARK-44752 URL: https://issues.apache.org/jira/browse/SPARK-44752 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-44751) XML: Implement FIleFormat Interface

2023-08-09 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandip Agarwala updated SPARK-44751: Description: This will also address most of the review comments from the first XML PR:

[jira] [Created] (SPARK-44751) XML: Implement FIleFormat Interface

2023-08-09 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44751: --- Summary: XML: Implement FIleFormat Interface Key: SPARK-44751 URL: https://issues.apache.org/jira/browse/SPARK-44751 Project: Spark Issue Type:

[jira] [Updated] (SPARK-44265) Built-in XML data source support

2023-07-19 Thread Sandip Agarwala (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandip Agarwala updated SPARK-44265: Description: XML is a widely used data format. An external spark-xml package

[jira] [Created] (SPARK-44265) Built-in XML data source support

2023-06-30 Thread Sandip Agarwala (Jira)
Sandip Agarwala created SPARK-44265: --- Summary: Built-in XML data source support Key: SPARK-44265 URL: https://issues.apache.org/jira/browse/SPARK-44265 Project: Spark Issue Type: New