+1 On Sat, 29 Jul 2023 at 22:49, Maciej <mszymkiew...@gmail.com> wrote:
> +1 > > Best regards, > Maciej Szymkiewicz > > Web: https://zero323.net > PGP: A30CEF0C31A501EC > > On 7/29/23 11:28, Mich Talebzadeh wrote: > > +1 for me. > > Though Databriks did a good job releasing the code. > > GitHub - databricks/spark-xml: XML data source for Spark SQL and DataFrames > <https://github.com/databricks/spark-xml> > > > Mich Talebzadeh, > Solutions Architect/Engineering Lead > Palantir Technologies Limited > London > United Kingdom > > > view my Linkedin profile > <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/> > > > https://en.everybodywiki.com/Mich_Talebzadeh > > > > *Disclaimer:* Use it at your own risk. Any and all responsibility for any > loss, damage or destruction of data or any other property which may arise > from relying on this email's technical content is explicitly disclaimed. > The author will in no case be liable for any monetary damages arising from > such loss, damage or destruction. > > > > > On Sat, 29 Jul 2023 at 06:34, Jia Fan <fanjiaemi...@qq.com.invalid> > <fanjiaemi...@qq.com.invalid> wrote: > >> >> + 1 >> >> >> 2023年7月29日 13:06,Adrian Pop-Tifrea <poptifreaadr...@gmail.com> 写道: >> >> +1, the more data source formats, the better, and if the solution is >> already thoroughly tested, I say we should go for it. >> >> On Sat, Jul 29, 2023, 06:35 Xiao Li <gatorsm...@gmail.com> wrote: >> >>> +1 >>> >>> On Fri, Jul 28, 2023 at 15:54 Sean Owen <sro...@gmail.com> wrote: >>> >>>> +1 I think that porting the package 'as is' into Spark is probably >>>> worthwhile. >>>> That's relatively easy; the code is already pretty battle-tested and >>>> not that big and even originally came from Spark code, so is more or less >>>> similar already. >>>> >>>> One thing it never got was DSv2 support, which means XML reading would >>>> still be somewhat behind other formats. (I was not able to implement it.) >>>> This isn't a necessary goal right now, but would be possibly part of >>>> the logic of moving it into the Spark code base. >>>> >>>> On Fri, Jul 28, 2023 at 5:38 PM Sandip Agarwala >>>> <sandip.agarw...@databricks.com.invalid> >>>> <sandip.agarw...@databricks.com.invalid> wrote: >>>> >>>>> Dear Spark community, >>>>> >>>>> I would like to start the vote for "SPIP: XML data source support". >>>>> >>>>> XML is a widely used data format. An external spark-xml package ( >>>>> https://github.com/databricks/spark-xml) is available to read and >>>>> write XML data in spark. Making spark-xml built-in will provide a better >>>>> user experience for Spark SQL and structured streaming. The proposal is to >>>>> inline code from the spark-xml package. >>>>> >>>>> SPIP link: >>>>> >>>>> https://docs.google.com/document/d/1ZaOBT4-YFtN58UCx2cdFhlsKbie1ugAn-Fgz_Dddz-Q/edit?usp=sharing >>>>> >>>>> JIRA: >>>>> https://issues.apache.org/jira/browse/SPARK-44265 >>>>> >>>>> Discussion Thread: >>>>> https://lists.apache.org/thread/q32hxgsp738wom03mgpg9ykj9nr2n1fh >>>>> >>>>> Please vote on the SPIP for the next 72 hours: >>>>> [ ] +1: Accept the proposal as an official SPIP >>>>> [ ] +0 >>>>> [ ] -1: I don’t think this is a good idea because __. >>>>> >>>>> Thanks, Sandip >>>>> >>>> >>