[
https://issues.apache.org/jira/browse/BEAM-9189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17548384#comment-17548384
]
Danny McCormick commented on BEAM-9189:
---------------------------------------
This issue has been migrated to https://github.com/apache/beam/issues/20026
> Add Daffodil IO for Apache Beam
> -------------------------------
>
> Key: BEAM-9189
> URL: https://issues.apache.org/jira/browse/BEAM-9189
> Project: Beam
> Issue Type: New Feature
> Components: sdk-java-core
> Reporter: Brian Hulette
> Priority: P3
> Labels: Clarified, gsoc
>
> From https://daffodil.apache.org/:
> {quote}Daffodil is an open source implementation of the DFDL specification
> that uses these DFDL schemas to parse fixed format data into an infoset,
> which is most commonly represented as either XML or JSON. This allows the use
> of well-established XML or JSON technologies and libraries to consume,
> inspect, and manipulate fixed format data in existing solutions. Daffodil is
> also capable of the reverse by serializing or “unparsing” an XML or JSON
> infoset back to the original data format.
> {quote}
> We should create a Beam IO that accepts a DFDL schema as an argument and can
> then produce and consume data in the specified format. I think it would be
> most natural for Beam users if this IO could produce Beam Rows, but an
> initial version that just operates with Infosets could be useful as well.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)