[ 
https://issues.apache.org/jira/browse/BEAM-9189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17548384#comment-17548384
 ] 

Danny McCormick commented on BEAM-9189:
---------------------------------------

This issue has been migrated to https://github.com/apache/beam/issues/20026

> Add Daffodil IO for Apache Beam
> -------------------------------
>
>                 Key: BEAM-9189
>                 URL: https://issues.apache.org/jira/browse/BEAM-9189
>             Project: Beam
>          Issue Type: New Feature
>          Components: sdk-java-core
>            Reporter: Brian Hulette
>            Priority: P3
>              Labels: Clarified, gsoc
>
> From https://daffodil.apache.org/:
> {quote}Daffodil is an open source implementation of the DFDL specification 
> that uses these DFDL schemas to parse fixed format data into an infoset, 
> which is most commonly represented as either XML or JSON. This allows the use 
> of well-established XML or JSON technologies and libraries to consume, 
> inspect, and manipulate fixed format data in existing solutions. Daffodil is 
> also capable of the reverse by serializing or “unparsing” an XML or JSON 
> infoset back to the original data format.
> {quote}
> We should create a Beam IO that accepts a DFDL schema as an argument and can 
> then produce and consume data in the specified format. I think it would be 
> most natural for Beam users if this IO could produce Beam Rows, but an 
> initial version that just operates with Infosets could be useful as well.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to