[ https://issues.apache.org/jira/browse/BEAM-9189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Brian Hulette updated BEAM-9189: -------------------------------- Priority: P3 (was: P2) > Add Daffodil IO for Apache Beam > ------------------------------- > > Key: BEAM-9189 > URL: https://issues.apache.org/jira/browse/BEAM-9189 > Project: Beam > Issue Type: New Feature > Components: sdk-java-core > Reporter: Brian Hulette > Priority: P3 > Labels: gsoc, stale-P2 > > From https://daffodil.apache.org/: > {quote}Daffodil is an open source implementation of the DFDL specification > that uses these DFDL schemas to parse fixed format data into an infoset, > which is most commonly represented as either XML or JSON. This allows the use > of well-established XML or JSON technologies and libraries to consume, > inspect, and manipulate fixed format data in existing solutions. Daffodil is > also capable of the reverse by serializing or “unparsing” an XML or JSON > infoset back to the original data format. > {quote} > We should create a Beam IO that accepts a DFDL schema as an argument and can > then produce and consume data in the specified format. I think it would be > most natural for Beam users if this IO could produce Beam Rows, but an > initial version that just operates with Infosets could be useful as well. -- This message was sent by Atlassian Jira (v8.3.4#803005)