You can do dfdl:separator="%#x02;" or even dfdl:separator="%STX;" (Section 
6.3.1.2 Table 4 DFDL Entities)

The "%" introduces a DFDL-specific character entity.

I generally recommened people use the DFDL "%" instead of the XML "&"

You are only stuck with dealing with the E000 stuff when those control 
characters appear in the values of elements. Delimiters that are explicit in 
the DFDL schema aren't part of the infoset, (they won't show up in your XML) so 
none of the E000 remapping occurs for those strings.

I hope that helps.

-mike beckerle

________________________________
From: Attila Horvath <[email protected]>
Sent: Thursday, July 22, 2021 1:56 PM
To: [email protected] <[email protected]>
Subject: CSV - hex char separator?

If I have a Character Separated Value [CSV] file, where the character is any 7 
bit hexadecimal character instead of simply 'comma' separated - eg: STX [0x02],
how can that be specified in a '<xs:sequence dfdl:separator="..."' attribute?

I tried '<xs:sequence dfdl:separator="&#xE002;"'. Syntactically it is correct 
but daffodil is not recognizing the STX [0x02] character as a field delimiter.

[image.png]

Thx in advance,

Attila

Reply via email to