[
https://issues.apache.org/jira/browse/DAFFODIL-258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16438222#comment-16438222
]
Michael Beckerle commented on DAFFODIL-258:
-------------------------------------------
This notion of lowering a delimter to iso-8859-1 characters aka bytes is
interesting.
Suppose a terminator is in utf-8. Consider:
{code}
val terminatorIn8859 = new String(terminator.getBytes("utf-8"),
StandardCharsets.ISO_8859_1)
{code}
So the multi-byte characters in this utf-8 are "lowered" to multiple characters
(aka bytes) in ISO-8859-1. These lowered characters can then be inter-mixed
with raw bytes - which are not "raw" but valid character codes in iso-8859-1
where every byte is valid.
> DFDL Entities: Implement Raw Bytes Entities
> -------------------------------------------
>
> Key: DAFFODIL-258
> URL: https://issues.apache.org/jira/browse/DAFFODIL-258
> Project: Daffodil
> Issue Type: New Feature
> Components: DFDL Language, Front End
> Reporter: Taylor Wise
> Priority: Major
> Fix For: deferred
>
>
> Implement raw byte entities.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)