Hey Chris,

Given the code generation is a serious piece of work and code to go with I wanted to highlight just one thing. Lambda expressions might result in messy stack traces. Since frame parsing in some ways is recursive process it migth result in multiple lambdas on stack trace. As long as we have conditionality only with primitive fields it will work. Yet it will be much harder to debug generated code if we will chain complex types (de)serialization.

I did update a while ago java ConversationContext to not rely on lambdas so stack traces are always starting with org.apache.plc4x package. ;-) All this should not hold you from doing your thing. I believe I will be able to de-lambdize java templates if we will find it necessary later on.

Best,
Łukasz


On 07.10.2021 19:08, Christofer Dutz wrote:
Hi,

now that I'm sort of allmost finished with adjusting the mspec parsers and 
model types, I'm planning up update the code generation templates.

I know that if I do it the normal way, that this would excessivley blow up the 
complexity and size of the template files as well as that of the produced code.

As this code has become more and more complex over the last year anyway, I 
would like to radically clean up the code here.

For that I had an idea, that I formalized a bit in the Confluence page: 
https://cwiki.apache.org/confluence/display/PLC4X/MSPEC+improvements

TL/DR: I would create static methods that handle the reading/writing of every 
field type and simply use these instead of generating code for every field.

Feel free to comment,

Chris


-----Ursprüngliche Nachricht-----
Von: Christofer Dutz <[email protected]>
Gesendet: Montag, 4. Oktober 2021 08:55
An: [email protected]
Betreff: AW: Changes on mspec: parameterized type refs, assert, try, const

Hi all,

so over the weekend Sebastian and I have been working hard on an updated mspec 
version.
In the "feature/mspec-ng" is where all of this is happening.

So far the changes:
- We introduced a concept of "attributes" which can be added to fields and types (These 
are name=expression) tuples. These are added before any argument-block. These can currently be used 
for setting the encoding for string types (possibly even for other types in case we need them). And 
for the upcomming "endianess" attribute which should allow switching endianess.
- We introduced a new simple type "vstring" for variable length strings. The 
expression however is optional.
- The previous "string" now only accepts fixed number of bits.
- The encoding for String fields (string and vstring) defaults to "UTF-8" but you can 
override it with an attribute with: encoding='"UTF-16"'
- The previous encoding string has been removed from string types)
- We added a new type of "field": "batchSet" which takes a list of attributes. These are 
automatically added to all fields it contains. (Currently this is only available in "type", 
"complexTyte" elements or inside typeSwitch case elements.

I think this was about it for now ... still need to implement the attribute 
functionality ... it's currently parsed, but not processed yet.

Chris


-----Ursprüngliche Nachricht-----
Von: Christofer Dutz <[email protected]>
Gesendet: Dienstag, 28. September 2021 14:57
An: [email protected]
Betreff: AW: Changes on mspec: parameterized type refs, assert, try, const

Another thought I had last night ... while sort of not being able to sleep (at 
least it hat something good)

So I think a "try" field should always be an optional field. As it might be set 
or not, just with the difference that the condition is different.

This got me thinking even further ... how about making the condition of an 
"optional" field optional? ... So in general what Sebastian was proposing with 
a:

[try simple SomeType 'coolField']

Would actually be:

[optional SomeType 'coolField']

This would simplify things even more...

Chris



-----Ursprüngliche Nachricht-----
Von: Christofer Dutz <[email protected]>
Gesendet: Sonntag, 26. September 2021 16:03
An: [email protected]
Betreff: AW: Changes on mspec: parameterized type refs, assert, try, const

Bringing the thought of the other discussion here too ...

How about adding name-value paris to the type declarations as well as to the fields? Then the "try" 
flag could be replaced with a name-value pair or even a name=expression pair ... sort of like 
"parse-behaviour=try" or something similar. Then we wouldn't be starting to add all sorts of 
"flags".

Chris

-----Ursprüngliche Nachricht-----
Von: Sebastian Rühl <[email protected]>
Gesendet: Sonntag, 26. September 2021 15:44
An: [email protected]
Betreff: Re: Changes on mspec: parameterized type refs, assert, try, const

Hey Lukaz,

the optional after the try is just a coincidence from the bac spec. Like it say 
the low should always appear with a high and if the id is set then no name 
should not be set. But in itself the try is self sufficient and behaves indeed 
like an optional without an expression. Other than that you can always group 
dependended fields as a complex... but let's see how this evolves.
These blocks could be indeed come in handy as an option to define things...

- Sebastian

On 2021/09/25 20:23:08, Łukasz Dywicki <[email protected]> wrote:
Hey Sebastian,
Not that far ago we had a similar discussion. Yet then it was mainly
about optionals. See "[DISCUSS] Change the way we use "optional".

Back then we did not do any changes to mspec, just ranted about
possible approach to make better use of optional fields.
I remember that current mspec/code can't for example handle ie. a
bacnet whois without all fields. Looking at your example I guess you
try to solve that puzzle.

I don't have any strong feelings here, but why not using optional
field with additional "reset" flag?
To me what rings a bell in your mail is an assumption that try is
always followed by optional field. This tells me that 'try' should
rather be a block than a field. By this way we will be able to pair
these visually as well as handle situation when there is one try
statement but multiple optional fields.

Best,
Łukasz


On 24.09.2021 23:52, Sebastian Rühl wrote:
Hi together,

I have some exciting changes in the pipeline regarding the mspec:

1. parameters on type refs
      with that change it is now possible to target a discriminated child in 
advance.
2. assert keyword
      with that change it is possible to throw a
ParserAssertException (in java, or errors in other languages). This field is 
similar to a const but instead of a ParseException a ParserAssertException is 
thrown. In contrast to a const the check expression can be dynamic (e.g. 
virtual fields now working on develop) 3. try keyword to prefix fields:
      with that change it is possible to try to parse some content and in case 
an assert fails it resets the buffer.
4. const is now extended to type reference
     this change allows enums to be used as const values.

All theses changes allow to encapsulate behavior in complex types so you don't 
need to DRY.

Here is a example working with bacnet:
          ['0x07' BACnetUnconfirmedServiceRequestWhoHas
              [try simple     BACnetComplexTagUnsignedInteger ['0', 
'BACnetDataType.UNSIGNED_INTEGER' ] 'deviceInstanceRangeLowLimit'  ]
              [optional       BACnetComplexTagUnsignedInteger ['1', 
'BACnetDataType.UNSIGNED_INTEGER' ] 'deviceInstanceRangeHighLimit' 
'deviceInstanceRangeLowLimit != null']
              [try simple     BACnetComplexTagOctetString     ['2', 
'BACnetDataType.OCTET_STRING'     ] 'objectIdentifier' ]
              [optional       BACnetComplexTagOctetString     ['3', 
'BACnetDataType.OCTET_STRING'     ] 'objectName' 'objectIdentifier == null' ]
          ]

The logic if a type matches is asserted in the type itself. The second optional 
implies when the first element appears the second must be present. The last one 
tries to read and if it fails it uses the second type.

Here is the snippet from the parent type:

[discriminatedType 'BACnetComplexTag' [uint 4 'tagNumberArgument', 
BACnetDataType 'dataType']
      [assert        uint 4           'tagNumber'                 
'tagNumberArgument'                                           ]
      [const         TagClass         'tagClass'                  
'TagClass.CONTEXT_SPECIFIC_TAGS'                              ]
      [simple        uint 3           'lengthValueType'                         
                                                ]
      .....
      [virtual       uint 32          'actualLength'     'lengthValueType == 5 
....']
      [typeSwitch 'dataType'
          ....
          ['OCTET_STRING' BACnetComplexTagOctetString [uint 32 'actualLength']
              // TODO: The reader expects int but uint32 get's mapped to long 
so even uint32 would easily overflow...
              [virtual    uint    16                           
'actualLengthInBit' 'actualLength * 8']
              [simple     string 'actualLengthInBit' 'ASCII'   'theString']
          ]

Would love to hear some opinions! If there are no objections I would push this 
change to develop soon.

- Sebastian

PatchContent:
Index:
code-generation/protocol-base-mspec/src/main/antlr4/org/apache/plc4x
/plugins/codegenerator/language/mspec/MSpec.g4
<+>UTF-8
===================================================================
diff --git 
a/code-generation/protocol-base-mspec/src/main/antlr4/org/apache/plc4x/plugins/codegenerator/language/mspec/MSpec.g4
 
b/code-generation/protocol-base-mspec/src/main/antlr4/org/apache/plc4x/plugins/codegenerator/language/mspec/MSpec.g4
--- 
a/code-generation/protocol-base-mspec/src/main/antlr4/org/apache/plc4x/plugins/codegenerator/language/mspec/MSpec.g4
        (revision ef35531d5a872f29dccddb3a11a135b166958185)
+++ 
b/code-generation/protocol-base-mspec/src/main/antlr4/org/apache/plc4x/plugins/codegenerator/language/mspec/MSpec.g4
        (date 1632518519426)
@@ -34,7 +34,7 @@
    ;
fieldDefinition
- : LBRACKET field (LBRACKET params=multipleExpressions RBRACKET)?
RBRACKET
+ : LBRACKET tryParse? field (LBRACKET params=multipleExpressions
+ RBRACKET)? RBRACKET
    ;
dataIoDefinition
@@ -49,6 +49,7 @@
    | discriminatorField
    | enumField
    | implicitField
+ | assertField
    | manualArrayField
    | manualField
    | optionalField
@@ -73,7 +74,7 @@
    ;
constField
- : 'const' type=dataType name=idExpression expected=expression
+ : 'const' type=typeReference name=idExpression expected=expression
    ;
discriminatorField
@@ -88,6 +89,10 @@
    : 'implicit' type=dataType name=idExpression serializeExpression=expression
    ;
+assertField
+ : 'assert' type=typeReference name=idExpression
+condition=expression  ;
+
   manualArrayField
    : 'manualArray' type=typeReference name=idExpression 
loopType=ARRAY_LOOP_TYPE loopExpression=expression parseExpression=expression 
serializeExpression=expression lengthExpression=expression
    ;
@@ -129,7 +134,7 @@
    ;
typeReference
- : complexTypeReference=IDENTIFIER_LITERAL
+ : complexTypeReference=IDENTIFIER_LITERAL (LBRACKET 
params=multipleExpressions RBRACKET)?
    | simpleTypeReference=dataType
    ;
@@ -150,6 +155,10 @@
    | base='dateTime'
    ;
+tryParse
+ : 'try'
+ ;
+
   argument
    : type=typeReference name=idExpression
    ;


Reply via email to