Re: Let's stop parser Hell

Roman D. Boiko Sat, 07 Jul 2012 15:20:35 -0700

On Saturday, 7 July 2012 at 22:03:20 UTC, Chad J wrote:

enum : SyntaxElement
{
  AST_EXPRESSION          = 0x0001_0000_0000_0000,
    AST_UNARY_EXPR        = 0x0000_0001_0000_0000 |

This would cause wasting space (probably a lot). Definitely itwould not be easy to implement in a parser generator, whenvarious language properties are not known beforehand forfine-grained tuning.

This approach of course has shameful nesting limitations, but Ifeel like these determinations could be fairly well optimizedeven for the general case. For example: another approach thatI might be more inclined to take is to give each token/symbol alow-valued index into a small inheritance table.

Depending on implementation, that might introduce the multiplieroverhead of table access per each comparison (and there would bemany in case of searching for nodes of specific type).

I would expect the regex engine to call the isA function as oneof it's operations. Thus placing an AST_EXPRESSION into yourexpression would also match an AST_NEGATE_EXPR too.

But actually it is not so difficult to implement in a verysimilar way to what you described. I was thinking about a lookuptable, but different from a traditional inheritance table. Itwould be indexed by AST node type (integral enum value), andstore various classification information as bits. Maybe this iswhat you meant and I misunderstood you... Example is here:https://github.com/roman-d-boiko/dct/blob/May2012drafts/fe/core.d(sorry, it doesn't show how to do classification, and has adifferent context, but I hope you get the idea). The advantageover storing hierarchical information directly in each token isobviously memory usage.

Re: Let's stop parser Hell

Reply via email to