Limited Semi-PolyMorphic (LSP) structs?

Era Scarecrow Mon, 26 Aug 2013 04:26:12 -0700

Been a while and out of the loop, I need to get my hands dirtyin the code again (soon). Anyways, let's get to the matter athand as I'm thinking about it. I'm working on some code (or willwork on code again) that could use a polymorphic type, but at theend it will never be shared externally and can never extendbeyond 'Known' types. (and the 'types' are behavior 95% of thetime)

True polymorphism will let me compile a code to work with say'Object' and you can plug in any code and pass it an object andit will be happy, or inherit and work with it. These are great asclasses. But I want to throw a wrench in the gears, I want to usestructs.

I know you're asking, Why? Well the data structure I'll beworking with have records in the hundreds of thousands, but mostof the time looking at/sorting them I don't necessarily need toallocate memory (or really even interact with them beyondsorting). Most of the time the OO aspect is just behavior and thedata between them never changes (behave this way because you're atype XXX). It can take a very long time to unpack everything(when you never needed to unpack in the first place, or allocateand then de-allocate immediately, probably without recycling).

Alright, structs can't inherit because they don't have an objectpointer to go to their parent structures that they are connectedto. If we eliminate the pointer and the overhead on some of that,we bring it down to the following:


 To be fully polymorphic you need to:
 * Be able to tell what type it is (Probably enum identifier)

* Extended types cannot be larger than the base/original (or thebase has to hold extra data in reserve for it's other types).



 When can this LSP be applicable?

* When a type can be changed on the fly with no side effectsother than intended (This 'text' struct is now 'left aligned textobject')* When allocations/pointers aren't needed (why fully build theobject at all?)* When it won't be extended beyond the current type... (only afew fixed subtypes)* When teardown/deallocation/~this() isn't needed (neverallocates? Never deallocates! Just throw it all away! POD's areperfect!)* When you have no extra 'data' to append to a class/struct andonly override methods/behavior (Encryption type switch from DESto BlowFish! Same API/methods, otherwise nothing really changed)* When you need a half step up from structs but don't need fullclasses/objects


 Limitations: Hmmm..
 * LSP's cannot be a template or mixin (they can call on them)
 * Known at compile-time
 * Only one of them can handle setup/teardown.

Now, I wonder. What if we make it so it CAN inherit? We need 3examples, where something is in A, A&B, and B. We will get tothat. iA and iB are the class/structs and A/B/C are themethods/functions. So...


//original based on...
class iA {
  void A();
  void C();
}
class iB : iA {
  void A();
  void B();
}

 Breaking them down for structs we have:

struct base_AB { //the 'manager' that does redirection and knownsize

  char type = 'A'; //polymorphic type identifier, for simplicity
  //shared 'parent' baggage ie -> iA
  //baggage space for iB

  //Exists in both iA & iB
  void A() {
    switch(type) {
      case 'A': (cast(iA) this).A(); break;
      case 'B': (cast(iB) this).B(); break;

default: throw new Exception("method A() invalid for structtype:" ~ type);

    }
  }

  //B only exists in inherited
  void B() {
    if (type == 'B') (cast(iB) this).B();
    else throw new Exception();
  }

  //C exists everywhere
  void C() { (cast(iA) this).C(); }
}

So far the base is easy to make, now however as for how iA andiB interact with eachother... Ignoring infinite loops problems:


struct iA {
  char type;
  //baggage - Size must be identical to base_AB

  void A() {
    A();

//B(); /*iA doesn't really know about B so can't call it fromhere... not safely anyways*/

    C();
  };
}

Now, C() can be called regardless of the type (no checksneeded). iB.B() can also be called anywhere in iB with noproblems, but it would be simpler if all calls regardless went tothe base and let the base handle it. (Optimizations wouldprobably remove unneeded checks later).


struct iB {
  char type;  //and other baggage

  void A() {
    A(); // actually: (cast(base_AB) this).A();
    B(); // actually: (cast(base_AB) this).B();
    C(); // actually: (cast(base_AB) this).C();
  }
  void B() {
    //ditto as A()'s
  }
}

This can probably be easily done having a different naming setwhich handles the redirection, but the names and boiler-plateseems unnecessarily large and verbose for a simple task;Optimization will likely remove the excess unneeded portions infinal binary form anyways.


//different name examples:
 struct base_AB {
  char type;  //and other baggage

  void A() {
    switch(type) {
      case 'A': (cast(iA) this).realA(); break;
      case 'B': (cast(iB) this).realB(); break;

default: throw new Exception("method A() invalid for structtype:" ~ type);

    }
  }

struct iB {
  char type;
  //bagage

  //boilerplate
  void A() { cast(base_AB) this.A(); }
  void B() { cast(base_AB) this.B(); }
  void C() { cast(base_AB) this.C(); }

  void realA() {
    A();

B(); //Path really is now: this.B() -> base_AB.B() ->iB.realB();

    C();
  }

  void realB() {  /* ... */  }
}

Now how would one build it without all the extra boilerplateoverhead (also preferably with fewer confusing elements) Threeideas are coming to mind.

1) Template. This can work but it's a lot of overhead, a lot ofwork and is hard to keep track of. I have an experimental versionthat semi-works but can't be used seriously. My project sortarelying on this has come to a halt for the moment. It's also veryfinickle where data (and struct order) are involved and refusesto compiler out of a certain order. Also for calling functionsthat exist in current structs require you to forcibly call 'this'or go around it in order for it to do the redirections elsewhere.

2) UDA's could be possible, if you tag a struct as belongingand then scan/add the appropriate code. But this seems like a lotof work and overhead. Probably won't work, plus a new UDA foreach type vs known fixed extensions. Seems like a lot of extrawork. Maybe?


  struct base_AB @LSP(iA, iB) {}
  struct iA @LSP_base(base_AB) {}
  struct iB @LSP_base(base_AB) {}

3) Compiler. This seems the most likely (and 95% of it isalready in place for classes), but i can't implement it myselfeasily; Plus i am not sure if Walter or Andrei would want/allowit. If you squint your eyes you can see it looks very similar tothe original struct/class formula for C++ (without usingpointers/new), except that the total size reserved is known/setin advance to incorporate all types (fixed size, nogrowing/shrinking by casting).

Perhaps introducing a third base type (so struct, classes andLSP's)? This might help but depends on the amount of extracomplexity would be needed, or how many bugs it couldintroduce... or incorporate it as the normal struct but is aexperimental/disabled feature unless you apply a certainattribute (or set of?)

If there's a suggestion of how to make this work and feel morenatural (class-like without allocation/pointers) thensuggestions/ideas would be nice. It's possible this type offeature is just not wanted. But I'm willing to bet there's quitea few places this type of feature/set could be utilized when youconsider what it can be used for.

Limited Semi-PolyMorphic (LSP) structs?

Reply via email to