Re: [drools-dev] Difference in solvers: Taseree and JCHS

Mark Proctor Mon, 18 Dec 2006 16:37:46 -0800

was just reading that EXT3 does HTREE as its more efficient than aBTREE, jdbm has an HTREE implementation.


MArk
Edson Tirelli wrote:

  Mark,
My feeling is that once someone is using an "ordering" constraint(>, >=, <, <=), it means the attribute will have several possiblevalues (not a discrete small set of possible values). So, the issue isnot really the overhead of a Tree x Hashmap, but if the overhead of atree pays off the overhead of not using indexing for cases like that.My feeling is that it does pays off and is worth at least a try.I'm saying this because what I think you are refering to when yousay a hashmap+training-data approach (or hints as suggested byMichael) will not address the general case.
  I mean, when you have a rule like:

when
  A( $var : attr )
  B( attr2 > $var )
then
Using a hashmap will lead to a buckets count explosion that maycause the hashmap to perform worse than the tree. Also, realize thatin the above case, a naive approach will cause a single B fact to beadded to multiple buckets, since it may match more than one A withdifferent "attr" values.So, thats why I think a tree is better for the general case of"ordering" constraints. Specific cases may be handled in a different way.
I know htrees only by name. Any specific feature you think may beuseful for us?
   []s
   Edson



Mark Proctor wrote:
I wonder if the overhead of a btree, btw have you seen htrees,compared to hashmap is worth while, I expect it is as join orderingcan make a big difference.
Mark

Edson Tirelli wrote:
  Mark,
The approach I was looking into that time was not feasible becausewe used to keep order of asserted fact/tuples on a node basis. Withthe core changes you made in 3.1, we can implement range ordering byreplacing the current hashmap index for a tree index. No need fortraining data, in my understanding.Maybe we will need a composed index approach to work some cases,but the general solution idea is simple.
  []s
  Edson

Mark Proctor wrote:
Actually I was just thinking about some stuff Edson has done.With solvers we know the available data and ranges, right? We canuse this to order indexes, I know this was something Edson lookedinto - but without training data, we couldn't make it worthwhile - same for custom indexing. So we can start to incorporatethose to get faster joins for known data sets.
Mark
Geoffrey De Smet wrote:
The more I learn from JCHS (or prolog for that matter),
the more I am starting to think that this is a different way ofsolving.
1) JCHS/prolog looks like (or is) declarative solving.

2) Taseree is actually more hybrid, the general idea behind it is:
- Drools (declarative programming) is very easy for evaluation
but very difficult for solving.
- Local/tabu search (procedural programming) is easy for solving
but difficult for evaluation.


Both have it's disadvantages and advantages, for example:
Local search is generally faster but doesn't recognize the optimalsolution.
To me it seems they are both interesting to implement,
there must be some common ground too.
We should hold a conference call about it this weekend?
It would be a good idea to compare JCHS and Taseree on a couple ofproblems, like the tt problem:
http://mat.gsia.cmu.edu/TOURN/
---------------------------------------------------------------------
To unsubscribe from this list please visit:

   http://xircles.codehaus.org/manage_email
---------------------------------------------------------------------
To unsubscribe from this list please visit:

   http://xircles.codehaus.org/manage_email



---------------------------------------------------------------------
To unsubscribe from this list please visit:

   http://xircles.codehaus.org/manage_email

Re: [drools-dev] Difference in solvers: Taseree and JCHS

Reply via email to