GitHub user abifet opened a pull request:
https://github.com/apache/incubator-samoa/pull/42
SAMOA-52: Fix nominal attribute problem in VHT
1. We don't need to split if the class distribution in a node is pure
2. We need to reset the best and second best attribute after an attempt of
split.
Every time we want to decide if we do a split, we should not reuse the best
and second best split from previous attempts to split. So each time we want to
decide if we split or not, we recollect the information from the attributes,
and then with the best and second best split, decide using the Hoeffding bound.
After that, we need to reset the best and second best split, and start again.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/abifet/incubator-samoa SAMOA-52
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-samoa/pull/42.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #42
----
commit 2b8da26d2aa0c9c5b2db63c2ec114ae82be77a5e
Author: Albert Bifet <[email protected]>
Date: 2015-12-07T13:20:00Z
SAMOA-52: Fix nominal attributes problem in VHT
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---