Greg,

Thank you.
Questions :)

1) I'm getting the following error message:
nms.remove('MolecularFormula')
ValueError: list.remove(x): x not in list

when I check the list of descriptor names indeed I don't see
MolecularFormula,

2) I'm getting the warning message:
[11:27:24] WARNING: The AvailDescriptors module is deprecated. Please
switch to using the Descriptors module.

3) Does the following means that each descriptors is effectively
binarized?
# number of possible values of each descriptor:
nPossible = [0]+[2]*ndescrs+[2]

4) Is this the correct way to build RandomForest? The wiki page sadly
stops at "bag of decision trees", I tried to extrapolate example from
RandomForest for binary fingerprint:
cmp.Grow(pts,attrs=attrs,nPossibleVals=nPossible,nTries=100,randomDescriptors=20,
         buildDriver=CrossValidate.CrossValidationDriver,
treeBuilder=QuantTreeBoot,needsQuantization=False,nQuantBounds=boundsPerVar,maxDepth=100)

I'm specifically concerned that needsQuantization=False (should it be
True in this case?), also maxDepth parameter - from what I understand
the randomForest trees should not be pruned, why is there maxDepth
parameter at all?

5) How do I suppress output of
Cycle:    0
Cycle:   10
...
?

6) The wiki seems to stop abruptly at
"Composite models can be also be pickled to disk and then reloaded to
classify new points:"

7) For some strange reason the model i build in this way predicts
nothing but zeros! The previous model - with fingerprints was predicting
something at least...
Do I still use ClassifyExample similar to as before?
cmp.ClassifyExample([i]+list(test_descrs[i])+[act])

Best regards,
Igor



On Thu, 2011-05-05 at 05:24 -0400, Greg Landrum wrote:
> Igor,
> 
> On Tue, May 3, 2011 at 6:03 PM, Igor Filippov <[email protected]> 
> wrote:
> > Questions:
> >
> > 1) How can I use real-valued descriptors, such as MOE-like descriptors
> > for such modeling? Do I need to pick descriptors one-by-one or is there
> > something like "AllDescriptors" which computes all of them in one pass?
> 
> An initial version of this is now here:
> http://code.google.com/p/rdkit/wiki/BuildingModelsUsingDescriptors1
> 
> -greg
> 
> ------------------------------------------------------------------------------
> WhatsUp Gold - Download Free Network Management Software
> The most intuitive, comprehensive, and cost-effective network 
> management toolset available today.  Delivers lowest initial 
> acquisition cost and overall TCO of any competing solution.
> http://p.sf.net/sfu/whatsupgold-sd
> _______________________________________________
> Rdkit-discuss mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss



------------------------------------------------------------------------------
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to