Hi Suyog,

To my knowledge, canonical smiles string comprision is a good way to go for.
canonical smiles string is unique  string for any compound.

Regards,

Xin



suyog wrote:
> 
>     
> hi... all
>         I have one .sdf file with thousands of molecule in it...
> I want to identify duplicate molecules(having similar structure) from .sdf
> file using CDK...
> 
> How do I do this????
> Which classes should i use?? 
> Or if you have any sample code please paste link....
> 
> Any suggestion/ clue is appreciated.... 
> 
> 
> Thanks & Regards
> Suyog
> 

-- 
View this message in context: 
http://www.nabble.com/How-identify-douplicate-molecules----tp24305861p24307189.html
Sent from the cdk-user mailing list archive at Nabble.com.


------------------------------------------------------------------------------
_______________________________________________
Cdk-user mailing list
Cdk-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to