Hi,

I've taken a longer look at the problem, and I think that it is indeed
the fault of the signature code. For the two versions of the test
molecule we get:

...[C]([C]p[C]p[C]), [C](p[C]p[C]), [C](p[C]p[C]), [C](p[C]p[C]),
[C](p[C]p[C]), [C](p[C]p[C][Cl])...

for "ErnestBupropion" and :

...[C](p[C]p[C]), [C](p[C]p[C]), [C](p[C]p[C]), [C](p[C]p[C]),
[C](p[C]p[C][C]), [C](p[C]p[C][Cl]),...

for "MeteorBupropion". These are just the sublist of signatures for
the atoms that have aromatic bonds (at height 2). The error here is
for "[C]([C]p[C]p[C])" and "[C](p[C]p[C][C])" which should not be
different. The 'p' stands for aromatic, if you didn't already guess :)

So this might take a little while to fix, assuming there is no quick
patch. Perhaps next weekend or so - then I will upload a new version
of the library as a patch to the CDK, and then announce to this list.
Probably discussion of the problem (if any) should go on in cdk-devel
list, although people are welcome to mail me directly, or even fix it
themselves if they want to!

gilleain

On 3/15/13, Patrik Rydberg <[email protected]> wrote:
> Hi John,
>
> Thanks for the tip, but I'm already using the toCanonicalString function, so
> this should not be happening. Which is somewhat troubling.
>
> Gilleain Torrance is looking at the code as well right now.
>
> Patrik
>
> --------------------------------------------------------------------------
> Patrik Rydberg, Associate Professor, Ph.D.
>
> Biostructural Research Group
> Department of Drug Design and Pharmacology
> Faculty of Health and Medical Sciences
> University of Copenhagen
> DK-2100 Copenhagen
>
> http://www.farma.ku.dk/index.php/Patrik-Rydberg/5304/0/
>
> From: John May [mailto:[email protected]]
> Sent: 14. marts 2013 18:30
> To: Patrik Rydberg
> Cc: [email protected]
> Subject: Re: [Cdk-user] problems in the use of the signature class
>
> Hi Patrik,
>
> This might help,
> https://github.com/egonw/cdk/blob/master/src/main/org/openscience/cdk/fingerprint/SignatureFingerprinter.java#L73-L80
>
> I can have a look for if you like, have you got the code?
>
> J
>
> On 14 Mar 2013, at 12:49, Patrik Rydberg
> <[email protected]<mailto:[email protected]>> wrote:
>
>
> Hi CDK users,
>
> Is there anyone who has a working code that creates atomic signatures of
> varying height for a molecule?
>
> I have made my own code, but for some strange reason it can on occasion
> generate slightly different signatures for the same molecular structure
> (same molecule, but two different mol files with different atom ordering).
> I'm quite confused in how this can happen and wondering if the error is in
> my code or in the signature class.
>
> If anyone would be willing to help me find the error I would be very
> grateful.
>
> Best regards,
> Patrik Rydberg
>
>
> --------------------------------------------------------------------------
> Patrik Rydberg, Associate Professor, Ph.D.
>
> Biostructural Research Group
> Department of Drug Design and Pharmacology
> Faculty of Health and Medical Sciences
> University of Copenhagen
> DK-2100 Copenhagen
>
> http://www.farma.ku.dk/index.php/Patrik-Rydberg/5304/0/
>
>
> ------------------------------------------------------------------------------
> Everyone hates slow websites. So do we.
> Make your web apps faster with AppDynamics
> Download AppDynamics Lite for free today:
> http://p.sf.net/sfu/appdyn_d2d_mar_______________________________________________
> Cdk-user mailing list
> [email protected]<mailto:[email protected]>
> https://lists.sourceforge.net/lists/listinfo/cdk-user
>
>

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_mar
_______________________________________________
Cdk-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to