The easiest way would be to do a substructure search as a subquery or
WITH statement first and then do a SMARTS search on the result. This
way you can keep your hydrogens and the speed (most of it).

Adrian

On Thu, Mar 17, 2011 at 19:11, JP <jeanpaul.ebe...@inhibox.com> wrote:
> Could the API possibly be altered to allow for H Removal (or not) ?
>
> On 17 March 2011 18:50, Greg Landrum <greg.land...@gmail.com> wrote:
>>
>> Hi JP,
>>
>> On Thu, Mar 17, 2011 at 2:55 PM, JP <jeanpaul.ebe...@inhibox.com> wrote:
>> > I am using RDKit 2010_12_1, in particular the database cartridge - but I
>> > am
>> > quite positive this is an RDkit core problem.
>>
>> Well, problem... feature.. it's all a matter of perspective. :-S
>>
>> > Now - I am trying a substructure search (using '@>' operator on the
>> > RDkit
>> > molecule table) using smiles [H]N([H])C(=O)C(=O)C (so two explicit
>> > hydrogens
>> > on the N which is bound to a C with double bond to O)  the following
>> > molecule is returned:
>> > I have the following smiles string:
>> > O=C(C(=O)N1CC[NH2+]CC1)c1ccc(cc1C)C
>> > Which RDKit::mol object
>> > Cc1ccc(C(C(N2CC[NH2+]CC2)=O)=O)c(C)c1
>> > But it shouldn't ! (No ?)
>> > Any help will be as usual - much appreciated.
>>
>> Here's what happens: when you construct the query molecule from SMILES
>> the Hs are removed, so the queries for [H]N([H])C(=O)C(=O)C and
>> NC(=O)C(=O)C end up being identical.
>>
>> At the moment, if you want to include the Hs in the query you have to
>> do a SMARTS query. This has the unfortunate side effect of making the
>> query substantially slower.
>>
>> I agree that this behavior is "suboptimal". I'm going to have to think
>> (and read) a bit to come up with a reasonable solution.
>>
>> -greg
>
>
>
> --
>
> Jean-Paul Ebejer
> Early Stage Researcher
> InhibOx Ltd
> Pembroke House
> 36-37 Pembroke Street
> Oxford
> OX1 1BP
> UK
> (+44 / 0) 1865 262 034
>
>
> This email and any files transmitted with it are confidential and intended
> solely for the use of the individual or entity to whom they are addressed.
> Any unauthorised dissemination or copying of this email or its attachments,
> and any use or disclosure of any information contained in them, is strictly
> prohibited and may be illegal.  If you have received this email in error
> please notify the sender and delete all copies from your system.
>
> We and our group companies accept no liability or responsibility for
> personal emails or emails unconnected with our business.
>
> Internet communications including emails and access and use of web sites
> cannot be guaranteed to be secure or error free as information can be
> intercepted, corrupted, lost or arrive late. Furthermore, while we have
> taken steps to control the spread of viruses on our systems, we cannot
> guarantee that this email and any files transmitted with it are virus free.
> No liability is accepted for any errors, omissions, interceptions, corrupted
> mail, lost communications or late delivery arising as a result of receiving
> this message via the Internet or for any virus that may be contained in it.
>
>
> ------------------------------------------------------------------------------
> Colocation vs. Managed Hosting
> A question and answer guide to determining the best fit
> for your organization - today and in the future.
> http://p.sf.net/sfu/internap-sfd2d
> _______________________________________________
> Rdkit-discuss mailing list
> Rdkit-discuss@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>
>

------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to