Hi Greg,

> What these are telling you is that the second query is not using the index:
> it's a sequential scan, so it has to test all rows of the database. This
> happens because the index is defined for the operator %, but not for the
> function tanimoto_sml(). There may be an approach to get the index set up
> using that function, but there we reach the limits of my expertise.

Well, I will stick to the recommended operator use then!


> One final advanced topic: if you are planning on making regular use of the
> similarity features in the cartridge and are running on a linux system or
> Mac I would recommend recompiling the cartridge with some optimizations for
> tanimoto similarity. To do this, you need to edit the cartridge Makefile
> from:
> PG_CPPFLAGS = -I${BOOSTHOME} -I${RDKIT}/Code -DRDKITVER='"007200"'
> ${INCHIFLAGS} #-DUSE_BUILTIN_POPCOUNT -msse4.2
> 
> to:
> PG_CPPFLAGS = -I${BOOSTHOME} -I${RDKIT}/Code -DRDKITVER='"007200"'
> ${INCHIFLAGS} -DUSE_BUILTIN_POPCOUNT -msse4.2
> 
> (I just removed a comment character here). This speeds the Tanimoto
> calculation up a fair bit (it's still not nearly as fast as Andrew's
> chemfp, but it's better than the default behavior).

I'm on linux (Ubuntu), and have just re-built with the above recommendation.
I'll see what the speeds look like afterwards (out of interest, I presume the 
timings in your examples were with this optimisation in place?).

Does this also affect dice?

And final question - after rebuilding the cartridge, does the extension need to 
be dropped and then re-created in all databases; does postgreSQL server need 
restarting; or neither?


> Hope this helps,
> -greg

It does - thanks!

Kind regards

James

______________________________________________________________________
PLEASE READ: This email is confidential and may be privileged. It is intended 
for the named addressee(s) only and access to it by anyone else is 
unauthorised. If you are not an addressee, any disclosure or copying of the 
contents of this email or any action taken (or not taken) in reliance on it is 
unauthorised and may be unlawful. If you have received this email in error, 
please notify the sender or postmas...@vernalis.com. Email is not a secure 
method of communication and the Company cannot accept responsibility for the 
accuracy or completeness of this message or any attachment(s). Please check 
this email for virus infection for which the Company accepts no responsibility. 
If verification of this email is sought then please request a hard copy. Unless 
otherwise stated, any views or opinions presented are solely those of the 
author and do not represent those of the Company.

The Vernalis Group of Companies
100 Berkshire Place
Wharfedale Road
Winnersh, Berkshire
RG41 5RD, England
Tel: +44 (0)118 938 0000

To access trading company registration and address details, please go to the 
Vernalis website at www.vernalis.com and click on the "Company address and 
registration details" link at the bottom of the page..
______________________________________________________________________
------------------------------------------------------------------------------
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
• 3 signs your SCM is hindering your productivity
• Requirements for releasing software faster
• Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to