On 27 May 2011, at 2:05am, Frank Chang wrote:

> Simon Slavin, Thank you for your suggestion. Our deduper prototoype uses 
> fuzzy matching methods such as the Levenshtein Distance to detect duplicates. 
> We have found that these fuzzy matching methods are best implemented in C++ 
> for processing time requirements.

Okay, you are correct.  A distance metric means you can't use the dodge I 
described.  Sorry.

>              We would still like to know your experience with SQLite WAL 
> databases compared to SQlite non-WAL databases. Particularly, we are in the 
> sqlite read processing in SQLIte WAL databases. Is possible to SQLiTe WAL 
> databases to have faster read processing than SQLite non-WAL databases. If 
> so, what method to use to gain the read improvement

I hope that other people will answer this.  However since speed is important to 
you, if you have only one thread of one process using SQLite you can save time 
using PRAGMAs which turn off the multi-process multi-user stuff.  In fact you 
might be able to turn journalling off entirely.

Simon.
_______________________________________________
sqlite-users mailing list
sqlite-users@sqlite.org
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Reply via email to