On Jul 30, Alexmipego <alexmip...@gmail.com> wrote:

> Once the MD5 column was present on the
> distribution everyone would be compatible with my solution. You ask
> about when and where this "matching" would be done, and the beauty of
> this (for you) is that it can be done completely outside imdbpy2sql.

I admit you've more or less persuaded me. ;-)
If I'll have time, this week or the next one, I'll look at what
can be done.

> Btw, I initially thought that I could simply assume the ID you assign
> to each title is sequential, so I could simply assume the first line
> in the CSV was ID 1 and the last would simply match. However, I found
> out that the final sql table has more rows than the raw file and that
> means something is either wrong or some extra processing is done.

That's due to the fact that not every file in the set of the
plain text data files is updated at the same time, and so there
can be incongruences and even errors here and there.
So, every time imdbpy2sql is asked to handle a movie title (or a
person or company name, for instance), it checks if it's already
known or create it otherwise.
This means that imdbpy2sql will run nicely even if the movies.list.gz
file is removed.


Bye,
-- 
Davide Alberani <davide.alber...@gmail.com> [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

------------------------------------------------------------------------------
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm
_______________________________________________
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help

Reply via email to