Here's an annoying one I just came across:

(I finally turned my test loop into a function :-)

#!/usr/bin/python2.5

import imdb

i1 = imdb.IMDb('http')
i2 = imdb.IMDb('sql', 'sqlite:/share/media/imdb/sql.db')

test_title = 'The Motorcycle Diaries'
test_year = '2004'


def do_search(i, title, year):
  s = i.search_movie(title)
  print len(s), 'results'

  for m in s:
    if m['year'] == int(year) or m['year'] == year:
      i.update(m)
      print m['title'], m['year'], m.get('akas')

do_search(i1, test_title, test_year)
do_search(i2, test_title, test_year)



ne...@thune[03:02 PM]~(533)
PYTHONPATH=~/workspace/imdbpy/build/lib.linux-i686-2.5 ./u.py | grep
Diar
Diarios de motocicleta 2004 [u'Carnets de voyage::(France)', u'Die
Reise des jungen Che::(Germany)', u'The Motorcycle Diaries::(USA)',
u'Voyage \xe0 motocyclette::(France) (festival title)', u'Carnets de
voyage::(Canada: French title) [fr]', u'Dagbok fr\xe5n en
motorcykel::(Sweden) [sv]', u'Di\xe1rios de Motocicleta::(Brazil)
[pt]', u'Diari della motocicletta, I::(Italy) [it]', u'Die Reise des
jungen Che::(Austria) [de]', u'Dnevnik s motocikla::(Croatia) [hr]',
u'Dzienniki motocyklowe::(Poland) [pl]', u'Imerologia
motosikletas::(Greece) [el]', u'Jurnal de calatorie::(Romania) [ro]',
u'Moottoripy\xf6r\xe4p\xe4iv\xe4kirja::(Finland) [fi]', u'Motorcykel
dagbogen::(Denmark) [da]']
The Making of 'The Motorcycle Diaries' 2004 None
Diarios de motocicleta 2004 [u'Carnets de voyage (2004)::(France)',
u'Diari della motocicletta, I (2004)::(Italy)', u'Motorcycle Diaries,
The (2004)::(USA)', u'Reise des jungen Che, Die (2004)::(Germany)',
u'Voyage \xe0 motocyclette (2004)::(France) (festival title)']


Note:
http:  u'The Motorcycle Diaries::(USA)'
sql: u'Motorcycle Diaries, The (2004)::(USA)'


So, I wrote a routine to normalize those to get rid of ::(USA) and
(2004)::(USA), great.

But now I have to move that damned article around too?!?

Grrrrr

Hmmm... looks like that's a part of the original data set.

And it wouldn't be trivial to convert the data to put the article first:

Diary 2, The (1999) (TV)
   (aka Cahiers intimes 2 (1999) (TV))  (France) (dubbed version)
   (aka Tagebuch der Lust, Teil 2 (1999) (TV))  (Germany)


It looks like different languages have different ways of moving the
article to the end of the string.  Interesting, I never knew that.


Fortunately for me, I only care about English.

mrc

------------------------------------------------------------------------------
Check out the new SourceForge.net Marketplace.
It is the best place to buy or sell services for
just about anything Open Source.
http://p.sf.net/sfu/Xq1LFB
_______________________________________________
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help

Reply via email to