Hi, any clues?  How can I find & delete the problematic record(s) ?

-----Original Message-----
From: KU Kam-ming [mailto:[email protected]] 
Sent: Thursday, June 18, 2009 8:50 AM
To: 'Tibor Simko'
Cc: 'project-cdsware-users (CDSware users list.)'
Subject: RE: bibindex error

It works fine before, but somehow, it fails after a few more records are
entered.

BibIndex Task Submission
========================
Username: admin
Password:
2009-06-18 08:41:42 --> storing task options ['./bibindex', '-v', '9',
'-f50000', '-s1h']

/usr/lib/python2.4/site-packages/invenio/dbquery.py:227: Warning: Field
'host' doesn't have a default value
  rc = cur.execute(sql, param)
2009-06-18 08:41:42 --> Task #86 submitted.
[root@ecdl bin]# ./bibindex 86
2009-06-18 08:41:47 --> argv=['./bibindex', '-v', '9', '-f50000', '-s1h']
2009-06-18 08:41:47 --> _options={'reindex': False, 'cmd': 'add',
'modified': [], 'collection': [], 'flush': 50000, 'maxmem': 0, 'id': [],
'windex': None}
2009-06-18 08:41:47 --> _task_params={'task_stop_helper_fnc': <function
task_stop_table_close_fnc at 0x8723b54>, 'verbose': 9, 'task_id': 86,
'sleeptime': '1h', 'priority': 0, 'task_specific_name': '', 'version': '$Id:
bibindex_engine.py,v 1.72 2008/06/17 14:34:50 tibor Exp $', 'user': '',
'task_name': 'bibindex', 'runtime': '2009-06-18 08:41:47'}
2009-06-18 08:41:47 --> Task #86 started.
2009-06-18 08:41:47 --> Updating task status to RUNNING.
2009-06-18 08:41:47 --> Updating task status to ERROR.
2009-06-18 08:41:47 --> Task #86 finished but not resubmitted. [ERROR]



>>> Going to detect system details...
* Hostname: ecdl.hkuspace.hku.hk
* Invenio version: 0.99.1
* Python version: 2.4.3 (#1, Jan 14 2008, 18:32:40)  [GCC 4.1.2 20070626
(Red Hat 4.1.2-14)]
* Apache version: Apache/2.2.3 (Red Hat) (Release 11.el5_1.3)
[/usr/sbin/httpd];
                  Apache/2.2.3 (Red Hat) (Release 11.el5_1.3)
[/usr/sbin/httpd.worker]
* mod_python version: 3.2.8
* MySQLdb version: 1.2.1_p2
* MySQL version:
    - version: 5.0.45
    - character_set_client: utf8
    - character_set_connection: utf8
    - character_set_database: utf8
    - character_set_results: utf8
    - character_set_server: latin1
    - character_set_system: utf8
    - collation_connection: utf8_general_ci
    - collation_database: utf8_general_ci
    - collation_server: latin1_swedish_ci
>>> System details detected successfully.







-----Original Message-----
From: Tibor Simko [mailto:[email protected]] 
Sent: Wednesday, June 17, 2009 11:54 PM
To: KU Kam-ming
Cc: project-cdsware-users (CDSware users list.)
Subject: Re: bibindex error

On Wed, 17 Jun 2009, KU Kam-ming wrote:
>     query = query.encode(charset)
> UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 61:
> ordinal not in range(128)
>
> It seems that it cannot index a record with invalid 'code' , how can I
> determine which record and delete it!?

Please use the verbose option of bibindex:

  $ bibindex -v 9 [...]

Otherwise here are some checklist ideas: what are your versions of CDS
Invenio, MySQL, and MySQLdb?  Is your MySQLdb version at least 1.2.1_p2
as recommended in the INSTALL file?  Are your MySQL tables defined with
DEFAULT CHARACTER SET utf8?  Can you send us the output of ``inveniocfg
--detect-system-details''?  For example, here is an output from a SLC5
(aka RHEL5) box:

$ /opt/cds-invenio/bin/inveniocfg --detect-system-details
>>> Going to detect system details...
* Hostname: foo.bar.com
* Invenio version: 0.99.1.20090602
* Python version: 2.4.3 (#1, Jan 22 2009, 20:23:44)  [GCC 4.1.2 20071124
(Red Hat 4.1.2-42)]
* Apache version: Apache/2.2.3 (Red Hat) (Release 22.el5) [/usr/sbin/httpd];
                  Apache/2.2.3 (Red Hat) (Release 22.el5)
[/usr/sbin/httpd.event];
                  Apache/2.2.3 (Red Hat) (Release 22.el5)
[/usr/sbin/httpd.worker]
* mod_python version: 3.3.1
* MySQLdb version: 1.2.2
* MySQL version:
    - version: 5.0.45-log
    - character_set_client: utf8
    - character_set_connection: utf8
    - character_set_database: utf8
    - character_set_results: utf8
    - character_set_server: latin1
    - character_set_system: utf8
    - collation_connection: utf8_general_ci
    - collation_database: utf8_general_ci
    - collation_server: latin1_swedish_ci
>>> System details detected successfully.

Best regards
-- 
Tibor Simko ** CERN Document Server ** <http://cds.cern.ch/>

Reply via email to