As per your request:
Tibor Simko wrote:
Can you please send me the result of:
$ inveniocfg --detect-system-details
>>> Going to detect system details...
* Hostname: lin32
* Invenio version: 0.99.0
* Python version: 2.3.4 (#1, Feb 7 2008, 19:02:00) [GCC 3.4.5 20051201
(Red Hat 3.4.5-2)]
* Apache version: Apache/2.0.52 (Scientific Linux) (Release 38.sl4.2)
[/usr/sbin/httpd];
Apache/2.0.52 (Scientific Linux) (Release 38.sl4.2)
[/usr/sbin/httpd.worker]
* mod_python version: mod_python/3.1.3
[/usr/lib/httpd/modules/mod_python.so]
* MySQLdb version: 1.2.1_p2
* MySQL version:
- version: 4.1.20
- character_set_client: utf8
- character_set_connection: utf8
- character_set_database: latin1
- character_set_results: utf8
- character_set_server: latin1
- character_set_system: utf8
- collation_connection: utf8_general_ci
- collation_database: latin1_swedish_ci
- collation_server: latin1_swedish_ci
>>> System details detected successfully.
When you have created your database, did you well specified the UTF-8
charset? You can confirm it by doing:
$ echo "SHOW CREATE DATABASE cdsinvenio" | /opt/cds-invenio/bin/dbexec
or:
$ echo "SHOW CREATE TABLE idxWORD09F" | /opt/cds-invenio/bin/dbexec
Yes. That is correct: --default-character-set="utf8"
Also, what is your version of pdftotext?
$ pdftotext -v
pdftotext version 3.02
P.S. BTW, I noticed that your site seems to be called "Invenio Test Site
at TRIUMF", but the home collection is called "Atlantis Institute
of Fictive Science". You may want to edit your invenio-local.conf
as necessary, e.g.:
CFG_SITE_NAME = Invenio Test Site at TRIUMF
CFG_SITE_NAME_INTL_en = Invenio Test Site at TRIUMF
[...]
and then to rerun:
$ inveniocfg --update-all --reset-all
$ webcoll -u admin -f
Oops! Done too.
Thanks,
Hossein
--
_____ _____ _____ _ _ _ _ ____ Hossein Rafighi
|_ _|| _ \ |_ _|| | | || \_/ || __|TRIUMF, 4004 Wesbrook Mall
| | | |_| ) | | | | | || || |__ Vancouver BC, Canada, V6T 2A3
| | | _ / | | | \_/ || \_/ || __|Voice: (604) 222-1047
| | | | \ \ _| |_ | || | | || | Fax: (604) 222-1074
|_| |_| \_\|_____| \___/ |_| |_||_| Website: http://www.triumf.ca