I forgot to mention the error message I receive when I run bibindex:
2008-06-19 10:51:43 --> ...updating 29562 words into idxWORD09F started
2008-06-19 10:51:43 --> Exception caught: (1267, "Illegal mix of
collations (latin1_swedish_ci,IMPLICIT) and (utf8_general_ci,COERCIBLE)
for operation '='")
2008-06-19 10:51:43 --> Unexpected error occurred: (1267, "Illegal mix
of collations (latin1_swedish_ci,IMPLICIT) and (utf8_general_ci,CO
ERCIBLE) for operation '='").
2008-06-19 10:51:43 --> Traceback is:
File "/usr/lib/python2.3/site-packages/invenio/bibtask.py", line 103,
in task_init
if not _task_run(task_run_fnc):
File "/usr/lib/python2.3/site-packages/invenio/bibtask.py", line 377,
in _task_run
if callable(task_run_fnc) and task_run_fnc():
File "/usr/lib/python2.3/site-packages/invenio/bibindex_engine.py",
line 1422, in task_run_core
_last_word_table.put_into_db()
File "/usr/lib/python2.3/site-packages/invenio/bibindex_engine.py",
line 713, in put_into_db
self.put_word_into_db(word)
Hossein Rafighi wrote:
Jerome Caffaro wrote:
Hi Hossein,
I did apply steps provided in both bibharvest and bibconvert admin
pages to my test indico site
and when I search http://docushare1.triumf.ca all I see is stuff
loaded from the demo database.
I don't see any entry for http://docushare1.triumf.ca
I see that your queue is in manual mode, and that there is no
BibHarvest task in the screenshot you have sent. Have you
scheduled/run this task?
No. I run it once as per the instruction:
>/opt/cds-invenio/bin/bibharvest -vListRecords -f2007-11-01
-u2008-06-02 -pmarcxml -o/tmp/z.xml http://docushare1.triumf.ca/oai.py
Starting the harvesting session at 2008-06-19 08:47:56 -->
docushare1.triumf.ca -
verb=ListRecords&from=2007-11-01&until=2008-06-02&metadataPrefix=marcxml
200(OK) : OK :
verb=ListRecords&from=2007-11-01&until=2008-06-02&metadataPrefix=marcxml
Harvesting successfully completed at: 2008-06-19 08:48:10 -->
I don't know how to add it to the scheduled tasks!
If so, I see that there is a failed BibUpload task. Could it be the
task generated by the harvesting? If yes, what does BibSched says when
you select this task and press L? And what about l (lowercase L)?
Pressing L I get:
2008-06-19 08:52:57 --> Task #31 started.
2008-06-19 08:52:57 --> Stemming is disabled for table idxWORD01F
2008-06-19 08:52:57 --> idxWORD01F contains 3622 words from 359 records
2008-06-19 08:52:57 --> idxWORD01F is in consistent state
2008-06-19 08:52:57 --> No new records added. idxWORD01F is up to date
Also after a harvesting WebColl and BibIndex should also run in order
see the records appear in the collections and index them. You might
then want to run them.
Here is a list of tasks I added:
/opt/cds-invenio/bin/bibsched start
/opt/cds-invenio/bin/bibindex -f50000 -s1h
/opt/cds-invenio/bin/bibreformat -oHB -s1h
/opt/cds-invenio/bin/webcoll -v0 -s1h
/opt/cds-invenio/bin/bibrank -f50000 -s1h
/opt/cds-invenio/bin/bibreformat -oHB -s1h
Finally could you send a screenshot of your BibHarvest configuration?
Attached, please find the requested screen shot.
Cheers,
Hossein
------------------------------------------------------------------------
--
_____ _____ _____ _ _ _ _ ____ Hossein Rafighi
|_ _|| _ \ |_ _|| | | || \_/ || __|TRIUMF, 4004 Wesbrook Mall
| | | |_| ) | | | | | || || |__ Vancouver BC, Canada, V6T 2A3
| | | _ / | | | \_/ || \_/ || __|Voice: (604) 222-1047
| | | | \ \ _| |_ | || | | || | Fax: (604) 222-1074
|_| |_| \_\|_____| \___/ |_| |_||_| Website: http://www.triumf.ca