I forgot to mention the error message I receive when I run bibindex:

2008-06-19 10:51:43 --> ...updating 29562 words into idxWORD09F started
2008-06-19 10:51:43 --> Exception caught: (1267, "Illegal mix of collations (latin1_swedish_ci,IMPLICIT) and (utf8_general_ci,COERCIBLE)
for operation '='")
2008-06-19 10:51:43 --> Unexpected error occurred: (1267, "Illegal mix of collations (latin1_swedish_ci,IMPLICIT) and (utf8_general_ci,CO
ERCIBLE) for operation '='").
2008-06-19 10:51:43 --> Traceback is:
File "/usr/lib/python2.3/site-packages/invenio/bibtask.py", line 103, in task_init
   if not _task_run(task_run_fnc):
File "/usr/lib/python2.3/site-packages/invenio/bibtask.py", line 377, in _task_run
   if callable(task_run_fnc) and task_run_fnc():
File "/usr/lib/python2.3/site-packages/invenio/bibindex_engine.py", line 1422, in task_run_core
   _last_word_table.put_into_db()
File "/usr/lib/python2.3/site-packages/invenio/bibindex_engine.py", line 713, in put_into_db
   self.put_word_into_db(word)


Hossein Rafighi wrote:


Jerome Caffaro wrote:
Hi Hossein,

I did apply steps provided in both bibharvest and bibconvert admin pages to my test indico site and when I search http://docushare1.triumf.ca all I see is stuff loaded from the demo database. I don't see any entry for http://docushare1.triumf.ca

I see that your queue is in manual mode, and that there is no
BibHarvest task in the screenshot you have sent. Have you
scheduled/run this task?
No. I run it once as per the instruction:
>/opt/cds-invenio/bin/bibharvest -vListRecords -f2007-11-01 -u2008-06-02 -pmarcxml -o/tmp/z.xml http://docushare1.triumf.ca/oai.py Starting the harvesting session at 2008-06-19 08:47:56 --> docushare1.triumf.ca - verb=ListRecords&from=2007-11-01&until=2008-06-02&metadataPrefix=marcxml 200(OK) : OK : verb=ListRecords&from=2007-11-01&until=2008-06-02&metadataPrefix=marcxml
Harvesting successfully completed at: 2008-06-19 08:48:10 -->

I don't know how to add it to the scheduled tasks!

If so, I see that there is a failed BibUpload task. Could it be the
task generated by the harvesting? If yes, what does BibSched says when
you select this task and press L? And what about l (lowercase L)?
Pressing L I get:
2008-06-19 08:52:57 --> Task #31 started.
2008-06-19 08:52:57 --> Stemming is disabled for table idxWORD01F
2008-06-19 08:52:57 --> idxWORD01F contains 3622 words from 359 records
2008-06-19 08:52:57 --> idxWORD01F is in consistent state
2008-06-19 08:52:57 --> No new records added. idxWORD01F is up to date


Also after a harvesting WebColl and BibIndex should also run in order
see the records appear in the collections and index them. You might
then want to run them.
Here is a list of tasks I added:
/opt/cds-invenio/bin/bibsched start
/opt/cds-invenio/bin/bibindex -f50000 -s1h
/opt/cds-invenio/bin/bibreformat -oHB -s1h
/opt/cds-invenio/bin/webcoll -v0 -s1h
/opt/cds-invenio/bin/bibrank -f50000 -s1h
/opt/cds-invenio/bin/bibreformat -oHB -s1h


Finally could you send a screenshot of your BibHarvest configuration?
Attached, please find the requested screen shot.

Cheers,
Hossein


------------------------------------------------------------------------


--
 _____  _____   _____  _   _  _   _  ____ Hossein Rafighi
|_   _||  _  \ |_   _|| | | || \_/ ||  __|TRIUMF, 4004 Wesbrook Mall
  | |  | |_|  )  | |  | | | ||     || |__ Vancouver BC, Canada, V6T 2A3
  | |  |  _  /   | |  | \_/ || \_/ ||  __|Voice: (604) 222-1047
  | |  | | \ \  _| |_ |     || | | || |   Fax:   (604) 222-1074
  |_|  |_|  \_\|_____| \___/ |_| |_||_|   Website: http://www.triumf.ca


Reply via email to