Hi Andreas,

 

Thanks for your response. It’s very helpful!



As you say, the problem probably is in individual records in the results for 
"[χ.ό.]". Do you see the error in the OPAC immediately after searching, or is 
it on a certain page of results? If it’s on a certain page of results, we might 
be able to narrow down the problem further.

 

When using “show” in yaz-client, are you able to view every single record? You 
might try using “format xml” in yaz-client if you’re not already doing so, as 
that might help problems to surface. 

 

How many records are in your Koha database overall? Depending on how technical 
you are, you might consider trying the MARC checker plugin ( 
<https://github.com/bywatersolutions/koha-plugin-marc-checker> 
https://github.com/bywatersolutions/koha-plugin-marc-checker), or writing your 
own script to iterate through all the MARCXML records in your database and try 
to create MARC::Record objects from them. If the problem is with the record 
itself, that’s a good way of discovering which one(s) are at fault.

 

If you’re having issues with results for "[χ.ό.]", it’s probably a safe 
assumption that there’s other problems with records in the database, so 
scanning all the records is probably a good idea. If you have a very large 
database, you can break it up into chunks using biblionumber. 

 

 

 

David Cook

Systems Librarian

 

Prosentient Systems

72/330 Wattle St

Ultimo, NSW 2007

 

Office: 02 9212 0899

Direct: 02 8005 0595

 

From: Andreas Roussos [mailto:arouss1...@gmail.com] 
Sent: Tuesday, 17 May 2016 5:15 PM
To: David Cook <dc...@prosentient.com.au>
Cc: koha@lists.katipo.co.nz
Subject: Re: [Koha] OPAC searches fail with ICU indexing enabled

 

Hi David,

Thank you for your reply. Please see my answers inline below:

 

On Tue, May 17, 2016 at 4:51 AM, David Cook <dc...@prosentient.com.au 
<mailto:dc...@prosentient.com.au> > wrote:

Hi Andreas,

This problem looks a little familiar. I have a few questions.

You find 335 records using yaz-client. Are you able to view those records using 
"show" in yaz-client? 


Yes, I can view the records using "show", or "show 1", "show 42" etc.

 

Also where are you seeing the following error:

> Error:
> :8: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xCF 0x3C 0x2F 0x74 ος Τιμόθεον Α΄-Ï€Ï Î¿Ï‚
> Τιμόθεον Β΄-Ï€Ï Î¿Ï‚ Τίτον-Ï€ ^

Is that in a file in your /var/log/koha/imp directory?

 

No, this is actually displayed in my web browser when searching in OPAC.

 

Also, those instructions at 
https://wiki.koha-community.org/wiki/Correcting_Search_of_Arabic_records look a 
bit suboptimal...

Are you using packages? Did you run the following?

sudo koha-restart-zebra {yourinstance}
sudo koha-rebuild-zebra -f {yourinstance}

 

Yes, I'm using packages and I've run both zebra commands.

 

That parser error doesn't look super helpful... using Windows-1251 0xCF is Ï, 
3C is <, / is 2F. With UTF-8, χ is 0xce 0xa7 and ό is 0xce 0x8c. So there isn't 
a clear relation there. If I had to guess, I'd say that Zebra thinks it's using 
ICU and UTF-8 but the data is still stored as Latin-1.

 

What I find odd is that other searches in OPAC for Greek characters work fine 
and return records (for example searching for "[α.β.]", "[α.ό.]"). It looks as 
if there's something contained in the results for "[χ.ό.]" that causes the 
failure.

 

Failing that... I have some other more in-depth troubleshooting ideas.

 

I'd be more than happy to hear those :-)
 

David Cook
Systems Librarian

Prosentient Systems
72/330 Wattle St
Ultimo, NSW 2007

Office: 02 9212 0899
Direct: 02 8005 0595

 

Regards,
Andreas

_______________________________________________
Koha mailing list  http://koha-community.org
Koha@lists.katipo.co.nz
https://lists.katipo.co.nz/mailman/listinfo/koha

Reply via email to