From: "Marvin Humphrey" <[email protected]>
> On Sun, Aug 07, 2011 at 11:56:54PM +0300, Octavian Rasnita wrote:
>> Is Lucy able to index and search for UTF-8 encoded documents? (because if I
>> understood right, KinoSearch can't do that.)
> 
> That problem afflicts KinoSearch 0.1x, but not KinoSearch 0.3x or Lucy.


This is great! I read that KinoSearch 0.3x supports Romanian - the language I 
need right now, and I hope Lucy also supports it.

I have tried to install KinoSearch 0.3, but I wasn't able to do it using CPAN 
because it gave the error below, so I will definitely need to wait for the new 
Lucy release:
error building dll file from 'core/KinoSearch/Test.c' at 
E:/usr/site/lib/ExtUtils/CBuilder/Platform/Windows.pm line 130, <DATA>
CREAMYG/KinoSearch-0.313.tar.gz 


>> And, is it possible to run 2 or more separate processes in parallel that
>> index new data in the same time?
> 
> There is a single write lock, which is held from Indexer->new through
> Indexer->commit.  Multiple processes attempting to write to the same index
> will likely experience lock contention.
> 
> There are multiple strategies for managing this limitation, such as queuing
> or utilizing Lucy::Index::BackgroundMerger, but the limitation remains.


Well, I made a test and I indexed more than 100.000 documents in less than 10 
minutes using KinoSearch 0.1 so it works very fast for indexing too (for my 
needs).
In conclusion, it is not a big problem that a single process can do the index.

Octavian

Reply via email to