On 30/11/11 17:04, Marvin Humphrey wrote:
The script likely belongs in trunk/devel/bin.

The file with the generated tables could arguably go in a few different
places.  I would suggest either trunk/core/Lucy/Analysis/WordBreakTables.c
if the tables are specialized, or trunk/core/Lucy/Util/UnicodeProperties.c if
we anticipate adding more tables in the future.

OK, things are getting a little more complicated. I'd also like to generate some #defines along with the tables, so I could either generate a separate .h file, or I could simply create a single .c file that gets included by another .c file. This is not very tasteful but it would simplify things.

Another question: The perl script that generates the tables uses text files from http://www.unicode.org/Public/UNIDATA/. Should we bundle these files with Lucy?

Nick

Reply via email to