Greetings Apertiumers! It occurred to me recently that one of the slowest parts of dictionaries to compile (regular expressions) tends to overlap the parts that are most often similar across different languages (email addresses, dates, numbers), so I added a tool for merging binary files and have started gathering shareable entries into https://github.com/apertium/apertium-shared
It currently has web stuff (emails, urls, filenames, etc - copied from apertium-cat) and punctuation. If you want to try it out, you can add it to a language with In configure.ac: PKG_CHECK_MODULES(APERTIUM_SHARED, apertium-shared >= 0.0.1) PKG_CHECK_VAR([SHARESRC], [apertium-shared], [srcdir]) In Makefile.am: $(LANG1).automorf.bin: .deps/$(LANG1).automorf.bin $(SHARESRC)/shared-mono-lr.bin lt-append -k $^ $@ And rename existing $(LANG1).automorf.bin to .deps/$(LANG1).automorf.bin and similarly for autogen.bin and shared-mono-rl.bin. Let me know if you have any questions or suggestions (I'll probably add dates and a bidix component, I just haven't gotten there yet). Daniel
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff