I just about have this extension running. It sure would benefit from MWSearch's improved unicode handling:
*** WARNING: Funny characters in title SchrauwenD???HaeneVerstraetenEtAl07 ************************ 14000 row(s) processed ************************ 14100 row(s) processed ************************ 14200 row(s) processed *** WARNING: Funny characters in title SerinoGiovagnoliL??davas09 ************************ 14300 row(s) processed ************************ 14400 row(s) processed ************************ 14500 row(s) processed ************************ 14600 row(s) processed ************************ 14700 row(s) processed ************************ 14800 row(s) processed ************************ 14900 row(s) processed ************************ 15000 row(s) processed ************************ 15100 row(s) processed *** WARNING: Funny characters in title StruffertK??hrmannEngelhornEtAl09 ************************ 15200 row(s) processed ************************ 15300 row(s) processed *** WARNING: Funny characters in title TamosiunaiteAsfourW??rg??tter09 ************************ 15400 row(s) processed *** WARNING: Funny characters in title TanakaBalleineO???Doherty08 ************************ 15500 row(s) processed ************************ 15600 row(s) processed ************************ 15700 row(s) processed ************************ 15800 row(s) processed ************************ 15900 row(s) processed *** WARNING: Funny characters in title UrbanoLeznikLlin??s07 *** WARNING: Funny characters in title ValentinDickinsonO???Doherty07 ************************ 16000 row(s) processed ************************ 16100 row(s) processed *** WARNING: Funny characters in title VerstraetenSchrauwenD??HaeneEtAl07 ************************ 16200 row(s) processed ************************ 16300 row(s) processed ************************ 16400 row(s) processed ************************ 16500 row(s) processed ************************ 16600 row(s) processed ************************ 16700 row(s) processed *** WARNING: Funny characters in title WikiPapers/log/Kov????csMehler09 ************************ 16800 row(s) processed *** WARNING: Funny characters in title WikiPapers/log/SerinoGiovagnoliL??davas09 *** WARNING: Funny characters in title WikiPapers/log/SotoFunesGuzm????n-Garc????aEtAl09 *** WARNING: Funny characters in title WikiPapers/log/TanakaBalleineO???Doherty08 *** WARNING: Funny characters in title WikiPapers/log/ValentinDickinsonO???Doherty07 *** WARNING: Funny characters in title WikiPapers/log/WinklerH??denLadinigEtAl *** WARNING: Funny characters in title WinklerH??denLadinigEtAl ************************ 16900 row(s) processed ************************ 17000 row(s) processed ************************ 17100 row(s) processed ************************ 17200 row(s) processed ************************ 17300 row(s) processed ************************ 17400 row(s) processed ************************ 17500 row(s) processed *** WARNING: Funny characters in title BrouilletCond??BealEtAl99.pdf ************************ 17600 row(s) processed *** WARNING: Funny characters in title Carrillo-ReidTecuapetlaIb????ez-SandovalEtAl09.pdf *** WARNING: Funny characters in title CepedaWuAndr??EtAl07.pdf On Thu, May 7, 2009 at 1:33 PM, Chris Reigrut <[email protected]> wrote: > I'd like to announce the first release of EzMwLucene. This project > provides a simplified Lucene search to Mediawiki. It is designed to be > easy to install, configure, and run. It provides real-time, multiple > field indexing and searching as well as text indexing of standard > attachment types (pdf, xls, doc, ppt, vsd). The server is a self > contained Java application (no application server needed), and the > client portion is a standard Mediawiki extension. It is currently in > production on an internal site with over 1000 users running on Mediawiki > 1.13. > > https://sourceforge.net/projects/ezmwlucene/ > > I welcome all feedback: questions, suggestions and offers to help > improve it! > > > _______________________________________________ > MediaWiki-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/mediawiki-l > _______________________________________________ MediaWiki-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
