[issue2027] Module containing C implementations of common text algorithms
Mark Lawrence breamore...@yahoo.co.uk added the comment: I'll close this as suggested in msg106281 in a couple of weeks unless someone objects. -- nosy: +BreamoreBoy status: open - pending ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue2027 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue2027] Module containing C implementations of common text algorithms
Changes by Benjamin Peterson benja...@python.org: -- resolution: - rejected status: pending - closed ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue2027 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue2027] Module containing C implementations of common text algorithms
Matthew Barnett pyt...@mrabarnett.plus.com added the comment: I've started on a module called 'texttools'. So far it has Levenshtein and Porter (both coded in C). If there's interest I'll put it on PyPI. Suggestions for other additions? -- nosy: +mrabarnett ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue2027 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue2027] Module containing C implementations of common text algorithms
Changes by Daniel Diniz aja...@gmail.com: -- components: +Extension Modules, Unicode -Library (Lib) priority: normal - low stage: - test needed versions: +Python 2.7, Python 3.2 -Python 2.6 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue2027 ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue2027] Module containing C implementations of common text algorithms
Georg Brandl added the comment: Even PHP includes Levenshtein... ;) -- nosy: +georg.brandl __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2027 __ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue2027] Module containing C implementations of common text algorithms
Amaury Forgeot d'Arc added the comment: I don't think that this should be part of the core standard library. Did you look at the TextIndexNG project? http://opensource.zopyx.com/projects/TextIndexNG3/ -- nosy: +amaury.forgeotdarc __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2027 __ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue2027] Module containing C implementations of common text algorithms
Christian Heimes added the comment: I agree with Amaury. Pyhton uses the slogan batteries included and not fusion reactor included. We can and will never include every library that may be useful for some users. Python core's development cycles are too slow for fast moving software. Andreas' TXNG3 contains fine implementations for stemming and levenstein. -- nosy: +tiran priority: - normal versions: +Python 2.6 __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2027 __ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue2027] Module containing C implementations of common text algorithms
Matt Chaput added the comment: The Porter stemming and Levenshtein edit-distance algorithms are not fast-moving nor are they fusion reactors... they've been around forever, and are simple to implement, but are still useful in various common scenarios. I'd say this is similar to Python including an implementation of digest functions such as SHA: it's useful enough, and compute-intensive enough, to warrant a C implementation. Shipping C extensions is not an option for everyone; it's especially a pain with Windows. __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2027 __ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
[issue2027] Module containing C implementations of common text algorithms
New submission from Matt Chaput: Add a module to the standard library containing fast (C) implementations of common text/language related algorithms, to begin specifically Porter (and perhaps other) stemming and Levenshtein (and perhaps other) edit distance. Both these algorithms are useful in multiple domains, well known and understood, and have sample implementations all over the Web, but are compute-intensive and prohibitively expensive when implemented in pure Python. -- components: Library (Lib) messages: 62134 nosy: mchaput severity: normal status: open title: Module containing C implementations of common text algorithms type: rfe __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2027 __ ___ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com