Fulltext is a simple Python library for converting document and media files to text. It's main purpose is for use with full-text indexing systems.
https://github.com/btimby/fulltext http://pypi.python.org/pypi/fulltext/0.1-1 For example, to easily extract text from a PDF file: > python > import fulltext > fulltext.get('resume.pdf') 'Experience: ...' -- http://mail.python.org/mailman/listinfo/python-announce-list Support the Python Software Foundation: http://www.python.org/psf/donations/