Package: wnpp Severity: wishlist Owner: Aryan Karamtoth <[email protected]> X-Debbugs-Cc: [email protected]
* Package name : markitdown Version : 0.1.4 Upstream Author: Microsoft * URL : https://github.com/microsoft/markitdown * License : MIT Programming Lang: Python Description : Python tool for converting files and office documents to Markdown. MarkItDown is a lightweight Python utility for converting various files to Markdown for use with LLMs and related text analysis pipelines. To this end, it is most comparable to textract<https://github.com/deanmalmgren/textract>, but with a focus on preserving important document structure and content as Markdown (including: headings, lists, tables, links, etc.) While the output is often reasonably presentable and human-friendly, it is meant to be consumed by text analysis tools -- and may not be the best option for high-fidelity document conversions for human consumption. This is a dependency for Alpaca. Me and Arian Ott will be maintaining this in the Python team

