Package: wnpp Severity: wishlist * Package name : label-studio Version : 1.7.0 Upstream Author : Heartexlabs * URL : https://github.com/heartexlabs/label-studio * License : Apache 2.0 Programming Lang: Python Description : multi-type data labeling and annotation tool with standardized output format
Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats. It can be used to prepare raw data or improve existing training data to get more accurate ML models. In my case I am considering to try it for a project to automate data entry QC (see https://github.com/con/noisseur/issues/1) -- if you know anything like that already, please let me know. Notes: - that git repository is a bit "suboptimal" -- >1GB of .git/objects (likely mistakes made in prior history, checkout tree is only about 200MB with all the docs/ images per each release etc). watchout while cloning, might want a shallow clone