Label Studio is an open source data labeling tool for labeling and exploring multiple types of data. It allows you to do the following:
- Perform different types of labeling with many data formats.
- Integrate Label Studio with machine learning models to supply predictions for labels (pre-labels), or perform continuous active learning. See Set up machine learning with your labeling process.
Label Studio is also available in Enterprise and Cloud editions with additional features. For more information, see the Label Studio features page.
When you upload data to Label Studio, each item in the dataset becomes a labeling task. The following table describes some terms you might encounter as you use Label Studio.
|Dataset||What you import into Label Studio, comprised of individual items, or labeling tasks.|
|Task||A distinct item from a dataset that is ready to be labeled, pre-annotated, or has already been annotated. For example: a sentence of text, an image, or a video clip.|
|Region||The portion of the task identified for labeling. For images, an example region is a bounding box. For text, an example region is a span of text. Often has a label assigned to it.|
|Labels||What you add to each region while labeling a task in Label Studio.|
|Relation||A defined relationship between two labeled regions.|
|Result||A label applied to a specific region as stored in an annotation or prediction. See Label Studio JSON format of annotated tasks.|
|Annotations||The output of a labeling task. Previously called “completions”.|
|Annotations in Label Studio format that machine learning models create for an unlabeled dataset. See import pre-annotations|
|Templates||Example labeling configurations that you can use to specify the type of labeling that you’re performing with your dataset. See all available templates|
|Tags||Configuration options to customize the labeling interface. See more about tags.|
Start and finish a labeling project with Label Studio by following these steps:
- Install Label Studio.
- Start Label Studio.
- Create accounts for Label Studio. Create an account to manage and set up labeling projects.4. [Restrict access to the project](manage_users.html). Set up role-based access control. Only available in Label Studio Enterprise Edition.
- Set up the labeling project. Define the type of labeling to perform on the dataset and configure project settings.
- Set up the labeling interface. Add the labels that you want annotators to apply and customize the labeling interface.
- Import data as labeling tasks.
- Label and annotate the data. 9. [Review the annotated tasks](quality.html). Only available in Label Studio Enterprise Edition.
- Export the labeled data or the annotations.
You can use any of the Label Studio components in your own tools, or customize them to suit your needs. Before customizing Label Studio extensively, you might want to review Label Studio Enterprise Edition to see if it already contains the relevant functionality you want to build. See Label Studio Features for more.
The component parts of Label Studio are available as modular extensible packages that you can integrate into your existing machine learning processes and tools.
|Label Studio Backend||Python and Django||Use to perform data labeling.|
|Machine Learning Backends||Python||Predict data labels at various parts of the labeling process.|
Label Studio collects anonymous usage statistics about the number of page visits and data types being used in labeling configurations that you set up. No sensitive information is included in the information we collect. The information we collect helps us improve the experience of labeling data in Label Studio and helps us plan future data types and labeling configurations to support.