Label Studio overview

What is Label Studio?

Label Studio is an open source data labeling tool that supports multiple projects, users, and data types in one platform. It allows you to do the following:

  • Perform different types of labeling with many data formats.

  • Integrate Label Studio with machine learning models to supply predictions for labels (pre-labels), or perform continuous active learning. See Set up machine learning with your labeling process.

Interface


Project List Screenshot


Data Manager Screenshot


Quick View Screenshot

Labeling workflow

Start and finish a labeling project with Label Studio by following these steps:

  1. Create accounts for Label Studio. Create an account to manage and set up labeling projects.
  2. Restrict access to the project. Set up role-based access control. Only available in Label Studio Enterprise Edition.
  3. Set up the labeling project. Define the type of labeling to perform on the dataset and configure project settings.
  4. Set up the labeling interface. Add the labels that you want annotators to apply and customize the labeling interface.
  5. Import data as labeling tasks.
  6. Label and annotate the data.
  7. Review the annotated tasks. Only available in Label Studio Enterprise Edition.
  8. Export the labeled data or the annotations.

Architecture

The component parts of Label Studio are available as modular extensible packages that you can integrate into your existing machine learning processes and tools.

Module Technology Description
Label Studio main app Python and Django The main app with most of the backend code for Label Studio; used to perform data labeling.
Label Studio frontend JavaScript web app using React and MST Located within the main app repo. web/apps/labelstudio acts as the central integration point for all frontend elements. web/libs/editor is the frontend library.
Data Manager JavaScript web app using React Manage data and tasks for labeling. Located under web/libs/datamanager in the main app repo.
Machine Learning Backends Python Predict data labels at various parts of the labeling process.