Tasks

Explore ML tasks

Find the right models, datasets, and demos for each ML task area.

542,600 mapped task-model linksLive counts synced

Natural Language Processing

✍️

Text Generation

182k models

Generate text given a prompt, including open-ended generation and conditional generation.

🏷️

Text Classification

74k models

Classify text into predefined categories such as sentiment, topic, or intent.

🔠

Token Classification

22k models

Label individual tokens in a sequence — used for NER, POS tagging, and chunking.

❓

Question Answering

18k models

Extract or generate answers to questions based on a given context passage.

📝

Summarization

9k models

Produce a shorter version of a document while preserving key information.

🌐

Translation

15k models

Convert text from one natural language to another.

🎭

Fill-Mask

11k models

Predict masked tokens in a sequence — the core pre-training objective for BERT-style models.

📐

Sentence Similarity

9k models

Compute semantic similarity scores between pairs of sentences.

🔢

Feature Extraction

31k models

Extract dense vector embeddings from text for downstream tasks like search and clustering.

🎯

Zero-Shot Classification

5k models

Classify text into categories never seen during training using natural language labels.

Computer Vision

🖼️

Image Classification

42k models

Assign one or more labels to an input image from a fixed set of categories.

🔍

Object Detection

10k models

Locate and classify multiple objects within an image with bounding boxes.

✂️

Image Segmentation

8k models

Assign a class label to every pixel in an image (semantic or instance segmentation).

📏

Depth Estimation

2k models

Predict per-pixel depth maps from a single RGB image.

🔄

Image-to-Image

6k models

Transform or enhance an input image — style transfer, super-resolution, inpainting.

🎨

Text-to-Image

38k models

Generate photorealistic or artistic images from natural language prompts.

📄

Image-to-Text

6k models

Generate descriptive text captions or OCR output from images.

🎞️

Video Classification

3k models

Label video clips with action or event categories.

Audio

🎙️

Automatic Speech Recognition

13k models

Transcribe spoken audio to text — the core task for voice assistants and captioning.

🔊

Text-to-Speech

9k models

Convert written text to natural-sounding speech audio.

🔉

Audio Classification

5k models

Classify audio clips into categories such as music genre, speaker, or sound event.

🎵

Audio-to-Audio

2k models

Transform audio signals — noise reduction, speech enhancement, source separation.

Multimodal

🖼️❓

Visual Question Answering

4k models

Answer natural language questions about the content of an image.

📑

Document Question Answering

3k models

Extract answers from structured documents such as PDFs, forms, and charts.

🎬

Image-to-Video

1k models

Animate a still image into a short video clip.

🎥

Text-to-Video

2k models

Generate video from natural language descriptions.

Tabular & Other

📊

Tabular Classification

4k models

Predict categorical labels from structured tabular features.

📈

Tabular Regression

3k models

Predict continuous numerical targets from structured tabular features.

🤖

Reinforcement Learning

4k models

Train agents to maximize rewards through interaction with an environment.

🕸️

Graph Machine Learning

1k models

Learn from graph-structured data — node classification, link prediction, graph classification.