Skip to content

Task Catalog

This page is the searchable public catalog of benchmark tasks.

Public Catalog

The current view is driven directly from the generated task_catalog.json file. It includes only the public test subset and supports lightweight search and domain filtering.

Data Source

The catalog is intended to be generated from task_catalog.json, which itself is built from structured task metadata in the main benchmark repository.