Skip to main content
Training data API and infrastructure for managing datasets, annotations, and data pipelines for machine learning workflows. Best for ML teams needing programmatic access to training data — dataset creation, annotation management, and data quality monitoring. Unlike model inference providers, Hub.xyz handles the data preparation stage of the ML lifecycle. 1 example endpoint available through Lava’s AI Gateway.
This provider requires your own credentials — connect your API key or OAuth account before use.
This is a catch-all provider — any valid URL under https://api.hub.xyz is supported. Training data API. Construct URL as https://api.hub.xyz/{path}. The endpoints below are curated examples.

Endpoints

Manage training datasets

POST https://api.hub.xyz/v1/datasets — Free / request
const data = await lava.gateway('https://api.hub.xyz/v1/datasets', { body: {"dataset":"support-conversations"} });

Next Steps

All Providers

Browse all supported AI providers

Forward Proxy

Learn how to construct proxy URLs and authenticate requests