Minamoto
Public catalog in preparation

Dataset catalog

Our public catalog is in preparation. Until then, tell us what you need — most of our work today is bespoke collection designed for a specific model.

Modalities and use cases we cover

Autonomous driving

Autonomous driving

Roads, signage, vehicles, pedestrians — captured across weather and time of day.

Physical AI & robotics

Physical AI & robotics

Household tasks, manipulation, assembly — first-person video with sensor logs.

LLMs & conversational AI

LLMs & conversational AI

Natural conversation, prompts, and long-form documents.

Speech recognition & TTS

Speech recognition & TTS

Diverse speakers, dialects, expressions — captured at studio quality.

Vision-language & multimodal

Vision-language & multimodal

Paired image-caption and video-transcript data to accelerate VLM training.

Image & video models

Image & video models

Curated data for specific scenes, styles, and subjects.

Request a dataset

Tell us about the dataset you're looking for — modality, scale, target task, timeline — and we'll come back with options.

Request a dataset