Models directory

Open models by workload and deployment path

Browse models by the work they are likely to power: reasoning, coding, multimodal understanding, OCR, local inference, and self-hosted agent stacks.

Pick the workload first

Reasoning, coding, OCR, long context, and multimodal tasks reward different model families.

Check the serving path

Open weights, hosted API access, local runtimes, and self-hosting change cost and data control.

Evaluate on your prompts

Use your own agent traces, latency budget, and license constraints before trusting public rankings.

Reasoning and coding models

Models to test for planning, code changes, technical review, and tool-heavy agent prompts.

Multimodal, OCR, and document models

Models for image understanding, documents, screenshots, OCR, and visual agent workflows.

Local and self-hosted models

Open-weight candidates for teams that care about privacy, reproducibility, and deployment control.

Compact and general-purpose candidates

Useful baselines for cheaper inference, edge experiments, and broad assistant workloads.