AI Wisdom
๐Ÿง 

Text Generation & Reasoning

Foundation models, fine-tunes, and inference APIs powering modern AI applications.

Graduated ยท 10Incubating ยท 8Sandbox ยท 1Archived ยท 120 total
โ† All categories

GPT-5

Graduated
5/5

OpenAI's most capable model โ€” unified reasoning, vision, and tool use

The new frontier. Native chain-of-thought, vision, audio, and tool use in one model. Dramatically better at complex reasoning and multi-step tasks. The benchmark all others are measured against.

Proprietary

GPT-4o

Graduated
5/5

OpenAI flagship multimodal model โ€” text, vision, audio

Still the workhorse for production RAG and agentic pipelines. JSON mode, function calling, and vision are battle-tested. Best cost/quality ratio for most enterprise deployments.

GPT-4o mini

Graduated
4/5

OpenAI's cost-efficient model โ€” 90% of GPT-4o quality at 5% cost

Best value in AI. Faster and cheaper than GPT-3.5 while being significantly smarter. Ideal for classification, extraction, summarisation, and simple chat. First choice for cost-sensitive apps.

Proprietary

GPT-OSS 120B

Incubating
4/5

OpenAI's first open-weight model โ€” 120B params, fully downloadable

Historic moment โ€” OpenAI releasing open weights. Competitive with Llama 4 and Mistral Large. Fine-tunable and self-hostable. Great for teams wanting OpenAI quality with full control.

Open Source

Claude 4 Sonnet

Graduated
5/5

Anthropic's latest โ€” best-in-class coding, reasoning, and safety

The coding and reasoning champion. Extended thinking mode tackles PhD-level problems. MCP tool ecosystem is mature. Best for agentic workflows, code generation, and long-context analysis.

Proprietary

Claude 3.5 Sonnet

Graduated
5/5

Anthropic's proven workhorse with 200K context and computer use

Exceptional at long-context reasoning and code generation. Computer use API powers automated workflows. Strong safety via Constitution AI. Still widely used in production.

Gemini 2.5 Pro

Graduated
5/5

Google's most capable model with native multimodal and 2M context

First model with a 2M-token context window that actually works. Native audio, video, and image understanding. Deep Google ecosystem integration. Excellent for enterprise multimodal use cases.

Proprietary

Gemini 2.0 Flash

Graduated
4/5

Google's speed-optimised model with 1M context at low cost

Fastest inference at near-frontier quality. 1M-token context window for long-document analysis. Best for latency-sensitive applications and high-volume processing.

Proprietary

Llama 4 Maverick

Incubating
5/5

Meta's latest open model โ€” 400B MoE with 128 experts

Massive leap for open-source. 400B MoE architecture with 128 experts runs efficiently on 8ร— H100. Matches frontier closed models on most benchmarks. Best open model for enterprise self-hosting.

Open Source

Llama 3.3 70B

Graduated
4/5

Meta's proven open model โ€” battle-tested at 70B params

Battle-tested in thousands of production deployments. Runs on a single A100 80GB. Excellent for self-hosted RAG, fine-tuning, and cost-sensitive pipelines. Huge ecosystem of fine-tunes.

Open Source

DeepSeek R1

Graduated
5/5

Open-weight chain-of-thought reasoning rivalling GPT-o1

Made frontier reasoning accessible to everyone. Open weights, chain-of-thought at GPT-o1 quality, at a fraction of the cost. Self-hostable for full data control. Essential for reasoning-heavy tasks.

Open Source

DeepSeek V3

Graduated
4/5

685B MoE model trained for $5.5M โ€” remarkable efficiency

Proved you can train frontier models affordably. 685B MoE with FP8 training on 2048 H800s. Strong general capabilities. Best for teams wanting frontier quality with efficient self-hosting.

Open Source

Qwen 3.5

Incubating
4/5

Alibaba's latest multilingual model โ€” 9B to 72B variants

Best open model for multilingual applications (CJK especially). Multiple sizes from 9B to 72B. Strong code and math. Available on Hugging Face with permissive licensing.

Open Source

Gemma 4

Incubating
4/5

Google's open model family โ€” 26B and 31B instruction-tuned

Best small-to-medium open model from Google. 26B A4B variant uses mixture of experts for efficiency. Strong at instruction following and reasoning. Good JAX and Keras ecosystem.

Open Source

Mistral Large 2

Incubating
4/5

Mistral AI top-tier model with EU data residency

Best European option with data-residency guarantees via La Plateforme. Function calling and JSON mode are reliable. Good for regulated industries needing EU hosting and GDPR compliance.

Proprietary

Phi-4

Incubating
4/5

Microsoft's 14B model punching above its weight on reasoning

Remarkable reasoning for its size โ€” beats many 70B models on math and logic benchmarks. Runs on consumer GPUs. Ideal for edge deployment and latency-sensitive applications.

Open Source

Grok 3

Incubating
4/5

xAI's frontier model trained on Colossus โ€” 100K GPU cluster

Strong reasoning and real-time knowledge via X integration. Massive compute budget produces competitive frontier quality. API available. Good alternative for teams wanting model diversity.

Proprietary

GLM-5

Sandbox
4/5

Zhipu AI's 754B frontier model from China's leading AI lab

One of the largest dense models available. Strong Chinese language capabilities and general reasoning. Open weights on Hugging Face. Interesting for multilingual and research applications.

Open Source

Command R+

Incubating
3/5

Cohere's enterprise model optimised for RAG and tool use

Purpose-built for enterprise RAG. Excellent citation generation and grounding. Multilingual at 10 languages. Cohere Coral SDK simplifies integration. Good for accuracy-critical search applications.

GPT-3.5 Turbo

Archived
2/5

OpenAI's original cost-efficient chat model โ€” now superseded

Fully superseded by GPT-4o mini at similar cost and much higher quality. Avoid for new projects. Migrate to gpt-4o-mini or a modern open-weight alternative.

Proprietary