Skip to main content

Guide category

Model Guides

15 guides

Explore Verdent guides for AI models, coding agents, benchmark interpretation, and practical model evaluation workflows.

Model

Claude Opus 4.5

A complete guide to Claude Opus 4.5 — what's new, how it performs on coding tasks, and how it compares to Claude Opus 4.7 and GPT-5 for agentic workflows.

Model

Codex CLI

A hands-on guide to OpenAI Codex CLI — how to install it, what it can do, and how it compares to Claude Code and Verdent for agentic software development.

Model

Gemini 2.5 Pro

Everything you need to know about Gemini 2.5 Pro — coding benchmarks, 1M context window, pricing, and how to use it inside Verdent for agentic software development.

Model

Gemini 3 Flash

A practical guide to Gemini 3 Flash — speed benchmarks, SWE-bench 78%, free tier access, and how to use it for fast agentic coding inside Verdent.

Model

Gemini 3 Pro

Everything you need to know about Gemini 3 Pro — Deep Think mode, 1M context, SWE-bench 78%, and how to use it with Verdent for agentic coding workflows.

Model

Gemini 3.5

A complete guide to the Gemini 3.5 series — starting with Gemini 3.5 Flash (I/O 2026). Outperforms Gemini 3.1 Pro on coding and agentic benchmarks, 4x faster, and Gemini 3.5 Pro confirmed coming next month.

Model

Gemini Omni

A first look at Gemini Omni — Google's new multimodal flagship announced at I/O 2026. Create anything from any input, with breakthrough video understanding, generation, and world modeling capabilities.

Model

Gemma 3

A complete guide to Google's Gemma 3 — benchmarks, local deployment, and how it compares to Gemma 4 and Llama 4 for coding and agentic tasks.

Model

Google Antigravity

A complete breakdown of Google Antigravity — what it does, how it compares to Verdent and Cursor, and which AI coding tool is right for your workflow.

Model

GPT-5.1 Codex

A developer's guide to GPT-5.1 Codex — long-horizon coding tasks, API setup, real benchmarks, and how it compares to Claude Code and Verdent for agentic workflows.

Model

GPT-OSS 20B

A practical guide to GPT-OSS 20B — OpenAI's first open-weight model. Benchmarks, deployment options, and how it compares to DeepSeek V3.2 and Llama 4.

Model

Grok 4

A complete guide to Grok 4 — benchmarks, coding capabilities, pricing, and how it compares to GPT-5 and Claude. See how Verdent uses Grok 4 for parallel agentic coding.

Model

Grok 4.1

A complete review of Grok 4.1 — LMArena #1 Elo rating, 65% lower hallucination rate, coding benchmarks, and how it compares to Claude Sonnet 4.6 and GPT-5.5.

Model

Kimi K2 Thinking

A deep dive into Kimi K2's Thinking mode — how it compares to DeepSeek R1, when to enable it, and how it powers agentic coding workflows inside Verdent.

Model

Phi-4

Everything about Microsoft Phi-4 — how a 14B model beats larger LLMs on reasoning, local deployment options, and when to use Phi-4 vs GPT-5 for your coding tasks.