6 posts

#LLM

xAI grok-build-0.1 API Public Beta: Token Costs and SDK Support

xAI grok-build-0.1 API Public Beta: Token Costs and SDK Support

xAI's coding model exits the $299 CLI gate. Here's what the public API beta actually offers developers.

Anthropic 0.105.0 Under the Hood: Output Attribution and File Caps

Anthropic 0.105.0 Under the Hood: Output Attribution and File Caps

v0.105.0 adds granular output-type attribution and configurable upload caps—here's what they do and when to use them.

Gemini 3.5 Flash: Benchmarks, Pricing, and API Changes for 2026

Gemini 3.5 Flash: Benchmarks, Pricing, and API Changes for 2026

Gemini 3.5 Flash is GA: 1M-token context, a breaking thinking_level change, and full pricing breakdown.

Google Managed Agents API: Sandbox, Skills, and Agentic Stack Analysis

Google Managed Agents API: Sandbox, Skills, and Agentic Stack Analysis

One API call provisions a hosted Linux agent with persistent state and GCS mounts. Here's what developers need to know.

Gemini 3.5 Flash vs Pro: Model Selection Guide 2026

Gemini 3.5 Flash vs Pro: Model Selection Guide 2026

Flash is GA. Pro isn't. Here's the benchmark data and decision framework developers need before choosing or migrating.

Claude Opus 4.8: Coding Benchmarks and Agentic Upgrades 2026

Claude Opus 4.8: Coding Benchmarks and Agentic Upgrades 2026

Anthropic ships Opus 4.8 with 69.2% SWE-Bench Pro, mid-conversation system messages, and adaptive thinking.

Showing 6 of 6 posts