All
August 13, 2025
Forge v0.106.0 Release: Plan Progress Tracking and Reliability ImprovementsForge v0.106.0 introduces plan progress tracking for better task management and reliability improvements to enhance your development workflow.
August 12, 2025
Coding Agents Showdown: VSCode Forks vs. IDE Extensions vs. CLI AgentsThe AI coding assistant landscape is fragmenting into three distinct ways to integrate AI into your development workflow. Here's an objective analysis of what each approach reveals about the future of software development.
August 10, 2025
Claude Sonnet 4 vs Kimi K2 vs Gemini 2.5 Pro: Which AI actually ships production code?I ran Claude Sonnet 4, Kimi K2, and Gemini 2.5 Pro on the same Next.js app and measured cost, speed, and whether the code actually shipped without follow-ups.
July 27, 2025
Graduating from Early Access: New Pricing Tiers Now AvailableHow our explosive early access growth shaped our pricing strategy and what's now available for developers at every scale.
July 26, 2025
Kimi K2 vs Grok 4: Which AI Model Codes Better?A deep dive into Kimi K2 and Grok 4 for real-world coding, comparing their performance across bug fixing, feature implementation, tool use, and cost efficiency. See which model stands out and when to choose each for your dev workflow.
July 23, 2025
Kimi K2 vs Qwen-3 Coder: Testing Two AI Models on Coding TasksI tested Kimi K2 and Qwen-3 Coder on 13 Rust development tasks across a 38k-line codebase and 2 Frontend refactor tasks. The results reveal differences in code quality, instruction following, and development capabilities.
July 18, 2025
Forge Performance RCA: Root Cause Analysis of Quality Degradation on July 12, 2025A detailed root cause analysis of the Forge AI coding assistant's quality degradation incident on July 12, 2025, including the impact of aggressive conversation compaction and steps taken for future prevention and stability improvements.
July 17, 2025
Grok 4 Initial Impressions: Is xAI's New LLM the Most Intelligent AI Model Yet?A deep dive into Grok 4's benchmarks, architecture, and community impressions. Is xAI's latest LLM a breakthrough towards AGI, and is it worth integrating into your AI development workflow?
July 10, 2025
Claude 4 Opus vs Grok 4: Which Model Dominates Complex Coding Tasks?I pitted Claude 4 Opus against Grok 4 in a series of challenging coding tasks. The results highlight trade-offs in speed, cost, accuracy, and frustration factors that every dev should know.
July 7, 2025
Forge v0.98.0: Integrated Authentication and Developer Experience ImprovementsForge v0.98.0 release brings browser-based authentication, AI safety limits, and enhanced file operations for AI coding assistants. Streamline your terminal development workflow with improved reliability and developer experience.
July 1, 2025
MCP 2025-06-18 Spec Update: AI Security, Structured Output, and User Elicitation for LLMsReal talk about MCP Spec update (v2025-06-18), including important changes, security implications and what developers should actually care about.
June 27, 2025
Simple Over Easy: Architectural Constraints for Maintainable AI-Generated CodeDiscover how applying Rich Hickey's 'Simple Made Easy' principles can solve the 'AI 90/10 problem', leading to more maintainable and reviewable AI-generated code by constraining architectural choices.