LLM API Blog

Blog

Insights on AI coding, Claude Code optimization, and cost-effective development

Comparison

Claude Code Pricing in 2026: Pro vs Max vs Pay-As-You-Go API

A plain-English breakdown of what Claude Code actually costs in 2026 — the official Pro and Max subscription tiers, how token-metered API billing compares, and when a relay subscription at a fraction of the price makes sense.

2026-06-05 8
Yazhou Hu
Tutorial

Using Cline and Roo Code with a Claude-Compatible API

Step-by-step setup for pointing Cline and Roo Code — the VS Code AI coding extensions — at an Anthropic-compatible relay. Provider settings, base URL, model IDs, and the gotchas with OpenAI-compatible vs native Anthropic mode.

2026-06-04 7
Yazhou Hu
Engineering Notes

A 15-Second CPU Stall From One Unclosed XML Tag

Postmortem on a relay process that pegged a single core for fifteen seconds straight. Catastrophic regex backtracking on user-controlled input. Three-character fix, three-layer defense (bounded quantifier + per-request watchdog + nginx next_upstream).

2026-05-06 9
Yazhou Hu
Engineering Notes

Tool-Use IDs in the Wild: When the Same ID Appears Twice

Long Claude Code sessions sometimes produced duplicate `tool_use_id` values from the upstream model. Naive fixes break round-trip correlation; the working answer is occurrence-counter dedup with two-direction translation. With diagnostic dumps for the cases that still slip.

2026-05-06 8
Yazhou Hu
Guide

Claude Code Tool Use: A Complete Developer Guide

Deep dive into Claude Code's six core tools (Read, Edit, Write, Bash, Grep, Glob), how the model decides which tool to call, parallel tool calls, and how to debug when tool use fails.

2026-04-23 11
Yazhou Hu