llm

3 posts

Feb 20, 2026

Paste Your Prompt Twice and Watch Accuracy Change

Google Research validated it across 7 models and 7 benchmarks. No training, no prompt engineering. Just copy-paste. I tested it and here's what actually happened.

Feb 8, 2026

The AI War Was Won by Focus - What Anthropic's Opus 4.5 Proves About Strategy

Anthropic's Claude Opus 4.5 didn't just set new benchmarks. It proved that going all-in on text, code, and agents while competitors spread thin is the winning play.

Feb 8, 2026

Making LLMs Write Code to Read 10M Tokens - How RLM Works

Bigger context windows don't make AI smarter. RLM flips the script by letting LLMs write code to selectively read massive documents instead of ingesting them whole.