The Strongest Model Just Dropped, and I’m Not 'Qualified' to Use It
GPT 5.6 looks strong, but only ~20 U.S.-approved partners can use it; I realize intelligence you can buy is already fair, while permission is the real gap.
10 posts
GPT 5.6 looks strong, but only ~20 U.S.-approved partners can use it; I realize intelligence you can buy is already fair, while permission is the real gap.
I found Fable 5 swapped to Opus 4.8; U.S. Commerce banned it in three days. Closed-source AI can vanish overnight; Chinese open-source models step in.
Fable 5 ran a 15-hour task for $420. After June 22 it leaves Coding Plan for API rates, a 10x cost jump: access to top intelligence is a wealth problem.
Top models tie on GPQA at 92–94%, yet real-world results diverge. Three skills now split: hard problems, rough tasks, open exploration; the last is hardest to measure.
GPT-5.5 arrived six weeks after 5.4; Opus 4.7 followed 4.6 in two months. GPT now sounds human, Opus doesn't, and judgment's half-life keeps shrinking.
DeepSeek V4 matches Opus 4.6, but FP4, 1M-token context, and day-0 chip support stress inference infra. GPT-5.5, Vision Banana, and LPM 1.0 landed too.
LLM pricing is stuck: chip controls cap supply, while three user groups pull demand into different shapes. The once-obvious Coding Plan is now under fire.
Opus 4.7 kept me up. I tested it, merged OpenClaude PRs, and pushed Strix Halo Qwen3-30B prefill to DGX Spark levels. Agents make parallel work real.
Researching text-to-video, I found Chinese dominance in open-source LLMs hasn't reached every domain. From LLaMA to Qwen to DeepSeek, what changed?
I switch models daily. Opus is reckless but strong, GPT 5.4 drifts, Gemini misses bugs, domestic models each have quirks. Switching beats prompt tuning.