Trump Signs Executive Order Seeking Oversight of A.I. Models
The White House reversed its hands-off stance on A.I., asking tech companies to voluntarily submit new models for a 30-day government review.
Per a thread blowing up on Reddit's r/artificial, Chinese AI lab MiniMax has released M3, a model that scores 83.5 on BrowseComp — beating Claude Opus 4.7's 79.3 on the web-navigation benchmark that has become a key barometer for agentic performance. The model ships with a live API, a one-million-token context window, and open weights reportedly in the pipeline, a combination that would rank it among the most capable open releases of 2026.
The PostTrainBench picture is more nuanced: M3 lands third there, behind Opus 4.7 and GPT-5.5. But benchmarks are a selective mirror, and for developers building agents that browse the web autonomously, BrowseComp is the number that matters.
Open weights would be the larger story. If MiniMax follows through, enterprise teams will have a genuine frontier alternative they can self-host — cutting off the per-token revenue streams Western labs depend on.
Chinese frontier labs have gone from benchmark chasers to benchmark setters in under two years. Washington is paying attention.
All comments are reviewed before appearing. Keep it respectful.
The White House reversed its hands-off stance on A.I., asking tech companies to voluntarily submit new models for a 30-day government review.
Fusion startup Xcimer activated a record-breaking laser this week, targeting the power crisis threatening AI's next generation of training runs.
A viral breakdown explains why slick agent demos never ship as products: authentication, identity, and persistent state are nobody's problem to solve.