June 15, 2026 09:22 AM
Anthropic said it disabled Fable 5 and Mythos 5 for all users after receiving a US government export-control directive tied to national security concerns and reported jailbreak risks.
Read MoreJune 15, 2026 09:22 AM
Conversations between Amazon chief executive Andy Jassy and US officials prompted the Trump administration to halt all foreign use of Anthropic's most capable AI models. Researchers at Amazon had used a series of prompts to get Anthropic's Fable 5 model to provide them with information that could be used to aid cyberattacks. White House officials asked Anthropic to fix the vulnerabilities or take down the model. Anthropic has shut down access to its Mythos and Fable models to comply, but says that the vulnerabilities flagged by Amazon are relatively basic and that other publicly available models are also capable of discovering them.
Read MoreJune 15, 2026 09:22 AM
GLM-5.2, a new flagship model from Z.ai, is now available to all GLM Coding Plan users. It delivers powerful coding capabilities, usable 1M-context support, and continued strengths in long-horizon tasks. API and chatbot services will launch next week. The model will be open-sourced under the MIT License.
Read MoreJune 15, 2026 09:22 AM
Google is integrating its products under Gemini Enterprise with a new "Skills Marketplace" tab for pre-defined, Google-optimized skills. The marketplace, aimed at helping teams develop dashboards and reporting tools without long engineering delays, includes a Skills Management UI, a Skills Builder, and the Marketplace itself.
Read MoreJune 15, 2026 09:22 AM
You need the following information to work out the dollar price-per-user: GPU hardware specs, context length, active parameter count of the model, and product-specific factors. The specifics of the model architecture matter surprisingly little, unless it's something entirely different, like diffusion. This post shows how to work out the math on paper. The exercise should reveal how various optimizations in inference engines help SaaS products remain profitable.
Read MoreJune 15, 2026 09:22 AM
Networks of smaller AI models are outperforming every frontier AI system on speed, accuracy, and cost. Everyone in the 1960s was wrong about the mainframe computer, and everyone is now wrong about centralized AI. The future is a network of neural networks.
Read MoreJune 15, 2026 09:22 AM
Rafa Schwinger reverse-engineers Claude Mythos and Fable by arguing the moat is not architecture but the environment foundry, with capability decomposing as base foundation times gradeable signal extracted on top, and verifiable reward becoming the scarce decisive input now that text and raw compute no longer are. The recipe stacks dense pretraining, GRPO-style verifier RL where reward-hacking soundness is the actual binding constraint, long-horizon process rewards with learned context-folding that beats million-token windows at 32K active, plus best-of-N test-time compute exposed as an effort dial.
Read MoreJune 15, 2026 09:22 AM
Kimi K2.7 Code is a coding-focused agentic model that has stronger end-to-end task completion across complex software engineering workflows and improved token efficiency compared to Kimi K2.6. The Mixture-of-Experts model has 1 trillion total parameters. It can be accessed on Moonshot's OpenAI/Anthropic compatible API. The model works best with Kimi Code CLI as its agent framework.
Read MoreJune 15, 2026 09:22 AM
The Open Knowledge Format is an open specification that formalizes the LLM-wiki pattern into a portable, interoperable format. It is vendor-neutral and agent- and human-friendly. The standard can represent the metadata, context, and curated knowledge that modern AI systems need. The specification uses familiar patterns with no complex compression scheme, new runtime, or required SDK.
Read MoreJune 15, 2026 09:22 AM
olmo-eval is a new evaluation workbench designed for iterative LLM development. Enhancing the OLMES standard, it streamlines adding benchmarks, supports agentic and multi-turn evaluations, and facilitates analysis by comparing changes across model checkpoints. Unlike Harbor, olmo-eval offers flexibility, minimizing resource use and focusing on development rather than public benchmarking.
Read MoreJune 15, 2026 09:22 AM
MiniMax Sparse Attention is a sparse attention architecture that uses group-specific Top-k block selection to scale long-context inference while preserving model quality. On a 109B multimodal model, it matched GQA performance while cutting attention compute by ~30x at 1M tokens.
Read MoreJune 15, 2026 09:22 AM
Object counting is still fragmented across domain-specific data sets and task formulations. Existing counting models are often tailored to specific scenarios and struggle to generalize across categories, visual domains, object scales, and density distributions. This paper presents a generalist model for text-guided object counting that achieves strong accuracy and multi-domain generalization.
Read MoreJune 15, 2026 09:22 AM
OpenAI and Anthropic have decided on different approaches to context management. OpenAI uses compaction, compressing everything and retaining only the relevant information. This results in one long thread that maintains a lot of coherence. Anthropic splits context windows across various agents, where each agent executes on the sub-problem within its own context window. Sub-agents do a large amount of work, then pass back only the relevant information to the parent agent. Anthropic's approach can result in sub-agents doing duplicate work, forgetting information, and generally wasting more tokens.
Read MoreJune 15, 2026 09:22 AM
The iOS 27 beta contains an Extensions system for third-party AI. The system includes a settings panel and a dedicated App Store section. Both have been built, but are toggled off in the backend. Apple was in discussions with major AI providers about granting entitlements for the framework, but it appears the company has decided against announcing the feature for now.
Read MoreJune 15, 2026 09:22 AM
Devin's engineering system guarantees more output than cost, with $10M pledged per customer. The system's effectiveness is validated using independent data. This bold claim aims to bolster confidence in their engineering productivity.
Read MoreJune 15, 2026 09:22 AM
NVIDIA's Blackwell Ultra NVL72 platform leads the AgentPerf benchmark by delivering 20x more agent throughput per megawatt than NVIDIA Hopper.
Read MoreJune 15, 2026 09:22 AM
Ramp released its own private SWE-Bench built from real engineering problems faced at Ramp, giving the team a way to evaluate coding models inside its actual financial software ecosystem.
Read MoreJune 15, 2026 09:22 AM
Microsoft CEO Satya Nadella argues every company needs to build both human capital and token capital.
Read MoreJune 15, 2026 09:22 AM
This thread explores a theory that DeepSeek's focus on open-source models, research sharing, and infrastructure development is part of a broader strategy aimed at becoming foundational AI infrastructure rather than competing directly on consumer products.
Read More