May 20, 2026 09:19 AM
Google has introduced Gemini 3.5 Flash, a new model focused on agentic workflows, coding, and long-horizon task execution. The release also expanded Gemini access across Search, enterprise tools, Android Studio, and Google's developer platforms.
Read MoreMay 20, 2026 09:19 AM
OpenAI's new Guaranteed Capacity offering allows customers to secure long-term access to compute to power AI products, agents, and workflows. Customers can choose between one-, two-, and three-year-long commitments, with discounts based on how long the commitment is. The company will offer Guaranteed Capacity until it sells out of its current allocation. It plans to offer it again in the future.
Read MoreMay 20, 2026 09:19 AM
Andrej Karpathy announced he has joined Anthropic, citing the next few years at the LLM frontier as especially formative for his return to R&D. Karpathy noted he remains passionate about education and plans to resume that work later, signaling the move is research-focused rather than a permanent pivot away from teaching.
Read MoreMay 20, 2026 09:19 AM
At I/O 2026, Google outlined how Gemini models were being integrated across consumer products, creative tools, and developer platforms. The company also shared that monthly token usage across its AI systems had grown to more than 3.2 quadrillion.
Read MoreMay 20, 2026 09:19 AM
Model half-life - the idea that model releases will keep getting faster and faster - doesn't really make sense under consideration. Model releases have definitely increased in pace, but the release time isn't halving every six months. This post looks at model release dates for several of the most well-known models and lays out predictions for when the next models might come out.
Read MoreMay 20, 2026 09:19 AM
HTML's richness allows it to convey complex information more effectively than Markdown, including layouts, data tables, and interactive elements. It enhances readability by organizing specs into well-structured, easily navigable documents and offers better sharing and interaction capabilities. Claude Code uses HTML to efficiently ingest context from various sources, aiding in specs, design prototyping, and creating custom editing interfaces with improved engagement and clarity.
Read MoreMay 20, 2026 09:19 AM
OlmoEarth v1.1, a new model family, reduces compute costs by up to 3X while maintaining performance, making planet-scale mapping more affordable. The models efficiently process remote sensing data by optimizing token sequence lengths, crucial for reducing computational costs. Methodological improvements allow similar performance to the original version with significantly less compute, benefiting developers and enhancing scientific research in remote sensing.
Read MoreMay 20, 2026 09:19 AM
NVIDIA's LongLive 1.0 is a framework for interactive long-form video generation that supports sequential prompting and real-time user-guided editing using streaming attention and KV-cache optimization techniques.
Read MoreMay 20, 2026 09:19 AM
Oz is a multi-harness control plane for cloud agents, supporting Claude Code, Codex, and Warp Agent. It offers automatic multi-agent orchestration, cross-harness Agent Memory, and improved cost and usage controls. Additionally, Oz provides expanded self-hosting options and enhanced governance features, streamlining agent management and deployment.
Read MoreMay 20, 2026 09:19 AM
Six new state-of-the-art CrossEncoder Ettin rerankers built on Ettin ModernBERT encoders have been released, offering models from 17M to 1B parameters. These models, trained with pointwise MSE distillation from a strong 1.54B parameter teacher, provide significant accuracy improvements over legacy models while enhancing speed, especially with Flash Attention 2. The models are particularly notable for their efficiency in retrieve-then-rerank systems and outperform models like ms-marco-MiniLM-L12-v2 on MTEB and NanoBEIR benchmarks.
Read MoreMay 20, 2026 09:19 AM
Index is a platform for content owners that helps them understand how AI agents use their work and earn revenue when they do.
Read MoreMay 20, 2026 09:19 AM
Kimi K2.6, a trillion-parameter model, has the fastest frontier model performance ever measured by Artificial Analysis at around 1,000 tokens per second.
Read MoreMay 20, 2026 09:19 AM
Learn more
Read MoreMay 20, 2026 09:19 AM
OpenAI strengthens content provenance by implementing C2PA standards and Google DeepMind's SynthID watermarking for AI-generated images.
Read MoreMay 20, 2026 09:19 AM
AI is about to generate hundreds of billions in new philanthropic funding.
Read More