February 17, 2026 09:17 AM
Qwen3.5-397B-A17B is the first model in the Qwen3.5 series. The native vision-language model demonstrates outstanding results in reasoning, coding, agent capabilities, and multimodal understanding. It uses an innovative hybrid architecture that fuses linear attention with a sparse mixture-of-experts. While it contains 397 billion parameters, only 17 billion are activated per forward pass. The model supports 201 languages and dialects.
Read MoreFebruary 17, 2026 09:17 AM
Manus Agents is a new way to access and use Manus directly inside messaging apps. Telegram is currently the only supported app, with more platforms coming soon. The agent features few reasoning, tools, and multi-step task execution. The feature makes agents accessible wherever users are.
Read MoreFebruary 17, 2026 09:17 AM
Microsoft is testing new Researcher and Analyst agents integrated into Copilot's upcoming "Tasks" feature. This feature will allow users to schedule complex prompts, leveraging OpenAI and o3-mini models for research and data analysis. The addition of an "Auto" mode aims to streamline task automation, potentially differentiating Copilot in productivity use cases.
Read MoreFebruary 17, 2026 09:17 AM
Anthropic's CEO, Dario Amodei, expects 'geniuses in a data center' to show up within a few years. While Anthropic's actions do not seem to fully reflect this optimism, its caution is necessary. This article contains notes from a recent podcast where Amodei discusses China, export controls, democracy, AI policy, AI risks, and continual learning.
Read MoreFebruary 17, 2026 09:17 AM
AGI is likely possible, but it probably won't come from Transformer-based models. Transformers are very powerful, but they have fundamental limitations. Solving these limitations could take decades. This isn't to say that LLMs aren't useful - the current technology is already fundamentally changing society.
Read MoreFebruary 17, 2026 09:17 AM
Inference costs may not be that much of a bottleneck for AI progress. The cost to reach a given capability level falls fast, so the inference cost burden is more transient than it might appear from looking at only frontier models at launch. The data on RL scaling is still thin, so it is difficult to draw conclusions yet. It will be interesting to see how quickly cheaper models catch up to frontier capability levels, and how inference costs for fixed tasks decrease over time.
Read MoreFebruary 17, 2026 09:17 AM
Benchmark performance gives biased estimates of out-of-distribution generalization if LLM training data is polluted with benchmark test data. Typical decontamination filters fail to detect semantic duplicates. This suggests that recent benchmark gains are confounded - the prevalence of soft contamination means gains reflect both genuine compatibility improvements and the accumulation of test data and effective test data in the growing training corpora.
Read MoreFebruary 17, 2026 09:17 AM
Alibaba's ZVEC is an open-source, in-process vector database enabling rapid, scalable similarity searches using Alibaba's PROXIMA engine. It supports dense and sparse vectors with hybrid searches and can be deployed across various platforms, including notebooks and edge devices. Installation is straightforward via Python or Node.js, offering a lightweight solution for handling vector data efficiently.
Read MoreFebruary 17, 2026 09:17 AM
Spreadsheet Arena is an open platform for evaluating LLM-generated spreadsheets. Formatting and structure often influence user preference more than formula complexity. There are significant differences in domain-specific preferences, with academic models suffering from heavy formatting and finance models benefiting from professional color coding. Crowd preferences often diverge from expert ratings, particularly in color coding and formatting.
Read MoreFebruary 17, 2026 09:17 AM
Micron Technology, the largest American maker of memory chips, is rushing to add manufacturing capacity to break the memory bottleneck. The company is spending $50 billion to more than double the size of its 450-acre campus. It will build two new chip factories, the first of which is expected to start production of DRAM in mid-2027. Micron also recently broke ground on a $100 billion fab complex in New York, and it announced a $9.6 billion fab investment in Japan last year.
Read MoreFebruary 17, 2026 09:17 AM
Flapping Airplanes aims to revolutionize AI by developing data-efficient training methods, reducing reliance on vast datasets. Backed by $180 million, the founders emphasize diverging from traditional methods, drawing inspiration from the human brain without replicating it exactly. They focus on creativity and fresh perspectives, employing a team oriented towards groundbreaking research rather than incremental improvements.
Read MoreFebruary 17, 2026 09:17 AM
Anthropic has a big consumer marketing problem. Its story, while well known in Silicon Valley, isn't widely heard or legible to the greater cultural conscience. It is puzzling how a company so good at aesthetics and narrative engineering can be so bad at this.
Read MoreFebruary 17, 2026 09:17 AM
LLMs can assist with decompiling Nintendo 64 games up to a certain point - this post describes a developer's attempt, how their workflow evolved as the project matured, what helped, and where they're currently stuck.
Read MoreFebruary 17, 2026 09:17 AM
US productivity increased roughly 2.7% for 2025, a near doubling from the sluggish 1.4% annual average that characterized the past decade.
Read MoreFebruary 17, 2026 09:17 AM
Nearly all models tested responded that they should walk.
Read MoreFebruary 17, 2026 09:17 AM
AIs optimized as “reward-seekers” might be influenced not just by local training incentives but also by distant retroactive rewards or simulated scenarios administered later or by powerful actors.
Read MoreFebruary 17, 2026 09:17 AM
AI is constrained by silicon, supply chains, and economics.
Read MoreFebruary 17, 2026 09:17 AM
Model labs have a structural cost advantage that pure inference providers will struggle to match.
Read More