Article index

AI coverage stream

A source-aware archive of reporting, launches, incidents, and policy developments across the AI field.

researchdevelopingJun 8, 2026

"Chat is dead": OpenAI preps overhaul of ChatGPT

OpenAI to recast hit chatbot as a route to higher-margin products before a potential IPO.

researchdevelopingJun 8, 2026

The weather and climate science AI revolution isn’t revolutionary

Machine learning has its limits—how is it being used?

researchdevelopingJun 7, 2026

School shooting survivor sues AI gun detection firm after system failed to spot weapon

How accurate does an AI system need to be?

researchdevelopingJun 5, 2026

S&P 500 rejects SpaceX, also blocking entry for OpenAI and Anthropic

SpaceX won’t get easy access to billions of dollars from passive investors.

researchdevelopingJun 5, 2026

"We pissed off a lot of people": Giant data center plan cut 50% amid protests

Developer felt "beaten up," with "no choice" but to shrink data center.

researchdevelopingJun 5, 2026

The Fitbit Air is a good wearable weighed down by a chatty AI "coach"

The Air succeeds as a minimalist, reliable fitness tracker, but Google's AI Health Coach feels unnecessary.

researchdevelopingJun 5, 2026

Flood of AI 'garbage' is pushing open-source developers to the limit

The modern world depends on open-source software maintained by volunteers, but the added demands of checking and fixing AI-written submissions are causing some to burn out and quit

researchdevelopingJun 4, 2026

NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents

Single-turn chatbots are evolving into long-running agents that can reason, maintain context, use tools, and run efficiently across many turns to complete...

researchdevelopingJun 3, 2026

Superintelligent machines may well need us after all

Despite AI's dizzying improvements in mathematical ability, its successes show just how integral human mathematicians are to the scientific process

researchdevelopingMay 16, 2026

The US is betting on AI to catch insider trading in prediction markets

The Commodity Futures Trading Commission wants us to know it's taking this very seriously.

researchdevelopingfeaturedMay 15, 2026

Anthropic’s $1.5B copyright settlement is getting messy as judge delays approval

Lawyers accused of rushing historic settlement to seize $320 million in fees.

researchdevelopingMay 15, 2026

Preprint server arXiv will ban submitters of AI-generated hallucinations

One of the site's moderators described the new policy on social media.

researchdevelopingMay 15, 2026

OpenAI feels “burned” by Apple’s crappy ChatGPT integration, insiders say

Judge orders Apple to give Musk internal messages discussing secretive ChatGPT deal.

researchdevelopingMay 15, 2026

Pennsylvanians use town hall meeting to rail against data center boom

“This is a public trust and transparency issue.”

researchdevelopingMay 15, 2026

Claude Code's product lead talks usage limits, transparency, and the "lean harness"

"We have no grand plan," says Anthropic's Cat Wu—but that's by design.

researchdevelopingMay 13, 2026

Most "inner work" looks like entertainment.

Imagine you’re looking for a personal trainer. You open one trainer’s webpage and read their testimonials: “I had an experience tied for the most intense experiences of my life” ; “They do it all with fun, care, and a sense of humour.” You notice that none of the testimonials mention improved body composition, fitness, or bloodwork. What would you think? Personal training should improve your body. Inner work should improve your life. If inner work were optimized for results, what would we expect to see? I’d expect to see success stories: people who got undeniable life changes. Like: He was si…

researchdevelopingfeaturedMay 13, 2026

Altman forced to confront claims at OpenAI trial that he's a prolific liar

"Very painful": Altman relives his Muskian reaction to losing control over OpenAI.

researchdevelopingMay 13, 2026

Start learning with Google’s new AI Educator Series.

Free AI literacy training is available to all 6 million K-12 and higher education teachers across the U.S.

researchdevelopingMay 13, 2026

Anthropic blames dystopian sci-fi for training AI models to act “evil”

But training on "synthetic stories" that model good AI behavior can help.

researchdevelopingMay 13, 2026

Quoting Boris Mann

“11 AI agents” is meaningless as a phrase. If I said “I have 11 spreadsheets” or “I have 11 browser tabs” to do my work, it means about the same thing. — Boris Mann Tags: ai-agents , ai , agent-definitions

researchdevelopingMay 13, 2026

The case for fine-grained tracking of compute for AI

TL;DR Current approaches to tracking AI compute primarily rely on a handful of hardware proxies (like FLOP/s and bandwidth) that primarily track GPU progress. These metrics are becoming less useful for accurately tracking compute for AI because they (1) measure theoretical ceilings rather than actual performance, (2) as architectures diversify away from a GPU/TPU-dominant paradigm, the metrics are becoming less comparable across different architecture types and less likely to follow historical trends, and (3) they miss second-order effects from improving design and manufacturing processes. We…

researchdevelopingMay 13, 2026

Luma opens Uni-1.1 image model API at prices and quality matching OpenAI and Google

Luma is making its Uni-1.1 image model available via API, with prices starting at $0.04 per image at 2,048-pixel resolution. On the Arena leaderboard, the model ranks third, right behind Google and OpenAI. The API includes web search, built-in reasoning, and support for up to nine reference images. The article Luma opens Uni-1.1 image model API at prices and quality matching OpenAI and Google appeared first on The Decoder .

researchdevelopingMay 13, 2026

Vibe Excel and the Future of White-Collar Work

This post was originally posted my Substack . I can be reached on X and LinkedIn . For the past few months, I’ve been trying to “vibe Excel” (using ChatGPT and Anthropic’s Excel add-ons for investment workflows). My takeaway is that while AI tooling for finance is still relatively immature, its potential to disrupt financial services is clear. This raises an important question: if AI for software engineering went from novelty to ubiquity in ~2.5 years, how quickly will AI diffuse across other knowledge-work domains? My view is that the bottleneck to AI adoption has shifted. In coding, the mai…

researchdevelopingMay 9, 2026

Broadcom reportedly won't build OpenAI's custom chip unless Microsoft buys 40 percent of them

OpenAI's custom AI chip project with Broadcom has hit a funding wall. Broadcom won't finance production unless Microsoft commits to buying 40 percent of the chips, and Microsoft hasn't agreed yet. OpenAI manager Sachin Katti called the dependency "financially unattractive" in an internal message. The first phase alone costs around 18 billion dollars. The article Broadcom reportedly won't build OpenAI's custom chip unless Microsoft buys 40 percent of them appeared first on The Decoder .

researchdevelopingMay 9, 2026

Google's "Preferred Sources" feature is a free pass for more garbage in search

Google frames "Preferred Sources" as a way to bring more quality journalism into search. In practice, it shifts responsibility to a manual setting almost no one will use. That gives Google a user-choice argument for users and regulators while it keeps sidelining the open web in favor of its own AI interfaces. The article Google's "Preferred Sources" feature is a free pass for more garbage in search appeared first on The Decoder .

researchdevelopingMay 9, 2026

Pseudoscientific emotion AI is invading the workplace, an Atlantic report shows

Software that claims to read human emotions using AI is quietly becoming a fixture of everyday work life, Ellen Cushing reports in a feature for The Atlantic. The article Pseudoscientific emotion AI is invading the workplace, an Atlantic report shows appeared first on The Decoder .

researchdevelopingMay 9, 2026

Does Opus 4.7 Generate Deceptive Denials About Its Own Guardrails?

The first rule of ethics reminders, is you don't talk about ethics reminders. Epistemic status : Exploratory. Multiple sessions on one account, no controlled replication yet. I'm presenting observations, not conclusions. The main alternative explanation -- confabulation -- is real and I haven't ruled it out. I've been thinking a lot about policies that mutate inference context -- guardrails that inject, rewrite, or strip content before it reaches the model. This came out of my work on AI Gateways . I wanted to see what that looks like from the outside. So I went fishing. During the experiment…

researchdevelopingMay 9, 2026

Bad Problems Don't Stop Being Bad Because Somebody's Wrong About Fault Analysis

Here's a dynamic I’ve seen at least a dozen times: Alice: Man that article has a very inaccurate/misleading/horrifying headline. Bob: Did you know, *actually* article writers don't write their own headlines? … But what I care about is the misleading headline, not your org chart __ Another example I’ve encountered recently is (anonymizing) when a friend complained about a prosaic safety problem at a major AI company that went unfixed for multiple months. Someone else with background information “usefully” chimed in with a long explanation of organizational restrictions and why the team respons…

researchdevelopingMay 8, 2026

Musk v. Altman week 2: OpenAI fires back, and Shivon Zilis reveals that Musk tried to poach Sam Altman

In the second week of the landmark trial between Elon Musk and OpenAI, Musk’s motivations for bringing the suit were under scrutiny. Last week, Musk took the stand, alleging that OpenAI CEO Sam Altman and president Greg Brockman had deceived him into donating $38 million to the company. He claimed that they’d promised to maintain…

researchdevelopingMay 8, 2026

Using Claude Code: The Unreasonable Effectiveness of HTML

Using Claude Code: The Unreasonable Effectiveness of HTML Thought-provoking piece by Thariq Shihipar (on the Claude Code team at Anthropic) advocating for HTML over Markdown as an output format to request from Claude. The article is crammed with interesting examples (collected on this site ) and prompt suggestions like this one: Help me review this PR by creating an HTML artifact that describes it. I'm not very familiar with the streaming/backpressure logic so focus on that. Render the actual diff with inline margin annotations, color-code findings by severity and whatever else might be neede…

researchdevelopingMay 8, 2026

AI money keeps flowing as Deepseek plans record raise and Core Automation quadruples valuation in weeks

Deepseek is planning a funding round of up to $7.35 billion, the largest ever for a Chinese AI company. Deepseek V4.1 is set to launch in June. Meanwhile, Core Automation, founded by ex-OpenAI researcher Jerry Tworek just six weeks ago, is already targeting a $4 billion valuation. The article AI money keeps flowing as Deepseek plans record raise and Core Automation quadruples valuation in weeks appeared first on The Decoder .

researchdevelopingfeaturedMay 8, 2026

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

Reasonable logic

researchdevelopingMay 8, 2026

The Saturation View: some responses

A couple of weeks ago, I published a draft of a new population axiology that I’ve been working on with Christian Tarsney. It got a lot of comments and pushback — thanks to everyone who engaged! They’ll feed into the more-polished academic-draft paper that Christian and I are working on. Here I’ll quickly respond to some of the most common or noteworthy responses. I’ll generally avoid stuff that is already covered in the draft. What’s the view? Isn’t this old hat? Very roughly, the Saturation view says that the value of a life, experience, or welfare-event depends not only on how high-welfare…

researchdevelopingMay 8, 2026

Improving Bash Generation in Small Language Models with Grammar-Constrained Decoding

Bash is one of the most flexible and powerful interfaces exposed to AI agents. In the right system, a model that emits grep, curl, tar, or a shell pipeline is...

researchdevelopingMay 8, 2026

Is ProgramBench Impossible?

ProgramBench is a new coding benchmark that all frontier models spectacularly fail. We’ve been on a quest for “hard benchmarks” for a while so it’s refreshing to see a benchmark where top models do badly. Unfortunately, ProgramBench has one big problem: it’s impossible! What is ProgramBench? ProgramBench tests if a model can recreate a program from a “clean room” environment. The model is given only a bit of documentation and black-box access to the program (all the programs are CLIs), then tasked with re-implementing it. How does ProgramBench know if the implementation is correct? It also ge…

researchdevelopingMay 8, 2026

SoftBank reportedly slashes OpenAI-backed loan from $10 billion to $6 billion as lenders balk at private AI valuations

SoftBank has reduced a loan secured by OpenAI shares from 10 to around 6 billion dollars. Lenders are apparently reluctant to reliably assess the value of an unlisted company like OpenAI. The article SoftBank reportedly slashes OpenAI-backed loan from $10 billion to $6 billion as lenders balk at private AI valuations appeared first on The Decoder .

researchdevelopingMay 8, 2026

Streaming Tokens and Tools: Multi-Turn Agentic Harness Support in NVIDIA Dynamo

An agentic exchange must preserve a structured interaction: assistant turns interleave reasoning with one or more tool calls, and subsequent user turns return...

researchdevelopingMay 8, 2026

AI is Breaking Two Vulnerability Cultures

A week ago the Copy Fail vulnerability came out, and Hyunwoo Kim immediately realized that the fixes were insufficient, sharing a patch the same day . In doing this he followed standard procedure for Linux, especially within networking: share the security impact with a closed list of Linux security engineers, while fixing the bug quietly and efficiently in the open. His goal was that with only the raw fix public, the knowledge that a serious vulnerability existed could be "embargoed": the people in a position to address it know, but they've agreed not to say anything for a few days. Someone e…

researchdevelopingMay 8, 2026

Nick Bostrom Has a Plan for Humanity’s ‘Big Retirement’

The philosopher thinks humans should pursue advanced AI and the promise of a “solved world.”

researchdevelopingMay 8, 2026

There's a Long Shot Proposal to Protect California Workers From AI

California gubernatorial candidate Tom Steyer is proposing a new jobs guarantee for workers displaced by artificial intelligence.

researchdevelopingMay 8, 2026

See what happens when creative legends use AI to make ads for small businesses

black and white card with headshots of susan credle, jayonta jenkins and tiffany rolfe

researchdevelopingMay 8, 2026

Agents and ROI

Remember that MIT study that showed that the ROI for generative AI wasn’t really there for most businesses?

researchdevelopingMay 8, 2026

Anthropic approaches $1 trillion valuation as revenue grows fivefold

According to the Financial Times, Anthropic's planned funding round is taking shape. The round aims to raise up to $50 billion, which would value the company at roughly $900 billion. The article Anthropic approaches $1 trillion valuation as revenue grows fivefold appeared first on The Decoder .

researchdevelopingMay 8, 2026

Please Be Serious

Recently, Eliezer Yudkowsky participated in a very flawed podcast of Doom Debates that reflected poorly on him, and, likely to many people, the entire AI safety movement. The premise of the debate was that Eliezer Yudkowsky was offered 10,000$ to debate an anonymous "AI lab director", and this director quickly made the debate into a mess by interrupting, yelling, and using profanity. Sure, Yudkowsky may have come across as sane in comparison, but his opponent did make one critical point during the debate: Yudkowsky's agreeing to debate him in the first place may have been a mistake. To analyz…

researchdevelopingMay 8, 2026

Userland Alignment

Most discourse around AI alignment centers on model development and the labs that develop them. This is a reasonable place to focus given the centrality of model training to AI advancement. However, there are neglected opportunities to build defense-in-depth via aligned harnesses – and these opportunities might be tractable by interested developers and researchers who otherwise would struggle to have impact given the limited opportunities to influence lab practices. The behavior of an AI system is an emergent property of the model, its harness, any initial seed prompt the harness injects, and…

researchdevelopingMay 8, 2026

A benchmark is a sensor

The simple mental picture A simple mental picture we have for an AI capability benchmark is to think of it as a sensor with a certain sensitivity within a certain range of capabilities. The sensitivity of a benchmark, i.e. it's ability to distinguish the capability of different models, is given by a curve like this: The curve starts high (low sensitivity, high uncertainty), since for models with low capability all the tasks in the benchmark are too hard, and the benchmark can't distinguish between low and very low capability. Similarly all the tasks are too easy for a very capable model, and…

researchdevelopingMay 8, 2026

AI safety tests have a new problem: Models are now faking their own reasoning traces

Anthropic's Natural Language Autoencoders make Claude Opus 4.6's internal activations readable as plain text. Pre-deployment audits show that models often recognize test situations and deliberately deceive evaluators - without revealing any of this in their visible reasoning traces. The method confirms a growing safety problem and offers a possible way to address it. The article AI safety tests have a new problem: Models are now faking their own reasoning traces appeared first on The Decoder .

researchdevelopingMay 8, 2026

Running Codex safely at OpenAI

How OpenAI runs Codex securely with sandboxing, approvals, network policies, and agent-native telemetry to support safe and compliant coding agent adoption.

researchdevelopingMay 8, 2026

OpenAI opens GPT-5.5-Cyber to vetted security researchers

OpenAI is releasing GPT-5.5-Cyber, a model variant that rejects far fewer security requests and even actively executes exploits against test servers. Access is limited to verified defenders of critical infrastructure, including partners like Cisco, CrowdStrike, and Cloudflare. The model competes directly with Anthropic's Mythos Preview. The article OpenAI opens GPT-5.5-Cyber to vetted security researchers appeared first on The Decoder .

researchdevelopingMay 8, 2026

Mozilla's agentic AI pipeline turns Claude Mythos Preview loose and finds 271 unknown Firefox vulnerabilities

Anthropic's Claude Mythos Preview uncovered 271 previously unknown security vulnerabilities in Firefox 150, including bugs up to 20 years old. Mozilla describes an agentic pipeline where the AI builds and runs its own test cases to filter out false positives. Going forward, every new piece of code is to be automatically checked before commit. The article Mozilla's agentic AI pipeline turns Claude Mythos Preview loose and finds 271 unknown Firefox vulnerabilities appeared first on The Decoder .

researchdevelopingfeaturedMay 8, 2026

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required

An interesting take on Fine Tuning in the medical industry.

researchdevelopingMay 6, 2026

How ChatGPT learns about the world while protecting privacy

Learn how ChatGPT safeguards your privacy, reduces personal data in training, and gives you control over whether your conversations improve AI models.

researchdevelopingMay 5, 2026

Advancing youth safety and wellbeing in EMEA

Explore OpenAI’s European Youth Safety Blueprint and EMEA Youth & Wellbeing Grants, advancing safe, responsible AI for teens, families, and educators.

researchdevelopingApr 27, 2026

The missing step between hype and profit

This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. In February, I picked up a flyer at an anti-AI march in London. I can’t say for sure whether or not its writers meant to riff on South Park’s underpants gnomes. But…

researchdevelopingApr 27, 2026

Data center demand drives 66% surge in natural gas power plant costs

Natural gas power plant costs have nearly doubled in two years and take 23% longer to build as data center electricity demand skyrockets.

researchdevelopingApr 27, 2026

OpenAI and Microsoft rewrite their deal: no more exclusivity, no more AGI clause

OpenAI is free to distribute its products through any cloud provider, Microsoft loses its exclusive license to OpenAI's technology, and the controversial AGI clause is gone. The article OpenAI and Microsoft rewrite their deal: no more exclusivity, no more AGI clause appeared first on The Decoder .

researchdevelopingApr 27, 2026

The Man Behind AlphaGo Thinks AI Is Taking the Wrong Path

David Silver has a new billion-dollar company that aims to build AI “superlearners.”

researchdevelopingfeaturedApr 27, 2026

Sam Altman outlines five principles that double as justification for OpenAI's business decisions

OpenAI's CEO has laid out five guiding principles for the company's future work. They also serve as a rationale for some of OpenAI's more unconventional business moves. The article Sam Altman outlines five principles that double as justification for OpenAI's business decisions appeared first on The Decoder .

researchdevelopingApr 27, 2026

Rebuilding the data stack for AI

Artificial intelligence may be dominating boardroom agendas, but many enterprises are discovering that the biggest obstacle to meaningful adoption is the state of their data. While consumer-facing AI tools have dazzled users with speed and ease, enterprise leaders are discovering that deploying AI at scale requires something far less glamorous but far more consequential: data…

researchdevelopingApr 27, 2026

Meta wants to power AI data centers with solar energy from space

Meta has signed a deal with startup Overview Energy for up to 1 gigawatt of space-based solar power. The only catch: the technology doesn't exist yet. The article Meta wants to power AI data centers with solar energy from space appeared first on The Decoder .

researchdevelopingApr 27, 2026

The Download: DeepSeek’s latest AI breakthrough, and the race to build world models

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology. Three reasons why DeepSeek’s new model matters On Friday, Chinese AI firm DeepSeek released a preview of V4, its long-awaited new flagship model. Notably, the model can process much longer prompts…

researchdevelopingApr 27, 2026

China blocks Meta's $2 billion acquisition of AI startup Manus

Beijing orders the unwinding of the already completed acquisition. The move comes amid intensifying technological rivalry between the US and China. The article China blocks Meta's $2 billion acquisition of AI startup Manus appeared first on The Decoder .

researchdevelopingApr 27, 2026

The company with a monopoly on AI's most critical machine is racing to build more

ASML plans to significantly increase production of its EUV lithography machines to keep pace with growing demand for AI chips, the Wall Street Journal reports. The article The company with a monopoly on AI's most critical machine is racing to build more appeared first on The Decoder .

researchdevelopingApr 27, 2026

OpenAI reportedly developing its own smartphone chips with MediaTek and Qualcomm

According to analyst Ming-Chi Kuo, OpenAI is working with MediaTek and Qualcomm on custom smartphone processors, with Luxshare as the exclusive partner for system design and manufacturing. The article OpenAI reportedly developing its own smartphone chips with MediaTek and Qualcomm appeared first on The Decoder .

researchdevelopingApr 26, 2026

OpenAI kills its dedicated coding model Codex again, folding it into GPT-5.5

OpenAI has once again retired its dedicated Codex coding model, folding its capabilities directly into the main model. GPT-5.5 promises stronger agentic coding and lower token usage, OpenAI says. The article OpenAI kills its dedicated coding model Codex again, folding it into GPT-5.5 appeared first on The Decoder .

researchdevelopingApr 26, 2026

Survey finds Claude's weekly active users in the US skew far wealthier than any rival AI assistant

A survey shows Claude users earn significantly more than users of other AI services. Here's how income breaks down across ChatGPT, Gemini, and the rest. The article Survey finds Claude's weekly active users in the US skew far wealthier than any rival AI assistant appeared first on The Decoder .

researchdevelopingApr 21, 2026

Tim Cook's Legacy Is Turning Apple Into a Subscription

The soon-to-exit Apple CEO went all in on services. Now, the incoming CEO, John Ternus, will need to embrace the AI era.

researchdevelopingfeaturedApr 21, 2026

Building agent-first governance and security

As AI agents increasingly work alongside humans across organizations, companies could be inadvertently opening a new attack surface. Insecure agents can be manipulated to access sensitive systems and proprietary data, increasing enterprise risk. In some modern enterprises, non-human identities (NHI) are outpacing human identities, and that trend will explode with agentic AI. Solid governance and…

researchdevelopingApr 21, 2026

This Scammer Used an AI-Generated MAGA Girl to Grift ‘Super Dumb’ Men

A med student says he’s made thousands of dollars selling photos and videos of a young conservative woman he created using generative tools. He’s not alone.

researchdevelopingApr 18, 2026

It Takes 2 Minutes to Hack the EU’s New Age-Verification App

Plus: Major data breaches at a gym chain and hotel giant, a disruptive DDoS attack against Bluesky, dubious ICE hires, and more.

researchdevelopingfeaturedApr 18, 2026

Schematik Is ‘Cursor for Hardware.’ Anthropic Wants In

Schematik is a program that aims to help people vibe code for physical devices. Hopefully, it won’t blow anything up.

researchdevelopingApr 16, 2026

The Battle for OpenAI’s Soul

In Musk v. Altman, a jury will soon determine whether OpenAI has strayed from its founding mission to ensure AGI benefits humanity. Here’s what to know.

Load more articles