The biggest AI agent news this week: LiteLLM compromised in supply chain attack affecting 36% of cloud environments, Claude gains Mac computer control with 52M views, Jensen Huang says engineers should burn $250K in tokens, OpenAI acquires Astral, and Stripe launches Machine Payments Protocol for autonomous AI transactions.

The AI agent ecosystem saw explosive developments this week, from Anthropic giving Claude the ability to control your Mac to a supply chain attack compromising one of the most popular Python packages in the LLM ecosystem. Here's everything you need to know.

Jensen Huang: If Your $500K Engineer Isn't Burning $250K in Tokens, "I Will Go Ape"

Nvidia CEO Jensen Huang made headlines during an appearance on the All-In Podcast at GTC 2026, declaring that his highly-paid engineers should be spending at least half their salaries on AI tokens.

"If that $500,000 engineer did not consume at least $250,000 worth of tokens, I'm going to be deeply alarmed," Huang stated. He compared engineers who underutilize AI to chip designers using "paper and pencil."

The comments reflect a broader shift in how companies are thinking about AI as compensation. Huang suggested that tokens could become part of recruitment strategy, with companies offering inference credits alongside salary, bonuses, and equity. Theory Ventures' Tomasz Tunguz has called tokens a potential "fourth component" of compensation.

Claude Can Now Control Your Mac While You're Away

In what may be the week's biggest consumer AI launch, Anthropic released computer use capabilities for Claude on macOS. The feature, available immediately as a research preview for paying subscribers, transforms Claude from a conversational assistant into something closer to a remote digital operator.

According to VentureBeat, Claude can now click buttons, open applications, type into fields, and navigate software on your behalf while you step away from your desk. The announcement generated over 52 million views on social media.

The capability pairs with Dispatch, released last week, which lets users assign Claude tasks from their iPhone and return to finished work on their desktop. Mac users must approve each app Claude can use and can stop Claude at any point.

"Computer use is still early compared to Claude's ability to code or interact with text," Anthropic noted. "Claude can make mistakes, and while we continue to improve our safeguards, threats are constantly evolving."

LiteLLM Supply Chain Attack Exposes Credentials Across 36% of Cloud Environments

A sophisticated supply chain attack hit LiteLLM, the popular open-source LLM proxy with approximately 97 million monthly downloads. Security researcher isfinne discovered that versions 1.82.7 and 1.82.8 contained credential-stealing malware.

According to Datadog Security Labs, the attack originated from a threat actor known as TeamPCP who compromised the real LiteLLM project on PyPI through a poisoned Trivy GitHub Action in the project's CI/CD pipeline.

The malicious payload runs a three-stage attack: harvesting credentials (SSH keys, cloud tokens, Kubernetes secrets, crypto wallets, and .env files), attempting lateral movement across Kubernetes clusters, and installing a persistent systemd backdoor.

Wiz data shows that LiteLLM is present in 36% of cloud environments, signifying widespread potential impact. The malicious versions were available for approximately three hours before PyPI quarantined the package. Organizations using LiteLLM are strongly advised to rotate all secrets and credentials immediately.

OpenAI Acquires Astral, Bringing Python's Most Beloved Tools to Codex

OpenAI announced it would acquire Astral, the company behind uv, Ruff, and ty—three of Python's most widely adopted open-source tools. The Astral team, led by founder Charlie Marsh, will join OpenAI's Codex team.

Astral's tools have grown from zero to hundreds of millions of downloads per month. All three are written in Rust, giving them significant speed advantages over older Python-based alternatives. The acquisition comes as Codex has seen 3x user growth and 5x usage increase since the start of the year, with over 2 million weekly active users.

According to Simon Willison, the Python community has been worried about strategic risk from a single VC-backed company owning key infrastructure. OpenAI committed to continuing support for Astral's open source tools after the deal closes.

Cursor's Composer 2 Secretly Built on Chinese AI Model

TechCrunch reports that Cursor released Composer 2 on March 19 without disclosing it was built on Kimi K2.5, an open-source model from Chinese startup Moonshot AI. The base model's identity became public after a developer intercepted API traffic and found the model ID in plain sight.

According to VentureBeat, Cursor accessed Kimi K2.5 through Fireworks AI as part of an authorized commercial partnership. Cursor co-founder Aman Sanger acknowledged the oversight: "It was a miss to not mention the Kimi base in our blog from the start. We'll fix that for the next model."

Meanwhile, Moonshot AI is seeking to raise $1 billion at an $18 billion valuation—more than quadrupling its value in just three months. The company behind the Kimi chatbot raised $700 million earlier this year at a $10 billion valuation with participation from Alibaba and Tencent.

Cloudflare Launches Dynamic Workers: AI Sandboxes 100x Faster Than Containers

Cloudflare announced Dynamic Workers in open beta, a new system designed to sandbox AI agent code with millisecond startup times. Built on V8 isolates rather than traditional containers, Dynamic Workers are roughly 100x faster to start and between 10x and 100x more memory efficient.

According to VentureBeat, the move reduces startup times from around 500ms to under 5ms, making it significant for bursts of agentic AI requests that require cold starts.

Dynamic Workers are available under Cloudflare's Workers paid plan at $0.002 per unique Worker loaded per day, with the per-Worker fee waived during beta.

Stripe Launches Machine Payments Protocol for AI Agent Transactions

Stripe launched the Machine Payments Protocol (MPP), an open standard co-authored with Tempo that enables AI agents to make payments autonomously. When a client requests a paid resource, the server returns an HTTP 402 response with payment details, and the agent handles authorization automatically.

According to PYMNTS, MPP supports stablecoins via Tempo, credit and debit cards via Stripe and Visa, and Bitcoin via Lightning. Fortune reports that Visa contributed to developing the specifications for letting agents pay with cards.

Early use cases include Browserbase (agents paying per browser session), PostalForm (printing and mailing physical mail), and even a New York deli letting agents order sandwiches for human pickup.

Better Auth Releases Agent Auth Protocol for AI Authentication

Better Auth announced the Agent Auth Protocol, an open standard for agent authentication, capability-based authorization, and service discovery.

The protocol addresses three fundamental problems: delegated agents sharing credentials without isolation, autonomous agents lacking their own identity model, and no standard way for services to advertise agent support. According to Better Auth documentation, the plugin includes adapters for OpenAPI and MCP, allowing existing REST APIs or MCP servers to become agent-auth-enabled without manual capability configuration.

Separately, the IETF published Internet-Draft draft-klrc-aiagent-auth-00 in March 2026, co-authored by engineers from AWS, Zscaler, and Ping Identity, demonstrating how to compose existing standards like OAuth 2.0 for AI agent authentication.

Claude Code Channels: Control Your Coding Agent from Telegram

Anthropic shipped Claude Code Channels, allowing developers to send messages from Telegram or Discord directly into a running Claude Code session on their local machine. The session processes requests with full filesystem, MCP, and git access, then replies through the same messaging app.

According to Claude Code documentation, channels use an MCP server architecture that acts as a two-way bridge. The feature requires Claude Code v2.1.80 or later and is positioned as Anthropic's answer to OpenClaw, which proved demand with 200K GitHub stars for letting developers message AI agents from mobile messaging apps.

Anthropic Sends Lawyers to OpenCode Over Claude Max Plugin

OpenCode 1.3.0 will no longer autoload the Claude Max plugin after Anthropic sent lawyers. The developers stated they "did our best to convince Anthropic to support developer choice but they sent lawyers."

The plugin has been removed from GitHub and marked deprecated on npm. According to Hacker News discussions, Anthropic has clarified that third-party developers are not permitted to offer Claude subscription rate limits in their products.

OpenCode gave a shoutout to OpenAI, GitHub, and GitLab, saying they're "going the other direction and supporting developer freedom." Meanwhile, T3 chat's Theo publicly stated he's not concerned about Anthropic's position, adding Claude integration regardless.

LangChain Releases Open SWE Framework for Enterprise Coding Agents

LangChain released Open SWE, an open-source framework for deploying autonomous coding agents inspired by internal systems at Stripe, Ramp, and Coinbase. The MIT-licensed project has already garnered 6,200+ GitHub stars.

Unlike copilots that provide real-time suggestions, Open SWE operates like another engineer on your team: you assign it a task, it researches your codebase, creates a plan, writes code, runs tests, and opens a pull request. Every task runs in a secure, isolated Daytona sandbox.

The framework ships with integrations for Linear and Slack—comment @openswe on any Linear issue, and the agent reads the full context and posts results back as comments.

Ramp Sheets Becomes Self-Maintaining with Agentic AI

Ramp built an agentic system to maintain Ramp Sheets that continuously monitors production, triages alerts, and proposes fixes without human intervention—although no code is merged without engineer review.

The system runs on a thousand AI-generated monitors, one for every 75 lines of code. According to Modal, Ramp uses their platform to power Ramp Inspect, an internal background coding agent that now writes over half of all merged pull requests at Ramp.

In its first week, the system caught 40 real bugs within minutes of users triggering the issues, with fixes ready immediately.

Google Stitch: The "Vibe Design" Platform That Spooked Figma

Google released a major update to Stitch, its AI-native design platform from Google Labs. The update introduces voice capabilities where users can speak directly to their canvas, receive real-time design critiques, and make updates through natural conversation.

According to TechRadar, the platform can now generate up to five screens in a single operation and export to Figma or HTML/CSS. Reports indicate Figma shares plunged 8.8% on March 18 following the announcement.

Access is free during beta with monthly limits: 350 standard generations and 200 pro generations.

Quick Hits: GPT-5.4 Mini, MiniMax M2.7, and More

GPT-5.4 Mini: OpenAI released GPT-5.4 Mini and Nano, described as their "most capable small models yet." The mini model runs 2x faster than GPT-5 Mini while approaching GPT-5.4 performance on SWE-Bench Pro, with a 400K context window at $0.75 per million input tokens.

MiniMax M2.7: MiniMax released M2.7, the first AI model to deeply participate in its own iteration. With only 10 billion active parameters, it achieved 56.22% on SWE-Pro, matching GPT-5.3-Codex while being 50x cheaper than Opus on input.

Morph Flash Compact: Morph introduced Flash Compact, a specialized model for context compaction running at 33,000+ tokens per second. It shrinks context 50-70% with verbatim accuracy—no summarization, no hallucination risk.

GLM-OCR: Zhipu AI released GLM-OCR, a 0.9B parameter vision model supporting 8K resolution, 8+ languages, and built-in text, LaTeX, and table recognition modes. According to MarkTechPost, it topped OmniDocBench V1.5, beating models 260x larger.

Netlify AI: Netlify announced users can now start new projects from a prompt at netlify.new using Agent Runners. Choose from Claude Code, Codex, or Gemini CLI and get a live web app in minutes, with 300 credits included for every account.

The Week Ahead

The AI agent landscape continues to evolve at breakneck speed. With Anthropic pushing the boundaries of what consumer AI can do while simultaneously tightening control over its subscription offerings, and Chinese model labs closing the gap with Western frontier models, the competitive dynamics are shifting rapidly.

The LiteLLM supply chain attack serves as a stark reminder that as AI infrastructure becomes more critical, it also becomes a more attractive target. Organizations should be auditing their AI dependencies and implementing security controls accordingly.

For developers, the message from Jensen Huang is clear: if you're not leveraging AI tokens heavily in your work, you may be leaving productivity on the table. Whether that means $250,000 worth remains to be seen—but the direction of travel is unmistakable.

AI Agent News: Supply Chain Attacks, $250K Token Budgets, and Claude Takes Over Your Mac