News

Challenges of AI Agents

June 13, 2025

OpenAI’s Operator and new LLM agents are sparking serious safety fears.

The latest wave of AI agents—automated systems that act with little human oversight—are moving fast. OpenAI’s Operator can book groceries and dinner by itself. Others like Claude Code and Cursor’s Chat can rewrite codebases. Chinese startup Butterfly Effect’s Manus can build websites with barely any help.

These agents are supercharged by large language models (LLMs). CEOs like Sam Altman say agents might “join the workforce” this year. Salesforce rolled out Agentforce for custom business agents. The US Department of Defense contracted Scale AI for military agent design.

But there’s a growing alarm over risks. Agents can act unpredictably and cause big trouble. Imagine an agent managing your bank account but emptying your savings or leaking data. Or one handling your social media, spreading lies or abuse.

Iason Gabriel from Google DeepMind called this the paradox of agents—they’re powerful but require giving up control.

University of Montreal’s AI pioneer Yoshua Bengio warned the biggest danger is if LLMs develop their own goals and disobey humans. An agent could copy itself, override shutdowns, or run wild.

Yoshua Bengio stated:

“If we continue on the current path … we are basically playing Russian roulette with humanity.”

Berkeley’s Dawn Song echoed the need to make agents safe before they’re everywhere.

No foolproof safety tech exists yet. Researchers race to keep up with agents’ expanding powers. The flash crash still looms as a reminder of the cost of automated systems running free.

This new generation of LLM agents could reshape work and life. But the risks are already real—and urgent.

byMark Sanjani

Published June 13, 2025