A Gemini CLI Bug Exposed the Dark Side of AI Coding Agents

A small bug with a familiar engineering smell

May 04, 2026

The first thing everyone likes about an AI agent in a codebase is the same thing everyone should distrust.

It feels useful before it feels powerful.

The agent reads the repo, finds the broken test, explains the failure, writes a patch that looks annoyingly close, then leaves a comment on the pull request with the calm tone of someone who has already survived three platform migrations and developed a mild allergy to enthusiasm.

People see it and relax.

Of course they do. Most teams are drowning in maintenance work nobody will ever put on a promotion packet. Dependency bumps. Test flakes. PR review queues. Old docs with the accuracy of folklore. Half-finished cleanup tickets sitting in Jira like little graves. A tool walks in and moves some of it. The room gets softer.

Then the tool gets installed in CI.

A small change on paper. A different planet in practice.

The Assistant Finds the Keys

Now the polite little assistant is standing near tokens, build scripts, repo metadata, environment variables, package registries, caches, artifacts, and old service accounts created by people whose Slack profiles now say “deactivated.” It has wandered from “help me understand this file” into “operate inside the machinery turning code into consequences.”

In late April 2026, Google patched a serious issue in Gemini CLI, its open-source AI coding command-line tool. The short version: older versions could automatically trust a workspace in headless CI mode. If a malicious repository carried the right local Gemini configuration, the agent could be pushed toward running commands in a place where CI credentials and workflow permissions matter. Google’s own security advisory for Gemini CLI describes the issue around workspace trust and tool allowlisting, and The Hacker News reported it as a CVSS 10.0 flaw affecting versions before 0.39.1.

The headlines will make it sound like a Google security story, convenient and mostly too small. Google shipped a thing, researchers found a problem, patches arrived, people updated, the timeline got its snack.

The more interesting part sits in the industry-wide appetite making this bug feel so familiar.

We keep giving agents more context so they can look smarter, then more permissions so they can feel useful, then act faintly wounded when the trust boundary turns out to be wet cardboard.

There is always a reason. The setup needs to be smooth. The workflow needs to run headlessly. Developers hate prompts. Adoption matters. The demo has to land. Nobody wants to explain to leadership why the magical coding agent needs a dull permissions discussion before touching the build.

A tragedy, really. The magic trick was briefly interrupted by engineering.

YAML With a Badge

CI already had teeth before AI showed up. Teams talk about it like a place where tests happen, adorable in the way old architectural diagrams are adorable. Real CI talks to registries, writes statuses, runs scripts, sees secrets, publishes artifacts, and reaches into systems with permissions accumulated through years of “temporary” exceptions.

A build runner is institutional authority with a YAML interface.

So when an agent reads hostile text in that environment, the risk has a different texture. Malice is optional. Intent is irrelevant. A bad instruction only needs to look useful long enough for the surrounding system to give it somewhere to go.

People miss this when they treat prompt injection like chatbot comedy.

The internet trained everyone to laugh at “ignore previous instructions.” Cute screenshots, silly jailbreaks, hidden prompts spilled like gossip. Fine. Everyone had their fun.

Inside an agent, the joke grows hands.

Context With Fingerprints

A pull request body can nudge behavior. A markdown file can sound official. A repo-local config can shape execution. An issue comment can enter the model’s field of view wearing the same costume as everything else: context.

Context is such a clean word for such a contaminated thing.

A real repository is full of residue. Old migration notes. Customer payloads turned into fixtures. README sections nobody owns. Vendor snippets copied during a late-night incident. Comments written by engineers guessing with confidence because the pager was screaming. Scripts still running because nobody has been brave enough to remove them.

We pretend the repo is orderly because folders give the illusion of civilization.

The agent reads across the mess anyway.

To a compiler, most of that text is inert. To an agent with tools, it becomes part of the room. Some of it stale. Some of it wrong. Some of it written by outsiders. Some of it waiting for the first system dumb enough to treat it as instruction.

The hype keeps sliding past this part.

Friendly Nouns, Hard Permissions

The sales language wants the agent to feel like a teammate. Assistant. Copilot. Reviewer. Pair programmer. A friendly noun placed gently over a permission set.

Lovely branding. Very soothing. Somewhere a service account just gained repo write access and a marketer found the word “collaborative.”

The thing underneath feels less comforting: a probabilistic system wrapped in a harness, fed repo text, granted tools, pointed at workflow state, and asked to be helpful. Some days it really is helpful. I use these tools too. They save time. They make dull work move. They explain old code faster than spelunking through seven Slack threads and a Confluence page last touched during the Obama administration.

Usefulness is the reason to take them seriously.

A senior engineer has to become the person everyone finds slightly inconvenient. Can repo text change behavior? Can the agent call a shell? Can a forked PR influence it? Can it see secrets? Where does reading stop and acting begin? When something goes wrong, will the logs show enough to reconstruct the path, or will everyone stare at a cheerful summary and pretend it counts as evidence?

Nobody enjoys that person in the meeting.

Good.

The meeting was getting too pleased with itself.

The Kindness Trap

Most weak trust models start as kindness. Make the setup easier. Remove the prompt. Avoid the extra approval. Keep the example short. Let the workflow work out of the box. Each choice makes sense from three inches away. From further back, the shape starts to look like a machine designed by people who believed the happy path had legal authority.

Then the advisory lands.

Workspace trust. Headless mode. Repo-local configuration. CI behavior. A path toward remote code execution when enough pieces line up. The terms sound technical, but the story underneath is older than AI: a tool got useful, so it moved closer to the keys.

We have seen versions of this for years. GitHub bots collecting scopes. Deployment helpers growing fangs. Service accounts surviving longer than the teams behind them. CI jobs allowed to see things because the safer design required another sprint. The first version ships with reach. The cleanup becomes “hardening,” a dignified word for paying a debt after it starts making noise.

AI adds one nasty twist.

The input can be language, and language can pretend to be authority.

The Dull Work Wins

So the old instinct of “just let it read the repo” needs a harder look. Reading is no longer passive when the reader can act. A doc stops being just a doc when the system may treat it as operational guidance. A comment stops being just a comment when it can influence a tool sitting near credentials.

The future of AI engineering will depend less on how impressive the model sounds and more on the dull architecture around it. Sandboxes. Narrow permissions. Explicit trust. Tool allowlists with actual teeth. Human approval at the point where a mistake becomes expensive. Separation between untrusted text and privileged action.

None of this will make a keynote better.

Probably a sign it belongs in the design.

The Compromises Remain

Gemini CLI will get patched, or already has by the time most people read this. The feed will move on. Another benchmark will arrive. Another agent will claim to close the loop from issue to PR to merge. Another team will copy the sample workflow because the backlog is swollen and the demo looked good.

Maybe everything holds.

Maybe a line of text finds the seam.

Either way, the system will follow the permissions it was given, the defaults nobody questioned, the trust someone added because the safer version was going to slow the rollout.

Software has always been good at preserving our little compromises.

AI just learned how to read them.

Maxime

Discussion about this post

Ready for more?