There is almost guaranteed going to be an attack along the lines of prompt-injec...

WXLCKNO · 2025-07-17T21:54:05 1752789245

In the system I'm building the main agent doesn't have access to tools and must call scoped down subagents who have one or two tools at most and always in the same category (so no mixed fetch and calendar tools). They must also return structured data to the main agent.

I think that kind of isolation is necessary even though it's a bit more costly. However since the subagents have simple tasks I can use super cheap models.

__jonas · 2025-07-17T23:56:20 1752796580

What isolation is there? If a compromised sub agent returns data that gets inserted into the main agents context (structured or not) then the end result is the same as if the main agent was directly interacting with the compromising resource is it not?

itsalotoffun · 2025-07-18T10:13:10 1752833590

Exactly. You can't both give the model access AND enforce security. You CAN convince yourself you've done it though. You see it all the time, including in this thread.

seunosewa · 2025-07-18T19:05:23 1752865523

Perhaps a reference to the data can be inserted in prompt. thee key or filename

clbrmbr · 2025-07-18T12:01:35 1752840095

And the way Google calendar works right now, it automatically shows invites on your calendar, even if they are spam. That does not bode well for prompt injection.