> They know all the flags and are generally better at interpreting tool output t...

tptacek · 2025-11-06T22:53:11 1762469591

Honestly, I didn't think very hard about how to make `ping` do something interesting here, and in serious code I'd give it all the `ping` options (and also run it in a Fly Machine or Sprite where I don't have to bother checking to make sure none of those options gives code exec). It's possible the post would have been better had I done that; it might have come up with an even better test.

I was telling a friend online that they should bang out an agent today, and the example I gave her was `ps`; like, I think if you gave a local agent every `ps` flag, it could tell you super interesting things about usage on your machine pretty quickly.

indigodaddy · 2025-11-07T15:56:32 1762530992

Or have the agent strace a process and describe what's going on as if you're a 5 year old (because I actually need that to understand strace output)

tptacek · 2025-11-07T15:57:44 1762531064

Iterated strace runs are also interesting because they generate large amounts of data, which means you actually have to do context programming.

mwcampbell · 2025-11-07T00:23:15 1762474995

What is Sprite in this context?

cess11 · 2025-11-07T08:52:46 1762505566

I'm guessing the Fly Machine they're referring to is a container running on fly.io, perhaps the sprite is what the Spritely Institute calls a goblin.

zahlman · 2025-11-06T23:10:22 1762470622

Also to be clear: are the schemas for the JSON data sent and parsed here specific to the model used? Or is there a standard? (Is that the P in MCP?)

spenczar5 · 2025-11-07T02:17:23 1762481843

Its JSON schema, well standardized, and predates LLMs: https://json-schema.org/

zahlman · 2025-11-07T05:05:02 1762491902

Ah, so I can specify how I want it to describe the tool request? And it's been trained to just accommodate that?

simonw · 2025-11-07T05:38:44 1762493924

Most LLMs have tool patterns trained into them now, which are then managed for you by the API that the developers run on top of the models.

But... you don't have to use that at all. You can use pure prompting with ANY good LLM to get your own custom version of tool calling:

  Any time you want to run a calculation, reply with:
  {{CALCULATOR: 3 + 5 + 6}}
  Then STOP. I will reply with the result.

Before LLMs had tool calling we called this the ReAct pattern - I wrote up an example of implementing that in March 2023 here: https://til.simonwillison.net/llms/python-react-pattern