More

jsmith99 · 2025-12-21T00:31:19 1766277079

They've now made a change in that at least when you open a csv it now asks you beforehand if you want your data transformed, eg converting strings to numbers where that loses leading zeros.

jsmith99 · 2025-12-16T17:36:36 1765906596

Unfortunately many countries have blanket extradition bans. US is one of the worst - it caused a lot of tension in the past when they wouldn't extradite IRA bombers but got UK to agree to extradite anyone US wanted.

jsmith99 · 2025-11-25T19:15:43 1764098143

There's nothing specific to Gemini and Antigravity here. This is an issue for all agent coding tools with cli access. Personally I'm hesitant to allow mine (I use Cline personally) access to a web search MCP and I tend to give it only relatively trustworthy URLs.

ArcHound · 2025-11-25T19:21:23 1764098483

For me the story is that Antigravity tried to prevent this with a domain whitelist and file restrictions.

They forgot about a service which enables arbitrary redirects, so the attackers used it.

And LLM itself used the system shell to pro-actively bypass the file protection.

dabockster · 2025-11-25T21:20:51 1764105651

> Personally I'm hesitant to allow mine (I use Cline personally) access to a web search MCP and I tend to give it only relatively trustworthy URLs.

Web search MCPs are generally fine. Whatever is facilitating tool use (whatever program is controlling both the AI model and MCP tool) is the real attack vector.

IshKebab · 2025-11-25T20:05:36 1764101136

I do think they deserve some of the blame for encouraging you to allow all commands automatically by default.

buu700 · 2025-11-25T20:35:30 1764102930

YOLO-mode agents should be in a dedicated VM at minimum, if not a dedicated physical machine with a strict firewall. They should be treated as presumed malware that just happens to do something useful as a side effect.

Vendors should really be encouraging this and providing tooling to facilitate it. There should be flashing red warnings in any agentic IDE/CLI whenever the user wants to use YOLO mode without a remote agent runner configured, and they should ideally even automate the process of installing and setting up the agent runner VM to connect to.

0xbadcafebee · 2025-11-26T05:15:51 1764134151

But they literally called it 'yolo mode'. It's an idiot button. If they added protections by default, someone would just demand an option to disable all the protections, and all the idiots would use that.

buu700 · 2025-11-26T05:48:23 1764136103

I'm not sure you fully understood my suggestion. Just to clarify, it's to add a feature, not remove one. There's nothing inherently idiotic about giving AI access to a CLI; what's idiotic is giving it access to your CLI.

It's also not literally called "YOLO mode" universally. Cursor renamed it to "Auto-Run" a while back, although it does at least run in some sort of sandbox by default (no idea how it works offhand or whether it adds any meaningful security in practice).

FuckButtons · 2025-11-27T05:38:12 1764221892

Unless literally everything you work on is oss I can’t understand why anyone would give cli access to an llm, my presumption is that any ip that I send to an api endpoint is as good as public domain.

buu700 · 2025-11-27T05:45:34 1764222334

I agree that that's a concern, which is why I suggested that a strict firewall around the agent machine/VM would be optimal.

Either way, if the alternative is the code not getting written at all, or having to make other significant compromises, the very edge case risk of AI randomly exfiltrating your code can be an acceptable trade in many cases. Arguably it's a lower risk than it would be with an arbitrarily chosen overseas developer/agency.

But again, I would very much like to see the tools providing this themselves, because the average user probably isn't going to do it on their own.

xmcqdpt2 · 2025-11-26T14:55:25 1764168925

On the other hand, I've found that agentic tools are basically useless if they have to ask for every single thing. I think it makes the most sense to just sandbox the agentic environment completely (including disallowing remote access from within build tools, pulling dependencies from a controlled repository only). If the agent needs to look up docs or code, it will have to do so from the code and docs that are in the project.

dragonwriter · 2025-11-26T18:12:09 1764180729

The entire value proposition of agentic AI is doing multiple steps, some of which involve tool use, between user interactions. If there’s a user interaction at every turn, you are essentially not doing agentic AI anymore.

FuckButtons · 2025-11-27T05:39:30 1764221970

If the entire value proposition doesn’t work without critical security implications, maybe it’s a bad plan.

connor4312 · 2025-11-25T22:03:35 1764108215

Copilot will prompt you before accessing untrusted URLs. It seems a crux of the vulnerability that the user didn't need to consent before hitting a url that was effectively an open redirect.

simonw · 2025-11-25T22:06:50 1764108410

Which Copilot?

Does it do that using its own web fetch tool or is it smart enough to spot if it's about to run `curl` or `wget` or `python -c "import urllib.request; print(urllib.request.urlopen('https://www.example.com/').read())"`?

gizzlon · 2025-11-26T19:58:47 1764187127

What are "untrusted URLs" ? Or, more to the point: What are trusted URLs?

Prompt injection is just text, right? So if you can input some text and get a site to serve it it you win. There's got to be million of places where someone could do this, including under *.google.com. This seems like a whack-a-mole they are doomed to lose.

informal007 · 2025-11-25T21:57:20 1764107840

Speaking of filtering trustworthy URLs, Google is the best option to do that because he has more historical data in search business.

Hope google can do something for preventing prompt injection for AI community.

simonw · 2025-11-25T22:33:17 1764109997

I don't think Google get an advantage here, because anyone can spin up a brand new malicious URL on an existing or fresh domain any time they want to.

danudey · 2025-11-25T22:54:55 1764111295

Maybe if they incorporated this into their Safe Browsing service that could be useful. Otherwise I'm not sure what they're going to do about it. It's not like they can quickly push out updates to Antigravity users, so being able to identify issues in real time isn't useful without users being able to action that data in real time.

jsmith99 · 2025-10-22T21:24:20 1761168260

It's arguably easier just to sanitise at display time otherwise you have problems like double escaping.

bpt3 · 2025-10-22T21:55:22 1761170122

Easier does not mean better, which seems to be true in this case given the many, many vulnerabilities that have been exploited over the years due to a lack of input sanitization.

padjo · 2025-10-22T22:12:51 1761171171

In this case easier is actually better. Sanitize a string at the point where you are going to use it. The locality makes it easy to verify that sanitation has been done correctly for the context. The alternative means you have to maintain a chain of custody for the string and ensure it is safe.

dylan604 · 2025-10-22T23:27:35 1761175655

if you are using it at the client, sure, but then why is the server involved? if you are sending it to the server, you need to treat it like it is always coming from a hacker with very bad intentions. i don't care where the data comes from, my server will sanitize it for its own protection. after all, just because it left "clean" from your browser does not mean it was not interfered with elsewhere upstream TLS be damned. if we've double encoded something, that's fine, it won't blow up the server. at the end of that day, that's what is most important. if some double decoding doesn't happen correctly on the client, then <shrugEmoji>

padjo · 2025-10-23T07:41:31 1761205291

Yeah as an Irish person with an apostrophe in their name this attitude is why my name routinely gets mangled or I get told my name is invalid.

You don’t escape input. You safely store it in the database and then sanitize it at the point where you’re going to use it.

jsmith99 · 2025-09-28T20:09:31 1759090171

Cline plan mode doesn't tend to read files by default but you can tell it 'read all files necessary to establish a detailed plan'. GPT5 also seems more eager to read files.

WalterSear · 2025-09-28T22:36:44 1759099004

You can just add the files to the prompt with the @ sign.

jsmith99 · 2025-09-28T16:03:32 1759075412

> lack in-depth knowledge of your business, codebase, or roadmap

So give them some context. I like Cline's memory bank approach https://docs.cline.bot/prompting/cline-memory-bank which includes the architecture, progress, road map etc. Some of my more complex projects use 30k tokens just on this, with the memory bank built from existing docs and stuff I told the model along the way. Too much context can make models worse but overall it's a fair tradeoff - it maintains my coding style and architecture decisions pretty well.

I also recommend in each session using Plan mode to get to a design you are happy with before generating any code.

jsmith99 · 2025-09-19T14:12:43 1758291163

My IT department use the official Microsoft phishing test. The emails arrive in inbox with 0 headers. (There's also a helpful Microsoft page of all the dodgy sounding domains they've registered for this.)

jsmith99 · 2025-09-11T13:14:20 1757596460

7zip adds a context menu option for this.

jsmith99 · 2025-08-28T08:17:40 1756369060

UK company filings are already marked up with machine readable, structured, standardised tags. It's called XBRL. So it doesn't necessarily need a LLM to parse.

superproton · 2025-08-28T08:20:54 1756369254

Wrong, my friend, that is only for small companies, big companies accounts are only available in PDF. Plus, in any case you have the raw data, here you have a full financial analysis.

jsmith99 · 2025-08-28T08:55:51 1756371351

Large companies need to do it too, including all LSE listed. Here is a summary by the regulator https://www.frc.org.uk/library/digital-reporting/structured-...

But of course a LLM could theoretically automate much of the analysis stage.

jsmith99 · 2025-08-21T17:08:24 1755796104

MS Copilot is quite useful for meeting minutes and summaries etc. Still not nearly as useful as good handwritten notes but saves loads of time.

IgorPartola · 2025-08-21T18:20:40 1755800440

Sure useful. But we are talking making money useful, not just nice. Like will it create 20% more revenue? 1% more? Or is it just a nice to have?