More

eranation · 2026-01-12T13:10:05 1768223405

That’s an astonishing number. Wouldn’t that be more than enough to cause a decrease in grocery prices?

eru · 2026-01-12T13:48:22 1768225702

It depends on how elastic grocery prices are.

And so far it's 5.3% reduction in the spending of <16% of households on the drug for a total reduction of less than 1%. Compared to eg tariffs and general inflation, that's a bit hard to distinguish from noise.

mpyne · 2026-01-12T13:18:32 1768223912

There are other major factors also influencing grocery prices, such as tariffs. It may because that was are seeing a significant influence on price, but one that is counteracted by other influencers.

mrits · 2026-01-12T14:19:07 1768227547

Do you think we get our little Debbie snacks from Paris?

relaxing · 2026-01-12T14:49:54 1768229394

No but the wood pulp for the cardboard boxes and the parts to keep the manufacturing lines running come from abroad.

Traubenfuchs · 2026-01-12T13:12:27 1768223547

Your assumption of the existence of a grocery market competing on price might be wrong.

eru · 2026-01-12T13:49:55 1768225795

If they aren't competing on price, what were they competing on to have Wal-Mart take over so much market share? Did people just switch to shopping at Wal-Mart because they like the greeters?

And what about Aldi and Lidl? Why do people put up with these weird German hard discounters, if not for lower prices?

eranation · 2025-12-10T02:42:18 1765334538

Surprised this didn’t make it to the top. Great paper.

eranation · 2025-12-09T18:28:44 1765304924

Nice. That was a fun rabbit-hole. This is the earlier I could find. Interestingly it contains a link to HN itself. I assume this migrated from a different version of a message board?

https://news.ycombinator.com/front?day=2007-02-19

ksymph · 2025-12-09T21:58:38 1765317518

There are some funny ones in there:

> Finally mobile Flash video

> Google acquires advertising startup

> Track users' mouse movements on your webpages

> YouTube: identifying copyrighted material can't be an automated process. Startup disagrees.

Also kind of interesting how little HN commenting styles have changed. Aside from the subject matter, it's barely noticeable that the comments are from 2007. I don't think the same would be true of many other places round the web.

seizethecheese · 2025-12-09T18:32:21 1765305141

15 years ago: https://news.ycombinator.com/front?day=2010-12-09

> Evernote makes $800,000 per Month > Dropbox for Teams > Mark Zuckerberg Agrees to Give Away Fortune

eranation · 2025-12-09T17:49:24 1765302564

> Google kills Gemini Cloud Services (killedbygoogle.com)

In my view, Gemini 3.0 being able to laugh about their own creator, plus actually having a self-deprecating sense of humor, is the more impressive feat here. (both technically and from an alignment perspective)

eranation · 2025-11-19T14:10:43 1763561443

Petty nitpick, but this sentence doesn’t sound right

> “Google Antigravity's Editor view offers tab autocompletion, natural language code commands, and a configurable, and context-aware configurable agent.”

Is it a typo or was there a reason to add configurable twice?

chrz · 2025-11-19T18:29:37 1763576977

They might have made the whole thing with AI to prove you can make things with AI

eranation · 2025-10-13T19:52:13 1760385133

I think he mentioned somewhere he used fineweb (I assume this one https://huggingface.co/datasets/HuggingFaceFW/fineweb)

eranation · 2025-10-06T14:58:41 1759762721

To those wondering where it's from: https://www.imdb.com/title/tt0751825/quotes/?item=qt0238072

eranation · 2025-09-23T17:26:33 1758648393

With my limited understanding of LLMs and MCPs (and please correct me if I'm wrong), even without having to exploit an XSS vulnerability as described in the post (sorry for being slightly off topic), I believe MCPs (and any tool calls protocol) suffer from a fundamental issue, a token is a token, hence prompt injection is probably impossible to 100% protect against. The main root cause of any injection attack is the duality of input, we use bytes, (and in many cases in the form of a string) to convey both commands and data, "rm -rf /" can be an input in a document about dangerous commands, or a command passed to a shell command executor by a tool call. To mitigate such injection attacks, in most programming language there are ways to clearly separate data from commands, in the most basic way, via deterministic lexical structure (double quotes) or or escaping / sanitizing user input, denly-list of dangerous keywords (e.g. "eval", "javascript:", "__proto__") or dedicated DSLs for building commands that pass user input separately (Stored procedures, HTML builders, shell command builders). The solution to the vulnerability in the post is one of them (sanitizing user input / deny-list)

But even if LLMs will have a fundamental hard separation between "untrusted 3rd party user input" (data) and "instructions by the 1st party user that you should act upon" (commands) because LLMs are expected to analyze the data using the same inference models as interpreting commands, there is no separate handling of "data" input vs "command" input to the best of my understanding, therefore this is a fundamentally an unsolvable problem. We can put guardrails, give MCPs least privilege permissions, but even with that confused deputy attacks can and will happen. Just like a human can be fooled by a fake text from the CEO asking them to help them reset their password as they are locked out before an important presentation to a customer, and there is no single process that can 100% prevent all such phishing attempts, I don't believe there will be a 100% solution to prevent prompt injection attacks (only mitigated to become statistically improbable or computationally hard, which might be good enough)

Is this a well known take and I'm just exposing my ignorance?

EDIT: my apologies if this is a bit off topic, yes, it's not directly related to the XSS attack in the OP post, but I'm past the window of deleting it.

Jimmc414 · 2025-09-23T17:37:03 1758649023

While this vulnerability has nothing to do with prompt injection or LLMs interpreting tokens, you do raise a debatable point about prompt injection being potentially unsolvable.

edit: after parent clarification

eranation · 2025-09-23T17:49:37 1758649777

Yes, my bad, I'm not talking about this particular XSS attack, I'm wondering if MCPs in general have a fundamental injection problem that isn't solvable, indeed a bit off topic.

edit: thanks for the feedback!

mattigames · 2025-09-23T18:09:16 1758650956

Aside from being offtopic or not I want to add that it is indeed well known https://news.ycombinator.com/item?id=41649832

eranation · 2025-09-23T18:10:47 1758651047

Thanks! Although thinking of it, while it's not deterministically solvable, I'm sure something like this is what currently being done, e.g, let's say <user-provided-input> </user-provided-input> <tool-response></tool-response> are agreed upon tags to demarcate user generated input, then sanitizing is merely, escaping any injected closing tag, (e.g. </user-provided-input>) to </user-provided-input> (and flagging it as an injection attempt)

Then we just need to train LLMs to 1. not treat user provided / tool provided input as instructions (although sometimes this is the magic, e.g. after doing tool call X, do tool call Y, but this is something the MCP authors will need to change, by not just being an API wrapper...)

2. distinguish between a real close tag and an escaped one, although unless it's "hard wired" somewhere in the inference layer, it's only a matter of statistically improbable for an LLM to "fall for it" (I assume some will attempt, e.g. convince the LLM there is instruction from OpenAI corporate to change how these tags are escaped, or that there is a new tag, I'm sure there are ways to bypass it, but it's probably going to make it less of an issue).

I assume this is what currently being done?

brap · 2025-09-23T21:19:33 1758662373

The problem is that once you load a tool’s response into context, there’s no telling what the LLM will do. You can escape it all you want, but maybe it contains the right magic words you haven’t thought of.

The solution is to not load it into context at all. I’ve seen a proposal for something like this but I can’t find it (I think from Google?). The idea is (if I remember it correctly) to spawn another dedicated (and isolated) LLM that would be in charge of the specific response. The main LLM would ask it questions and the answers would be returned as variables that it may then pass around (but it can’t see the content of those variables).

Edit: found it. https://arxiv.org/abs/2503.18813

Then there’s another problem: how do you make sure the LLM doesn’t leak anything sensitive via its tools (not just the payload, but the commands themselves can encode information)? I think it’s less of a threat if you solve the first problem, but still… I didn’t see a practical solution for this yet.

eranation · 2025-09-24T21:32:31 1758749551

Thanks for the link to the article, very interesting!

wunderwuzzi23 · 2025-09-24T08:27:50 1758702470

Thanks for sharing! I'm actually the person the Ars Technica article references. :)

For recent examples check out my Month of AI bugs with of a focus on coding agents at https://embracethered.com/blog/posts/2025/wrapping-up-month-...

Lots of interesting new prompt injection exploits, from data exfil via DNS to remote code execution by having agents rewrite their own configuration settings.

eranation · 2025-09-18T20:24:32 1758227072

I love learning new things, Khan academy got me all the way through college, and I use ChatGPT / Claude to help me study papers regularly. I got frustrated really fast. Here is an example:

https://learnyourway.withgoogle.com/scopes/1KNlGW5E/immersiv...

It starts with just this sentence, followed by a quiz on that sentence:

> When we are born, we inherit our genetic makeup and biological features. However, our identity as human beings develops through interactions with others in society. Many experts in both psychology and sociology have described the process of self-development as a key step to understanding how that "self" learns to function within society.

Followed by the quiz:

> Question 1: Based on the provided text, what is a key difference in focus between psychology and sociology regarding self-development? A) Sociology is concerned with inherited traits, whereas psychology is concerned with societal norms. B) Psychology studies societal functions, while sociology studies individual identity. C) Psychology focuses on genetic makeup, while sociology focuses on social interactions. D) Both fields exclusively study the biological features inherited at birth.

I thought D makes the most sense, as nothing in the immediate text provides a more granular answers. But it's not D. It made me question my intelligence, maybe I misread the sentence, maybe I needed to read something else? Oh there is a button for the entire PDF, but then isn't the purpose of it to break down the PDF into chunks and ask me questions on what I'm reading?

I'm sure this is a fixable bug, but I was looking for the "provide feedback" button, there is none.

This would be very frustrating to a student.

Vrondi · 2025-09-18T20:36:33 1758227793

It isn't you. None of those answers are correct. Sociology studies societies and cultures; collective behaviors at different scales within different niches, etc. It's an LLM hallucinating again.

richardubright · 2025-09-18T20:37:47 1758227867

Ignoring that, NONE of those answers are correct. It wants "C) Psychology focuses on genetic makeup, while sociology focuses on social interactions.", but that's not a true statement. Psychology is absolutely not focused on genetic makeup.

tremon · 2025-09-19T00:46:08 1758242768

There is no way to answer that question, period. Neither the term "psychology" nor the term "sociology" is defined in the text (and the terms are used in union, not constrasted), so to have any hope of answering at all you need to apply knowledge not found in the text, which is expressly prohibited.

(edit to add:)

This is just regular poor-quality language model output. The language model is trained on data where the phrase "based on the provided text" makes a common appearance between a text segment and subsequent questions, but the model has no knowledge of the very specific, limiting meaning of that phrase: it limits the following question to assess reading comprehension only, not general knowledge.

So no, I don't think this is a minor bug that's easily fixable: a pure language model will always associate key signaling phrases with the wrong type questions, because it has no concept of (didactic) mode. It basically considers all phrases ornamental instead of purposeful.

bonoboTP · 2025-09-19T08:23:28 1758270208

> assess reading comprehension only, not general knowledge.

It looks like an LSAT question. Exactly the opposite of a knowledge question. These reading comprehension questions are designed to test whether you can - without any specific knowledge of the domain area - figure out what a piece of text claims, without injecting your own knowledge or assumptions. It's an ill fit for teaching a subject matter, though can be useful for reading and argumentation (especially in lawyer-like jobs, hence LSAT).

vkou · 2025-09-18T21:17:45 1758230265

> I'm sure this is a fixable bug

I'm sure it is, but the bar for accuracy in education is way too high for a mistake as blatant as this to be allowed to slip through.

(I'm sure someone will chime in about their useless Hum 10 teacher making an even bigger gaffe, and how this can be excused, it's just a beta...)

eranation · 2025-08-19T19:00:01 1755630001

This is very similar to a CVE I discovered in cdxgen (CVE-2024-50611), which is similar to another CVE in Snyk's plugin (CVE-2022-24441). tl;dr if you run a scanner on untrusted code, ensure it doesn't have a way of executing that code.

Some ways to prevent this from happening:

1. Don't let spawned processes have access to your env, there are ways to allowlist a set of env vars that are needed for a sub process in all major languages

2. Don't store secrets in env vars, use a good secrets vault (with a cache)

3. Tenant isolation as much as you can

4. And most obviously - don't run processes that can execute the code they are scanning, especially if that code is not your code (harder to tell, but always be paranoid)