More

throwaway894345 · 2026-01-10T05:17:08 1768022228

That’s interesting, particularly since as far as I can tell, nothing in userland really bothers to make use of its GPU. I would really like to understand why, since I have a whole bunch of Pi’s and it seems like their GPUs can’t be used for much of anything (not really much for transcoding nor for AI).

codeflo · 2026-01-10T09:05:13 1768035913

> their GPUs can’t be used for much of anything (not really much for transcoding nor for AI)

It's both funny and sad to me that we're at the point where someone would (perhaps even reasonably) describe using the GPU only for the "G" in its name as not "much of anything".

throwaway894345 · 2026-01-13T16:31:07 1768321867

Is video transcoding not “graphics”? Is it doing meaningful graphics work?

kcb · 2026-01-10T06:02:18 1768024938

The Raspberry Pi GPU has one of the better open source GPU drivers as far as SBCs go. It's limited in performance but its definitely being used for rendering.

regularfry · 2026-01-10T11:49:56 1768045796

There is a Vulkan API, they can run some compute. At least the 4 and 5 can: https://github.com/jdonald/vulkan-compute-rpi . No idea if it's worth the bus latency though. I'd love to know the answer to that.

I'd also love to see the same done on the Zero 2, where the CPU is far less beefy and the trade-off might go a different way. It's an older generation of GPU though so the same code won't work.

ryandrake · 2026-01-10T16:34:35 1768062875

One (obscure) example I know of is the RTLSDR-Airband[1] project uses the GPU to do FFT computation on older, less powerful Pis, through the GPU_FFT library[2].

1: https://github.com/rtl-airband/RTLSDR-Airband

2: http://www.aholme.co.uk/GPU_FFT/Main.htm

bitwize · 2026-01-10T08:08:27 1768032507

You can play Quake on 'em.

throwaway894345 · 2026-01-08T18:55:58 1767898558

Seems like it would be an easy target for the government (or really anyone) to DOS, right? Presumably there's no good way for the nation-wide intranet to exclude government actors? I'm just thinking out loud; I'm glad to hear something is being done and I wish the Iranian people the best.

throwaway894345 · 2026-01-08T18:53:30 1767898410

I'm also curious about LoRA / sneakernet applications. Have those been widely used in cases of censorship?

anakaine · 2026-01-08T19:28:26 1767900506

Lora is fine if you want to send a very short message. Its not useful for much else.

Its also not a prevalent technology compared to general.internet/mobile phone.

Organising resistance with it is the pipe dream of those who play with chips and antennas, but its not something thats going to happen when crowds and mobs form up in a situation like this. Not least because the hardware is not accessible to your average citizen.

itintheory · 2026-01-08T20:19:00 1767903540

There are real-world examples of non-internet networks being created in authoritarian regimes. One example I've read about is in Cuba [1] but I presume there are others.

[1] https://restofworld.org/2020/the-life-and-death-of-snet-hava...

throwaway894345 · 2026-01-08T19:42:53 1767901373

Yeah, that makes sense. I’ve curious if there are sneakernet things for communicating messages between passing mobile devices? Something that uses exist hardware and is actually used in practice.

wiml · 2026-01-08T21:06:36 1767906396

There are things like Briar, Scuttlebutt, Berty, Serval, probably more I don't know of.

throwaway894345 · 2026-01-08T18:49:14 1767898154

I've got to think it's easy to find starlink receivers--I know they use a directed beam but they must give off a bunch of lateral noise, right? Or does Starlink use the same frequency bands as other common equipment such that it would be difficult to distinguish starlink signals from others? If the government was motivated they could surely start finding these receivers, right?

BenjiWiebe · 2026-01-08T19:22:07 1767900127

Well the better your beam is directed, the less lateral noise there is.

A simple 3 element yagi has <1% of the power to the sides. It has more of the power straight behind it, but still 1% or so of the main lobe.

throwaway894345 · 2026-01-08T19:44:25 1767901465

Is 1% still quite a lot louder than other things in the same band?

everfrustrated · 2026-01-08T19:23:41 1767900221

From what I read, the Russians were targeting Starlink terminals based on the built-in wifi access point not the Starlink frequencies.

runlaszlorun · 2026-01-09T00:31:20 1767918680

I read the satellite has an omnidirectional antenna?

throwaway894345 · 2026-01-07T03:55:53 1767758153

What does it mean that only 3B parameters are active at a time? Also any indication of whether this was purely CPU or if it’s using the Pi’s GPU?

kouteiheika · 2026-01-07T04:45:26 1767761126

> What does it mean that only 3B parameters are active at a time?

In a nutshell: LLMs generate tokens one at a time. "only 3B parameters active a a time" means that for each of those tokens only 3B parameters need to be fetched from memory, instead of all of them (30B).

tgv · 2026-01-07T10:12:18 1767780738

Then I don't understand why it would matter. Or does it really mean that for each input token 10% of the total network runs, and then another 10% for the next token, rather than running each 10 batches of 10% for each token? If so, any idea or pointer to how the selection works?

kouteiheika · 2026-01-07T12:02:38 1767787358

Yes, for each token only, say, 10% of the weights are necessary, so you don't have to fetch the remaining 90% from memory, which makes inference much faster (if you're memory bound; if you're doing single batch inference then you're certainly memory bound).

As to how the selection works - each mixture-of-experts layer in the netwosk has essentially a small subnetwork called a "router" which looks at the input and calculates the scores for each expert; then the best scoring experts are picked and the inputs are only routed to them.

numpad0 · 2026-01-07T06:53:26 1767768806

I've asked Gemini about it the other day(I'm dumb and shameless). Apparently it means that the model branches into bunch of 3B sections in the middle and joins at both ends, totaling in parameters at 30B. This means computational footprint reduces to (bottom "router" parts + 3B + top parts) of effectively-5B or whatever specific to that model implied by "3B", rather than the full 30B.

MoE models still operate on token-by-token basis, i.e. "pot/at/o" -> "12345/7654/8472". "Experts" are selected on per-token basis, not per-interation, so "expert" naming might be a bit of a misnomer, or marketing.

throwaway894345 · 2026-01-05T01:35:34 1767576934

I’m also curious if an AI could process the screen feed quickly enough to compete in first-person shooter games. Seems like it would be difficult without extremely high end hardware for the foreseeable future?

ModernMech · 2026-01-05T03:29:51 1767583791

I had students build this kind of thing in 2020 by screenshotting the game and processing it with a standard OpenCV pipeline. No GenAI needed.

throwaway894345 · 2026-01-05T15:11:44 1767625904

Thank you for educating me. How does OpenCV work from the perspective of recognizing things in an image? Is there some kind of underlying model there that learns what a target looks like or not?

ModernMech · 2026-01-05T15:32:21 1767627141

The way they did it, they were writing an aimbot. So the pipeline was:

- take a screenshot

- run massive skeletal detection on it to get the skeletons of any humanoids

- of those skeletons, pick a target closest to the player

- for that target, get coordinates of head node

- run a PID to control the cursor to where the head node is located

- move the camera one frame, repeat the whole process. If you can fit that pipeline to 16ms it can run in real time.

throwaway894345 · 2026-01-06T21:06:40 1767733600

Wow, that's fascinating. Were they able to fit the whole thing inside the 16ms frame?

ModernMech · 2026-01-06T23:38:06 1767742686

oh yeah, with little problem, especially with a GPU it's not hard at all.

Scion9066 · 2026-01-05T03:49:06 1767584946

There's already models specifically for things like identifying players in Counter-Strike 2, including which team they're on.

Someone has even rigged up a system like that to a TENS system to stimulate the nerves in their arm and hand to move the mouse in the correct direction and fire when the crosshair is over the enemy.

We are definitely already there.

fragmede · 2026-01-05T15:31:21 1767627081

They documented it on YouTube for us to see:

https://youtu.be/9alJwQG-Wbk

holy shit that's amazing.

Thaxll · 2026-01-05T02:23:21 1767579801

It already exists.

throwaway894345 · 2026-01-03T06:03:16 1767420196

How does this work? Do you give the AI read permissions on your system, or is it just running arbitrary commands?In the latter case, is it prompting you before each?

throwaway894345 · 2026-01-02T19:22:50 1767381770

Relatedly, I've been seeing some people buying up old domains and squatting on them with AI generated content. Not even ads, but content that seems like something that might actually show up in a rare Google search query. Not really sure what the play is or why this is better than advertising the domain for sale (do registrars punish overt squatting these days?).

throwaway894345 · 2025-12-30T19:45:43 1767123943

Iran has protests all the time, as do lots of other countries _including the US_ and France and others and they don't herald US-led regime change. But if you have evidence for your claim that the US and Israel are planning a war in Iran, that would be major international news.

f33d5173 · 2025-12-30T19:52:34 1767124354

It is. I heard about it on the radio.

throwaway894345 · 2025-12-30T19:41:55 1767123715

Smaller than containers seems unlikely since a container doesn't have any kernel at all, while these microvms have to reproduce at least the amount of kernel they would otherwise need (e.g., a networking stack). I'm sure some will be inclined to compare an optimized microvm to an application binary slapped into an Ubuntu container image, but that's obviously apples/oranges.

Faster might be possible without the context switching between kernel and app? And maybe additional opportunities for the compiler to optimize the entire thing (e.g., LTO)?

justatdotin · 2025-12-31T07:41:01 1767166861

container can be smaller at rest, but larger at runtime

if you're not sure which you want its probably container