Hacker Newsnew | past | comments | ask | show | jobs | submit | xd1936's commentslogin

https://annas-archive.li/robots.txt

https://annas-archive.li/llms.txt

robots.txt is a machine-parsed standard with defined syntax. llms.txt is a proposal for a more nebulous set of text instructions, in Markdown.

https://llmstxt.org/


This rules! What a fun idea.

I couldn't get the "Walk Here"/"Taxi Here" tooltip pop-up to stay open on Firefox when clicking on a train station, but it's working in Chrome.


Sorry about that. Should be fixed now

What alternative did you switch to?

I went with CachyOS. It's based off of Arch and compiles packages with flags for more modern CPUs. They also build extra packages that aren't normally provided by Arch (always nice to rely a little less on the AUR). And their default scheduler is supposed to provide a smoother gaming experience, though they have a selection of schedulers if you don't like that one.

I tried it but couldn't get it stable on my system. Ended up with EndeavorOS which is arch-based with a much better-install process.

I never tried Bazzite, but have been using Jovian, which is a NixOS-based gaming setup.

I have heard that CachyOS (Arch) and Nobara (Fedora) are two other decent options.



It ought to be illegal to buy ads against trademarks (+/- some reasonable edit distance).

Long ago, Google search used to be its own product. Now it's the URL bar for 91% of internet users. This is no longer fair.

Google gets to not only tax every brand, but turn every brand into a biding war.

International laws need to be written against this.

Searching for "Claude" brings up a ton of competition in the first spot, and Google gets to fleece Anthropic and OpenAI, yet get its own products featured for free.

Searching "{trademark} vs" (or similar) should be the only way to trigger ads against a trademark.


> It ought to be illegal to buy ads against trademarks (+/- some reasonable edit distance).

I get the intention here, but how do you limit the collateral damage? (Or do you not care about it / see reducing the ability to advertise as a positive?)

There are a lot of trademarks, and they have to be scoped to specific goods and services, but Google has no way of knowing if you're actually looking for something related to that trademark.

e.g. doing a quick trademark search, I see active, registered trademarks for "elevator", "tower", "collision", "cancer sucks", "steve's", "local", "best", "bus", "eco", "panel", "motherboard", "grass", etc. etc. I'm not familiar with any of those brands, but that's just a small sample of the fairly generic terms that would no longer be able to be advertised on.


Google has a way of knowing. They can ask for documentation on who their customers are and what markets they operate in, and do some due diligence. Just like they have ways of knowing whether the ads they run are for blatant scams.

I'm not saying Google doesn't know if a company is in a particular market, I'm saying that a) Google doesn't know what market I'm searching for something from and b) even if they know both from context, it puts them in some awkward positions.

e.g. Vice Media has a trademark on "motherboard" that covers the tech news blog website service.

Is it now impossible for Asus to place an ad for the official Asus motherboard blog on the search term "motherboard"?

Is it legal to advertise for "motherboard" for any good or service other than a tech news blog website?

Is it now illegal to advertise a website featuring in-depth motherboard reviews using the term "motherboard"?

If I search for "motherboard website", what is Google allowed to show me for ads, given they don't know if I'm looking for the Vice website, or motherboard reviews, or the Asus homepage?

If a plain search for "motherboard" results in Vice's website not being in the top results, is Vice allowed to advertise on their own trademark to put it above other results? (Either above organic results, or above paid results for motherboard manufacturers, depending on whether you're allowing the latter.)


> Is it legal to advertise for "motherboard" for any good or service other than a tech news blog website?

Roughly speaking (modulo dilution which doesn't seem like it'd apply here), that's my understanding of trademark law. So your questions are all basically trivially answered, and those things are fine. A human should be able to review such cases.


There should be no ads on the internet.

Yeah, and like, I commiserate with that view, I think it would make the internet/world a better place, but I don't think "no ads for trademarks" is helpful way to reach for that goal.

I don't agree. If I search for "leatherman" it seems totally reasonable to give competitors a chance. I generally think brand recognition is too powerful. If there is another high quality multitool on the market for a better price, why shouldn't I know about it?

Disclaimer: See my sibling comments for some my general thoughts on the problems with banning trademark ads.

But for your specific example - I get where you're coming from, but I'm skeptical that the ad market is even that functional.

Firstly, if I google "leatherman", every sponsored result for Leatherman brand multitools anyway. (And no amount of refreshes or re-searches gives me anything other than Leathermans.)

Secondarily, I'm not convinced that the set of advertisers (not counting Leatherman itself) that will advertise for "leatherman" are actually on average a better products for the consumer. (e.g. as opposed to lower-quality, higher-priced knockoffs.)


These are both fair points (generally, the consumer market is pretty dysfunctional and not behaving at all like economists would like it to), but the comment I was replying to ("It ought to be illegal to buy ads against trademarks") seems both too heavy-handed and unlikely to actually do any good.

I agree it's a bit perverse, but the problem predates Google. People do the real world equivalent all the time. When there are big conferences for specific companies, rivals buy up local ad space on billboards and subways.

That has caused some companies hosting conferences to pay for some of those ad spaces in advance.


Ads on billboards and subways actually bother me far more than search ads.

It's visual and cognitive pollution on public space that I've never consented to - I find it viscerally offensive.

We don't accept billboards on hiking trails, or in elementary classrooms, or in courtrooms (as far as I'm aware, though I wouldn't be surprised if someone turns up a real-life grotesque examples) - we shouldn't accept them in other public spaces either.


> It ought to be illegal to buy ads against trademark

this was one of the biggest problems of AdWords from beginning on: You could do brand-bidding unlimited, even today you see it every day: Search for brand X and competitor Y will show up with same words



I've also had the exact opposite experience with tone. Claude Code wants to build with me, and Codex wants to go off on its own for a while before returning with opinions.

Its likely that both are steering towards the middle from their current relative extremes and converging to nearly the same place.

also my experience in using these two models. they are trying to recover from oversteer perhaps.

well with the recent delays i can easily find claude code going off on it's own for 20 minutes and have no idea what it's going to come back with. but one time it overflowed it's context on a simple question, and then used up the rest of my session window. in a way a lot of ai assistants have ime have this awkward thing where they complicate something in a non-visible and think about it for a long time burning up context before coming up with a summary based upon some misconception.

The key is a well defined task with strong guardrails. You can add these to your agents file over time or you can probably just find someone's online to copy the basics from. Any time you find it doing something you didn't expect or don't like, add guardrails to prevent that in future. Claude hooks are also useful here, along with the hookify plugin to create them for you based on the current conversation.

I have started using openspec for this. I find it works far better to have a proposal and a list of tasks the ai stays more focused.

https://openspec.dev/


For complex tasks I ask ChatGPT or Grok to define context then I take it to Claude for accurate execution. I also created a complete pipeline to use locally and enrich with skills, agents, RAG, profiles. It is slower but very good. There is no magic, the richer the context window the more precise and contained the execution.

In terms of 'tone', I have been very impressed with Qwen-code-next over the last 2 days, especially as I have it running locally on a single modest 4090.

Did you set that up following a guide or anything you could share?

Easiest way I know is to just use LMStudio. Just download and press play :). Optional, but recommended, increase the context length to 262144 if you have the DRAM available. It will definitely get slower as your interaction prolongs, but (at least for me) still tolerable speed.

not OP, but I got it running on my 4090 (and RAM) by following this guide: https://unsloth.ai/docs/models/qwen3-coder-next

I see around 30 t/s


Same here, CC gives me options to pick direction after the planning stage.

I was trying to set up OpenClaw and broke it. My bad guys.


POST https://fjtwtlaryvoqohkwnbwd.supabase.co/functions/v1/transf... 402 (Payment Required)

Function error: FunctionsHttpError: Edge Function returned a non-2xx status code

:(


Fun! I wish WebTorrent had caught on more. I've always thought it had a worthy place in the modern P2P conversation.

In 2020, I messed around with a PoC for what hosting and distributing Linux distros could look like using WebTorrent[1]. The protocol project as a whole has a lovely and brilliant design but has stayed mostly stagnant in recent years. There are only a couple of WebRTC-enabled torrent trackers that have remained active and stable.

1. https://github.com/leoherzog/LinuxExchange


I think the issue has generally been that web torrent doesn't work enough like the real thing to do its job properly. There are huge bit torrent based streaming media networks out there, illicit, sure, but its a proven technology. If browsers had real torrent clients we would be having a very different conversation imo

I don't remember the web torrent issue numbers off the top of my head, but there are a number of long standing issues that seem blocked on webrtc limitations.


I think we still have the same blocker as we had back when WebTorrent first appeared; browsers cannot be real torrent clients and open connections without some initial routing for the discovery, and they cannot open bi-directional unordered connections between two browsers.

If we could say do peer discovery via Bluetooth, and open sockets directly from a browser page, we could in theory have local-first websites running in the browser, that does P2P connections straight between browsers.


Could you run some kind of hybrid DHT where part of it was Webrtc and part was plain HTTP(S) / WebSocket?

There are some nodes (desktop clients with UPNP, dedicated servers) that can accept browser connections. Those nodes could then help you exchange offers/answers to give you connections with the Webrtc-only ones, and those could facilitate offer/answer exchanges with their peers in turn.

It'd be dog-slow compared to the single-udp-packet-in, single-udp-packet-out philosophy of traditional mainline DHT, but I don't see why the idea couldn't work in principle.

I think a much bigger problem is content discovery and update distribution. You can't really do decentralized search because it'd very quickly get sybil-attacked to death. You'd always need some kind of centralized, trusted content index, but not necessarily one hosted on a centralized server. If you could have a reliable way to go from a pubkey to the latest hash signed by that pubkey in a decentralized way, + E.G. a Sqlite extension to get pages on-demand via WebTorrent, that would get you a long way towards solving the problem.


That was you ask exists; it updates through a version counter. It just works on mainline DHT btw.


If a tracker could be connected to via WebRTC and had additional STUN functionality, would that suffice? Are there additional WebRTC limitations?

> they cannot open bi-directional unordered connections between two browsers.

Last I checked, DataChannels were bidirectional


Yes, but it's STUN that sucks. If the software ships with a public (on the internet) relay/STUN server for connecting the two clients, it won't work if either aren't connected to the internet, even though the clients could still be on the same network and reach each other.


/? STUN: https://hn.algolia.com/?dateRange=all&page=0&prefix=false&qu...

There is a Native Sockets spec draft that only Chrome implements;

"Direct Sockets API": https://developer.chrome.com/docs/iwa/direct-sockets :

> The Direct Sockets API addresses this limitation by enabling Isolated Web Apps (IWAs) to establish direct TCP and UDP connections without a relay server. With IWAs, thanks to additional security measures—such as strict Content Security Policy (CSP) and cross-origin isolation— this API can be safely exposed.

Though there's UPNP XML, it lacks auth for port forwarding permissions. There's also IPV6.

Similar: "Breaking the QR Limit: The Discovery of a Serverless WebRTC Protocol – Magarcia" https://news.ycombinator.com/item?id=46829296 re: Quick Share, Wi-Fi Direct, Wi-Fi Aware, BLE Beacons, BSSIDs and the Geolocation API


That seems like a nonissue for the purposes of this discussion though, in terms of user uptake. Tiktok and Facebook and other websites aren't exactly focused on serving to people on the same network.


"If browsers had real torrent clients we would be having a very different conversation imo"

The elinks text-only browser has a "real" torrent client




Can't seem to find any mentions of this online from over a week ago, not much commentary either, mostly stuff that smells like advertising / astroturfing. Hmm...


huh?


I had never heard of this! Pretty cool!


Oh wow


Was there ever a web-based Jigdo?


This was a fun rabbit hole. Thanks for educating me!

And they reference that announcement and related information in the second line.


Which announcement are you looking at? I see no references to llama-cpp in either Ollama's blog post or this project's github page.


I've used em dashes for decades — They're great.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: