More

ssabev · 2025-05-21T20:49:03 1747860543

Came here to agree.

Initially I thought it's a bloody stupid idea, however at this stage I reckon we need it or a lot of boomers are going to be ones hotted into singing away all their wealth away.

Henchman21 · 2025-05-21T21:56:39 1747864599

Aaah, now you see the plan clearly.

ssabev · 2025-03-16T13:38:25 1742132305

love grok3? can't wait for official api?

now you can run an openai-compatible proxy for all of your "local model" needs.

currently using it as my custom provider in repoprompt

pkg includes: client, cli, and proxy

ssabev · on June 21, 2024

Would recommend just picking up a gateway that you can deploy and act as an OpenAI compatible endpoint.

We built something like this for ourselves here -> https://www.npmjs.com/package/@kluai/gateway?activeTab=readm....

Documentation is a bit sparse but TL;DR - deploy it in a cloudflare worker and now you can access about 15 providers (the one that matter - OpenAI, Cohere, Azure, Bedrock, Gemini, etc) all with the same API without any issues.

refulgentis · on June 21, 2024

Wow; this is really nice work, I wish you deep success.

refulgentis · on June 21, 2024

Coming back to write something more full-throated: Klu.ai is a rare thing in the LLM space, well-thought out, has the ancillary tools you need, is beautiful, and isn't a giveaway from a BigCo that is a privacy nightmare: ex. Cloudflare has some sort of halfway similar nonsense that, in all seriousness, logs all inputs/outputs.

I haven't tried it out in code, it's too late for me and I'm doing native apps, but I can tell you this is a significant step up in the space.

Even if you don't use multiple LLMs yet, and your integration is working swell right now, you will someday. These will be commodities, valuable commodities, but commodities. It's better to get ahead of it now.

Ex. If you were using GPT-4 2 months ago, you'd be disappointed by GPT-4o, and it'd be an obvious financial and quality decision to at least _try_ Claude 3.5 Sonnet.

It's a weird one. Benchmarks great. Not bad. Pretty damn good. But ex. It's now the only provider I have to worry about for RAG. Prompt says "don't add footnotes, pause at the end silently, and I will provide citations", and GPT-4o does nonsense like saying "I am now pausing silently for citations: markdown formatted divider"

ssabev · on Dec 16, 2023

The ELO leaderboard you mean?

ssabev · on Dec 16, 2023

Spoiler: it's fast, cheap, overly protective, and has Kafkaesque DX

ssabev · on Nov 10, 2023

We benchmarked retrieval, GPT-4 turbo vs GPT-4, and fine-tuned several models. You can use the result of one here https://huberman.klu.ai/

ssabev · on Sept 22, 2023

Hey folks, any of you tried with this stream=True? It works in the playground, but it appears that you cannot actually stream completions from this model via the python openai package.

Wondering if it's a weird PEBKAC or someone else has had the same experience?

ssabev · on Aug 22, 2023

Absolutely mad... Saw this and thought - what the hell. I have to say moving away is starting to feel really tempting

ssabev · on Aug 17, 2023

Absolutely agree here. We should be intolerant of intolerance in democratic societies and reciprocate with the more questionable counter parties from countries with .. shall we say different moral norms.

ssabev · on Aug 12, 2023

But I was told China was winning. De-dollarisation to a completely "free-floating" RMB was happening. The official numbers foretold it! /s