More

tsenart · 2025-10-01T15:46:41 1759333601

This was missing in the Go world.

tsenart · 2025-09-25T20:31:29 1758832289

Proprietary.

tsenart · 2025-09-25T20:00:06 1758830406

Author here, indeed a variation of bloom filters: https://x.com/lemire/status/1971279371131646063

jacquesm · 2025-09-25T22:20:26 1758838826

Ok. I have blocked X at the router level here since Elon went certifiable so I can't read that link but I will happily take your word for it.

1000units · 2025-09-28T14:04:41 1759068281

It's funny how this comment chain is about how names stick to ideas in somewhat arbitrary ways, and you are using "Elon" to explain a personal policy for information grooming.

jacquesm · 2025-09-28T15:46:58 1759074418

I think 'don't give your data to assholes' is a pretty good policy, regardless of whether it is personal or business.

tsenart · on Dec 5, 2024

Yes! A typical use case is to efficiently implement ORDER BY LIMIT N in SQL databases in a way that doesn’t require sorting the entire column just to get those first N items.

johnthescott · on Dec 8, 2024

i assume this go code runs in the client since pg does not support golang server side. why would a client side ordering be faster than doing in the database?

tsenart · on Dec 10, 2024

This is to implement a database, not use one.

tsenart · on Dec 4, 2024

Author here! Will do eventually.

tsenart · on Jan 23, 2024

Do share your findings!

tsenart · on Jan 23, 2024

Author here. Agree 100%! It's often what didn't work that is omitted. But there's so much juice in failed experiments — it's important to share with others.

tsenart · on July 21, 2023

Our Go ULID package has millisecond precision + monotonic random bytes for disambiguation while preserving ordering within the same millisecond. https://github.com/oklog/ulid

tsenart · on Nov 12, 2021

This, please! Native support for read-replicas would be awesome. Ideally it would now if a query is read-only or not without application changes.

jpgvm · on Nov 13, 2021

For a variety of reasons this is incredibly difficult. Functions, etc make SELECT queries writes, not just UPDATE/DELETE, etc.

It's a lot easier for your application to know what a write is and just establish connections to 2 separate poolers (or hosts on the same poolers) and direct the reads/writes appropriately.

x4m · on Nov 13, 2021

There's already working part of libpq protocol for this - target_session_attrs. But the problem with target_session_attrs is that it just takes too long to discover new primary after failolver. We want to fix this within Odyssey.

tsenart · on Aug 15, 2018

How does it compare to https://github.com/tdunning/t-digest?

seiflotfy · on Aug 15, 2018

Author here Some benchmarks on insertion

---

BenchmarkMetrics/Add/streadway/quantile-8 5000000 358 ns/op

BenchmarkMetrics/Add/bmizerany/perks/quantile-8 5000000 291 ns/op

BenchmarkMetrics/Add/dgrisky/go-gk-8 5000000 363 ns/op

BenchmarkMetrics/Add/influxdata/tdigest-8 5000000 250 ns/op

BenchmarkMetrics/Add/axiom/quantiles-8 10000000 208 ns/op

---

I think its the fastest for insertion

Querying need finalization of state then its just pretty fast but will comment once i can get the API into a friendlier state :D

vvern · on Aug 15, 2018

Aren't the goals of t-digest a little bit different?

T-digest seeks to have a bounded size and an error proportional to q*(1-q), hence it gives up quantile accuracy in the middle of the distribution when under load. This algorithm seems to provide total bounded error without small but unbounded size.

tsenart · on Aug 15, 2018

Could you elaborate on the differences a bit deeper? I’m really interested in understanding.

seiflotfy · on Aug 15, 2018

http://web.cs.ucla.edu/~weiwang/paper/SSDBM07_2.pdf is the paper its mostly based on Figure 1. Actually describes how big the datastructure can get. It keeps getting bigger the more data you feed it.