More

tudorg · 2026-02-26T19:41:57 1772134917

This sounds cool, adding to my list of things to try out.

One note, this shouldn't be confused with https://github.com/xataio/pgstream which does logical replication/CDC with DDL changes.

grove · 2026-02-28T17:01:25 1772298085

We've renamed the project to pg_trickle: https://github.com/grove/pg-trickle

grove · 2026-02-26T20:02:36 1772136156

Thanks. Naming is hard, but we'll look into it.

tudorg · 2026-02-19T18:23:14 1771525394

> The Amp editor extensions will self-destruct on March 5 at 8pm Pacific Time. Time to switch to the Amp CLI.

That's an interesting way to tackle the removal of features.

tudorg · 2026-02-14T12:31:49 1771072309

> The Postgres extension model to capture the metrics (we also experimented with eBPF, but it causes too many kernel-user space context switches when you can do the same in an extension without them), and a small sidecar to push the metrics out via a standardized protocol like OTEL.

The extension model is great, but it doesn't work with existing postgres providers (RDS, Aurora, etc.). Unless one such extension becomes standard enough that all providers will support it. That would be ideal, IMO.

To be clear, I don't mean pg_stat_statements, that is standard enough, but an extension that pushes the actual queries in real-time.

> If it's a network hop, then adds milliseconds, and not microseconds.

Are you talking about connection establishing time or for query delay? I think it should normally be under a millisecond for the later.

ahachete · 2026-02-14T20:24:23 1771100663

> The extension model is great, but it doesn't work with existing postgres providers (RDS, Aurora, etc.). Unless one such extension becomes standard enough that all providers will support it. That would be ideal, IMO.

That's true, but that's a problem of the PGaaS providers for them to fix by providing the best functionality available. I'm planning on following this route in a pure OSS basis.

> Are you talking about connection establishing time or for query delay? I think it should normally be under a millisecond for the later.

Network trip. If the proxy is not co-located with the database but rather a network hop away, that's usually adding at least 1ms there, could be more.

tudorg · 2026-02-14T10:13:45 1771064025

Even then, though, it needs to run on the server so it's hard to guarantee to not impact performance and availability. There are many Postgres/Mysql proxies used for connection pooling and such, so at least we understand their impact pretty well (and it tends to be minimal).

tudorg · 2026-01-15T02:41:39 1768444899

Others have mentioned similar solutions but I’d like to add one: a database solution with CoW branching and PII anonymisation solves the db part in a safe way.

Disclaimer: I work at Xata.io, which provides these features. We have a recent blog post with a demo of this: https://xata.io/blog/database-branching-for-ai-coding-agents

tudorg · 2025-12-25T03:02:44 1766631764

This is really cool and I love to see the interest in fast clones / branching here.

We've built Xata with this idea of using copy-on-write database branching for staging and testing setups, where you need to use testing data that's close to the real data. On top of just branching, we also do things like anonymization and scale-to-zero, so the dev branches are often really cheap. Check it out at https://xata.io/

> The source database can't have any active connections during cloning. This is a PostgreSQL limitation, not a filesystem one. For production use, this usually means you create a dedicated template database rather than cloning your live database directly.

This is a key limitation to be aware of. A way to workaround it could be to use pgstream (https://github.com/xataio/pgstream) to copy from the production database to a production replica. Pgstream can also do anonymization on the way, this is what we use at Xata.

tudorg · 2025-09-30T20:11:05 1759263065

Thanks for the report!

tudorg · 2025-09-25T17:37:01 1758821821

Lots of good improvements, my favorites are Oauth, NOT NULL constraint with NOT VALID, uuidv7, RETURNING contains old/new. And I think the async IO will bring performance benefits, although maybe not so much immediately.

sergeyprokhoren · 2025-09-25T20:46:14 1758833174

PostgreSQL Gains a Built-in UUIDv7 Generation Function for Primary Keys (many interesting details)

https://habr.com/en/news/950340/

tudorg · 2025-09-15T16:03:51 1757952231

Great to see pgstream on HN!

Besides DDL changes, pgstream can also do on-the-fly data anonymization and data masking. It can stream to other stores, like Elasticsearch, but the current main focus is on PG to PG replication with DDL changes and anonymization.

We've been using it in Xata to power our `xata clone` functionality, which creates a "staging replica" on the Xata platform, with anonymized data that closely resembles production. Then one gets fast copy-on-write branching from this anonymized staging replica. This is great for creating dev branches and ephemeral environments.

tudorg · 2025-07-26T10:43:55 1753526635

Check out Xata.io. A `xata.micro` instance is ~$9 per month (+storage). It’s in private beta but I can give you access if you are interested.