Daily Database Roasts

Integrating Citus with Patroni: Sharding and High Availability Together

Originally from percona.com/blog/feed/

August 4, 2025 • Roasted by Alex "Downtime" Rodriguez Read Original Article

Alright, so the latest hotness is "Citus, a robust PostgreSQL extension that aids in scaling data distribution and provides a solid sharding mechanism." Pause for effect, a deep, tired sigh. Oh, bless your heart, you sweet summer child. You think an extension is going to save us from the inherent complexities of distributed systems? I've got a drawer full of vendor stickers from "robust" and "solid" database solutions that are now gathering dust right next to my Beanie Babies collection – remember those? Thought they were the future too.

"Scaling a single-host PostgreSQL," they say. That's like putting a spoiler on a bicycle and calling it a race car. You're still starting with a bicycle, and you're just adding more points of failure and configuration overhead. And "enriches features like distributed tables, reference tables, columnar storage, schema-based sharding, etc." Yeah, "etc." is right. That "etc." is where my 3 AM phone calls live.

Let's break down this masterpiece of marketing jargon, shall we?

Distributed tables: So, instead of a single point of failure, I now have N points of failure, N times the network latency, and N times the fun when a query needs to hit multiple shards. Tell me, how are we going to do an ALTER TABLE on that when some bright-eyed dev decides to add a new non-nullable column to a million-row table? Is your "zero-downtime migration" going to magically handle that? Because every "zero-downtime" migration I've ever lived through has involved a mandatory maintenance window, a prayer, and me bringing a sleeping bag to the office.
Reference tables: Oh, so some tables are replicated everywhere? Great! Now I get to troubleshoot replication lag across dozens of nodes when someone forgets a WHERE clause and updates every row in a large reference table.
Columnar storage: In a PostgreSQL extension? You're trying to marry OLTP and OLAP in one messy, convoluted package. This sounds like an anti-pattern designed by a committee that couldn't agree on what problem they were trying to solve. Performance will be great... until it isn't. And then good luck figuring out why.
Schema-based sharding: So, every new customer or tenant gets their own shard? Lovely. What happens when one customer blows up and needs a new shard? Or when you need to rebalance a hundred different schemas? Do you have an automated tool for that, or am I going to be manually pg_dump-ing and restoring shards over the Christmas break?

And don't even get me started on the monitoring. You know how this goes. The dashboards will be green. Glorious, vibrant green. Meanwhile, half your users are getting 500 errors because one specific shard, serving one specific customer, is silently melting down due to a SELECT * without limits. The "initial setup part" is always easy. It's the "day 2 operations" that send you spiraling into the existential void. It's the "how do I find a rogue transaction that's locking up a distributed query across 12 nodes when the application logs are useless?" It's the "oh, the extension itself has a memory leak on the coordinator node."

So, here's my prediction: Sometime around 3 AM on the Saturday of a long holiday weekend – probably Memorial Day, because that's when the universe likes to mock me – someone will push a seemingly innocuous change. It'll cause a data rebalance that deadlocks half the nodes, because an indexing operation on one shard clashes with a write on another, or some obscure citus_distribute_table function throws an unexpected error. Or perhaps the "robust" extension will decide it needs to re-index all the distributed tables due to a minor version upgrade, locking everything up for hours. My phone will ring, I'll stumble out of bed, past my collection of "Cassandra is Web-Scale!" and "MongoDB is Document-Oriented!" stickers, and I'll spend the next eight hours trying to piece together why your "solid sharding mechanism" became a pile of broken shards. And when I'm done, I'll just be adding another vendor's sticker to the "Lessons Learned the Hard Way" collection. But hey, at least you got to write a blog post about it.

🔥 The DB Grill 🔥