Daily Database Roasts

The Evolution of Redis: From Cache to AI-Database (V1.0 to 8.4)

Originally from percona.com/blog/feed/

January 21, 2026 • Roasted by Alex "Downtime" Rodriguez Read Original Article

Ah, wonderful. A history lesson on the "evolution" of Redis. Just what I needed while triaging PagerDuty alerts for a cluster that decided its primary node was merely a suggestion. Thank you, Peter, for reminding me of the simple days of 2009, back when a key-value store was just a key-value store, before it got a liberal arts degree and decided it wanted to be a "multi-model platform."

You say "evolution" like it’s a beautiful, natural process. From my trench, it looks more like a series of hastily-grafted limbs that twitch unpredictably. We went from a tool that did one thing exceptionally well—being a lightning-fast cache—to something that also wants to do vector searches. Oh, terrific. Now my cache, the thing that's supposed to reduce database load, is running its own little AI film festival. What could possibly go wrong with letting the cache do CPU-intensive math? I'm sure there are no performance implications there at all.

My favorite part of this "evolution" is always the migration path. It’s sold to us in a slick PowerPoint as a "seamless, zero-downtime" experience. Sure it is. Just like my last "painless" root canal. It always starts with a simple directive from architecture:

"We need to leverage the new Geo-search-vector-JSON-timeseries capabilities. The vendor says it's just a minor version bump. Alex's team can handle it over the weekend."

That "minor version bump" will inevitably involve a completely new replication protocol, a configuration file format that’s now YAML instead of .conf for synergy, and one critical flag that’s been deprecated with no clear replacement. The documentation for this will be a single, unanswered Stack Overflow question from 2018.

So we'll schedule the change for 2 AM on a Saturday. Labor Day weekend, probably. The first five nodes will upgrade perfectly. The sixth will hang. The entire cluster will fail to elect a new master because the new and old nodes are communicating in what I can only assume is panicked screaming. And just like that, "zero-downtime" becomes all the downtime. The application will start throwing errors, and my on-call engineer will page me with the seven words that strike fear into the heart of every Ops person: "Hey... you seeing this, too?"

And how are we supposed to watch this glorious new beast? Oh, don't you worry. There's a "brand-new observability plugin." Which is another way of saying, "we remembered you need to monitor this at the last possible second, so here’s a half-baked Grafana dashboard that only shows you when the CPU is on fire, not why." All our existing alerting, our painstakingly crafted runbooks? Obsolete. The new metrics have names like vector_index_ion_flux_capacitor_utilization and absolutely no one knows what a "normal" value is. The vendor dashboard will show a big, happy green checkmark while the website is serving 503 errors to every customer.

I have a graveyard on my laptop lid for projects like this. It’s a collection of vendor stickers. I’ve got one for RethinkDB, a lovely one from Couchbase’s first failed rebranding, and a holographic sticker from that one that promised "infinitely scalable ACID transactions across the multiverse." They went out of business six months after their IPO. This Redis "platform" is earning its spot right between them.

So please, keep writing these articles about the four distinct eras. It's cute. My eras are more like:

Era 1: It Just Works.
Era 2: We Added A Cool New Feature! (and broke everything)
Era 3: Frantic 3 AM Rollback.
Era 4: Writing The Post-Mortem.

But no, really, this is a great retrospective. It’s inspiring to see how far the project has come. Keep evolving. We'll be here... with the runbooks, a pot of stale coffee, and a deep, world-weary sigh. Don't worry about us. We're used to it.

🔥 The DB Grill 🔥