Daily Database Roasts

Covering index for $group/$sum in MongoDB aggregation (with hint)

Originally from dev.to/feed/franckpachot

November 1, 2025 • Roasted by Sarah "Burnout" Chen Read Original Article

Alright, team, gather ‘round for the latest gospel from the Church of Next-Gen Data Solutions. I’ve just finished reading this... inspiring piece on how to make our lives easier with MongoDB, and my eye has developed a permanent twitch. They’ve discovered a revolutionary new technique called “telling the database how to do its job.” I’m filled with the kind of joy one only feels at 3 AM while watching a data migration fail for the fifth time.

Here are just a few of my favorite takeaways from this blueprint for our next inevitable weekend-long incident.

First, we have the majestic know-it-all query planner that, after you painstakingly create the perfect index, decides to ignore it completely. It’s like paving a new six-lane highway and watching the GPS route all the traffic down a dirt path instead. But don’t worry, it’s not a bug, it’s a feature! We get the privilege of manually intervening with a hint. Because what every developer loves more than writing business logic is littering their code with brittle, database-specific directives that will absolutely, positively never be forgotten or become obsolete during the next “painless” upgrade.
I’m also thrilled by the concept of Covering Indexes, the database equivalent of putting a sticky note over a warning light on your car's dashboard. The solution to slow queries caused by fetching massive documents is… don’t fetch the massive documents! Groundbreaking. This is sold as a clever optimization, but it feels more like an admission that your data model is a monster you can no longer control. So now, instead of one source of truth, we have two: the actual document and the shadow-world of indexes we have to carefully curate, lest we summon the COLLSCAN demon.
Let’s talk about the solution to our willfully ignorant query planner: the hint. This is not a tool; it’s a promise of future suffering. I can see it now. Six months from today, a fresh-faced junior engineer, full of hope and a desire to “clean up the code,” will see { hint: { groupme: 1 } } and think, “What’s this magic comment doing here?” They’ll delete it. And at 2:17 AM on a Saturday, my phone will scream, and I’ll be staring at a PagerDuty alert telling me the main aggregation pipeline is timing out, all because we’re building our core performance on what is essentially a glorified code comment.

The most important factor is ensuring the index covers the fields used by the $group stage... you typically need to use a hint to force their use, even when there is no filter or sort.

Of course. It’s so simple. We just have to manually ensure every index for every aggregation query is perfectly crafted and then manually force the database to use it. This is not engineering; this is database whispering. It’s a dark art. This article is less of a technical guide and more of a page from a grimoire on how to appease angry machine spirits.
And the grand finale: we learn that under memory pressure—a totally hypothetical scenario that never happens in a real startup—the actual order of the keys in your index suddenly matters. So the thing that didn’t matter a second ago is now the only thing that matters when the server is on fire. Fantastic. We’ve replaced a predictable problem (“this query is slow”) with a much more exciting, context-dependent one (“this query is fast, except on Tuesdays during a full moon when the cache is cold and Jenkins is running a build”).

So, yes, I am thrilled to implement this. We’ll spend the next sprint sprinkling hints throughout the codebase like salt on a cursed battlefield. It will all work beautifully until the day our traffic doubles, every aggregation starts spilling to disk, and we realize the magical index order we chose is wrong. I’ll see you all at 4 AM for the post-mortem. There will be coffee and existential dread.

🔥 The DB Grill 🔥