Cohort analysis is one of the most powerful tools Shopify merchants have for understanding whether their business is actually improving over time — or just growing louder while the same problems repeat. Here is how to run it properly and what to do with what you find.

What Cohort Analysis Actually Measures

A cohort is simply a group of customers who share a common starting point — most often the month they made their first purchase. Instead of looking at your entire customer base as one flat pool, cohort analysis lets you track each group's behavior week by week after acquisition: how many came back, how much they spent on return visits, and how that compares to cohorts acquired in earlier or later periods.

For Shopify merchants, this matters because aggregate metrics lie. Your overall repeat purchase rate might look stable at 28%, but cohort analysis could reveal that customers acquired through paid social in Q4 retain at 18%, while organic search customers retain at 40%. Without separating those groups, you would never know which acquisition channel is actually building a sustainable business.

The three metrics worth tracking per cohort are:

  • Repeat purchase rate — the percentage of customers who buy again within a defined window (4 weeks, 8 weeks, 90 days)
  • Average order value (AOV) — does it grow, shrink, or hold steady on repeat purchases?
  • Cumulative revenue per customer — how much has each cohort generated in total, per head, by week N?

How Much Data You Need Before Results Are Reliable

This is where most merchants go wrong: they pull cohort data too early and make decisions on noise.

The minimum threshold is 12 weeks of customer data per cohort. If a cohort is only two weeks old, retention will appear near zero for every group — that is not a signal, it is just recency. You need enough time for repeat purchase behavior to actually emerge.

Equally important is cohort size. If a cohort contains fewer than 100 customers, your repeat purchase rate will swing wildly with just a handful of orders. Aim for 200 or more customers per cohort before drawing conclusions. For smaller stores still building volume, this means you may need to use 60-day or 90-day cohort windows instead of monthly ones, simply to accumulate enough customers per group to get a readable signal.

If your store has only two weeks of data, you can set up the framework, but do not act on the numbers yet. Wait 8 to 12 weeks before treating any retention figure as meaningful.

How to Structure Your Cohorts Operationally

Calendar month is the right default. Group customers by the month they were first acquired, then track their behavior week by week from that starting point. This approach has a practical advantage: it ties directly to your marketing calendar. A January cohort maps to your January campaigns, your January promotions, and January's competitive environment. That makes it far easier to diagnose why one cohort outperforms another.

Avoid cohorting by customer lifetime in isolation (e.g., "customers in their first 30 days" regardless of when they joined). While useful for some lifecycle analyses, it obscures the campaign and seasonality context that makes cohorts actionable for operators.

Handle seasonal products with year-over-year comparisons. If you sell products with strong seasonal demand — winter apparel, holiday gifts, back-to-school supplies — never compare a January cohort to a June cohort. Those customers entered your store in completely different buying contexts, and their retention patterns will reflect that, not your product or marketing quality. Instead, compare January 2025 to January 2024. Year-over-year cohort comparisons give you a like-for-like benchmark that accounts for seasonality and lets you measure genuine improvement.

Segmenting Beyond Acquisition Month

Once you have the basic monthly cohort structure working, layer in acquisition source as a secondary dimension. Run separate cohorts for:

  • Paid social (Meta, TikTok)
  • Paid search (Google Shopping, Performance Max)
  • Organic search and SEO
  • Email or SMS list-driven first purchases
  • Referral or influencer traffic

This segmentation reveals which channels are buying customers versus building customers — a distinction that changes how you allocate budget.

What "Good" Looks Like and How to Benchmark

Repeat purchase rate benchmarks vary by category and price point, but here are the thresholds worth using as a starting framework:

  • 40% or above at 4 weeks — excellent retention; your product and post-purchase experience are working
  • 25–35% at 4 weeks — solid; room to improve but not a crisis
  • Below 20% at 4 weeks — a retention problem worth diagnosing before scaling acquisition spend

For high-ticket products priced at $500 or more, adjust your expectations. A 15–20% repeat purchase rate is acceptable in that range because the purchase cycle is naturally longer and customers deliberate more before returning. Comparing a $600 product's retention to a $40 consumable's retention is not a useful exercise.

When you find a cohort underperforming, the next question is where in the post-purchase journey the drop-off occurs. Is retention low at week 2 (suggesting a fulfillment or first-impression problem) or at week 8 (suggesting the product did not generate enough ongoing need or desire)? The timing of the drop shapes the fix.

Key Takeaways

  • Run cohorts by calendar month of acquisition so results tie directly to campaigns and seasonality.
  • Wait for at least 12 weeks of data and 200+ customers per cohort before treating retention figures as reliable.
  • Use year-over-year comparisons for seasonal products — never compare January to June.
  • Benchmark repeat purchase rate at 4 weeks: 25–35% is solid, 40%+ is excellent, below 20% signals a retention problem (with higher thresholds acceptable for $500+ products).
  • Segment cohorts by acquisition source to identify which channels are building long-term customer value versus generating one-time buyers.

Cohort analysis does not require sophisticated tooling to start — a well-structured export from Shopify's orders data and a spreadsheet will get you most of the way there. The discipline is in waiting for enough data, comparing the right groups, and then acting on what you find rather than just reporting it.