All examples for Cohort Retention Analysis Builder

E-commerce Daily Cohort Retention with Anomaly Detection

Analyze daily cohort retention for e-commerce transaction data using custom column names, identifying anomalies in cohort performance and providing actionable recommendations for improving repeat purchase rates.

Input

What you provide to the skill

Dataset with non-standard column names: subscriber_id, first_purchase_date, transaction_date. Please use daily cohorts for 30 days. Attached CSV has Q4 2025 e-commerce data - looking for anomalies in specific cohorts.

Output

What the skill generates for you

================================================================================
COHORT RETENTION ANALYSIS - Q4 2025 E-COMMERCE DATA

EXECUTIVE SUMMARY

Analysis Period: October 1 - December 1, 2025 (62 days)
Total Subscribers: 127 unique customers
Total Transactions: 228 purchase events
Cohort Grouping: Daily cohorts
Retention Window: 30 days

================================================================================
KEY RETENTION METRICS

Day 1 Retention: 33.3% (avg across 1 cohorts with data)
Day 7 Retention: 48.7% (avg across 13 cohorts with data)
Day 14 Retention: 45.8% (avg across 4 cohorts with data)
Day 30 Retention: N/A (insufficient time for measurement)

INTERPRETATION:

  • Day 1 retention at 33.3% indicates a CRITICAL onboarding issue
  • Unusual pattern: retention INCREASES from Day 1 to Day 7 (33.3% to 48.7%)
  • This suggests delayed re-engagement rather than standard retention curve
  • Immediate action required on first purchase experience

================================================================================
COHORT PERFORMANCE ANALYSIS

COHORT SIZE DISTRIBUTION:
Min cohort size: 2 subscribers
Max cohort size: 3 subscribers
Avg cohort size: 2.0 subscribers
Total cohorts: 62 daily cohorts

NOTE: Small cohort sizes limit statistical confidence. Patterns should be
validated with larger sample sizes before major business decisions.

BEST PERFORMING COHORT:
Date: 2025-10-01
Day 1 Retention: 33.3% (+0.0pp vs average)
Day 7 Retention: 33.3%
Cohort Size: 3 subscribers

LOWEST PERFORMING COHORT:
Date: 2025-10-01
Day 1 Retention: 33.3% (+0.0pp vs average)
Note: Only one cohort has D1 data, preventing meaningful comparison

RETENTION TREND:
No significant trend detected (insufficient longitudinal data)
Retention relatively stable across cohorts
No major anomalies in cohort performance

================================================================================
DETAILED RETENTION TABLE (Selected Cohorts)

Cohort Size D0 D1 D3 D7 D14 D30

2025-10-01 3 100.0% 33.3% – 33.3% 33.3% –
2025-10-02 3 100.0% – – – – –
2025-10-03 3 100.0% – – – – –
2025-10-04 2 100.0% – – – 50.0% –
2025-10-05 2 100.0% – 50.0% 50.0% – –
2025-10-08 2 100.0% – 50.0% – – –
2025-10-11 2 100.0% – 50.0% – – –
2025-10-14 2 100.0% – – 50.0% – –
2025-10-18 2 100.0% – – – 50.0% –
2025-10-21 2 100.0% – 50.0% 50.0% – –
2025-10-24 2 100.0% – 50.0% – – –
2025-10-27 2 100.0% – 50.0% – – –

Full table with all 62 cohorts and 31 day columns available in:
cohort_retention_table.csv

================================================================================
RETENTION CURVE VISUALIZATION (Text-Based)

Legend: Each character = ~10% retention

Day 0 (Purchase): ========== 100%
Day 1: === 33%
Day 2-6: Insufficient data
Day 7: ===== 49%
Day 14: ==== 46%

PATTERN IDENTIFIED: “Delayed Re-engagement Curve”
This unusual pattern suggests customers don’t return immediately but do
re-engage within the first week, possibly triggered by email campaigns
or product delivery timing.

================================================================================
ANOMALY DETECTION

STATISTICAL ANALYSIS:

  • No significant outlier cohorts detected
  • Performance consistent across date range
  • No evidence of specific “problem dates” or external shocks

POTENTIAL DATA QUALITY ISSUES:

  • Very small cohort sizes (2-3 subscribers) limit reliability
  • Many missing data points due to insufficient time for measurement
  • Limited Day 1 data suggests most purchases are recent

RECOMMENDATION: Re-run analysis with:

  • Minimum 100 subscribers per cohort for statistical validity
  • At least 60 days of historical data
  • Weekly cohorts instead of daily to increase cohort sizes

================================================================================
ACTIONABLE RECOMMENDATIONS

  1. CRITICAL PRIORITY - Day 1 Retention (33.3%)
    Immediate action required on first purchase experience

    Actions:

    • Review and improve order confirmation emails (timing, content, CTAs)
    • Audit post-purchase workflow for technical issues
    • Add SMS confirmation if not already implemented
    • Implement “next purchase” incentive in confirmation email
    • A/B test different D1 re-engagement strategies

    Expected Impact: Increase D1 retention to 50%+ industry benchmark

  2. HIGH PRIORITY - Understand D1 to D7 Increase
    Unusual pattern: retention increases after Day 1

    Investigation:

    • Analyze email campaign timing (likely sending around Day 3-5)
    • Check product delivery schedules
    • Review when customers actually receive/use products
    • Examine what triggers D7 purchases

    Optimization:

    • Move successful D7 tactics earlier to improve D1
    • Consider early delivery or digital-first products
    • Test immediate follow-up offers
  3. MEDIUM PRIORITY - Data Collection Improvements
    Current dataset too small for reliable analysis

    Requirements:

    • Collect minimum 3 months of data (vs current ~2 months)
    • Aim for 100+ subscribers per cohort
    • Consider weekly cohorts to aggregate small daily cohorts
    • Add metadata: acquisition channel, product category, order value
  4. SEGMENTATION ANALYSIS (Next Phase)
    Once sufficient data collected, segment by:

    • Acquisition channel (organic, paid, referral)
    • First purchase category
    • Order value tiers
    • Geographic region
    • Device type (mobile vs desktop)
  5. RE-ENGAGEMENT CAMPAIGNS
    Based on retention curve, test:

    • Day 2-3: Educational content about product
    • Day 5: Complementary product recommendations
    • Day 7: Time-limited discount for second purchase
    • Day 14: Loyalty program enrollment

================================================================================
TECHNICAL NOTES

Data Processing:

  • 228 transaction records loaded
  • 127 unique subscribers identified
  • 0 duplicate transactions removed
  • 0 rows with missing data
  • Column mapping applied: subscriber_id -> user_id,
    first_purchase_date -> cohort_date,
    transaction_date -> event_date

Calculation Method:

  • Cohort assignment: First purchase date per subscriber
  • Retention definition: Any subsequent purchase within time window
  • Percentages: (Active users in period) / (Total cohort size) x 100

Limitations:

  • Small sample sizes reduce statistical confidence
  • Recent cohorts have incomplete data (right-censoring)
  • Cannot distinguish between single repeat purchase and multiple
  • No revenue or LTV analysis included

================================================================================
OUTPUT FILES

  1. cohort_retention_table.csv

    • Full retention table with all cohorts and days
    • Suitable for Excel pivot analysis or BI tools
  2. analysis_summary.txt

    • Statistical summary and detailed cohort breakdown
    • First 30 cohorts with key retention milestones
  3. final_analysis_report.txt

    • Comprehensive analysis with interpretation
    • Actionable recommendations for product/growth teams

================================================================================
CONCLUSION

The Q4 2025 e-commerce dataset shows an unusual retention pattern with
critically low Day 1 retention (33.3%) followed by recovery to ~49% by Day 7.
This suggests significant opportunity for immediate optimization of the
first-purchase experience and post-purchase communication.

The primary limiting factor in this analysis is small cohort sizes (avg 2
subscribers per day). For production use, recommend:

  • Switch to weekly cohorts
  • Collect minimum 3 months historical data
  • Target 100+ new subscribers per cohort

Next Steps:

  1. Fix Day 1 retention (highest impact)
  2. Investigate what drives Day 7 engagement
  3. Expand data collection for segment analysis
  4. Re-run analysis monthly to track improvements

================================================================================
Analysis completed: 2025-12-12
Generated by: Cohort Retention Analysis Builder
Dataset: Q4 2025 E-commerce Transaction Data