All examples for Dataset Explorer

Sales Pipeline Forecast Readiness Assessment

Quality assessment of an 8-deal sales pipeline before building forecast model. Identifies critical blocker (25% missing expected_close_date values preventing time-based forecasting), strong deal value-pipeline age correlation (r=0.88), and $2.5M outlier deal requiring validation. Provides stage-weighted analysis and forecast-specific recommendations.

Input

What you provide to the skill

Profile /tmp/sales_pipeline.csv - need data quality check before forecast

Output

What the skill generates for you

Dataset Explorer - Sales Pipeline Quality Check Complete

Iโ€™ve completed a comprehensive data quality assessment on your sales pipeline dataset (/tmp/sales_pipeline.csv). Hereโ€™s the executive summary:


๐ŸŽฏ FORECAST READINESS: โ›” NOT READY

Critical Blocker: 25% of deals (2 out of 8) are missing expected_close_date - these cannot be included in time-based forecasts.


๐Ÿ“Š Key Findings

Pipeline Overview

  • Total Deals: 8
  • Total Pipeline Value: $3,110,000
  • Weighted Pipeline Value: $1,629,750 (adjusted by probability)
  • Data Completeness: 96.4% (2 missing values)

Distribution Highlights

  • Deal Values: Heavily right-skewed (median: $90K, but one $2.5M outlier)
  • Win Probability: Average 54% (ranging from 20-100%)
  • Pipeline Age: Average 58 days (one deal at 180 days - potential stall risk)
  • Lead Sources: Evenly distributed across Referral, Website, Outbound, Partner, and Event

Critical Correlation

โš ๏ธ Deal Value โ†” Days in Pipeline (r=0.88): Strong positive correlation suggests larger deals naturally take longer, BUT this could also indicate velocity issues.


๐Ÿšจ Data Quality Issues

  1. CRITICAL: Missing close dates on deals D003 and D006 ($80K total value)
  2. WARNING: One deal (D004) has been in pipeline 180 days - 3x the average
  3. INFO: Deal D004 ($2.5M) is a significant outlier - verify if legitimate enterprise deal

๐Ÿ“ˆ Stage-Weighted Analysis

Stage Deals Total Value Weighted Value Avg Probability
Proposal 3 $2,725K $1,393K 58%
Qualification 2 $135K $36K 28%
Negotiation 1 $125K $100K 80%
Closed Won 1 $95K $95K 100%
Discovery 1 $30K $6K 20%

Insight: 88% of weighted value is in Proposal stage or later - healthy pipeline maturity.


โœ… Immediate Actions Before Forecasting

  1. Add expected_close_date to deals D003 ($50K) and D006 ($30K)
  2. Investigate D004 ($2.5M, 180 days) - re-engage or update probability
  3. Validate probabilities - ensure they reflect historical win rates by stage
  4. Add historical closed/lost data - needed to calculate accurate win rates

๐Ÿ”ฎ Forecasting Recommendations

Once data is cleaned:

  1. Segment by lead source - different channels may have different conversion patterns
  2. Apply time decay - deals >90 days should have reduced probabilities
  3. Use probability weighting - you have good probability data, leverage it
  4. Build confidence intervals - small sample size means uncertainty ranges are important
  5. Track stage velocity - monitor average days in each stage for early warning signals

๐Ÿ“ Generated Files

  • /tmp/sales_pipeline_analysis.png - 6-panel visualization dashboard
  • /tmp/sales_pipeline_correlations.png - Correlation heatmap
  • /tmp/profile_sales_pipeline.py - Reusable profiling script

Estimated cleanup time: 1-2 business days to address missing dates and validate outliers.

The dataset structure is solid and has the right fields for forecasting, but the missing close dates are a blocker. Once those 2 deals are updated, youโ€™ll have an 87.5% forecast-ready pipeline ($2,645K of $3,110K total value).