Something went wrong!
Hang in there while we get back on track
Sales Pipeline Forecast Readiness Assessment
Quality assessment of an 8-deal sales pipeline before building forecast model. Identifies critical blocker (25% missing expected_close_date values preventing time-based forecasting), strong deal value-pipeline age correlation (r=0.88), and $2.5M outlier deal requiring validation. Provides stage-weighted analysis and forecast-specific recommendations.
Input
What you provide to the skill
Profile /tmp/sales_pipeline.csv - need data quality check before forecast
Output
What the skill generates for you
Dataset Explorer - Sales Pipeline Quality Check Complete
Iโve completed a comprehensive data quality assessment on your sales pipeline dataset (/tmp/sales_pipeline.csv). Hereโs the executive summary:
๐ฏ FORECAST READINESS: โ NOT READY
Critical Blocker: 25% of deals (2 out of 8) are missing expected_close_date - these cannot be included in time-based forecasts.
๐ Key Findings
Pipeline Overview
- Total Deals: 8
- Total Pipeline Value: $3,110,000
- Weighted Pipeline Value: $1,629,750 (adjusted by probability)
- Data Completeness: 96.4% (2 missing values)
Distribution Highlights
- Deal Values: Heavily right-skewed (median: $90K, but one $2.5M outlier)
- Win Probability: Average 54% (ranging from 20-100%)
- Pipeline Age: Average 58 days (one deal at 180 days - potential stall risk)
- Lead Sources: Evenly distributed across Referral, Website, Outbound, Partner, and Event
Critical Correlation
โ ๏ธ Deal Value โ Days in Pipeline (r=0.88): Strong positive correlation suggests larger deals naturally take longer, BUT this could also indicate velocity issues.
๐จ Data Quality Issues
- CRITICAL: Missing close dates on deals D003 and D006 ($80K total value)
- WARNING: One deal (D004) has been in pipeline 180 days - 3x the average
- INFO: Deal D004 ($2.5M) is a significant outlier - verify if legitimate enterprise deal
๐ Stage-Weighted Analysis
| Stage | Deals | Total Value | Weighted Value | Avg Probability |
|---|---|---|---|---|
| Proposal | 3 | $2,725K | $1,393K | 58% |
| Qualification | 2 | $135K | $36K | 28% |
| Negotiation | 1 | $125K | $100K | 80% |
| Closed Won | 1 | $95K | $95K | 100% |
| Discovery | 1 | $30K | $6K | 20% |
Insight: 88% of weighted value is in Proposal stage or later - healthy pipeline maturity.
โ Immediate Actions Before Forecasting
- Add expected_close_date to deals D003 ($50K) and D006 ($30K)
- Investigate D004 ($2.5M, 180 days) - re-engage or update probability
- Validate probabilities - ensure they reflect historical win rates by stage
- Add historical closed/lost data - needed to calculate accurate win rates
๐ฎ Forecasting Recommendations
Once data is cleaned:
- Segment by lead source - different channels may have different conversion patterns
- Apply time decay - deals >90 days should have reduced probabilities
- Use probability weighting - you have good probability data, leverage it
- Build confidence intervals - small sample size means uncertainty ranges are important
- Track stage velocity - monitor average days in each stage for early warning signals
๐ Generated Files
/tmp/sales_pipeline_analysis.png- 6-panel visualization dashboard/tmp/sales_pipeline_correlations.png- Correlation heatmap/tmp/profile_sales_pipeline.py- Reusable profiling script
Estimated cleanup time: 1-2 business days to address missing dates and validate outliers.
The dataset structure is solid and has the right fields for forecasting, but the missing close dates are a blocker. Once those 2 deals are updated, youโll have an 87.5% forecast-ready pipeline ($2,645K of $3,110K total value).
About This Skill
Instantly profile CSV/Excel datasets to understand structure, data types, distributions, quality issues, and correlations. Generates comprehensive statistics, visualizations, and actionable insights.
View Skill DetailsMore Examples
Customer Revenue Analysis with Segmentation
Analyzing a small customer dataset (10 rows) to understand revenue patterns and data quality. Demonstrates detection of strong age-revenue correlation (r=0.76), identification of missing emails (30%), and Enterprise vs SMB segmentation insights showing 10x revenue difference between segments.
Transaction Dataset Pattern and Quality Analysis
Comprehensive profiling of 100-row transaction dataset revealing regional performance differences, category distribution, and customer demographics. Discovers North region paradox (lowest volume but highest average transaction value at $3,151), identifies 4 missing customer ages (4%), and provides symmetric transaction value distribution indicating healthy business mix.