Something went wrong!
Hang in there while we get back on track
Sales Data with Skewed Distributions
Analyzing sales metrics including revenue, units sold, profit margin, and discount rate. Demonstrates handling of mixed distribution types - right-skewed revenue requiring log transformation, normally distributed profit margins, and moderate skew requiring square root transformation.
Input
What you provide to the skill
Create and analyze a sales dataset with revenue, units_sold, profit_margin, and discount_rate (1000 rows) - include some skewed distributions
Output
What the skill generates for you
Distribution Analysis Report
Summary Statistics
| Variable | Count | Mean | Median | Std | Min | Max | Q1 | Q3 | IQR | Skewness | Kurtosis | Missing | Missing% |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| revenue | 1000 | 6473.79 | 4511.85 | 6226.43 | 545.66 | 82722.3 | 2650.16 | 7954.29 | 5304.13 | 3.6226 | 26.7905 | 0 | 0 |
| units_sold | 1000 | 28.592 | 24 | 20.6861 | 0 | 129 | 14 | 38 | 24 | 1.3744 | 2.2091 | 0 | 0 |
| profit_margin | 1000 | 0.2794 | 0.2864 | 0.0958 | -0.0219 | 0.4865 | 0.2149 | 0.351 | 0.1361 | -0.3653 | -0.2861 | 0 | 0 |
| discount_rate | 1000 | 0.1543 | 0.0325 | 0.2053 | 0 | 0.6939 | 0.0125 | 0.3256 | 0.313 | 1.1136 | -0.3546 | 0 | 0 |
Distribution Shapes
| Variable | Distribution | Skewness | Kurtosis |
|---|---|---|---|
| revenue | Right-skewed, heavy-tailed | 3.6226 | 26.7905 |
| units_sold | Right-skewed, heavy-tailed | 1.3744 | 2.2091 |
| profit_margin | Approximately normal | -0.3653 | -0.2861 |
| discount_rate | Right-skewed | 1.1136 | -0.3546 |
Normality Tests
| Variable | Test | Statistic | p-value | Result |
|---|---|---|---|---|
| revenue | Shapiro-Wilk | 0.7061 | 0 | Non-normal |
| units_sold | Shapiro-Wilk | 0.8934 | 0 | Non-normal |
| profit_margin | Shapiro-Wilk | 0.9864 | 0 | Non-normal |
| discount_rate | Shapiro-Wilk | 0.7209 | 0 | Non-normal |
Transformation Recommendations
| Variable | Recommendation |
|---|---|
| revenue | Log transformation (strong right skew) |
| units_sold | Square root transformation |
| profit_margin | Z-score standardization if needed |
| discount_rate | Square root transformation |
Saved: distribution_plots.png
About This Skill
Automated distribution analysis for numeric dataset variables with statistics, visualizations, and transformation recommendations.
View Skill DetailsMore Examples
Customer Demographics Dataset
Analyzing a customer dataset with age, income, and spending score columns. Demonstrates basic distribution analysis with symmetric distributions and appropriate standardization recommendations for variables with heavy tails.
IoT Sensor Data with Large Sample
Analyzing sensor readings with 6000 observations to trigger Kolmogorov-Smirnov test instead of Shapiro-Wilk. Demonstrates correct test selection for large samples, handling of zero-inflated error counts, and identification of normally distributed sensor measurements alongside skewed operational metrics.