Something went wrong!
Hang in there while we get back on track
IoT Sensor Data with Large Sample
Analyzing sensor readings with 6000 observations to trigger Kolmogorov-Smirnov test instead of Shapiro-Wilk. Demonstrates correct test selection for large samples, handling of zero-inflated error counts, and identification of normally distributed sensor measurements alongside skewed operational metrics.
Input
What you provide to the skill
Analyze a sensor readings dataset with temperature, humidity, vibration, and error_count (6000 rows) to test large-sample normality detection
Output
What the skill generates for you
Distribution Analysis Report
Summary Statistics
| Variable | Count | Mean | Median | Std | Min | Max | Q1 | Q3 | IQR | Skewness | Kurtosis | Missing | Missing% |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| temperature | 6000 | 71.9906 | 72.0059 | 2.9952 | 62.2762 | 83.7787 | 70.0094 | 73.9706 | 3.9612 | 0.0069 | 0.0392 | 0 | 0 |
| humidity | 6000 | 44.9284 | 44.8502 | 8.0399 | 13.6208 | 73.2324 | 39.5512 | 50.3567 | 10.8055 | -0.0154 | 0.0523 | 0 | 0 |
| vibration | 6000 | 0.0412 | 0.0289 | 0.041 | 0 | 0.3798 | 0.0124 | 0.056 | 0.0436 | 2.0395 | 6.1968 | 0 | 0 |
| error_count | 6000 | 0.4598 | 0 | 1.2954 | 0 | 10 | 0 | 0 | 0 | 3.1302 | 9.9363 | 0 | 0 |
Distribution Shapes
| Variable | Distribution | Skewness | Kurtosis |
|---|---|---|---|
| temperature | Approximately normal | 0.0069 | 0.0392 |
| humidity | Approximately normal | -0.0154 | 0.0523 |
| vibration | Right-skewed, heavy-tailed | 2.0395 | 6.1968 |
| error_count | Right-skewed, heavy-tailed | 3.1302 | 9.9363 |
Normality Tests
| Variable | Test | Statistic | p-value | Result |
|---|---|---|---|---|
| temperature | K-S | 0.0071 | 0.922159 | Normal |
| humidity | K-S | 0.0061 | 0.976811 | Normal |
| vibration | K-S | 0.1576 | 0 | Non-normal |
| error_count | K-S | 0.4964 | 0 | Non-normal |
Transformation Recommendations
| Variable | Recommendation |
|---|---|
| temperature | None needed - approximately normal |
| humidity | None needed - approximately normal |
| vibration | Log transformation (strong right skew) |
| error_count | Shift + Log transformation |
About This Skill
Automated distribution analysis for numeric dataset variables with statistics, visualizations, and transformation recommendations.
View Skill DetailsMore Examples
Customer Demographics Dataset
Analyzing a customer dataset with age, income, and spending score columns. Demonstrates basic distribution analysis with symmetric distributions and appropriate standardization recommendations for variables with heavy tails.
Sales Data with Skewed Distributions
Analyzing sales metrics including revenue, units sold, profit margin, and discount rate. Demonstrates handling of mixed distribution types - right-skewed revenue requiring log transformation, normally distributed profit margins, and moderate skew requiring square root transformation.