Something went wrong!
Hang in there while we get back on track
Missing Data Pattern Analyzer
Analyze missing data patterns in research datasets and recommend appropriate handling strategies with statistical tests and implementation code.
What You Get
Get complete missing data analysis including mechanism testing, strategy recommendations, R/Python code templates, and publication-ready methods text - replacing hours of manual analysis with statistically rigorous guidance.
The Problem
The Solution
How It Works
- 1 Calculate missing data summary statistics including percentage missing per variable and complete case counts
- 2 Classify missingness pattern as univariate, monotone, or arbitrary and generate visualization code
- 3 Test missingness mechanism using Little's MCAR test and logistic regression for predictors
- 4 Determine mechanism (MCAR/MAR/MNAR) by synthesizing test results with theoretical reasoning
- 5 Recommend handling strategy (deletion, multiple imputation, or FIML) with justification
- 6 Provide implementation code templates in R (mice, lavaan) and Python (sklearn)
- 7 Generate publication-ready methods section text and sensitivity analysis recommendations
What You'll Need
- Dataset with missing values (CSV or described tabular data)
- Variable names, types, and sample size
- Missing counts per variable
- Research context (survey, longitudinal, clinical trial)
Get This Skill
Requires Pro subscription ($9/month)
Have Feedback?
Help us improve this skill by sharing your ideas and suggestions.
Request ImprovementsExamples
Cross-Sectional Employee Survey Analysis
Analyze missing data in an employee survey with concentrated missingness in income variable (15.2%), recommending multiple imputation with m=20 and providing R/Python implementation code.
4-Wave Longitudinal Depression Study
Analyze monotone attrition pattern in a longitudinal study with 29.5% cumulative dropout, recommending FIML growth curve modeling with lavaan and providing complete implementation code.
RCT with Differential Dropout
Analyze differential attrition in a clinical trial (30% control vs 15% treatment dropout), recommending multiple imputation with m=40 for ITT analysis with tipping point sensitivity analysis.