Decision Path Comparisons: Similar Datasets, Different Methods
==============================================================
This document provides side-by-side comparisons of how CAIS selects different methods for similar datasets with slight variations in characteristics. Understanding these comparisons helps illustrate the decision tree logic and method selection criteria.
Overview
--------
Small changes in dataset characteristics can lead to dramatically different method selections. This document shows:
- How minor data differences affect method choice
- Why certain methods are preferred over others
- What happens when key assumptions are violated
- How to interpret method selection decisions
Comparison 1: Randomized vs. Observational Education Data
---------------------------------------------------------
**Scenario**: Evaluating the impact of a tutoring program on student test scores
Randomized Version
~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Students randomly assigned to tutoring program
- Balanced baseline characteristics
- Rich covariate information available
- Perfect compliance with assignment
.. mermaid::
flowchart TD
A[Randomized Tutoring Study] --> B{Is this randomized?}
B -->|Yes ✓| C{Are covariates available?}
C -->|Yes ✓| D[Linear Regression
with Covariates]
style A fill:#e3f2fd
style B fill:#e8f5e8
style C fill:#fff3e0
style D fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Linear Regression with Covariates
Reasoning:
✓ Randomization ensures causal identification
✓ Covariates improve precision (reduce standard errors)
✓ No selection bias concerns
✓ Straightforward interpretation
Expected Results:
- Unbiased treatment effect estimate
- Narrow confidence intervals (high precision)
- Clear causal interpretation
Observational Version
~~~~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Students self-select into tutoring program
- Systematic differences in baseline characteristics
- Same rich covariate information available
- Good overlap in covariate distributions
.. mermaid::
flowchart TD
A[Observational Tutoring Study] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|No ✗| D{Running variable?}
D -->|No ✗| E{Binary treatment?}
E -->|Yes ✓| F{Instrumental variable?}
F -->|No ✗| G{Rich covariates?}
G -->|Yes ✓| H{Good overlap?}
H -->|Yes ✓| I[Propensity Score
Matching]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#ffebee
style D fill:#ffebee
style E fill:#fff3e0
style F fill:#ffebee
style G fill:#fff3e0
style H fill:#fff3e0
style I fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Propensity Score Matching
Reasoning:
❌ No randomization (selection bias present)
✓ Rich covariates available for matching
✓ Good covariate overlap enables valid matches
✓ Can control for observed confounders
Expected Results:
- Potentially biased if unobserved confounders exist
- Wider confidence intervals (less precision)
- Requires strong unconfoundedness assumption
**Side-by-Side Comparison**:
.. list-table:: Randomized vs. Observational Comparison
:header-rows: 1
:widths: 25 35 40
* - Aspect
- Randomized Version
- Observational Version
* - Method Selected
- Linear Regression + Covariates
- Propensity Score Matching
* - Identification
- Randomization
- Unconfoundedness assumption
* - Bias Risk
- None (randomized)
- Possible (unobserved confounders)
* - Precision
- High (uses all data)
- Lower (matched sample only)
* - Assumptions
- Minimal
- Strong (no unmeasured confounding)
---
Comparison 2: Cross-Sectional vs. Panel Policy Data
---------------------------------------------------
**Scenario**: Evaluating the impact of minimum wage increases on employment
Cross-Sectional Version
~~~~~~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Single time point after policy implementation
- States with and without minimum wage increases
- Rich economic and demographic controls
- No pre-policy baseline data
.. mermaid::
flowchart TD
A[Cross-Sectional Policy Data] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|No ✗| D{Running variable?}
D -->|No ✗| E{Binary treatment?}
E -->|Yes ✓| F{Instrumental variable?}
F -->|No ✗| G{Rich covariates?}
G -->|Yes ✓| H{Good overlap?}
H -->|Yes ✓| I[Propensity Score
Methods]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#ffebee
style D fill:#ffebee
style E fill:#fff3e0
style F fill:#ffebee
style G fill:#fff3e0
style H fill:#fff3e0
style I fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Propensity Score Methods
Reasoning:
❌ No randomization (policy endogenous)
❌ No panel data (single time point)
❌ No clear running variable
✓ Rich covariates for matching/weighting
Limitations:
⚠️ Cannot control for unobserved state characteristics
⚠️ Policy adoption may be endogenous
⚠️ Strong unconfoundedness assumption required
Panel Version
~~~~~~~~~~~~~
**Dataset Characteristics**:
- Multiple time periods before and after policy
- Staggered implementation across states
- Same rich controls available
- Clear treatment timing variation
.. mermaid::
flowchart TD
A[Panel Policy Data] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|Yes ✓| D{Treatment timing varies?}
D -->|Yes ✓| E[Difference-in-Differences]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#fff3e0
style D fill:#fff3e0
style E fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Difference-in-Differences
Reasoning:
❌ No randomization (policy endogenous)
✓ Panel data with timing variation
✓ Can control for time-invariant confounders
✓ Exploits policy timing for identification
Advantages:
✓ Controls for unobserved state characteristics
✓ Handles time trends
✓ More credible identification than cross-sectional
**Side-by-Side Comparison**:
.. list-table:: Cross-Sectional vs. Panel Comparison
:header-rows: 1
:widths: 25 35 40
* - Aspect
- Cross-Sectional Version
- Panel Version
* - Method Selected
- Propensity Score Methods
- Difference-in-Differences
* - Identification
- Unconfoundedness
- Parallel trends
* - Controls For
- Observed confounders only
- Time-invariant unobservables
* - Key Assumption
- No unmeasured confounding
- Parallel trends
* - Credibility
- Lower (strong assumptions)
- Higher (weaker assumptions)
---
Comparison 3: Sharp vs. Fuzzy Discontinuity
-------------------------------------------
**Scenario**: Evaluating scholarship program effects on college enrollment
Sharp Discontinuity Version
~~~~~~~~~~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Test score determines scholarship eligibility
- Sharp cutoff at score = 1200
- All students above cutoff get scholarship
- No students below cutoff get scholarship
.. mermaid::
flowchart TD
A[Sharp RDD Data] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|No ✗| D{Running variable with cutoff?}
D -->|Yes ✓| E{Sharp discontinuity?}
E -->|Yes ✓| F[Sharp Regression
Discontinuity]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#ffebee
style D fill:#fff3e0
style E fill:#fff3e0
style F fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Sharp Regression Discontinuity
Reasoning:
✓ Clear running variable (test score)
✓ Sharp cutoff at 1200
✓ Treatment probability jumps from 0 to 1
✓ Local randomization around cutoff
Implementation:
- Compare students just above/below cutoff
- Estimate local treatment effect
- Check continuity assumptions
Fuzzy Discontinuity Version
~~~~~~~~~~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Same test score running variable
- Same cutoff at score = 1200
- Scholarship probability increases but doesn't reach 100%
- Some students below cutoff still get scholarships
.. mermaid::
flowchart TD
A[Fuzzy RDD Data] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|No ✗| D{Running variable with cutoff?}
D -->|Yes ✓| E{Sharp discontinuity?}
E -->|No ✗| F{Fuzzy discontinuity?}
F -->|Yes ✓| G[Fuzzy Regression
Discontinuity]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#ffebee
style D fill:#fff3e0
style E fill:#ffebee
style F fill:#fff3e0
style G fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Fuzzy Regression Discontinuity
Reasoning:
✓ Clear running variable (test score)
✓ Discontinuous jump in treatment probability
❌ Treatment probability doesn't reach 100%
✓ Can use IV approach with cutoff as instrument
Implementation:
- First stage: cutoff predicts scholarship probability
- Second stage: predicted scholarship affects enrollment
- Estimate local average treatment effect (LATE)
**Side-by-Side Comparison**:
.. list-table:: Sharp vs. Fuzzy RDD Comparison
:header-rows: 1
:widths: 25 35 40
* - Aspect
- Sharp RDD
- Fuzzy RDD
* - Method Selected
- Sharp RDD
- Fuzzy RDD (IV approach)
* - Treatment Assignment
- Deterministic at cutoff
- Probabilistic at cutoff
* - Identification
- Direct comparison
- Instrumental variables
* - Interpretation
- Average treatment effect
- Local average treatment effect
* - Complexity
- Simpler
- More complex (two-stage)
---
Comparison 4: Strong vs. Weak Instrument
----------------------------------------
**Scenario**: Evaluating the effect of education on earnings
Strong Instrument Version
~~~~~~~~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Distance to college as instrument for education
- Strong first-stage relationship (F > 50)
- Credible exclusion restriction
- Large sample size
.. mermaid::
flowchart TD
A[Strong IV Data] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|No ✗| D{Running variable?}
D -->|No ✗| E{Binary treatment?}
E -->|No ✗| F{Continuous treatment}
F --> G{Instrumental variable?}
G -->|Yes ✓| H{Strong instrument?}
H -->|Yes ✓| I[Instrumental Variables]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#ffebee
style D fill:#ffebee
style E fill:#ffebee
style F fill:#fff3e0
style G fill:#fff3e0
style H fill:#fff3e0
style I fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Instrumental Variables
Reasoning:
✓ Strong first-stage relationship (F = 52.3)
✓ Credible exclusion restriction
✓ Handles unmeasured confounding
✓ Large sample provides adequate power
Expected Results:
- Consistent estimates
- Reasonable precision
- Valid inference
Weak Instrument Version
~~~~~~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Same distance to college instrument
- Weak first-stage relationship (F < 10)
- Same exclusion restriction
- Same sample size
.. mermaid::
flowchart TD
A[Weak IV Data] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|No ✗| D{Running variable?}
D -->|No ✗| E{Binary treatment?}
E -->|No ✗| F{Continuous treatment}
F --> G{Instrumental variable?}
G -->|Yes ✓| H{Strong instrument?}
H -->|No ✗| I[⚠️ Weak Instrument
Consider Alternatives]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#ffebee
style D fill:#ffebee
style E fill:#ffebee
style F fill:#fff3e0
style G fill:#fff3e0
style H fill:#ffebee
style I fill:#fff3e0
**Agent Decision**:
.. code-block:: text
⚠️ Weak Instrument Detected: Consider Alternatives
Problems with Weak IV:
❌ First-stage F-statistic = 8.2 (< 10 threshold)
❌ Biased estimates in finite samples
❌ Invalid inference (confidence intervals too narrow)
❌ Sensitive to small violations of exclusion restriction
Recommended Alternatives:
1. Find stronger instruments
2. Use limited information maximum likelihood (LIML)
3. Consider observational methods with rich controls
4. Collect more data to improve first-stage power
**Side-by-Side Comparison**:
.. list-table:: Strong vs. Weak IV Comparison
:header-rows: 1
:widths: 25 35 40
* - Aspect
- Strong Instrument
- Weak Instrument
* - First-Stage F
- 52.3 (strong)
- 8.2 (weak)
* - Method Selected
- Standard IV
- Alternative methods recommended
* - Bias Risk
- Low
- High (finite sample bias)
* - Inference
- Valid
- Invalid (undersized tests)
* - Sensitivity
- Robust
- Highly sensitive
---
Comparison 5: Good vs. Poor Covariate Overlap
---------------------------------------------
**Scenario**: Evaluating job training program effectiveness
Good Overlap Version
~~~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Observational data with selection bias
- Rich set of baseline characteristics
- Good overlap in covariate distributions
- Treated and control units across full covariate range
.. mermaid::
flowchart TD
A[Good Overlap Data] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|No ✗| D{Running variable?}
D -->|No ✗| E{Binary treatment?}
E -->|Yes ✓| F{Instrumental variable?}
F -->|No ✗| G{Rich covariates?}
G -->|Yes ✓| H{Good overlap?}
H -->|Yes ✓| I[Propensity Score
Matching]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#ffebee
style D fill:#ffebee
style E fill:#fff3e0
style F fill:#ffebee
style G fill:#fff3e0
style H fill:#fff3e0
style I fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Propensity Score Matching
Reasoning:
✓ Rich covariates available
✓ Excellent covariate overlap (common support)
✓ Can find good matches for most treated units
✓ Transparent balance assessment
Expected Results:
- High-quality matches
- Good balance on observables
- Credible causal estimates (if unconfoundedness holds)
Poor Overlap Version
~~~~~~~~~~~~~~~~~~~~
**Dataset Characteristics**:
- Same observational data structure
- Same rich baseline characteristics
- Poor overlap in covariate distributions
- Treated units concentrated in one region of covariate space
.. mermaid::
flowchart TD
A[Poor Overlap Data] --> B{Is this randomized?}
B -->|No ✗| C{Panel data available?}
C -->|No ✗| D{Running variable?}
D -->|No ✗| E{Binary treatment?}
E -->|Yes ✓| F{Instrumental variable?}
F -->|No ✗| G{Rich covariates?}
G -->|Yes ✓| H{Good overlap?}
H -->|No ✗| I[Propensity Score
Weighting]
style A fill:#e3f2fd
style B fill:#ffebee
style C fill:#ffebee
style D fill:#ffebee
style E fill:#fff3e0
style F fill:#ffebee
style G fill:#fff3e0
style H fill:#ffebee
style I fill:#e8f5e8
**Agent Decision**:
.. code-block:: text
🎯 Selected Method: Propensity Score Weighting
Reasoning:
✓ Rich covariates available
❌ Poor covariate overlap (limited common support)
❌ Matching would discard many observations
✓ Weighting can handle poor overlap better
Caveats:
⚠️ Extrapolation required (poor overlap)
⚠️ High variance in weights possible
⚠️ Results may be sensitive to specification
**Side-by-Side Comparison**:
.. list-table:: Good vs. Poor Overlap Comparison
:header-rows: 1
:widths: 25 35 40
* - Aspect
- Good Overlap
- Poor Overlap
* - Method Selected
- Propensity Score Matching
- Propensity Score Weighting
* - Common Support
- Excellent
- Limited
* - Sample Usage
- High (good matches)
- Full sample (with weights)
* - Extrapolation
- Minimal
- Substantial
* - Variance
- Lower
- Higher (extreme weights)
Key Learning Points
------------------
Decision Tree Sensitivity
~~~~~~~~~~~~~~~~~~~~~~~~~
Small changes in data characteristics can lead to dramatically different method selections:
1. **Randomization Status**: Completely changes the analysis approach
2. **Data Structure**: Panel vs. cross-sectional determines method families
3. **Instrument Strength**: Weak instruments invalidate IV approaches
4. **Overlap Quality**: Affects choice between matching and weighting
Method Hierarchy
~~~~~~~~~~~~~~~~
CAIS follows a clear hierarchy of method preferences:
1. **Experimental Methods**: Always preferred when randomization is available
2. **Natural Experiments**: RDD and strong IV are next best
3. **Quasi-Experiments**: DiD with credible parallel trends
4. **Observational Methods**: Matching/weighting with rich covariates
5. **Regression Methods**: Last resort with strong assumptions
Assumption Importance
~~~~~~~~~~~~~~~~~~~~
Different methods rely on different assumptions:
- **Randomization**: Minimal assumptions, strongest identification
- **Parallel Trends**: Moderate assumptions, good identification
- **Exclusion Restriction**: Strong assumptions, requires careful validation
- **Unconfoundedness**: Very strong assumptions, often untestable
Practical Implications
~~~~~~~~~~~~~~~~~~~~~
Understanding these comparisons helps with:
1. **Study Design**: Plan data collection to enable better methods
2. **Method Selection**: Understand why CAIS chooses specific approaches
3. **Result Interpretation**: Know the limitations of your selected method
4. **Robustness Checking**: Test sensitivity across similar methods
Next Steps
----------
1. **Apply to Your Data**: Use these comparisons to understand your method selection
2. **Design Better Studies**: Plan data collection to enable stronger methods
3. **Validate Assumptions**: Check key assumptions for your selected method
4. **Explore Alternatives**: Consider how small data changes might improve identification
**Related Resources**:
- :doc:`../case_studies/index` - Detailed case studies by domain
- :doc:`../../methods/decision_tree` - Complete decision tree documentation
- :doc:`dataset_method_gallery` - Visual method selection examples