Method Selection and Decision-Making
This section explains how the CAIS agent selects the most appropriate causal inference method for your data and research question. Understanding this process helps you interpret the agent’s decisions and assess the credibility of your results.
The Method Selection Challenge
Choosing the right causal inference method is one of the most critical decisions in any analysis. The wrong method can lead to:
Biased estimates: Results that systematically over- or under-estimate true effects
Invalid inferences: Conclusions that don’t hold up to scrutiny
Wasted resources: Time and money spent on unreliable analysis
Traditional Approach: Researchers manually review methodological literature, assess data characteristics, and make subjective decisions about method appropriateness.
CAIS Approach: The agent systematically evaluates all available methods, tests assumptions, and selects the approach with the strongest identification strategy for your specific context.
The Agent’s Decision Framework
The agent uses a multi-stage decision process that considers:
Identification Strength: How convincingly does the method establish causation?
Assumption Plausibility: How likely are the method’s assumptions to hold?
Data Requirements: Does your data satisfy the method’s requirements?
Robustness: How sensitive are results to assumption violations?
Decision Tree Overview
The agent follows a structured decision tree with multiple pathways:
flowchart TD
A[Data Analysis] --> B{Treatment Assignment}
B -->|Random| C[Experimental Methods]
B -->|As-Good-As-Random| D[Quasi-Experimental Methods]
B -->|Non-Random| E[Observational Methods]
C --> C1[Randomized Controlled Trial]
C --> C2[Natural Experiment]
D --> D1{Panel Data?}
D1 -->|Yes| D2{Policy Change?}
D2 -->|Yes| D3[Difference-in-Differences]
D2 -->|No| D4{Discontinuity?}
D4 -->|Yes| D5[Regression Discontinuity]
D4 -->|No| D6[Panel Methods]
D1 -->|No| D7{Instrument Available?}
D7 -->|Yes| D8[Instrumental Variables]
D7 -->|No| D9{Discontinuity?}
D9 -->|Yes| D5
D9 -->|No| E
E --> E1[Propensity Score Methods]
E --> E2[Backdoor Adjustment]
E --> E3[Linear Regression]
Key Insight: The agent doesn’t just follow rigid rules. It uses contextual reasoning to navigate edge cases and make nuanced decisions.
Stage 1: Treatment Assignment Analysis
The first and most crucial step is understanding how treatment was assigned.
Random Assignment (Experimental)
- Indicators the Agent Looks For:
Variables named “random_assignment”, “treatment_group”, “randomized”
Balanced baseline characteristics between treatment and control
Study documentation mentioning randomization
Equal treatment probabilities across subgroups
Agent Decision Process:
Agent: "I detect a 'randomization_indicator' variable and observe
balanced baseline characteristics (p-values > 0.1 for all covariates).
This appears to be a randomized experiment. I'll use experimental
methods and focus on intention-to-treat effects."
- Selected Methods:
Simple difference in means
Regression with baseline controls (for precision)
Instrumental variables (if non-compliance exists)
As-Good-As-Random Assignment (Quasi-Experimental)
- What the Agent Looks For:
Policy changes or natural experiments
Discontinuities in treatment assignment
Temporal variation in treatment timing
Geographic variation in treatment rollout
Example Scenarios:
Policy Change: .. code-block:: text
Agent: “I see a policy implemented in 2018 with pre- and post-policy data across multiple states. Some states implemented earlier than others. This is a difference-in-differences setup.”
Discontinuity: .. code-block:: text
Agent: “Treatment assignment changes sharply at age 65, with a clear cutoff. This suggests regression discontinuity design is appropriate.”
Natural Experiment: .. code-block:: text
Agent: “Treatment varies based on lottery numbers or random assignment to bureaucrats. This creates as-good-as-random variation suitable for instrumental variables.”
Non-Random Assignment (Observational)
- When This Occurs:
Individuals self-select into treatment
Treatment assigned based on characteristics
No clear source of exogenous variation
Agent Response: .. code-block:: text
Agent: “Treatment appears to be chosen by individuals based on their characteristics. I’ll need to use methods that control for selection bias, such as propensity score matching or backdoor adjustment.”
Stage 2: Data Structure Analysis
The agent analyzes your data structure to determine which methods are feasible.
Panel Data Detection
- What the Agent Looks For:
Multiple observations per unit over time
Time-varying treatments or outcomes
Variables indicating time periods or waves
- Implications for Method Selection:
Enables difference-in-differences methods
Allows for unit fixed effects
Supports dynamic treatment effect analysis
Agent Decision: .. code-block:: text
Agent: “I detect panel structure with individuals observed over 5 years. This enables me to control for time-invariant confounders using fixed effects methods.”
Cross-Sectional Data
- Characteristics:
Single observation per unit
No temporal variation
Limited to methods that don’t require time dimension
- Available Methods:
Instrumental variables (if instruments available)
Regression discontinuity (if discontinuity exists)
Propensity score methods
Linear regression with controls
Time Series Data
- Special Considerations:
Temporal dependencies in data
Potential for dynamic effects
Need for time series methods
Agent Approach: .. code-block:: text
Agent: “This is time series data with a single treated unit. I’ll consider synthetic control methods to construct an appropriate counterfactual.”
Stage 3: Method Prioritization
The agent ranks methods based on identification strength and assumption plausibility.
Identification Strength Hierarchy
- Tier 1: Experimental Methods
Randomized controlled trials
Natural experiments with random assignment
- Tier 2: Strong Quasi-Experimental Methods
Sharp regression discontinuity
Difference-in-differences with parallel trends
Strong instrumental variables
- Tier 3: Moderate Quasi-Experimental Methods
Fuzzy regression discontinuity
Difference-in-differences with concerns
Weak instrumental variables
- Tier 4: Observational Methods
Propensity score methods with good overlap
Backdoor adjustment with rich controls
Linear regression
Agent Prioritization Logic:
Agent: "Multiple methods are applicable:
1. RDD (Tier 2): Strong identification, clear discontinuity
2. IV (Tier 3): Instrument available but exclusion restriction uncertain
3. Matching (Tier 4): Good overlap but selection on unobservables concern
I'll implement RDD as primary method with IV and matching as robustness checks."
Assumption Assessment
For each potential method, the agent evaluates assumption plausibility:
Difference-in-Differences Example: .. code-block:: text
Agent: “Assessing DiD assumptions: ✓ Parallel trends: Pre-treatment trends similar (p=0.23) ✓ No spillovers: Geographically separated treatment/control ? Stable composition: Some migration between regions
Overall assessment: STRONG identification with minor composition concerns. Recommendation: Include robustness check excluding high-migration areas.”
Instrumental Variables Example: .. code-block:: text
Agent: “Assessing IV assumptions: ✓ Relevance: F-stat = 47.3 (strong instrument) ? Exclusion restriction: Instrument may affect outcome through other channels ✓ Monotonicity: Likely satisfied based on institutional context
Overall assessment: MODERATE identification due to exclusion restriction concerns. Recommendation: Use as robustness check, not primary method.”
Stage 4: Robustness and Sensitivity Analysis
The agent doesn’t just select one method—it implements a comprehensive robustness strategy.
Multiple Method Implementation
Primary Method: Strongest identification strategy Secondary Methods: Alternative approaches for robustness Sensitivity Analyses: Tests of key assumptions
Example Strategy: .. code-block:: text
Agent: “Implementing comprehensive analysis:
PRIMARY: Difference-in-differences - Main specification with state and year fixed effects - Clustered standard errors at state level
ROBUSTNESS CHECKS: - Alternative time windows (exclude early/late periods) - Placebo tests (fake treatment dates) - Synthetic control method - Event study specification
SENSITIVITY ANALYSES: - Different control group definitions - Varying treatment intensity measures - Subgroup analysis by key characteristics”
Assumption Testing Protocol
The agent systematically tests method assumptions:
- Pre-Analysis Tests:
Balance tests for randomized experiments
Parallel trends tests for difference-in-differences
Density tests for regression discontinuity
Instrument strength tests for IV methods
- Post-Analysis Validation:
Placebo tests with fake treatments
Sensitivity to specification changes
Outlier influence analysis
Cross-validation with alternative methods
Example Testing Sequence: .. code-block:: text
Agent: “Running assumption tests:
Parallel trends test: PASSED (p=0.34)
Placebo test (pre-treatment): PASSED (effect ≈ 0)
Sensitivity to controls: ROBUST (effect stable)
Outlier analysis: ROBUST (no influential observations)
Conclusion: Strong evidence for causal interpretation.”
Stage 5: Method Selection Decision
The agent synthesizes all information to make final method selection.
Decision Integration
- The agent considers:
Statistical test results
Assumption plausibility assessments
Data quality and sample size
Robustness across specifications
External validity considerations
Final Decision Process: .. code-block:: text
Agent: “FINAL METHOD SELECTION:
PRIMARY METHOD: Difference-in-Differences - Strong identification strategy - Assumptions well-supported by tests - Robust across multiple specifications - Good external validity for policy evaluation
CONFIDENCE LEVEL: HIGH - Multiple assumption tests passed - Consistent results across robustness checks - Large sample size provides precise estimates
LIMITATIONS: - Results specific to this policy context - Cannot rule out all unobserved confounders - Effect may vary across subgroups”
Communicating Uncertainty
The agent transparently communicates the strength of identification:
High Confidence: Strong experimental or quasi-experimental design with assumptions well-supported by tests
Moderate Confidence: Good identification strategy but some assumption concerns or limited robustness checks
Low Confidence: Observational methods with strong assumptions or failed assumption tests
Example Communication: .. code-block:: text
Agent: “CONFIDENCE ASSESSMENT: HIGH
This analysis uses a strong quasi-experimental design (difference-in-differences) with well-supported assumptions. The parallel trends assumption is validated through pre-treatment tests, and results are robust across multiple specifications. While we cannot completely rule out all confounders, the identification strategy provides credible evidence for a causal interpretation.
Key strengths: - Clear policy discontinuity creates quasi-random variation - Rich data allows comprehensive assumption testing - Results consistent across robustness checks
Remaining limitations: - External validity limited to similar policy contexts - Cannot detect very small effect heterogeneity - Long-term effects beyond study period unknown”
Common Decision Scenarios
Scenario 1: Clear Randomized Experiment .. code-block:: text
Data: Randomization indicator, balanced baseline characteristics Agent Decision: Simple difference in means with baseline controls Confidence: HIGH
Scenario 2: Policy Change with Panel Data .. code-block:: text
Data: Policy implemented at different times across states Agent Decision: Difference-in-differences with staggered adoption methods Confidence: HIGH (if parallel trends hold)
Scenario 3: Age-Based Eligibility Cutoff .. code-block:: text
Data: Sharp discontinuity in treatment at specific age Agent Decision: Sharp regression discontinuity design Confidence: HIGH (if no manipulation of running variable)
Scenario 4: Observational Data with Rich Controls .. code-block:: text
Data: Self-selected treatment, many baseline characteristics Agent Decision: Propensity score matching with overlap assessment Confidence: MODERATE (depends on selection on observables assumption)
Scenario 5: Weak Identification .. code-block:: text
Data: Observational with limited controls, no clear instruments Agent Decision: Multiple methods with extensive sensitivity analysis Confidence: LOW (strong assumptions required)
Best Practices for Users
Provide Context: Include variable descriptions and study background to help the agent make better decisions
Review Decisions: Understand why the agent selected specific methods and assess whether the reasoning makes sense in your context
Consider Limitations: Pay attention to the agent’s confidence assessments and stated limitations
Validate Results: Use the agent’s analysis as a starting point, but consider additional robustness checks based on your domain knowledge
Seek Expert Review: For high-stakes decisions, have domain experts review the agent’s method selection and results
The agent’s method selection process represents a systematic approach to one of the most challenging aspects of causal inference, but human judgment remains essential for interpreting results in context and making policy decisions.