Yggdrasil
MCP ServersMCP 伺服器 SKILLs技能 PlugIns解決方案 Asgard AI SolutionAsgard AI 方案 Submit Listing申請上架 GitHub

追蹤資料分析 (Panel Data Analysis) 追蹤資料分析

Released已發布
theory theory

Apply panel data analysis with fixed effects, random effects, and dynamic GMM to exploit longitudinal variation and control for unobserved heterogeneity. Use this skill when the user has repeated observations over time for multiple entities, needs to choose between FE and RE via Hausman test, or when they ask 'how do I control for firm-specific effects', 'fixed or random effects', or 'how to handle endogeneity in panels'.

學術研究技能:追蹤資料分析 (Panel Data Analysis) 分析與應用。

View on GitHub在 GitHub 查看

Overview概述

Panel data analysis exploits both cross-sectional and temporal variation to estimate causal effects while controlling for unobserved heterogeneity. Fixed effects eliminate time-invariant confounders through within-entity demeaning, while random effects assume unobserved heterogeneity is uncorrelated with regressors, yielding more efficient estimates when valid.

When to Use使用時機

  • Data has repeated observations for the same entities (firms, individuals, countries) over time
  • Unobserved time-invariant factors likely confound the relationship of interest
  • Testing whether a policy or treatment effect varies across time periods
  • Dynamic models where the lagged dependent variable is a regressor (use GMM)

When NOT to Use不適用時機

  • Pure cross-sectional data with no time dimension
  • Interest is in estimating the effect of time-invariant variables (FE eliminates these)
  • Panel is extremely short (T = 2) with many endogenous regressors
  • Attrition is non-random and creates survivorship bias

Assumptions前提假設

IRON LAW: Fixed effects ONLY controls for TIME-INVARIANT unobservables —
time-varying confounders remain a threat. FE does not solve all
endogeneity problems.

Key assumptions:

  1. Strict exogeneity for FE/RE: past, current, and future errors are uncorrelated with regressors
  2. No serial correlation in idiosyncratic errors (or use cluster-robust SEs)
  3. RE additionally assumes individual effects are uncorrelated with regressors
  4. For dynamic GMM: instruments are valid and not too many (instrument proliferation)

Framework 框架

Step 1 — Explore Panel Structure

Report N (entities), T (time periods), balance status. Check within vs between variation for key variables. Visualize entity-level trends.

Step 2 — Estimate FE and RE Models

Run fixed effects (within estimator) and random effects (GLS). Include time fixed effects if common shocks exist. Use cluster-robust standard errors at the entity level.

Step 3 — Hausman Test for Model Selection

Test H₀: RE is consistent (individual effects uncorrelated with regressors). Rejection favors FE. See references/ for test statistic derivation.

Step 4 — Dynamic Extensions (if needed)

If lagged DV is included, use Arellano-Bond or System GMM. Report AR(1), AR(2) tests and Hansen/Sargan test for instrument validity. Monitor instrument count.

Output Format輸出格式

Gotchas注意事項

  • FE discards all between-entity variation; if most variation is between, FE estimates are imprecise
  • Hausman test has low power in small samples — insignificance does not validate RE
  • Dynamic panel GMM with too many instruments causes overfitting and weakens the Hansen test
  • Nickell bias afflicts FE estimates with a lagged DV when T is small
  • Two-way FE (entity + time) is often necessary but rarely the default in software
  • Cluster-robust standard errors require a sufficient number of clusters (N ≥ 50 as guideline)

References參考資料

  • Wooldridge, J. M. (2010). Econometric Analysis of Cross Section and Panel Data (2nd ed.). MIT Press.
  • Arellano, M., & Bond, S. (1991). Some tests of specification for panel data. Review of Economic Studies, 58(2), 277-297.
  • Baltagi, B. H. (2013). Econometric Analysis of Panel Data (5th ed.). Wiley.

Tags標籤

panel-datafixed-effectsrandom-effectsHausman-test