Shruti
← Back to Projects

Disaster Insurance Coverage Gaps

A Python-driven analysis using EM‑DAT to study how much economic loss from natural disasters is actually insured, which regions are most under‑covered, and how extreme losses behave in the tail.

Domain: Risk / Insurance Data: EM‑DAT natural disasters (2010–2025, focus 2020–2025) Tech: Python • pandas • NumPy • seaborn • matplotlib • SciPy • scikit‑learn • machine‑learning • shap

Abstract

This project examines recent natural disasters to see where economic losses are insured and where major protection gaps remain. Using EM‑DAT records for 1,985 events worldwide between 2020 and 2025, including 530 disasters with damage data and 52 with both total and insured losses, it applies descriptive statistics, hypothesis tests, portfolio risk measures and simple machine‑learning models to study patterns over time, across regions and by disaster type.

Results show that storms and floods dominate both event counts and total losses, while insurance data is mainly recorded for very large disasters in the Americas and parts of Asia. Many frequent hazards – especially floods and droughts in lower‑income regions – have little or no recorded coverage. Loss‑distribution fitting, Value at Risk and a simple reinsurance layer indicate that a small number of extreme events drive most of the portfolio loss, suggesting that available insurance data understates true disaster risk and that institutions need to account for missing data and heavy‑tailed losses when planning protection and capital.

What I did

Key findings

What this shows about me