Dataset logo

Historical Air Quality

Dataset logo
Dataset logo

Historical Air Quality

By US EPA

Annual summary data from 1980 to present

Product type

Dataset

Update frequency

Annually

Updated

May 15, 2023

Delivery method

Download

Metadata is available for each file version of the dataset. A JSON file can be downloaded once you subscribe to the dataset. The information below is an example of the metadata included in each JSON file.

Dataset file name: annual_aqi_by_cbsa_2003.zip

Key Statistics

Total records

568

Missing values

1%

Duplicate rows

0%

Columns

Columns

19

Categorical

0

Numeric

18

Date/time

0

Correlations Correlations are columns within the dataset that have a linear dependency on each other and can be dropped or filtered out when creating models

Correlations

7

Correlated columns

Moderate Days and Unhealthy for Sensitive Groups Days, 90th Percentile AQI and Moderate Days, 90th Percentile AQI and Median AQI, Median AQI and Moderate Days, 90th Percentile AQI and Unhealthy for Sensitive Groups Days, Max AQI and Unhealthy for Sensitive Groups Days, Max AQI and Unhealthy Days

Column name

Type

Unique values

Min value

Max value

CBSA Code

numeric5681010049740

Good Days

numeric2752360

Moderate Days

numeric1990271

Unhealthy for Sensitive Groups Days

numeric790189

Unhealthy Days

numeric31098

Very Unhealthy Days

numeric11051

Hazardous Days

numeric5018

Max AQI

numeric173616515

90th Percentile AQI

numeric1116209

Days with AQI

numeric1532365

Median AQI

numeric820118

Days CO

numeric480198

Days Ozone

numeric2130365

Days SO2

numeric1180365

Days PM2.5

numeric1900365

Days NO2

numeric800183

Year

numeric120032003

Days PM10

numeric990365

CBSA

text568846