Data Coverage & Methodology - FARS API by farsapi.com

What years of FARS data are available through the API?

FARS API covers 2017 through 2023 — seven full years of NHTSA FARS fatal crashes, about 254,000 records across all 50 states plus DC. Data is refreshed annually when NHTSA publishes new years; publication lags actual crashes by roughly 18 months because NHTSA waits for all states to finalize files. Historical data back to 1975 is available directly from NHTSA.

What Data is Included

Every fatal motor vehicle crash in the United States from 2017 through 2023. A crash is included if it resulted in at least one fatality within 30 days of the crash and involved a motor vehicle traveling on a public road.

Year	Crashes	Fatalities	Vehicles	Persons
2017	34,560	37,473	53,128	85,840
2018	33,919	36,835	52,286	84,344
2019	33,487	36,355	51,623	82,843
2020	35,935	39,007	54,552	86,396
2021	39,785	43,230	61,802	97,511
2022	39,422	42,721	60,765	96,186
2023	37,769	41,025	58,508	92,768
Total	254,877	276,646	392,664	625,888

Data Source

Crash data is sourced from NHTSA's Fatality Analysis Reporting System (FARS), maintained by the National Highway Traffic Safety Administration, part of the US Department of Transportation. FARS has collected data on every fatal motor vehicle crash in the US since 1975.

Traffic volume (AADT) and road segment data is sourced from FHWA's Highway Performance Monitoring System (HPMS) 2017 public release, maintained by the Federal Highway Administration. We spatially join each FARS crash to its nearest HPMS road segment within an 80-meter radius.

Traffic volume / road segment join

204,751 of 254,877 fatal crashes (80.3%) successfully matched a FHWA HPMS 2017 road segment with non-zero AADT. Matched crashes carry the segment's AADT (annual average daily traffic), AADT_truck, functional class (Interstate, Principal Arterial, etc), lane count, posted speed limit, and the snap distance from the crash GPS point to the segment centerline (median 2.5m, p95 22m).

Per-state match rate:

Coverage	# states	Examples
≥85% (excellent)	11	Texas 91.7%, DC 94.1%, Massachusetts 90.6%, California 85.8%
75–85% (good)	24	Florida 84.7%, New York 83.4%, Georgia 80.0%, Illinois 77.5%
65–75% (fair)	14	Kentucky 67.2%, New Jersey 72.5%, Iowa 72.6%
<65% (poor)	2	Delaware 63.2%, North Carolina 64.8%

The two low-coverage states (NC, DE) reflect a known issue where some state DOT submissions to HPMS exclude smaller portions of the federal-aid road network, so a meaningful share of fatal crashes happens on roads not represented in HPMS at all. We document this and degrade gracefully (return road_exposure: null for unmatched crashes) rather than guessing.

Per-road-class national baselines: computed from the joined FARS+HPMS population, used to anchor Expected vs Actual claims:

Road class	Fatal crashes per 100M VMT
Interstate	0.32
Principal Arterial - Other Freeway	0.42
Principal Arterial - Other	1.34
Minor Arterial	1.55
Major Collector	1.89
Local	0.98

Interstates are the safest road class per vehicle mile traveled, despite carrying the most traffic. This matches well-established findings in transportation safety literature.

What Each Record Contains

Crash Record

Date, time, GPS coordinates, state, county, road type, weather, light conditions, manner of collision, speed limit, number of fatalities, number of vehicles, drunk driver involvement, hit-and-run flag.

Vehicle Record

Make, model, model year, body type, number of occupants, travel speed, speed limit, rollover, fire, hit-and-run, driver drinking status.

Person Record

Age, sex, person type (driver/passenger/pedestrian/cyclist), seat position, injury severity, restraint use, air bag deployment, ejection, alcohol test result.

Update Frequency

NHTSA publishes FARS data annually, typically in Q2 of the year following the crash year. There is an approximately 18-month lag - 2024 data is expected to be available by mid-2025. farsapi.com ingests new data within one week of NHTSA publication.

The 2023 dataset is currently labeled as "Initial Release" by NHTSA. A final version with corrections will replace it when available. farsapi.com labels data release status in API responses.

Known Limitations

Fatal crashes only. FARS covers crashes with at least one fatality. Non-fatal crashes are tracked by NHTSA's CRSS (Crash Report Sampling System), which is not included in this API.
18-month data lag. This is a limitation of the source data, not the API.
GPS precision. Approximately 1% of records have missing or imprecise coordinates. Older years (pre-2015, not currently loaded) have higher rates of missing coordinates (~15-20%).
PII removed. NHTSA strips personally identifiable information before publication. No names, license plates, or VINs are included.
Year-over-year schema changes. NHTSA occasionally changes variable codes and column names between years. FARS API normalizes these - but some fields may have slightly different coverage across years.

Methodology

FARS API downloads annual CSV files from NHTSA's FTP server, parses the ACCIDENT, VEHICLE, and PERSON tables, translates numeric codes to human-readable labels using NHTSA's coding manuals, computes derived fields (e.g., drunk driver counts from vehicle-level data), and loads the normalized data into a PostgreSQL database with geographic indexing for radius queries.

Explore the API