j40-cejst-2/data/data-pipeline/data_pipeline/etl/sources/national_risk_index
Billy Daly f0900f7b69
Adds National Risk Index data to ETL pipeline (#549)
* Adds dev dependencies to requirements.txt and re-runs black on codebase

* Adds test and code for national risk index etl, still in progress

* Removes test_data from .gitignore

* Adds test data to nation_risk_index tests

* Creates tests and ETL class for NRI data

* Adds tests for load() and transform() methods of NationalRiskIndexETL

* Updates README.md with info about the NRI dataset

* Adds to dos

* Moves tests and test data into a tests/ dir in national_risk_index

* Moves tmp_dir for tests into data/tmp/tests/

* Promotes fixtures to conftest and relocates national_risk_index tests:
The relocation of national_risk_index tests is necessary because tests 
can only use fixtures specified in conftests within the same package

* Fixes issue with df.equals() in test_transform()

* Files reformatted by black

* Commit changes to other files after re-running black

* Fixes unused import that caused lint checks to fail

* Moves tests/ directory to app root for data_pipeline
2021-09-07 20:51:34 -04:00
..
__init__.py Adds National Risk Index data to ETL pipeline (#549) 2021-09-07 20:51:34 -04:00
etl.py Adds National Risk Index data to ETL pipeline (#549) 2021-09-07 20:51:34 -04:00
README.md Adds National Risk Index data to ETL pipeline (#549) 2021-09-07 20:51:34 -04:00

FEMA National Risk Index

Description

The National Risk Index is a new, online mapping application from FEMA that identifies communities most at risk to 18 natural hazards. This application visualizes natural hazard risk metrics and includes data about expected annual losses from natural hazards, social vulnerability and community resilience.

The National Risk Index's interactive web maps are at the county and Census tract level and made available via geographic information system (GIS) services for custom analyses. For this project, we've utilized the NRI data collected at the Census tract level

Data Transformation Summary

The following transformations were applied to the NRI data during the ETL process:

  • The TRACTFIPS column was renamed to GEOID10_TRACT to match the name of columns that hold the Census Tract FIPS code in other data sets
  • The NRI score values for each Census tract were applied to each of the Census block groups inside of that Census tract so that the unit of analysis would match that of other datasets like the American Communities Survey