j40-cejst-2/data/data-pipeline/data_pipeline/etl/score
Matt Bowen 876655d2b2
Add tests for all non-census sources (#1899)
* Refactor CDC life-expectancy (1554)

* Update to new tract list (#1554)

* Adjust for tests (#1848)

* Add tests for cdc_places (#1848)

* Add EJScreen tests (#1848)

* Add tests for HUD housing (#1848)

* Add tests for GeoCorr (#1848)

* Add persistent poverty tests (#1848)

* Update for sources without zips, for new validation (#1848)

* Update tests for new multi-CSV but (#1848)

Lucas updated the CDC life expectancy data to handle a bug where two
states are missing from the US Overall download. Since virtually none of
our other ETL classes download multiple CSVs directly like this, it
required a pretty invasive new mocking strategy.

* Add basic tests for nature deprived (#1848)

* Add wildfire tests (#1848)

* Add flood risk tests (#1848)

* Add DOT travel tests (#1848)

* Add historic redlining tests (#1848)

* Add tests for ME and WI (#1848)

* Update now that validation exists (#1848)

* Adjust for validation (#1848)

* Add health insurance back to cdc places (#1848)

Ooops

* Update tests with new field (#1848)

* Test for blank tract removal (#1848)

* Add tracts for clipping behavior

* Test clipping and zfill behavior (#1848)

* Fix bad test assumption (#1848)

* Simplify class, add test for tract padding (#1848)

* Fix percentage inversion, update tests (#1848)

Looking through the transformations, I noticed that we were subtracting
a percentage that is usually between 0-100 from 1 instead of 100, and so
were endind up with some surprising results. Confirmed with lucasmbrown-usds

* Add note about first street data (#1848)
2022-09-19 15:17:00 -04:00
..
config Add tests for all non-census sources (#1899) 2022-09-19 15:17:00 -04:00
schemas Add FUDS ETL (#1817) 2022-08-16 13:28:39 -04:00
tests Issue 1831: missing life expectancy data from Maine and Wisconsin (#1887) 2022-09-09 20:35:01 -04:00
__init__.py Data directory should adopt standard Poetry-suggested python package structure (#457) 2021-08-05 15:35:54 -04:00
constants.py Issue 1831: missing life expectancy data from Maine and Wisconsin (#1887) 2022-09-09 20:35:01 -04:00
etl_score.py Removing low pop tracts from FEMA population loss (#1898) 2022-09-12 13:48:38 -04:00
etl_score_geo.py Remove no land tracts from map (#1894) 2022-09-08 14:55:00 -04:00
etl_score_post.py Pipeline tile tests (#1864) 2022-09-01 13:07:14 -04:00
etl_utils.py 1831 Follow up (#1902) 2022-09-15 17:46:01 -04:00