j40-cejst-2/data/data-pipeline/data_pipeline
Matt Bowen 876655d2b2
Add tests for all non-census sources (#1899)
* Refactor CDC life-expectancy (1554)

* Update to new tract list (#1554)

* Adjust for tests (#1848)

* Add tests for cdc_places (#1848)

* Add EJScreen tests (#1848)

* Add tests for HUD housing (#1848)

* Add tests for GeoCorr (#1848)

* Add persistent poverty tests (#1848)

* Update for sources without zips, for new validation (#1848)

* Update tests for new multi-CSV but (#1848)

Lucas updated the CDC life expectancy data to handle a bug where two
states are missing from the US Overall download. Since virtually none of
our other ETL classes download multiple CSVs directly like this, it
required a pretty invasive new mocking strategy.

* Add basic tests for nature deprived (#1848)

* Add wildfire tests (#1848)

* Add flood risk tests (#1848)

* Add DOT travel tests (#1848)

* Add historic redlining tests (#1848)

* Add tests for ME and WI (#1848)

* Update now that validation exists (#1848)

* Adjust for validation (#1848)

* Add health insurance back to cdc places (#1848)

Ooops

* Update tests with new field (#1848)

* Test for blank tract removal (#1848)

* Add tracts for clipping behavior

* Test clipping and zfill behavior (#1848)

* Fix bad test assumption (#1848)

* Simplify class, add test for tract padding (#1848)

* Fix percentage inversion, update tests (#1848)

Looking through the transformations, I noticed that we were subtracting
a percentage that is usually between 0-100 from 1 instead of 100, and so
were endind up with some surprising results. Confirmed with lucasmbrown-usds

* Add note about first street data (#1848)
2022-09-19 15:17:00 -04:00
..
comparison_tool Imputing income using geographic neighbors (#1559) 2022-08-11 12:33:45 -04:00
content updated to show T/F/null vs T/F for AML and FUDS (#1866) 2022-08-24 20:22:59 -04:00
data Starting Tribal Boundaries Work (#1736) 2022-07-30 01:13:10 -04:00
etl Add tests for all non-census sources (#1899) 2022-09-19 15:17:00 -04:00
files Add files via upload (#1656) 2022-05-31 13:19:01 -04:00
ipython just testing that the boolean is preserved on gha (#1867) 2022-08-31 12:55:03 -04:00
score tribal tiles fix (#1874) 2022-09-01 10:19:13 -04:00
tests Add tests for all non-census sources (#1899) 2022-09-19 15:17:00 -04:00
tile Score tests (#1847) 2022-08-26 15:23:20 -04:00
__init__.py Data directory should adopt standard Poetry-suggested python package structure (#457) 2021-08-05 15:35:54 -04:00
application.py Add FUDS ETL (#1817) 2022-08-16 13:28:39 -04:00
config.py Issue 1831: missing life expectancy data from Maine and Wisconsin (#1887) 2022-09-09 20:35:01 -04:00
utils.py Score tests (#1847) 2022-08-26 15:23:20 -04:00