j40-cejst-2/data/data-pipeline/data_pipeline
Matt Bowen d5fbb802e8
Add FUDS ETL (#1817)
* Add spatial join method (#1871)

Since we'll need to figure out the tracts for a large number of points
in future tickets, add a utility to handle grabbing the tract geometries
and adding tract data to a point dataset.

* Add FUDS, also jupyter lab (#1871)

* Add YAML configs for FUDS (#1871)

* Allow input geoid to be optional (#1871)

* Add FUDS ETL, tests, test-datae noteobook (#1871)

This adds the ETL class for Formerly Used Defense Sites (FUDS). This is
different from most other ETLs since these FUDS are not provided by
tract, but instead by geographic point, so we need to assign FUDS to
tracts and then do calculations from there.

* Floats -> Ints, as I intended (#1871)

* Floats -> Ints, as I intended (#1871)

* Formatting fixes (#1871)

* Add test false positive GEOIDs (#1871)

* Add gdal binaries (#1871)

* Refactor pandas code to be more idiomatic (#1871)

Per Emma, the more pandas-y way of doing my counts is using np.where to
add the values i need, then groupby and size. It is definitely more
compact, and also I think more correct!

* Update configs per Emma suggestions (#1871)

* Type fixed! (#1871)

* Remove spurious import from vscode (#1871)

* Snapshot update after changing col name (#1871)

* Move up GDAL (#1871)

* Adjust geojson strategy (#1871)

* Try running census separately first (#1871)

* Fix import order (#1871)

* Cleanup cache strategy (#1871)

* Download census data from S3 instead of re-calculating (#1871)

* Clarify pandas code per Emma (#1871)
2022-08-16 13:28:39 -04:00
..
comparison_tool Imputing income using geographic neighbors (#1559) 2022-08-11 12:33:45 -04:00
content adds UST indicator (#1786) 2022-08-11 12:33:46 -04:00
data Starting Tribal Boundaries Work (#1736) 2022-07-30 01:13:10 -04:00
etl Add FUDS ETL (#1817) 2022-08-16 13:28:39 -04:00
files Add files via upload (#1656) 2022-05-31 13:19:01 -04:00
ipython Add FUDS ETL (#1817) 2022-08-16 13:28:39 -04:00
score updated to fix linting errors (#1818) 2022-08-11 16:34:56 -04:00
tests Add FUDS ETL (#1817) 2022-08-16 13:28:39 -04:00
tile Setting zoom levels for tribal map (#1810) 2022-08-09 13:56:03 -04:00
__init__.py Data directory should adopt standard Poetry-suggested python package structure (#457) 2021-08-05 15:35:54 -04:00
application.py Add FUDS ETL (#1817) 2022-08-16 13:28:39 -04:00
config.py Data directory should adopt standard Poetry-suggested python package structure (#457) 2021-08-05 15:35:54 -04:00
utils.py NRI dataset and initial score YAML configuration (#1534) 2022-08-09 16:37:10 -04:00