j40-cejst-2/data/data-pipeline/data_pipeline/score
Lucas Merrill Brown aca226165c
Issue 1900: Tribal overlap with Census tracts (#1903)
* working notebook

* updating notebook

* wip

* fixing broken tests

* adding tribal overlap files

* WIP

* WIP

* WIP, calculated count and names

* working

* partial cleanup

* partial cleanup

* updating field names

* fixing bug

* removing pyogrio

* removing unused imports

* updating test fixtures to be more realistic

* cleaning up notebook

* fixing black

* fixing flake8 errors

* adding tox instructions

* updating etl_score

* suppressing warning

* Use projected CRSes, ignore geom types (#1900)

I looked into this a bit, and in general the geometry type mismatch
changes very little about the calculation; we have a mix of
multipolygons and polygons. The fastest thing to do is just not keep
geom type; I did some runs with it set to both True and False, and
they're the same within 9 digits of precision. Logically we just want to
overlaps, regardless of how the actual geometries are encoded between
the frames, so we can in this case ignore the geom types and feel OKAY.

I also moved to projected CRSes, since we are actually trying to do area
calculations and so like, we should. Again, the change is small in
magnitude but logically more sound.

* Readd CDC dataset config (#1900)

* adding comments to fips code

* delete unnecessary loggers

Co-authored-by: matt bowen <matthew.r.bowen@omb.eop.gov>
2022-09-20 14:53:12 -04:00
..
schemas Marshmallow Schemas for YAML files (#1497) 2022-03-31 13:56:10 -04:00
adding_variables_to_score.md Adding first street foundation data (#1823) 2022-08-17 10:14:23 -04:00
field_names.py Issue 1900: Tribal overlap with Census tracts (#1903) 2022-09-20 14:53:12 -04:00
score.py Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
score_a.py Various updates to Definition L (#850) 2021-11-04 12:17:45 -04:00
score_b.py Various updates to Definition L (#850) 2021-11-04 12:17:45 -04:00
score_c.py Add NATA cancer risk and respiratory hazard to definition L (#1001) 2021-12-07 12:45:45 -05:00
score_d.py Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
score_f.py Add NATA cancer risk and respiratory hazard to definition L (#1001) 2021-12-07 12:45:45 -05:00
score_g.py Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
score_h.py Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
score_i.py Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
score_k.py Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
score_l.py Issue 1141: Definition M (#1151) 2022-01-18 14:56:55 -05:00
score_m.py Adding booleans for FE to display (#1393) 2022-03-29 20:17:10 -04:00
score_narwhal.py Issue 1900: Tribal overlap with Census tracts (#1903) 2022-09-20 14:53:12 -04:00
score_runner.py Updates backend constants to N (#1854) 2022-08-23 16:19:00 -04:00
utils.py Add donut hole calculation to score (#1828) 2022-08-18 12:04:46 -04:00