j40-cejst-2/data/data-pipeline/data_pipeline
Shelby Switzer 7bd1a9e59e
Big ole score refactor (#815)
* WIP

* Create ScoreCalculator

This calculates all the factors for score L for now (with placeholder
formulae because this is a WIP). I think ideallly we'll want to
refactor all the score code to be extracted into this or  similar
classes.

* Add factor logic for score L

Updated factor logic to match score L factors methodology.
Still need to get the Score L field itself working.

Cleanup needed: Pull field names into constants file, extract all score
calculation into score calculator

* Update thresholds and get score L calc working

* Update header name for consistency and update comparison tool

* Initial move of score to score calculator

* WIP big refactor

* Continued WIP on score refactor

* WIP score refactor

* Get to a working score-run

* Refactor to pass df to score init

This makes it easier to pass df around within a class with multiple
methods that require df.

* Updates from Black

* Updates from linting

* Use named imports instead of wildcard; log more

* Additional refactors

* move more field names to field_names constants file
* import constants without a relative path (would break docker)
* run linting
* raise error if add_columns is not implemented in a child class

* Refactor dict to namedtuple in score c

* Update L to use all percentile field

* change high school ed field in L back

Co-authored-by: Shelby Switzer <shelby.switzer@cms.hhs.gov>
2021-11-02 14:12:53 -04:00
..
data AWS Sync Public Read (#508) 2021-08-12 14:17:25 -04:00
etl Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
files added new PDF file (#745) 2021-09-23 13:34:50 -04:00
ipython Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
score Big ole score refactor (#815) 2021-11-02 14:12:53 -04:00
tests Add ETL Contract Checks (#619) 2021-10-13 15:54:15 -04:00
tile Prototype G (#672) 2021-09-14 10:48:11 -04:00
__init__.py Data directory should adopt standard Poetry-suggested python package structure (#457) 2021-08-05 15:35:54 -04:00
application.py Combine + Tilefy (#806) 2021-11-01 18:05:05 -04:00
config.py Data directory should adopt standard Poetry-suggested python package structure (#457) 2021-08-05 15:35:54 -04:00
utils.py Data sources from S3 (#769) 2021-10-13 16:00:33 -04:00