Jorge Escobar
1b17af84c8
Combine + Tilefy ( #806 )
...
* init
* score-post
* added score csv s3 download; remore poetry cmds from readme
* working census tile fetch
* PR review
* Github Actions Work
2021-11-01 18:05:05 -04:00
Jorge Escobar
3b04356fb3
Data sources from S3 ( #769 )
...
* Started 535
* Data sources from S3
* lint
* renove breakpoints
* PR comments
* lint
* census data completed
* lint
* renaming data source
2021-10-13 16:00:33 -04:00
Jorge Escobar
fc5ed37fca
dependabot bump pillow ( #681 )
...
* dependabot bump pillow
* updated poetry
* adding encoding to file open
2021-09-14 17:28:59 -04:00
Lucas Merrill Brown
65ceb7900f
Score F, testing methodology ( #510 )
...
* fixing dependency issue
* fixing more dependencies
* including fraction of state AMI
* wip
* nitpick whitespace
* etl working now
* wip on scoring
* fix rename error
* reducing metrics
* fixing score f
* fixing readme
* adding dependency
* passing tests;
* linting/black
* removing unnecessary sample
* fixing error
* adding verify flag on etl/base
Co-authored-by: Jorge Escobar <jorge.e.escobar@omb.eop.gov>
2021-08-24 16:40:54 -04:00
Nat Hillard
ec19d86f6f
Adding back census to list of potential datasets, but separating out from standard list ( #484 )
...
Error this addresses:
File "/Users/lucas/Documents/usds/repos/justice40-tool/data/data-pipeline/data_pipeline/etl/runner.py", line 71, in etl_runner
f"data_pipeline.etl.sources.{dataset['module_dir']}.etl"
TypeError: 'NoneType' object is not subscriptable
2021-08-09 09:52:06 -04:00
Jorge Escobar
f51b0d69d9
Poetry updates for application ( #483 )
2021-08-06 16:24:30 -04:00
Nat Hillard
9d962eb5d9
Moving from relative imports to absolute to enable poetry run python data-pipeline/application.py [command] ( #476 )
2021-08-06 11:41:28 -04:00
Nat Hillard
45a8b1c026
Census ETL should use standard ETL form ( #474 )
...
* Fixes #473
Census ETL should use standard ETL form
* linter fixes
2021-08-06 11:01:51 -04:00
Nat Hillard
9f3b2f056b
Fixes #467 : ( #470 )
...
If the census download task is run more than once,
us.csv doubles in size and all data is removed from dataframe
2021-08-05 16:20:18 -04:00
Nat Hillard
c1568e87c0
Data directory should adopt standard Poetry-suggested python package structure ( #457 )
...
* Fixes #456 - Our data directory should adopt standard python package structure
* a few missed references
* updating readme
* updating requirements
* Running Black
* Fixes for flake8
* updating pylint
2021-08-05 15:35:54 -04:00