j40-cejst-2/data/data-pipeline/data_pipeline/etl/sources/census_decennial
Travis Newby 6f39033dde
Add ability to cache ETL data sources (#2169)
* Add a rough prototype allowing a developer to pre-download data sources for all ETLs

* Update code to be more production-ish

* Move fetch to Extract part of ETL
* Create a downloader to house all downloading operations
* Remove unnecessary "name" in data source

* Format source files with black

* Fix issues from pylint and get the tests working with the new folder structure

* Clean up files with black

* Fix unzip test

* Add caching notes to README

* Fix tests (linting and case sensitivity bug)

* Address PR comments and add API keys for census where missing

* Merging comparator changes from main into this branch for the sake of the PR

* Add note on using cache (-u) during pipeline
2023-03-03 12:26:24 -06:00
..
__init__.py [ISS-723] Load Census Data for 4 Territories (#816) 2021-11-09 16:32:46 -05:00
etl.py Add ability to cache ETL data sources (#2169) 2023-03-03 12:26:24 -06:00
README.md [ISS-723] Load Census Data for 4 Territories (#816) 2021-11-09 16:32:46 -05:00

Decennial Census of Island Areas

Adding a new ETL folder for Decennial Census of Island Areas since the Island areas aren't included in ACS.

There's the American Samoa Summary File, the Guam Summary File, the Commonwealth of the Northern Mariana Islands Summary File, and the US Virgin Islands Summary File:

  1. https://api.census.gov/data/2010/dec/as.html
  2. https://api.census.gov/data/2010/dec/gu.html
  3. https://api.census.gov/data/2010/dec/mp.html
  4. https://api.census.gov/data/2010/dec/vi.html