35 lines
1.5 KiB
Text
35 lines
1.5 KiB
Text
Public 12 Reporting
|
|
|
|
|
|
COVID-19 Case Surveillance Public Use Data Utility Summary
|
|
|
|
|
|
Users should consider the level of completeness, including suppression levels when planning their analyses and use of public datasets. Privacy protections will suppress
|
|
field values to reduce reidentification risks. Completeness varies by jurisdiction (i.e., state, local, and territorial) and time period. Variables are consistently coded to the
|
|
|
|
value “Unknown” when jurisdictions specify in the case data submitted to CDC that the value is unknown, the value “Missing” when jurisdictions do not provide a value,
|
|
and the value “NA” when the value is suppressed as part of privacy protections.
|
|
|
|
|
|
Dataset version: 5/2/2024
|
|
|
|
|
|
Quick Summary
|
|
summary all_fields_counts all_fields_pct quasi_fields_counts quasi_fields_pct
|
|
String Double Double Double Double
|
|
1 total_rows 105,869,141 NaN% 105,869,141 NaN%
|
|
2. total_columns 12 NaN% S NaN%
|
|
3 total_cells 1,270,429,692 100.0% 317,607,423 100.0%
|
|
4 suppressed_fields 75 0.0% 75 0.0%
|
|
5 missing_fields 290,381,180 22.9% 4,317,834 1.4%
|
|
6 unknown_fields 86,247,511 6.8% 32,738,708 10.3%
|
|
7 non_blank_fields 893,800,926 70.4% 280,550,806 88.3%
|
|
Field Level Utility Summary
|
|
variable suppressed suppressed_pct missing missing_pct unknown unknown_pct
|
|
String Long String Long String Long String
|
|
1} sex 12 0.0% 496,836 0.5% 1,031,761 1.0%
|
|
2 age_group 51 0.0% 1,128,733 1.1% 0 0.0%
|
|
3. race_ethnicity_combined 12 0.0% 2,692,265 2.5% 31,706,947 29.9%
|
|
4 records_with_any_quasi_identifier Si. 0.0% 3,998,415 3.8% 32,025,979 30.3%
|
|
|
|
|