Skip to content

Latest commit

 

History

History
5 lines (5 loc) · 760 Bytes

File metadata and controls

5 lines (5 loc) · 760 Bytes
  1. The data shows the causes of death in NYC between 2007 and 2021. The Sex column contains mixed representations for gender, specifically Male, Female, M, and F.
  2. In the Race Ethnicity column, there are inconsistently named categories representing the same demographic groups, such as 'Non-Hispanic White' vs. 'White Non-Hispanic', and 'Non-Hispanic Black' vs. 'Black Non-Hispanic'.
  3. There are 343 missing values (NaN) in both the Death Rate and Age Adjusted Death Rate columns.
  4. The Deaths, Death Rate, and Age Adjusted Death Rate columns contain . (dot), likely used as a representation for a missing data.
  5. Columns that represent numbers (Deaths, Death Rate, Age Adjusted Death Rate) are loaded as object/string data types instead of numeric types.