Race and Ethnicity in New York City
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
I’m about to start work on a second edition of my Data Visualization book. As a result I continue to mess around with stuff I’m considering including in a new edition. The other day I pulled some block-level Census data and drew a map of the distribution of people of color in New York City, which is to say the share of the population that reports being something other than Non-Hispanic White. I’ve polished that map a little more and drawn some additional ones. As before, the main substantive issues to bear in mind are (a) how the Census measures and classifies race and ethnicity, and (b) the new and I am inclined to think unwise practice of “differential privacy”. The main thing to know about the former is that in the present US Census Bureau schema people of Hispanic or Latino origin may be of any race. (I follow the Census’s nomenclature here, by the way, as it’s their data.) This is the reason that “Non-Hispanic White” is a category for example. The main thing to know about the latter is that it deliberately introduces noise into counts within units where the observed N is small.
The other thing to remember is every choropleth maker’s oldest friend, the Modifiable Areal Unit Problem. Census Blocks are the smallest spatial unit we can make a choropleth map of, but they’re not “real”, so to speak.
As usual, the tools used to make these maps are R, ggplot, and the tidycensus and sf packages. Together, of course, with the really phenomenal range of data made available by the Census Bureau API.
For these new versions I decided to add a little more context by sketching in some of the coastline, particularly the outline of New Jersey to the west, along with a reminder that Long Island continues to exist past the Queens border, etc These coastal outlines come from the NOAA CUSP maps.
PDFs of these maps:
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.