Race and Ethnicity in New York City

[This article was first published on R on kieranhealy.org, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

I’m about to start work on a second edition of my Data Visualization book. As a result I continue to mess around with stuff I’m considering including in a new edition. The other day I pulled some block-level Census data and drew a map of the distribution of people of color in New York City, which is to say the share of the population that reports being something other than Non-Hispanic White. I’ve polished that map a little more and drawn some additional ones. As before, the main substantive issues to bear in mind are (a) how the Census measures and classifies race and ethnicity, and (b) the new and I am inclined to think unwise practice of “differential privacy”. The main thing to know about the former is that in the present US Census Bureau schema people of Hispanic or Latino origin may be of any race. (I follow the Census’s nomenclature here, by the way, as it’s their data.) This is the reason that “Non-Hispanic White” is a category for example. The main thing to know about the latter is that it deliberately introduces noise into counts within units where the observed N is small.

The other thing to remember is every choropleth maker’s oldest friend, the Modifiable Areal Unit Problem. Census Blocks are the smallest spatial unit we can make a choropleth map of, but they’re not “real”, so to speak.

As usual, the tools used to make these maps are R, ggplot, and the tidycensus and sf packages. Together, of course, with the really phenomenal range of data made available by the Census Bureau API.

For these new versions I decided to add a little more context by sketching in some of the coastline, particularly the outline of New Jersey to the west, along with a reminder that Long Island continues to exist past the Queens border, etc These coastal outlines come from the NOAA CUSP maps.

New York City’s POC population.

New York City’s POC population.

New York City population, percent reporting ‘Black Alone’ to race question.

New York City population, percent reporting ‘Black Alone’ to race question.

New York City population, percent Hispanic/Latino origin.

New York City population, percent Hispanic/Latino origin.

New York City population, precent reporting ‘Asian Alone’ to race question.

New York City population, precent reporting ‘Asian Alone’ to race question.

PDFs of these maps:

To leave a comment for the author, please follow the link and comment on their blog: R on kieranhealy.org.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)