Articles by R on kieranhealy.org

Burn Notice

February 16, 2025 | R on kieranhealy.org

Your Phone and Watch have a lot of data about you. I mean, like, a lot. Someone should really write a book all about the general issues for society that this raises. Yesterday I decided I wanted to take a look specifically at the health data on my iPho... [Read more...]

Kerning and Kerning in a Widening Gyre

February 6, 2025 | R on kieranhealy.org

This post summarizes an extended period of deep annoyance. I have tried to solve the problem it describes more than once before and not quite done it. This has, in fact, happened again. I have still not satisfactorily solved the problem. But this time I know why I can’t ... [Read more...]

Halloween Data Cleaning

October 12, 2024 | R on kieranhealy.org

This week in Modern Plain Text Computing we put together some of the things we’ve been learning about cleaning and tidying data. Here’s a somewhat sobering example using data from the Fatality Analysis Reporting System, which is how the NTSA tracks information about road accidents in the United ... [Read more...]

Dr Drang and the Electoral College

September 6, 2024 | R on kieranhealy.org

The other week, the Internet’s most beloved creepy snowman wrote a blog post where he showed how to use a little Python to group states by their number of electoral college votes to make a table like this: Electors States PopPct ECPct 3 AK, DE, DC, ND, SD, VT, WY 1.61% 3.90% 4 ... [Read more...]

New York City’s POC Population

May 16, 2024 | R on kieranhealy.org

I was messing around with some Census data this morning. I had two main thoughts. One was to show the utility of old-fashioned grayscale when it comes to mapping data (or displaying it in general). The goal of most carefully thought-through dataviz col... [Read more...]

Make Your Own NOAA Sea Temperature Graph

April 4, 2024 | R on kieranhealy.org

Sea-surface temperatures in the North Atlantic have been in the news recently as they continue to break records. While there are already a number of excellent summaries and graphs of the data, I thought I’d have a go at making some myself. The starting point is the detailed data ... [Read more...]

gssr Update

April 1, 2024 | R on kieranhealy.org

NORC released version 2a of the 1972-2022 General Social Survey cumulative file. I’ve updated {gssr}, an R package that makes it more convenient for R users to work with GSS Data. One handy feature of {gssr} is that it lets you see documentation for individual GSS variables as R ... [Read more...]

Pi Day Circles

March 14, 2024 | R on kieranhealy.org

Some Lissajous animations for Pi Day. Made with R, ggplot, and gganimate. And the really not very efficient code that made them: r 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 ... [Read more...]

Dorling Cartograms

December 6, 2023 | R on kieranhealy.org

I was writing some examples for next semester’s dataviz class and shared one of them—a Dorling Cartogram—on the socials medias. Some people don’t like cartograms, some people do like cartograms; in conclusion, we live in a world... [Read more...]

gssr Update

December 2, 2023 | R on kieranhealy.org

The General Social Survey, or GSS, is one of the cornerstones of US public opinion research and one of the most-analyzed datasets in Sociology. My colleague Steve Vaisey aptly describes it as the Hubble Space Telescope of American social science. It is... [Read more...]

Flipbookr for Quarto

August 10, 2023 | R on kieranhealy.org

{{flipbookr}} is an R package written by Gina Reynolds. It’s very useful for teaching. It was developed for use with .Rmd files Xaringan and presently does not work with Quarto. I hacked-up a version of Flipbookr that does work with Quarto. Using it with Xaringan should be exactly the ... [Read more...]

The Naming of Stats

June 19, 2023 | R on kieranhealy.org

The Naming of Stats is a difficult matter,      It isn’t just one of your holiday games; You may think at first I’m as mad as a hatter When I tell you, a stat must have THREE DIFFERENT NAMES. First of all are the names where usage is informal,      ... [Read more...]

Reading Remote Data Files

March 25, 2023 | R on kieranhealy.org

Sometimes data arrives as a series of individual files each of which is organized in the same way—which is to say, each of which has the same variables, features, or columns. Imagine a series of tables reporting mandated information about every s... [Read more...]

Escaping the Malthusian Trap

January 8, 2023 | R on kieranhealy.org

The Broadberry et al GDP series has estimates of England’s real GDP and population from the year 1270 onwards. It’s available, along with a lot of other long-run data, from The Bank of England. Here’s an animation of the series. I som... [Read more...]

Unhappy in its Own Way

July 22, 2022 | R on kieranhealy.org

“Happy families are all alike; every unhappy family is unhappy in its own way” runs the opening sentence of Anna Karenina. Hadley Wickham echoes the sentiment in a somewhat different context: “Tidy datasets are all alike, but every messy dataset is messy in its own way”. Data analysis is mostly ... [Read more...]
1 2 3 4