Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
In the graphic you can see that by far most tweets in our set are tweets – not retweets, and that of the 12,159,856 tweeters in our set, only about 13% get retweeted (the decimal is in the wrong place for both retweets and retweeted users. Sorry
Further, some folks still use the old style meaning we still need to parse the text of the tweet to get an accurate count of retweets. And this number can be significant.
In the chart to the right I have shown the ratio of tweets, new style retweets and old style retweets for a subset of our collection (only tweets with the hashtag #OccupyOakland from 10/12/2011 to 11/20/2011). I this set 8% are old-style retweets. But in this 8% I am including modified tweets (MT) retweets (RT) and via.
I think as researchers start to code text from tweets one thing we need to think about is the different meanings that apply when people use these different mechanisms of attribution. Comments?
In the meantime, here is a bit of network eye candy.
Nodes are sized by how many times those users’ tweets were retweeted.
The ring around the outside represents people who tweeted with the OccupyOakland hashtag and were retweeted but not by anyone in the code of the network.
The core is densely connected, which makes sense for a few reasons. First, the is a collection of retweets over 30 days, so it represents many information flows and connections between people. Second, the OccupyOakland data set has a surprisingly high rate of retweeting. In the first graph we can see the rate of retweeting across the whole 65 million tweets in our set is low – about 7% (again, the decimal is the wrong place on the plot). For the Oakland subset it is 64% for new style and 72% for both new and old!
All of these visualizations were created using R. I’m happy to post example code if anyone is interested.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.