Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
There’s a lot to analyze with the Texas high school academic UIL data set. Maybe I find it more interesting than others due to my personal experiences with these competitions.
Now, after examining some of the biggest topics associated with this data–including competitions, individuals, and schools–in a broad manner, there are some other things that don’t necessarily fall into these categories that I think are worth investigating.
Siblings
Let’s look at the performance of siblings. Maybe this topic only came to mind for me because I have brothers, on of who is my twin, but I think anyone can find something interesting on this matter.
Sibling Participation
So, let’s start with something easy–which siblings competed together the most?
rnk | name_last | name_first_pair | n |
---|---|---|---|
1 | Zhang | Jim & Mark | 24 |
2 | Ballard | Chance & Rance | 22 |
3 | Garcia | Javier & Juan | 20 |
4 | Walter | Collin & Lowell | 20 |
5 | Bass | Brian & Michael | 19 |
6 | Fabre | Guadalupe & Maria | 19 |
7 | Priest | Alex & Chandler | 18 |
8 | Vicuna | Bianca & Daniel | 17 |
9 | Gee | Grace & John | 16 |
10 | Morris | Jason & Ty | 16 |
18 | Elhabr | Andrew & Anthony | 13 |
Note: 1 # of total rows: 2,289
Admittedly, I am a bit disappointed to find that my twin brother and I are not at the very top of this list. Nonetheless, we are fairly near the top, so I can take some satisfaction in that. 1
I should note that the scraped data does not distinguish siblings, so I had to define criteria to do so. To be specific, the table above enforces the criteria that two people have the same last name, school, and city, and that they compete in the exact same competition–that is, a competition occurring in a given year and being of a same competition type and same competition level (as well as the same conference and competition area, if applicable). The numbers are inflated when not enforcing the criteria that the two people must have competed in the same competition type and level (nor conference and competition area), and even more so when throwing out the criteria for same year.
Sibling Performance
Participation in competitions is one thing, but what about sibling
performance? Let’s use the same metric used elsewhere for ranking
performance–percent rank of scores summed across all records
(prnk_sum
)–and see which sibling pairs show up among the top.
rnk | name_last | name_first_pair | n_bycomp | n_defeat | n_state | prnk | rnk_max |
---|---|---|---|---|---|---|---|
1 | Priest | Alex & Chandler | 1,222 | 1,022 | 11 | 31.73 | 72 |
2 | Fabre | Guadalupe & Maria | 1,348 | 1,078 | 14 | 30.31 | 76 |
3 | Walter | Collin & Lowell | 1,074 | 768 | 16 | 29.99 | 80 |
4 | Bass | Brian & Michael | 1,138 | 889 | 12 | 29.62 | 76 |
5 | Gee | Grace & John | 896 | 711 | 10 | 26.28 | 64 |
6 | Morris | Jason & Ty | 852 | 625 | 11 | 24.13 | 64 |
7 | Patterson | Ben & Jeremy | 994 | 708 | 9 | 22.30 | 62 |
8 | Alsup | Jon & Mason | 886 | 667 | 9 | 22.18 | 56 |
9 | Vicuna | Bianca & Daniel | 1,056 | 653 | 3 | 21.39 | 68 |
10 | Beavers | Clay & Cody | 902 | 696 | 8 | 20.71 | 52 |
17 | Elhabr | Andrew & Anthony | 788 | 481 | 5 | 16.89 | 52 |
Note: 1 # of total rows: 1,787
It looks like the pairs at the top of these rankings based on score are fairly similar to the list of pairs competing most frequently. (This is not too surprising given that my choice of metric of ranking is based on a summed value that “rewards” volume of participation rather than per-competition performance.) Again, my twin brother and I appear near the top.
My High School
Even though I highlighted my high school (“CLEMENS”) in my examination of schools and looked at individual scores elsewhere, I did not look at other individuals that have gone to my school. Perhaps it is a bit egotistical, but I am interested in knowing how I compare with others that have attended my school (either before, with, or after me).
rnk | name | n | prnk_sum | prnk_mean | n_defeat_sum | n_defeat_mean | n_advanced_sum |
---|---|---|---|---|---|---|---|
1 | Land, Noah | 17 | 13.67 | 0.80 | 447 | 26.29 | 14 |
2 | Fulton, Chris | 18 | 12.66 | 0.70 | 351 | 19.50 | 16 |
3 | Gonzales, Gavyn | 17 | 12.26 | 0.72 | 371 | 21.82 | 15 |
4 | Elhabr, Andrew | 15 | 9.87 | 0.66 | 296 | 19.73 | 11 |
5 | Perry, Robert | 15 | 8.75 | 0.58 | 249 | 16.60 | 10 |
6 | Garcia, Jon | 9 | 7.94 | 0.88 | 259 | 28.78 | 6 |
7 | Nesser, Austin | 17 | 7.93 | 0.47 | 231 | 13.59 | 15 |
8 | Elhabr, Anthony | 13 | 7.76 | 0.60 | 216 | 16.62 | 10 |
9 | Guyott, David | 9 | 5.37 | 0.60 | 157 | 17.44 | 7 |
10 | Baker, Ian | 10 | 5.32 | 0.53 | 185 | 18.50 | 8 |
Note: 1 # of total rows: 95
Alas, although my twin brother and I did not rank among the very top of the siblings by participation and performance, we do appear among the top when evaluating only people from my high school. In my opinion, the sample size isn’t so small that this achievement is trivial.
Wrap-Up
I think all I’ve done here is more investigation of my personal performance, so I’ll spare the reader any more of my egotistical exporation. And, with that said, I think this is a good point to bring an end to my investigation of Texas high school academic UIL competitions.
- I don’t explicitly try to filter for twins only, but it’s reasonable to believe that many, if not most, are twins. ^
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.