An Analysis of Texas High School Academic Competition Results, Part 5 – Miscellaneous

[This article was first published on r on Tony ElHabr, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

There’s a lot to analyze with the Texas high school academic UIL data set. Maybe I find it more interesting than others due to my personal experiences with these competitions.

Now, after examining some of the biggest topics associated with this data–including competitions, individuals, and schools–in a broad manner, there are some other things that don’t necessarily fall into these categories that I think are worth investigating.


Let’s look at the performance of siblings. Maybe this topic only came to mind for me because I have brothers, on of who is my twin, but I think anyone can find something interesting on this matter.

Sibling Participation

So, let’s start with something easy–which siblings competed together the most?

rnk name_last name_first_pair n
1 Zhang Jim & Mark 24
2 Ballard Chance & Rance 22
3 Garcia Javier & Juan 20
4 Walter Collin & Lowell 20
5 Bass Brian & Michael 19
6 Fabre Guadalupe & Maria 19
7 Priest Alex & Chandler 18
8 Vicuna Bianca & Daniel 17
9 Gee Grace & John 16
10 Morris Jason & Ty 16
18 Elhabr Andrew & Anthony 13

Note: 1 # of total rows: 2,289

Admittedly, I am a bit disappointed to find that my twin brother and I are not at the very top of this list. Nonetheless, we are fairly near the top, so I can take some satisfaction in that. 1

I should note that the scraped data does not distinguish siblings, so I had to define criteria to do so. To be specific, the table above enforces the criteria that two people have the same last name, school, and city, and that they compete in the exact same competition–that is, a competition occurring in a given year and being of a same competition type and same competition level (as well as the same conference and competition area, if applicable). The numbers are inflated when not enforcing the criteria that the two people must have competed in the same competition type and level (nor conference and competition area), and even more so when throwing out the criteria for same year.

Sibling Performance

Participation in competitions is one thing, but what about sibling performance? Let’s use the same metric used elsewhere for ranking performance–percent rank of scores summed across all records (prnk_sum)–and see which sibling pairs show up among the top.

rnk name_last name_first_pair n_bycomp n_defeat n_state prnk rnk_max
1 Priest Alex & Chandler 1,222 1,022 11 31.73 72
2 Fabre Guadalupe & Maria 1,348 1,078 14 30.31 76
3 Walter Collin & Lowell 1,074 768 16 29.99 80
4 Bass Brian & Michael 1,138 889 12 29.62 76
5 Gee Grace & John 896 711 10 26.28 64
6 Morris Jason & Ty 852 625 11 24.13 64
7 Patterson Ben & Jeremy 994 708 9 22.30 62
8 Alsup Jon & Mason 886 667 9 22.18 56
9 Vicuna Bianca & Daniel 1,056 653 3 21.39 68
10 Beavers Clay & Cody 902 696 8 20.71 52
17 Elhabr Andrew & Anthony 788 481 5 16.89 52

Note: 1 # of total rows: 1,787

It looks like the pairs at the top of these rankings based on score are fairly similar to the list of pairs competing most frequently. (This is not too surprising given that my choice of metric of ranking is based on a summed value that “rewards” volume of participation rather than per-competition performance.) Again, my twin brother and I appear near the top.

My High School

Even though I highlighted my high school (“CLEMENS”) in my examination of schools and looked at individual scores elsewhere, I did not look at other individuals that have gone to my school. Perhaps it is a bit egotistical, but I am interested in knowing how I compare with others that have attended my school (either before, with, or after me).

rnk name n prnk_sum prnk_mean n_defeat_sum n_defeat_mean n_advanced_sum
1 Land, Noah 17 13.67 0.80 447 26.29 14
2 Fulton, Chris 18 12.66 0.70 351 19.50 16
3 Gonzales, Gavyn 17 12.26 0.72 371 21.82 15
4 Elhabr, Andrew 15 9.87 0.66 296 19.73 11
5 Perry, Robert 15 8.75 0.58 249 16.60 10
6 Garcia, Jon 9 7.94 0.88 259 28.78 6
7 Nesser, Austin 17 7.93 0.47 231 13.59 15
8 Elhabr, Anthony 13 7.76 0.60 216 16.62 10
9 Guyott, David 9 5.37 0.60 157 17.44 7
10 Baker, Ian 10 5.32 0.53 185 18.50 8

Note: 1 # of total rows: 95

Alas, although my twin brother and I did not rank among the very top of the siblings by participation and performance, we do appear among the top when evaluating only people from my high school. In my opinion, the sample size isn’t so small that this achievement is trivial.


I think all I’ve done here is more investigation of my personal performance, so I’ll spare the reader any more of my egotistical exporation. And, with that said, I think this is a good point to bring an end to my investigation of Texas high school academic UIL competitions.

  1. I don’t explicitly try to filter for twins only, but it’s reasonable to believe that many, if not most, are twins. ^

To leave a comment for the author, please follow the link and comment on their blog: r on Tony ElHabr. offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)