An Analysis of Texas High School Academic Competition Results, Part 5 – Miscellaneous

[This article was first published on r on Tony ElHabr, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

There’s a lot to analyze with the Texas high school academic UIL data
set. Maybe I find it more interesting than others due to my personal
experiences with these competitions.

Now, after examining some of the biggest topics associated with this
data–including competitions, individuals, and schools–in a broad
manner, there are some other things that don’t necessarily fall into
these categories that I think are worth investigating.


Let’s look at the performance of siblings. Maybe this topic only came to
mind for me because I have brothers, on of who is my twin, but I think
anyone can find something interesting on this matter.

Sibling Participation

So, let’s start with something easy–which siblings competed together
the most?

rnk name_last name_first_pair n
1 Zhang Jim & Mark 24
2 Ballard Chance & Rance 22
3 Garcia Javier & Juan 20
4 Walter Collin & Lowell 20
5 Bass Brian & Michael 19
6 Fabre Guadalupe & Maria 19
7 Priest Alex & Chandler 18
8 Vicuna Bianca & Daniel 17
9 Gee Grace & John 16
10 Morris Jason & Ty 16
18 Elhabr Andrew & Anthony 13

Note: 1 # of total rows: 2,289

Admittedly, I am a bit
disappointed to find that my twin brother and I are not at the very top
of this list. Nonetheless, we are fairly near the top, so I can take
some satisfaction in that. 1

I should note that the scraped data does not distinguish siblings, so I
had to define criteria to do so. To be specific, the table above
enforces the criteria that two people have the same last name, school,
and city, and that they compete in the exact same competition–that is,
a competition occurring in a given year and being of a same competition
type and same competition level (as well as the same conference and
competition area, if applicable). The numbers are inflated when not
enforcing the criteria that the two people must have competed in the
same competition type and level (nor conference and competition area),
and even more so when throwing out the criteria for same year.

Sibling Performance

Participation in competitions is one thing, but what about sibling
performance? Let’s use the same metric used elsewhere for ranking
performance–percent rank of scores summed across all records
(prnk_sum)–and see which sibling pairs show up among the top.

rnk name_last name_first_pair n_bycomp n_defeat n_state prnk rnk_max
1 Priest Alex & Chandler 1,222 1,022 11 31.73 72
2 Fabre Guadalupe & Maria 1,348 1,078 14 30.31 76
3 Walter Collin & Lowell 1,074 768 16 29.99 80
4 Bass Brian & Michael 1,138 889 12 29.62 76
5 Gee Grace & John 896 711 10 26.28 64
6 Morris Jason & Ty 852 625 11 24.13 64
7 Patterson Ben & Jeremy 994 708 9 22.30 62
8 Alsup Jon & Mason 886 667 9 22.18 56
9 Vicuna Bianca & Daniel 1,056 653 3 21.39 68
10 Beavers Clay & Cody 902 696 8 20.71 52
17 Elhabr Andrew & Anthony 788 481 5 16.89 52

Note: 1 # of total rows: 1,787

It looks like the pairs at the top of these rankings based on score are
fairly similar to the list of pairs competing most frequently. (This is
not too surprising given that my choice of metric of ranking is based on
a summed value that “rewards” volume of participation rather than
per-competition performance.) Again, my twin brother and I appear near
the top.

My High School

Even though I highlighted my high school (“CLEMENS”) in my examination
of schools and looked at individual scores elsewhere, I did not look at
other individuals that have gone to my school. Perhaps it is a bit
egotistical, but I am interested in knowing how I compare with others
that have attended my school (either before, with, or after me).

rnk name n prnk_sum prnk_mean n_defeat_sum n_defeat_mean n_advanced_sum
1 Land, Noah 17 13.67 0.80 447 26.29 14
2 Fulton, Chris 18 12.66 0.70 351 19.50 16
3 Gonzales, Gavyn 17 12.26 0.72 371 21.82 15
4 Elhabr, Andrew 15 9.87 0.66 296 19.73 11
5 Perry, Robert 15 8.75 0.58 249 16.60 10
6 Garcia, Jon 9 7.94 0.88 259 28.78 6
7 Nesser, Austin 17 7.93 0.47 231 13.59 15
8 Elhabr, Anthony 13 7.76 0.60 216 16.62 10
9 Guyott, David 9 5.37 0.60 157 17.44 7
10 Baker, Ian 10 5.32 0.53 185 18.50 8

Note: 1 # of total rows: 95

Alas, although my twin brother and I did not rank among the very top of
the siblings by participation and performance, we do appear among the
top when evaluating only people from my high school. In my opinion, the
sample size isn’t so small that this achievement is trivial.


I think all I’ve done here is more investigation of my personal
performance, so I’ll spare the reader any more of my egotistical
exporation. And, with that said, I think this is a good point to bring
an end to my investigation of Texas high school academic UIL

  1. I don’t explicitly try to filter for twins only, but it’s reasonable to believe that many, if not most, are twins.

To leave a comment for the author, please follow the link and comment on their blog: r on Tony ElHabr. offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)