Welcome to the Hardcore Husky Forums. Folks who are well-known in Cyberland and not that dumb.

Geeky Recruiting Data Shit

I am learning how to use some data visualization software, so of course I am playing around with TBS data.

The data is Rivals recruiting database. I'd prefer to use the 24/7 Composite rankings, but scraping that shit from the web is a major PITA.

Pac-12 North
image

Pac-12 South
image
«1

Comments

  • AIRWOLF
    AIRWOLF Member Posts: 1,840
    AEB said:

    Based on this analysis, Ty just needed more time.

    It was his least terrible class, on paper. In reality it was pretty awful, though Polk, Kearse, and Ta'amu were special.
  • GrundleStiltzkin
    GrundleStiltzkin Member Posts: 61,516 Standard Supporter
    Charts not chinteractive. Pass.
  • Baseman
    Baseman Member Posts: 12,369
    edited November 2017
    image
    I prefer DDY’s recent outlook of positive developments in our DL recruiting.
  • RaceBannon
    RaceBannon Member, Moderator, Swaye's Wigwam Posts: 113,976 Founders Club
    Need @Dardanus to confirm

    Stanford appears to be way more talented than us. I'm not sure but we're third now behind USC and Stanford

    We sure sucked for a long tim
  • Ice_Holmvik
    Ice_Holmvik Member Posts: 2,912
    AIRWOLF said:

    I am learning how to use some data visualization software, so of course I am playing around with TBS data.

    The data is Rivals recruiting database. I'd prefer to use the 24/7 Composite rankings, but scraping that shit from the web is a major PITA.

    Pac-12 North
    image

    Pac-12 South
    image

    The funny part is the dip beore UW's rise was Pete's first class which ended up re-ranked as the best in the Pac that year.
  • FremontTroll
    FremontTroll Member Posts: 4,744
    You can clearly see the inflation in the grading system. Everyone other than Cal and the Arizona schools trending up.
  • AIRWOLF
    AIRWOLF Member Posts: 1,840
    Dardanus said:

    Pretty good start @AIRWOLF. Couple things I would consider:

    1. Sort/order by star ranking (best to worst), not alphabetical. The idea is you want to show the important information in order.

    2. The message you send with a line chart is greatly affected by aspect ratio. There is no perfect ratio, but generally 2:1 (w:h) is about right. In this case, a 0.2 star ranking change is a steep increase or decrease. Columns or bars might be a better choice.

    3. Are you trying to emphasize the trend over time, or compare average ranking of each school? Comparing rank of each school can be done much simpler (11 bars ordered by avg rank over this time period, Cal excluded of course).
    Comparing the trend of multiple items can be difficult in a limited space. A good method is to put all lines on a single chart, with a selected school using their color, all other schools being gray/semi-transparent in the background. Your visualization tool might not do this, though.

    4. Be careful web-scraping TBS stuff. My cousin had the FBI knock on his door for similar activity. Not good.

    Do you even Tableau, bro?

    Seriously though, thanks. I have tons more I need to do with this, but I am a retard with this software.

    A lot of it is stuff I have done in Excel before, just not over as long a time period.

    I really am doing this first and foremost to learn how to use the stupid software. Simple things like changing the colors of the lines and formatting things are easy once you know how to do them, but they are not easy when you don't.
  • AIRWOLF
    AIRWOLF Member Posts: 1,840

    You can clearly see the inflation in the grading system. Everyone other than Cal and the Arizona schools trending up.

    Yeah. There is certainly grade inflation going on. The Sum total of the average Stars for the Top 100 teams has risen steadily over time.

    I am going to work in a viz (look at me using serious geek lingo) on that too.
  • AEB
    AEB Member Posts: 2,994
    AIRWOLF said:

    Dardanus said:

    Pretty good start @AIRWOLF. Couple things I would consider:

    1. Sort/order by star ranking (best to worst), not alphabetical. The idea is you want to show the important information in order.

    2. The message you send with a line chart is greatly affected by aspect ratio. There is no perfect ratio, but generally 2:1 (w:h) is about right. In this case, a 0.2 star ranking change is a steep increase or decrease. Columns or bars might be a better choice.

    3. Are you trying to emphasize the trend over time, or compare average ranking of each school? Comparing rank of each school can be done much simpler (11 bars ordered by avg rank over this time period, Cal excluded of course).
    Comparing the trend of multiple items can be difficult in a limited space. A good method is to put all lines on a single chart, with a selected school using their color, all other schools being gray/semi-transparent in the background. Your visualization tool might not do this, though.

    4. Be careful web-scraping TBS stuff. My cousin had the FBI knock on his door for similar activity. Not good.

    Do you even Tableau, bro?

    Seriously though, thanks. I have tons more I need to do with this, but I am a retard with this software.

    A lot of it is stuff I have done in Excel before, just not over as long a time period.

    I really am doing this first and foremost to learn how to use the stupid software. Simple things like changing the colors of the lines and formatting things are easy once you know how to do them, but they are not easy when you don't.
    We understand better than most... It’s hard
  • Muttzen
    Muttzen Member Posts: 1,015
    Chinspiring. Time to gogo data science.
  • Muttzen
    Muttzen Member Posts: 1,015
    Created a shitty heatmap with dendrograms, interpolating the missing games so we get to play EVERYONE.

    Rows are offense, columns are defense.

    Spoilers: we lose to USC and are a tossup with dickrod thanks to our shitty offense. Our defense is clustered similarly to USC and Stanford, our offense is similar to that of ASU.


    image
  • Dardanus
    Dardanus Member Posts: 2,623
    AIRWOLF said:

    Dardanus said:

    Pretty good start @AIRWOLF. Couple things I would consider:

    1. Sort/order by star ranking (best to worst), not alphabetical. The idea is you want to show the important information in order.

    2. The message you send with a line chart is greatly affected by aspect ratio. There is no perfect ratio, but generally 2:1 (w:h) is about right. In this case, a 0.2 star ranking change is a steep increase or decrease. Columns or bars might be a better choice.

    3. Are you trying to emphasize the trend over time, or compare average ranking of each school? Comparing rank of each school can be done much simpler (11 bars ordered by avg rank over this time period, Cal excluded of course).
    Comparing the trend of multiple items can be difficult in a limited space. A good method is to put all lines on a single chart, with a selected school using their color, all other schools being gray/semi-transparent in the background. Your visualization tool might not do this, though.

    4. Be careful web-scraping TBS stuff. My cousin had the FBI knock on his door for similar activity. Not good.

    Do you even Tableau, bro?

    Seriously though, thanks. I have tons more I need to do with this, but I am a retard with this software.

    A lot of it is stuff I have done in Excel before, just not over as long a time period.

    I really am doing this first and foremost to learn how to use the stupid software. Simple things like changing the colors of the lines and formatting things are easy once you know how to do them, but they are not easy when you don't.
    Yep, Tableau is great. It can definitely create the dynamic line chart. This poast may help (the "data-ink" concept is what I was trying to get at).
  • Swaye
    Swaye Moderator, Swaye's Wigwam Posts: 41,739 Founders Club
    Damn. Last 3 classes are the best since 02, and getting better every year. Pete has this shit rolling.
  • AIRWOLF
    AIRWOLF Member Posts: 1,840
    Swaye said:

    Damn. Last 3 classes are the best since 02, and getting better every year. Pete has this shit rolling.

    Yep. The other thing to realize is that even if you throw out Sarkisian's 2009 transition class, his recruiting was basically just in-line with the Power 5 average over the 2010-2013 time frame.

    Petersen's '16 class was equivalent to Sarkisian's '13 class (Sark's best) on paper. Of course, we all realize that "on paper" is ridiculously flattering to Sark's classes, while it is actually a bit punitive to Petersen's classes.
  • UWhuskytskeet
    UWhuskytskeet Member Posts: 7,113
    I see you've given up on Tableau.
  • theknowledge
    theknowledge Member, Swaye's Wigwam Posts: 5,636 Founders Club
    Pete's classes are a stairway to heaven. I love 2018's class. Best I've ever seen at UW since I've cared to pay attention (1996). The raw data clearly shows that my boner will also be raw come this time next year. 2019 offseason natty will be truly speshul.
  • AIRWOLF
    AIRWOLF Member Posts: 1,840
    edited February 2018

    I see you've given up on Tableau.

    Not given up, but I have the raw data in Excel and I am just not very good at Tableau yet(?).
  • UW_Doog_Bot
    UW_Doog_Bot Member, Swaye's Wigwam Posts: 18,128 Founders Club
    With grade inflation you are probably also seeing some distortion of the data since you have an upward limit to the grade. I would think this would mean that especially in the case of media bias it means our classes would be essentially under rated vs top recruiting classes full of kids with "5 star" ratings would be overstated.