~300k OCTGN game stats

db0 · August 25, 2014, 1:35pm

I thought y’all number crunchers might appreciate this. Here’s the latest export of OCTGN stats, around 310k of them, up to today.

https://drive.google.com/file/d/0B-gMiPlH3rBATm9yV2QyaUExOWM/edit?usp=sharing

Unfortunately none of you thought to contact me to use a definite stimhack league tag (like [BGG-L05] for the BGG leagues) so I can’t provide a filter to crunch those data. Nevertheless, should give you some interesting data to play with.

Ajar · August 25, 2014, 2:03pm

Awesome. I’m at a machine learning conference for work this week, but I’ll try to find some time to grab the data and finish updating my matchup code.

Ajar · August 25, 2014, 2:26pm

First observation: the H&P Criminal IDs are stored as Criminal | subtitle, not Criminal | name. So Iain is Criminal | Retired Spook, not Criminal | Iain Stirling. Same for Ken and Silhouette.

Pruning unreleased IDs only removes 1,254 of 306,310 games - not bad. There aren’t a ton of people playing Fisk / Collective / Mind-Mapping.

Korrigan · August 25, 2014, 4:58pm

Can we get a detailed NEH breakdown please?

Ajar · August 25, 2014, 5:39pm

On that note, NEH is listed as NBN | Broadcast Center rather than NBN | Near-Earth Hub.

Ajar · August 25, 2014, 6:09pm

NEH winrates for the full dataset:

                      Runner         Pack  CorpWins  RunWins    Games
               Anarch | Noise Future Proof 0.6741996 0.3258004  1062
          Anarch | Reina Roja Future Proof 0.7247706 0.2752294   436
            Anarch | Whizzard Future Proof 0.6457490 0.3542510   494
         Criminal | Andromeda Future Proof 0.5900131 0.4099869   761
 Criminal | Disappeared Clone Future Proof 0.6392857 0.3607143   280
  Criminal | Gabriel Santiago Future Proof 0.5697674 0.4302326   344
     Criminal | Retired Spook Future Proof 0.7968750 0.2031250   192
 Criminal | Stealth Operative Future Proof 0.7650602 0.2349398   166
        Shaper | Chaos Theory Future Proof 0.7734375 0.2265625   512
               Shaper | Exile Future Proof 0.8695652 0.1304348    69
      Shaper | Kate McCaffrey Future Proof 0.6923767 0.3076233  1115
        Shaper | Nasir Meidan Future Proof 0.7940199 0.2059801   301
Shaper | Rielle "Kit" Peddler Future Proof 0.7416974 0.2583026   271
       Shaper | The Professor Future Proof 0.8000000 0.2000000    80

NEH winrates for the competitive cut (rating > 1 sd above mean and > 5 games played, just like in my articles):

               Runner         Pack         CorpWins  RunWins    Games
               Anarch | Noise Future Proof 0.6778711 0.3221289   357
          Anarch | Reina Roja Future Proof 0.7135678 0.2864322   199
            Anarch | Whizzard Future Proof 0.5975610 0.4024390   164
         Criminal | Andromeda Future Proof 0.5844156 0.4155844   154
 Criminal | Disappeared Clone Future Proof 0.5462963 0.4537037   108
  Criminal | Gabriel Santiago Future Proof 0.5619048 0.4380952   105
     Criminal | Retired Spook Future Proof 0.7058824 0.2941176    85
 Criminal | Stealth Operative Future Proof 0.7500000 0.2500000    68
        Shaper | Chaos Theory Future Proof 0.7941176 0.2058824   170
               Shaper | Exile Future Proof 0.8571429 0.1428571    42
      Shaper | Kate McCaffrey Future Proof 0.6730769 0.3269231   312
        Shaper | Nasir Meidan Future Proof 0.7731092 0.2268908   119
Shaper | Rielle "Kit" Peddler Future Proof 0.7009346 0.2990654   107
       Shaper | The Professor Future Proof 0.7142857 0.2857143    42

Korrigan · August 25, 2014, 6:22pm

I hope everyone who was arguing that NEH is being talked up will finally accept this “proof”.

Although I do agree that these winrates will drop somewhat over the next weeks.

Ajar · August 25, 2014, 7:30pm

I decided to orphan the old code repository in the course of rolling my code into a package that any R user can install with devtools::install_github(). After parsing the latest data, I pushed my latest code to a new repository at github.com/AjarKeen/netrunner.

So if you’re an R user, you can use devtools to install my package like so:

library(devtools)
install_github('AjarKeen/netrunner')

The package has standard documentation, with the caveat that download.octgn() shows up in the documentation but isn’t actually user-accessible (because it doesn’t work). You need to download the data and put it in your working directory yourself.

After that, you can read the data file, prune it in various ways, rate the players using Glicko, and do winrate and matchup calculations.

If I get rigorous enough with package development, I may eventually submit it to CRAN so you can just use install.packages(), but I think I’d need to rewrite all of my dplyr code to use standard evaluation.

Chill84 · August 25, 2014, 9:04pm

Thanks for parsing all of the data, but I almost threw up in my mouth a little bit.

MadmanMSU · August 25, 2014, 10:21pm

Out of curiosity (and boredom), I took a look at the data as well. Unless I made a mistake:

All results are for players who have at least 20 games played. Both Skewness and Kurtosis were very close to 0, suggesting a normal distribution.

Win %

Mean - 45.5
Median - 45.8
Std - 15.0
Q1 - 35.1
Q3 - 55.9

If you reduce the population to only those players who have played at least 20 games, the Corp wins 52.4% of games.

If you reduce the population to only those players who have played at least 20 games AND at least one person involved in the game has a win percentage of 60% or greater (1 sigma), the corp wins 50.6% of games.

SamRS · August 25, 2014, 11:55pm

criminals and whizzard are doing better than I expected, really surprised by the tenma numbers actually (not just the winrate, but the games played as well).

Kingsley · August 25, 2014, 11:56pm

I’m not sure what your post is saying. If you select for players that have high win percentages, they have high win percentages?

Shango · August 26, 2014, 12:08am

As far as I can tell, we are looking at win percentages of competitive players against the entire field. What about win percentages for NEH versus runners who are also high percentage win rate players? In other words, how does NEH fare against experienced runners, rather than just the entire field, in the hands of capable players.

To elaborate, the statistics we got out of our local regional (San Rafael) showed that overall, Corp win rates were above that of Runners, but when you looked at just the top 16 players, Runner winrates were still above that of Corp. That was before Upstalk of course, but the point is, high level players vs high level players is what we are trying to look at here.

MadmanMSU · August 26, 2014, 12:35am

Yeah, I thought after I left work that I should have been more clear.

If you reduce the population to only those players who have played at least 20 games, the Corp wins 52.4% of games.

If you reduce the population to only those players who have played at least 20 games AND at least one person involved in the game has a win percentage of 60% or greater (1 sigma), the corp wins 50.6% of games.

I didn’t limit the dataset by date, so its all inclusive. Something I will look at tomorrow.

Ajar · August 26, 2014, 2:22am

No, my competitive cut data is only games played by the players who made the cut – players whose Glicko rating is more than one standard deviation above average.

I can easily change that threshold to two standard deviations (or some other arbitrary threshold) if you only care about the best of the best, but bear in mind that the sample size will be small. For example, with the one sd cutoff, we keep 1588 players and 118,466 games. With a two sd cutoff, we keep 257 players and 4,049 games. The two sd cutoff leaves a total of 21 NEH games in the dataset, which is nowhere near enough to generalize from.

Just how good do you want the players to be?

mediohxcore · August 26, 2014, 5:42am

Did you use ratings from old data, (were the users the same number as last time), and/or did you calculate/recalculate only with the new data? And can we see it?

Remorhaz · August 26, 2014, 6:55am

i dont see what more data can tell us. Astroscript aside the NEH’s ability equates to extra clicks. the balance between runner/corp revolves around 4 clicks vs 3 clicks + 1 draw…except with NEH its 3 clicks + draw + draw if the deck is built correctly. of course NEH is very good.

all i know is im very angry at all the people who voted for laramy fisk because you thought you were saving netrunner…that extra click the collective gets seems right in line with IDs like NEH and blue sun.

db0 · August 26, 2014, 7:37am

@mediohxcore The anonymized user numbers always change between exports.

(Are you trying to figure out your rating, you narcissist? :P)

Ajar · August 26, 2014, 12:31pm

I redid the ratings with the new data. Now that all of the code is written, it actually runs very fast – each step takes at most couple of seconds, and usually much less.

I could post a ratings file, but you’d also need to figure out your player ID from @db0’s original data.

There’s a lot more stuff I want to look at, in addition to redoing / updating my matchup plots. The matchups are still just simple winrate calculations on subsets of games. I’m curious about more complex questions, like whether loyalty to an ID is correlated with rating. Do people who play the same ID for a long time tend to get better?

MadmanMSU · August 26, 2014, 2:29pm

So I’m continuing to look at the data, mostly for practice/curiosity. If anyone can double check my results, that would be awesome. I constantly check for errors I’ve made and attempt to correct them when I find them.

First, I wanted to check and see if Kingsley was right and that I had made some kind of mistake. He was. I fixed my code and reran. I then also wanted to see if there was a difference between corp win rates before and after Upstalk. I’m not sure when Upstalk was released, but I think it was in early July. I’m also not sure when cards start getting play on OCTGN, so I set my time periods as “Pre” upstalk (dates before July 1) and “Post” upstalk (dates on and after July 1). Games played had to be >=20. Frequencies showed that:

Pre July 1
Corp win % for all games: 51.8
Post July 1
Corp win % for all games: 57.1

Small jump. Is it significant?

I did a logistic regression analysis to see if there was a difference in corp win rates by time period for all games.

Here are the results. Analysis shows significance, with a point estimate of 0.2166 in favor of the Post period.

I then did it again, this time limiting my dataset to games >=20 and where BOTH players had a win % >= 60%. Results also show significance, and point estimate went up to 0.3688.

Interpretation: So basically what that’s saying is that the difference in the Pre time period and Post time period was significant, and was more pronounced when players of “high ability” (win % mean plus sigma 1) were playing. The Least Squares Mean estimate is the log odds ratio. My take away from this is that something happened during the Post time period to change the rate at which Corp wins games.