GRADING ANOMALIES

Roger de Coverly · Post by **Roger de Coverly** » Thu May 21, 2009 7:47 pm

The question is can one find a better rule (or formulae) which would result in a grading system which would need less or no adjustment as seasons go by?

It's my opinion that the ECF are completely in the wrong to break the historic series of grades. By all means review such ad-hoc factors as the the 30 game rule, the 40 point rule, the junior increments, the estimation of new players and the treatment of rapidly improving players, but do it by parallel running against the established system so that you know the magnitude and effect of each proposed change.

If you want to abandon the long standing principle of equal grade for equal play, then just move sideways onto an Elo system rather than perpetuate another national system.

Robert Jurjevic · Post by **Robert Jurjevic** » Fri May 22, 2009 10:25 am

Ã‰GS2 vs GS

(In each example Ã‰GS2 calculation was performed as if all the grades were equally trusted.)

1) Let us assume that there are two player pools, one with 365 players graded 125 and other with 365 players graded 150. Let us assume that each player from 125 pool plays each player from 150 pool (there are 133,225 games in total) and that all games are drawn. According to GS after calculating new grades 125 pool will become a 150 pool and a 150 pool will become a 125 pool. According to Ã‰GS2 after calculating new grades both pools will become 138 pools.

2) Let us assume that there are two player pools, one with 365 players graded 120 and other with 365 players graded 100. Let us assume that each season each player from 120 pool plays each player from 100 pool (there are 133,225 games in total) and that all games are drawn. According to GS after calculating new grades the 120 pool will become a 100 pool and a 100 pool will become a 120 pool, and the grades will keep alternating between the pools as the seasons pass. According to Ã‰GS2 after calculating new grades both pools will become 110 pools, which would then stay as such in each subsequent season.

3) Let us assume that there are two player pools, both with 365 players graded 100. Let us assume that each player from one pool plays each player from other pool (there are 133,225 games in total) and that all games are won by players of one pool. According to GS after calculating new grades the winning pool will become a 150 pool and the losing pool will become a 50 pool. According to Ã‰GS2 after calculating new grades the winning pool will become a 125 pool and a losing pool will become a 75 pool.

My thoughts on why in my opinion Ã‰GS2 results make more sense than GS results in each example:

1) When you look at the example from a perspective of a single player from the 125 pool one may think, the player scored 50% against a pool of 150 players, therefore he must be a 150 player, but if you look at the example from the pool perspective it looks to me more plausible to assume that after calculating new grades the pool grades should become equal, as both pools scored 50%. This implies that when looking at the example from a perspective of a single 125 player one should not assume that the grade of the 150 pool is fixed and that therefore if the 125 player scored 50% against it he must be a 150 player, you have seen that from pool perspective in this example the 125 player was in fact playing a degrading pool of players whose average grade dropped from 150 to 138, therefore in my opinion it would be fair to assume that if the 125 player scored 50% against the pool he is a 138 rather than a 150 player.

2) This is a version of the "lighthouse keeper" example which most of you would agree demonstrates a known theoretical disadvantage of GS.

3) If we neglect the 40 point rule in GS a minimum grade difference required for a performance of 100% is 50, so it would seem that new pool grades calculated by GS are too far apart, i.e., the grades were unnecessarily stretched.

Roger de Coverly · Post by **Roger de Coverly** » Fri May 22, 2009 11:35 am

Let us assume that each player from 125 pool plays each player from 150 pool (there are 133,225 games in total) and that all games are drawn.

I don't think you should evaluate grading systems based on 1 in a billion events.

The everyday point is that players improve from 125 to 150 or 150 to 175 every season and a grading system should have a means of revaluing them. Here's a closer to home example. If you play in the top section of weekend tournaments and score around 50%, then you usually get a grading performance of 175 ish. So if a player steps up from playing in the rated restricted tournaments and plays in the opens and scores 50% then for a suitably large number of games he should get a grade of 175. That's the same grade has someone who perpetually scores around that mark and the same grade as last season's 200 player who had a relatively bad year. It's also the same grade as a new player scoring 50% who hasn't previously played in the English system

I would rather assume that one player from the 125 pool plays 30 players from the 150 pool and scores 50%. Also one player from the 150 pool plays 30 games against the 125 pool and scores 50%. On the ECF system we have one transfer from the 125 pool to the 150 pool and one from the 150 pool to the 125 pool. I see nothing wrong with this and no reason to slow down the transition unless it was done as part of a package of changes including adopting of an Elo based system.

Roger de Coverly · Post by **Roger de Coverly** » Fri May 22, 2009 12:43 pm

Just another thought on averaging.

If in the ECF system you increased the minimum games cutoff from 30 to 60 and if you assumed everybody played exactly 30 games, then the example of a 125 player who upgrades to playing 150 opposition and scoring 50% would get a first season new grade of 138 exactly as in this EGS2 thing. So the ECF system already has some of the averaging. With the current 30 game cutoff, exact averaging applies at 15 games.

Robert Jurjevic · Post by **Robert Jurjevic** » Fri May 22, 2009 1:18 pm

Roger de Coverly wrote:I don't think you should evaluate grading systems based on 1 in a billion events.

Extreme cases may help in showing the differences between the systems. Say, you would not compare Eninsten's to Newton's theory on experiments in which both theories give virtually the same results, but would rather search for the extreme cases in which theories predict measurably different results.

Roger de Coverly wrote:The everyday point is that players improve from 125 to 150 or 150 to 175 every season and a grading system should have a means of revaluing them.

If a 125 player scores 75% against a field of players of 150, according to Ã‰GS2 (assuming that all grades are equally trusted), the 125 player would improve to 150 player, so Ã‰GS2 would allow for it, but it would require a performance of 75% rather than 50%.

Roger de Coverly wrote:I would rather assume that one player from the 125 pool plays 30 players from the 150 pool and scores 50%. Also one player from the 150 pool plays 30 games against the 125 pool and scores 50%. On the ECF system we have one transfer from the 125 pool to the 150 pool and one from the 150 pool to the 125 pool. I see nothing wrong with this and no reason to slow down the transition unless it was done as part of a package of changes including adopting of an Elo based system.

If we assume that the new grades of all other players in the pools remain unchanged (i.e., all other 150 players remain 150 and all other 125 players remain 125) then indeed your argument holds and the transition should be fair. Ã‰GS2 would require from the 125 player to score 75% and from the 150 player to score 25% in order for the transition to take place, when it looks like 50% for both players should be enough.

Actually, even if the 150 pool grade drops and the 125 pool grade raises the 50% should be a fair performance for the transition to take place if we do not wish to correct for grade change during a course of the season.

Maybe Ã‰GS3 system, where 'ka' and 'kb' are doubled in comparison to 'ka' and 'kb' in Ã‰GS2, should be considered. Ã‰GS3 is similar to GS except it uses logistic curve for 'p = f(d)' and fine-tunes grade change based on how much the grades are trusted (the more grade is trusted the less it changes and vice versa, that is a simple emulation of the difference between Glicko and Ã‰lo system).