EIDRaS Ratings
Forum rules
This forum is limited to topics relating to the game Diplomacy only. Other posts or topics will be relocated to the correct forum category or deleted. Please be respectful and follow our normal site rules at http://www.webdiplomacy.net/rules.php.
This forum is limited to topics relating to the game Diplomacy only. Other posts or topics will be relocated to the correct forum category or deleted. Please be respectful and follow our normal site rules at http://www.webdiplomacy.net/rules.php.
Re: EIDRaS Ratings
First of all, Elo is a name - not an acronym so you it doesn't need to be all caps.
Like others have said, GR is quite simple to understand. At the start of the game, everybody 'bids' a fraction of their rating and that makes up the pot. So, the pot is bigger if you play against better people but your bid is always the same regardless.
@Octavius, regarding new players:
The system mitigates these issues in two ways:
1) Games with more provisional players are weighted less. Right now, that weighting is 2*p/7 where p is the number of ppl in the game who have played more than 7 games.
2) player ratings are more volatile for new players. The weighting for a game is inversely proportional to the number of games you've played. This allows players to rise (or drop) to their 'true' rating quicker.
Like others have said, GR is quite simple to understand. At the start of the game, everybody 'bids' a fraction of their rating and that makes up the pot. So, the pot is bigger if you play against better people but your bid is always the same regardless.
@Octavius, regarding new players:
The system mitigates these issues in two ways:
1) Games with more provisional players are weighted less. Right now, that weighting is 2*p/7 where p is the number of ppl in the game who have played more than 7 games.
2) player ratings are more volatile for new players. The weighting for a game is inversely proportional to the number of games you've played. This allows players to rise (or drop) to their 'true' rating quicker.
Re: EIDRaS Ratings
Not to bog things down but they aren't exactly zero sum. Expected scores are based on an 'exponential mean' rather than the arithmetic mean so the rating changes don't always add to 0.
Re: EIDRaS Ratings
Really? Thanks for letting me know.A_Tin_Can wrote: ↑Sun Jan 21, 2018 11:22 amYes, they are. This is because GR doesn't have a constant k-factor - the ratings of all players in this game is an input into the k-factor, which cancels out the earlier weightings. Have a play with some numbers in GR, you'll find it behaves very differently to typical Elo.in Ghost Rating, losses against expert opponents and beginners are not rated equally.
Re: EIDRaS Ratings
My bad, I missed that part.
So, the system has a built in inflation due to the exponential function I guess?
Re: EIDRaS Ratings
@RJ
Yeah, but I suppose the average rating could shrink as well in each game. At any rate, I imagine the rating inflation is dwarfed by the inflation caused by low-rated players leaving the site. We could curve everybody's rating to keep the site wide average at 1000 but I'm not sure if that's necessary.
Yeah, but I suppose the average rating could shrink as well in each game. At any rate, I imagine the rating inflation is dwarfed by the inflation caused by low-rated players leaving the site. We could curve everybody's rating to keep the site wide average at 1000 but I'm not sure if that's necessary.
Re: EIDRaS Ratings
@Yonni
Since the exponential function is convex the exponential mean is greater than the arithmetic mean, unless all players in the game have the same rating.
But, yes, there are probably bigger problems in terms of inflation.
Since the exponential function is convex the exponential mean is greater than the arithmetic mean, unless all players in the game have the same rating.
But, yes, there are probably bigger problems in terms of inflation.
-
- Posts: 137
- Joined: Sun Dec 31, 2017 12:32 am
- Location: Belgrade, Serbia
- Contact:
Re: EIDRaS Ratings
I dont understand how those ratings work, but I definitely like what you did there. Cheers, Yonni. 

Re: EIDRaS Ratings
Someone should work on a rating system that only factors in your last years worth of games
Re: EIDRaS Ratings
I can do this with GR. Maybe I will later.
Yonni, can I suggest that, at least at first, you make the weightings for all games identical to those for GR? It'll make it really easy to compare the two.
Re: EIDRaS Ratings
But as elo-based system, gr is not meant for short term evaluations - it is a long term rating which aims to your real rating ("in the limit").
Some months ago, a bug due to crashed games forced the vdip ranking to be limited to the games of the last couple of years only, and it was a real mess!
Some months ago, a bug due to crashed games forced the vdip ranking to be limited to the games of the last couple of years only, and it was a real mess!
Re: EIDRaS Ratings
Yes, GR over the past year will mostly just demonstrate who played the most. Year over year GR improvement might be a more sane metric, but even then, all increases aren't created equal.
Re: EIDRaS Ratings
A ratings system that only factors in your last year of games is going to be very unstable.
Re: EIDRaS Ratings
It's really better to call GR "Elo inspired" rather than "Elo based". Many of the features that make Elo effective are not present (for better or worse) in GR.
Re: EIDRaS Ratings
All of the weightings are in Variantfile.csv in the git repo I posted. Second to last column, I think they're the in verses of the actual numbers you want (gunboat is .25 rather than 4).
Re: EIDRaS Ratings
1v1 is weighted 0. It's really a separate game, they're all unranked, and it's two-part, so I just implemented Elo for it.
Who is online
Users browsing this forum: No registered users