Coder Social home page Coder Social logo

1's Introduction

WEBVTT

1 00:00:00.012 --> 00:00:01.758 Hi again. It's Matt,

2 00:00:01.758 --> 00:00:05.403 and now we're talking more about strategic reasoning.

3 00:00:05.403 --> 00:00:10.896 And in particular let's go through and and analyze the Keynes beauty contest

4 00:00:10.896 --> 00:00:14.651 game now and talk about the Nash equilibria of this game.

5 00:00:14.651 --> 00:00:17.541 So, remember what the structure of this game was.

6 00:00:17.541 --> 00:00:22.099 Each player named an integer between 1 and 100, so you've got a population of

7 00:00:22.099 --> 00:00:24.452 players, they're all naming integers.

8 00:00:24.452 --> 00:00:30.125 the person who names the integer closest to 2/3 of the average integer named by

9 00:00:30.125 --> 00:00:33.634 people wins, other people don't get anything.

10 00:00:33.634 --> 00:00:36.758 ties are broken uniformly at random. Okay.

11 00:00:36.758 --> 00:00:42.681 So again, what are other players going to do? You have to reason through that and

12 00:00:42.681 --> 00:00:47.012 then what should I do in response? So these are the key ingredients of a Nash

13 00:00:47.012 --> 00:00:51.445 equilibrium and the Nash equilibrium is everybody's choosing their optimal

14 00:00:51.445 --> 00:00:55.789 response, the one that's going to give them the maximum chance of winning in

15 00:00:55.789 --> 00:00:58.521 this game to what the other players are doing,

16 00:00:58.521 --> 00:01:02.323 that's going to be a Nash equilibrium. Okay. So let's take a look.

17 00:01:02.323 --> 00:01:06.880 so, how are we going to reason about this? Suppose that I think that the

18 00:01:06.880 --> 00:01:12.020 average play the averaged integer named in this game is going to be some number

19 00:01:12.020 --> 00:01:14.740 X. so, I, you know, including my own

20 00:01:14.740 --> 00:01:17.561 integer, I think this is going to be the average.

21 00:01:17.561 --> 00:01:22.081 Well, what has to be true about my reply to that, my reply should be 2/3 of X,

22 00:01:22.081 --> 00:01:26.769 right? I should be naming the integer closest to 2/3 of whatever I believe the

23 00:01:26.769 --> 00:01:30.578 average is going to be. So my optimal strategy should be naming

24 00:01:30.578 --> 00:01:34.546 an integer closest to 2/3 of X. So here, we're just working through

25 00:01:34.546 --> 00:01:39.344 heuristically, we'll, we'll get to formal definitions and analysis in a little bit,

26 00:01:39.344 --> 00:01:42.282 but let's just go through the basic reasoning now.

27 00:01:42.282 --> 00:01:47.134 Okay, so I should be trying to name 2/3 of what I think the average is going to

28 00:01:47.134 --> 00:01:49.596 be. Well, X has to be less than a 100, right?

29 00:01:49.596 --> 00:01:53.096 There's no way that the average guess can be more then 100.

30 00:01:53.096 --> 00:01:57.578 So the optimal strategy for any player should be no more then 67 right? So if I

31 00:01:57.578 --> 00:02:03.299 think that everybody's rational I, so, if I believe that's true, then I think that

32 00:02:03.299 --> 00:02:06.867 nobody should be naming an integer bigger than 67.

33 00:02:06.867 --> 00:02:12.477 Okay, so what does that mean? Well, that means that I can't think the average is

34 00:02:12.477 --> 00:02:17.590 any higher than 67, right? So, if, if the average X is no bigger than

35 00:02:17.590 --> 00:02:20.980 67, then I should be naming no more than 2/3 of 67.

36 00:02:20.980 --> 00:02:26.495 Right? Now, you can begin to see where this is going, so that means that if I

37 00:02:26.495 --> 00:02:31.917 think everybody else understands the game and understands that nobody should be

38 00:02:31.917 --> 00:02:36.757 naming a number bigger than 67 and nobody should be naming numbers bigger than 2/3

39 00:02:36.757 --> 00:02:39.507 of 67. we keep going on this, so nobody should

40 00:02:39.507 --> 00:02:42.352 be naming anything more than 2/3 of this, of 2/3 of 67.

41 00:02:42.352 --> 00:02:46.677 Now, obviously, when you, if you just keep looking, everybody's going to want

42 00:02:46.677 --> 00:02:49.592 to be a little bit lower than everybody else's guess.

43 00:02:49.592 --> 00:02:53.172 So wherever the average is you should be lower than that.

44 00:02:53.172 --> 00:02:58.108 What's the only number which, everybody can be naming, and consistently choosing

45 00:02:58.108 --> 00:03:01.369 the best response they have to what the average guess is.

46 00:03:01.369 --> 00:03:05.889 the unique Nash equilibrium of this game is for every player to announce one.

47 00:03:05.889 --> 00:03:09.678 Okay? Well that's, yeah, so, so we're driven all the way down to,

48 00:03:09.678 --> 00:03:14.236 to announcing one and that's a unique Nash equilibrium, and what happens now,

49 00:03:14.236 --> 00:03:17.978 we all announce one we all tie, and somebody wins at random.

50 00:03:17.978 --> 00:03:22.429 If, if I try to deviate form that, if I try to announce a higher integer, I'd

51 00:03:22.429 --> 00:03:26.971 just be higher than the average guess, so I wouldn't be at 2/3 of the mean.

52 00:03:26.971 --> 00:03:29.581 So this is going to be a stable point. Okay?

53 00:03:29.581 --> 00:03:33.532 So, let's see what, what actually happens when people play this.

54 00:03:33.532 --> 00:03:38.392 So part of this reasoning is you're trying to form expectations of what other

55 00:03:38.392 --> 00:03:43.227 players are doing and you need to make sure that those expectations actually

56 00:03:43.227 --> 00:03:46.425 match reality. So let's have a peek at some plays of

57 00:03:46.425 --> 00:03:50.024 this game. So this, this is a plot here where we're

58 00:03:50.024 --> 00:03:55.553 actually giving you the results of the online course of when it was taught last

59 00:03:55.553 --> 00:03:59.879 year, we had players play this game, and so these are the results.

60 00:03:59.879 --> 00:04:04.445 And here from 2012, we had more than 10,000 people actually participate in

61 00:04:04.445 --> 00:04:09.085 this particular game. What do we see? So, down here on this, we

62 00:04:09.085 --> 00:04:15.142 have integers going from 0 to 100 and then over here, we have the frequency.

63 00:04:15.142 --> 00:04:21.088 So, how many people named the given integer? So the, the 50 right here is

64 00:04:21.088 --> 00:04:23.693 the, is the mode, so we get the mode of 50.

65 00:04:23.693 --> 00:04:27.742 The most often named integer was 50, 1,600 people named 50.

66 00:04:27.742 --> 00:04:33.310 Well, obviously, they hadn't gone through all the reasoning and it takes a while to

67 00:04:33.310 --> 00:04:37.106 sort of figure out what the equilibrium of this game is.

68 00:04:37.106 --> 00:04:42.172 what's the mean here? So the mean was 34, so actually there's some interesting

69 00:04:42.172 --> 00:04:44.930 things. Some people naming 100, a number that

70 00:04:44.930 --> 00:04:49.704 could never really win, right? So it's not clear exactly what what, it

71 00:04:49.704 --> 00:04:54.130 could, it could end up winning if everybody named 100 then you could end up

72 00:04:54.130 --> 00:04:58.465 in a tie there, but then you would be better off naming 67 instead.

73 00:04:58.465 --> 00:05:03.227 So so when we, when we end up looking through this, what we end up with is some

74 00:05:03.227 --> 00:05:06.634 people naming high numbers, but very few people,

75 00:05:06.634 --> 00:05:12.432 then we end up with some interesting spikes a bunch of people just named 50.

76 00:05:12.432 --> 00:05:16.012 Not clear exactly what the reasoning is on, on 50.

77 00:05:16.012 --> 00:05:22.192 interestingly if you think that a bunch of people are going to do that you might

78 00:05:22.192 --> 00:05:26.512 want to name 2/3 of 50. Okay, well, there's a big spike here at

79 00:05:26.512 --> 00:05:32.162 33 where a bunch of people believed that other people were going to name 50.

80 00:05:32.162 --> 00:05:37.070 if we keep going, so down here. If we keep going and looking at this,

81 00:05:37.070 --> 00:05:40.413 what we see, then we see another spike at 2/3 of 33.

82 00:05:40.413 --> 00:05:44.657 So some people said, okay, well, maybe a bunch of people are going to think that

83 00:05:44.657 --> 00:05:48.317 the average is going to be 50, they are going to name 33.

84 00:05:48.317 --> 00:05:52.830 I'm going to go one better than that. I am going to name something around 22,

85 00:05:52.830 --> 00:05:55.980 23. you know what the winner in this game

86 00:05:55.980 --> 00:06:01.168 was? The winner was actually 23. So 2/3 of the average guess here was

87 00:06:01.168 --> 00:06:06.532 about 23 because the mean was, was 34 and so one of these people randomly would end

88 00:06:06.532 --> 00:06:12.099 up being the winner of this game. Okay? there's actually a spike of people who

89 00:06:12.099 --> 00:06:18.104 went all the way to the Nash equilibrium and it's interesting here, because the

90 00:06:18.104 --> 00:06:22.230 Nash equilibrium works if you believe that everyone else is going to name the

91 00:06:22.230 --> 00:06:24.579 integer one, then that's your best response.

92 00:06:24.579 --> 00:06:28.907 But, in situations where a bunch of people don't necessarily understand the

93 00:06:28.907 --> 00:06:32.974 game and haven't reasoned through it, then you actually would be better off

94 00:06:32.974 --> 00:06:36.510 naming a higher number. So Nash equilibrium is a stable point if

95 00:06:36.510 --> 00:06:41.007 everybody figures it out and everybody abides by it, then it's the best thing

96 00:06:41.007 --> 00:06:45.367 you can do but it might be that some of the players aren't necessarily figuring

97 00:06:45.367 --> 00:06:48.452 out exactly what goes on. Okay. Now suppose you, you start with

98 00:06:48.452 --> 00:06:52.847 this game and they're not necessarily playing the Nash equilibrium, but now we

99 00:06:52.847 --> 00:06:56.217 have them play it again. Right? So, they get to do this, play it

100 00:06:56.217 --> 00:06:59.812 again, and then see what happens. Well, now, these people should realize

101 00:06:59.812 --> 00:07:03.817 that they overestimated, right? There's a bunch of people here who are naming

102 00:07:03.817 --> 00:07:08.012 numbers too high, they should be moving their announcements to, to lower numbers,

103 00:07:08.012 --> 00:07:12.054 right? They should be moving down. And if, if, if I anticipate that

104 00:07:12.054 --> 00:07:16.709 everybody's going to adjust and move downwards I should move my announcement

105 00:07:16.709 --> 00:07:20.208 downwards as well. So let's have a peek at what happens.

106 00:07:20.208 --> 00:07:25.063 So here is, is a subset of players actually from, from one of the classes I,

107 00:07:25.063 --> 00:07:28.736 I did on campus, where they got, this is the second play

108 00:07:28.736 --> 00:07:31.301 of the game. So after the first play, then we have

109 00:07:31.301 --> 00:07:34.283 them play again. Now you can begin to see that things, you

110 00:07:34.283 --> 00:07:38.105 know, the, the 50s have disappeared, all the numbers up here have disappeared,

111 00:07:38.105 --> 00:07:41.504 people have moved down, and in fact, a lot more people have are

112 00:07:41.504 --> 00:07:45.677 moving towards the equilibrium once you get to the second part, the second

113 00:07:45.677 --> 00:07:48.306 chance. So if you've played this game, you begin

114 00:07:48.306 --> 00:07:51.842 to see the logic of it. You played again and now we get closer to

115 00:07:51.842 --> 00:07:54.715 Nash equilibrium. So, Nash equilibrium does is a better

116 00:07:54.715 --> 00:07:58.254 predictor here. if from experienced players who have

117 00:07:58.254 --> 00:08:03.219 played this game understood it and, and interacting with the same population, you

118 00:08:03.219 --> 00:08:06.948 can begin to see things unraveling and moving back towards

119 00:08:06.948 --> 00:08:10.418 all announcing one. Okay. So Nash equilibrium, basic ideas, a

120 00:08:10.418 --> 00:08:16.040 consistent list of actions, so each player is maximizing his or her payouts

121 00:08:16.040 --> 00:08:21.680 given the actions of the other players. Should be self-consistent and stable.

122 00:08:21.680 --> 00:08:27.613 the nice parts about this, each players action is maximizing what they can get

123 00:08:27.613 --> 00:08:32.867 given the other players. nobody has an incentive to deviate from

124 00:08:32.867 --> 00:08:36.242 their action if an equilibrium profile is, is played.

125 00:08:36.242 --> 00:08:41.492 someone does have an incentive to deviate from a profile of actions that do not

126 00:08:41.492 --> 00:08:45.742 form an equilibrium. So these are the basic ideas and we'll be

127 00:08:45.742 --> 00:08:49.112 looking at, at Nash equilibrium in much more detail.

128 00:08:49.112 --> 00:08:52.975 So, in terms of of, of making predictions, you know, why, should we

129 00:08:52.975 --> 00:08:57.615 expect Nash equilibrium to be played? Well, I, I think there is sort of

130 00:08:57.615 --> 00:09:01.272 interesting logic here. in this logic, actually goes back to, to

131 00:09:01.272 --> 00:09:06.380 some of the original discussion by Nash. when we want to make a prediction of

132 00:09:06.380 --> 00:09:11.488 what's going on a game we want something which if players really understood

133 00:09:11.488 --> 00:09:16.812 things, it would be consistent. And the interesting thing is we should

134 00:09:16.812 --> 00:09:21.607 expect non-equilibria not to be stable, in the sense that, if players understood

135 00:09:21.607 --> 00:09:25.572 it and see what happens in a non-equilibrium, they should move away

136 00:09:25.572 --> 00:09:28.522 from that. And we saw exactly that in the, in the,

137 00:09:28.522 --> 00:09:32.927 the second round of the, the beauty contest game, then people start moving

138 00:09:32.927 --> 00:09:36.937 down toward the Nash equilibrium. So it's not necessarily true that we

139 00:09:36.937 --> 00:09:40.272 always expect equilibrium to be played, [COUGH] but we should expect

140 00:09:40.272 --> 00:09:44.292 non-equilibrium to vanish over time. And the, there'll be various dynamics and

141 00:09:44.292 --> 00:09:48.817 other kinds of settings where there will be strong pushes towards equilibrium over

142 00:09:48.817 --> 00:09:52.897 time, but they might have to be learned and they might have to evolve and, and so

143 00:09:52.897 --> 00:09:55.332 forth. So, as this course goes on, we'll talk

144 00:09:55.332 --> 00:09:59.222 more and more about some of the dynamics and, and things to push towards Nash

145 00:09:59.222 --> 00:09:59.895 equilibrium.

1's People

Contributors

revaraver avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.