Law of Large Numbers

Edit subtitles

0:02 - 0:08

Let's learn a little bit about the law of large numbers
0:08 - 0:12

which is on many levels, one of the most intuitive laws
0:12 - 0:14

in mathematics and in probability theory.
0:14 - 0:19

But because it's so applicable to so many things, it's often
0:19 - 0:22

a misused law or sometimes, slightly misunderstood.
0:22 - 0:26

So just to be a little bit formal in our mathematics,
0:26 - 0:29

let me just define it for you first and
0:29 - 0:29

then we'll talk a little bit about the intuition.
0:29 - 0:34

So let's say I have a random variable, X.
0:34 - 0:39

And we know its expected value or its population mean.
0:39 - 0:42

The law of large numbers just says that
0:42 - 0:46

if we take a sample of n observations of our random variable,
0:46 - 0:49

and if we were to average all of those observations--
0:49 - 0:51

and let me define another variable.
0:51 - 0:54

Let's call that x sub n with a line on top of it.
0:54 - 0:57

This is the mean of n observations of
0:57 - 0:58

our random variable.
0:58 - 1:01

So it's literally this is my first observation.
1:01 - 1:03

So you can kind of say I run the experiment once and
1:03 - 1:07

I get this observation and I run it again, I get that observation.
1:07 - 1:12

And I keep running it n times and
1:12 - 1:13

then I divide by my number of observations.
1:13 - 1:14

So this is my sample mean.
1:14 - 1:17

This is the mean of all the observations I've made.
1:17 - 1:23

The law of large numbers just tells us that my sample mean
1:23 - 1:28

will approach my expected value of the random variable.
1:28 - 1:33

Or I could also write it as my sample mean will approach
1:33 - 1:40

my population mean for n approaching infinity.
1:40 - 1:43

And I'll be a little informal with what does approach or
1:43 - 1:44

what does convergence mean?
1:44 - 1:46

But I think you have the general intuitive sense that
1:46 - 1:50

if I take a large enough sample here that I'm going to end up
1:51 - 1:54

getting the expected value of the population as a whole.
1:54 - 1:57

And I think to a lot of us that's kind of intuitive.
1:57 - 2:02

That if I do enough trials that over large samples, the trials
2:02 - 2:04

would kind of give me the numbers that I would expect
2:04 - 2:07

given the expected value and the probability and all that.
2:07 - 2:09

But I think it's often a little bit misunderstood
2:09 - 2:11

in terms of why that happens.
2:11 - 2:13

And before I go into that let me give you
2:13 - 2:15

a particular example.
2:15 - 2:17

The law of large numbers will just tell us that-- let's say
2:17 - 2:25

I have a random variable-- X is equal to the number of heads
2:25 - 2:31

after 100 tosses of a fair coin-- tosses or flips
2:31 - 2:33

of a fair coin.
2:36 - 2:38

First of all, we know what the expected value of
2:38 - 2:40

this random variable is.
2:40 - 2:43

It's the number of tosses, the number of trials times
2:43 - 2:46

the probabilities of success of any trial.
2:46 - 2:49

So that's equal to 50.
2:49 - 2:53

So the law of large numbers just says if I were to take a sample
2:53 - 2:58

or if I were to average the sample of a bunch of these trials,
2:58 - 3:03

so you know, I get-- my first time I run this trial
3:03 - 3:06

I flip 100 coins or have 100 coins in a shoe box and
3:06 - 3:10

I shake the shoe box and I count the number of heads, and I get 55.
3:10 - 3:12

So that Would be X1.
3:12 - 3:15

Then I shake the box again and I get 65.
3:15 - 3:18

Then I shake the box again and I get 45.
3:18 - 3:23

And I do this n times and then I divide it by
3:23 - 3:24

the number of times I did it.
3:24 - 3:27

The law of large numbers just tells us that this the average
3:27 - 3:31

the average of all of my observations,
3:31 - 3:39

is going to converge to 50 as n approaches infinity.
3:39 - 3:41

Or for n approaching 50.
3:41 - 3:43

I'm sorry, n approaching infinity.
3:43 - 3:45

And I want to talk a little bit about why this happens
3:45 - 3:47

or intuitively why this is.
3:47 - 3:51

A lot of people kind of feel that oh, this means that
3:51 - 3:55

if after 100 trials that if I'm above the average that somehow
3:55 - 3:58

the laws of probability are going to give me more heads
3:58 - 4:00

or fewer heads to kind of make up the difference.
4:00 - 4:02

That's not quite what's going to happen.
4:02 - 4:04

That's often called the gambler's fallacy.
4:04 - 4:05

Let me differentiate.
4:05 - 4:06

And I'll use this example.
4:06 - 4:08

So let's say-- let me make a graph.
4:08 - 4:09

And I'll switch colors.
4:23 - 4:25

This is n, my x-axis is n.
4:25 - 4:28

This is the number of trials I take.
4:28 - 4:33

And my y-axis, let me make that the sample mean.
4:33 - 4:36

And we know what the expected value is, we know
4:36 - 4:39

the expected value of this random variable is 50.
4:39 - 4:40

Let me draw that here.
4:43 - 4:43

This is 50.
4:47 - 4:50

So just going to the example I did.
4:50 - 4:54

So when n is equal to-- let me just [INAUDIBLE]
4:54 - 4:55

here.
4:55 - 4:59

So my first trial I got 55 and so that was my average.
4:59 - 5:01

I only had one data point.
5:01 - 5:05

Then after two trials, let's see, then I have 65.
5:05 - 5:09

And so my average is going to be 65 plus 55 divided by 2.
5:09 - 5:10

which is 60.
5:10 - 5:13

So then my average went up a little bit.
5:13 - 5:15

Then I had a 45, which will bring my average
5:15 - 5:17

down a little bit.
5:17 - 5:18

I won't plot a 45 here.
5:18 - 5:19

Now I have to average all of these out.
5:20 - 5:22

What's 45 plus 65?
5:22 - 5:24

Let me actually just get the number just
5:24 - 5:25

so you get the point.
5:25 - 5:29

So it's 55 plus 65.
5:29 - 5:33

It's 120 plus 45 is 165.
5:33 - 5:36

Divided by 3.
5:36 - 5:40

3 goes into 165 5-- 5 times 3 is 15.
5:40 - 5:42

It's 53.
5:42 - 5:44

No, no, no.
5:44 - 5:45

55.
5:45 - 5:47

So the average goes down back down to 55.
5:47 - 5:49

And we could keep doing these trials.
5:49 - 5:52

So you might say that the law of large numbers tells this,
5:52 - 5:57

OK, after we've done 3 trials and our average is there.
5:57 - 6:00

So a lot of people think that somehow the gods of probability
6:00 - 6:02

are going to make it more likely that we get fewer
6:02 - 6:03

heads in the future.
6:03 - 6:06

That somehow the next couple of trials are going to have to
6:06 - 6:09

be down here in order to bring our average down.
6:09 - 6:11

And that's not necessarily the case.
6:11 - 6:13

Going forward the probabilities are always the same.
6:13 - 6:15

The probabilities are always 50% that
6:15 - 6:16

I'm going to get heads.
6:16 - 6:20

It's not like if I had a bunch of heads to start off with or
6:20 - 6:22

more than I would have expected to start off with, that
6:22 - 6:25

all of a sudden things would be made up and I would get more tails.
6:25 - 6:28

That would the gambler's fallacy.
6:28 - 6:30

That if you have a long streak of heads or you have
6:30 - 6:32

a disproportionate number of heads, that at some point
6:32 - 6:35

you're going to have-- you have a higher likelihood of having
6:35 - 6:37

a disproportionate number of tails.
6:37 - 6:38

And that's not quite true.
6:38 - 6:41

What the law of large numbers tells us is that it doesn't care
6:41 - 6:46

Let's say after some finite number of trials
6:46 - 6:48

Your average actually-- it's a low probability of this happening,
6:48 - 6:50

but let's say your average is actually up here.
6:50 - 6:52

Is actually at 70.
6:52 - 6:56

You're like, wow, we really diverged a good bit from
6:56 - 6:57

the expected value.
6:57 - 6:58

But what the law of large numbers says, well,
6:58 - 7:00

I don't care how many trials this is.
7:00 - 7:04

We have an infinite number of trials left.
7:04 - 7:07

And the expected value for that infinite number of trials,
7:07 - 7:12

especially in this type of situation is going to be this.
7:12 - 7:16

So when you average a finite number that averages out to
7:16 - 7:18

some high number, and then an infinite number that's going to
7:18 - 7:23

converge to this, you're going to over time, converge back
7:23 - 7:24

to the expected value.
7:24 - 7:27

And that was a very informal way of describing it,
7:27 - 7:30

but that's what the law or large numbers tells you.
7:30 - 7:31

And it's an important thing.
7:31 - 7:34

It's not telling you that if you get a bunch of heads that
7:34 - 7:36

somehow the probability of getting tails is going
7:36 - 7:38

to increase to kind of make up for the heads.
7:38 - 7:42

What it's telling you is, is that no matter what happened
7:42 - 7:45

over a finite number of trials, no matter what the average is
7:45 - 7:47

over a finite number of trials, you have
7:47 - 7:48

an infinite number of trials left.
7:48 - 7:52

And if you do enough of them it's going to converge back
7:52 - 7:53

to your expected value.
7:53 - 7:54

And this is an important thing to think about.
7:54 - 7:58

But this isn't used in practice every day with the lottery and with casinos
7:58 - 8:02

because they know that if you do large enough samples
8:02 - 8:05

and we could even calculate
8:05 - 8:08

if you do large enough samples,
8:08 - 8:10

what's the probability that things deviate significantly?
8:10 - 8:13

But casinos and the lottery every day operate on this principle
8:13 - 8:16

that if you take enough people-- sure,
8:16 - 8:18

in the short-term or with a few samples,
8:18 - 8:20

a couple people might beat the house.
8:20 - 8:22

But over the long-term the house is always going to win
8:22 - 8:24

because of the parameters of the games that
8:24 - 8:25

they're making you play.
8:25 - 8:28

Anyway, this is an important thing in probability and
8:28 - 8:30

I think it's fairly intuitive.
8:30 - 8:33

Although, sometimes when you see it formally explained
8:33 - 8:34

like this with the random variables and that
8:34 - 8:35

it's a little bit confusing.
8:35 - 8:40

All it's saying is that as you take more and more samples,
8:40 - 8:45

the average of that sample is going to
8:45 - 8:46

approximate the true average.
8:46 - 8:47

Or I should be a little bit more particular.
8:47 - 8:52

The mean of your sample is going to converge to
8:52 - 8:55

the true mean of the population or
8:55 - 8:56

to the expected value of the random variable.
8:56 - 8:59

Anyway, see you in the next video.

Title:: Law of Large Numbers
Description:: Introduction to the law of large numbers

more » « less
Video Language:: English
Duration:: 08:59

	Alex Mou edited English subtitles for Law of Large Numbers	Nov 3, 2011, 6:37 PM
	Alex Mou edited English subtitles for Law of Large Numbers	Oct 30, 2011, 3:52 PM
	brettle edited English subtitles for Law of Large Numbers	Apr 18, 2011, 1:26 AM
	brettle edited English subtitles for Law of Large Numbers	Apr 18, 2011, 1:26 AM
	brettle edited English subtitles for Law of Large Numbers	Apr 18, 2011, 1:26 AM
	brettle edited English subtitles for Law of Large Numbers	Mar 2, 2011, 5:58 PM
	brettle edited English subtitles for Law of Large Numbers	Mar 2, 2011, 5:58 PM
	brettle edited English subtitles for Law of Large Numbers	Mar 2, 2011, 5:58 PM

Show all

English subtitles

Revisions

Revision 5

Alex Mou Nov 3, 2011, 6:37 PM

Law of Large Numbers

Revisions

Our website uses cookies

Operating cookies (Required)