hide💡July 26 marks the anniversary of the Americans with Disabilities Act.
Accessibility and Inclusion is at the heart of what we do, learn with Amara.org about the role of captions in ADA compliance!

< Return to Video

Statistics intro: mean, median and mode

  • 0:01 - 0:07
    We will now begin our journey
    into the world of statistics,
  • 0:07 - 0:10
    which is really a way
    to understand or get
  • 0:10 - 0:12
    our head around data.
  • 0:12 - 0:15
    So statistics is all about data.
  • 0:15 - 0:19
    And as we begin our journey
    into the world of statistics,
  • 0:19 - 0:21
    we will be doing
    a lot of what we
  • 0:21 - 0:23
    can call descriptive statistics.
  • 0:23 - 0:25
    So if we have a bunch
    of data, and if we
  • 0:25 - 0:28
    want to tell something
    about all of that data
  • 0:28 - 0:30
    without giving them
    all of the data,
  • 0:30 - 0:34
    can we somehow describe it
    with a smaller set of numbers?
  • 0:34 - 0:36
    So that's what we're
    going to focus on.
  • 0:36 - 0:37
    And then once we
    build our toolkit
  • 0:37 - 0:39
    on the descriptive
    statistics, then we
  • 0:39 - 0:42
    can start to make
    inferences about that data,
  • 0:42 - 0:44
    start to make conclusions,
    start to make judgments.
  • 0:44 - 0:49
    And we'll start to do a lot
    of inferential statistics,
  • 0:49 - 0:51
    make inferences.
  • 0:51 - 0:53
    So with that out of
    the way, let's think
  • 0:53 - 0:56
    about how we can describe data.
  • 0:56 - 1:01
    So let's say we have
    a set of numbers.
  • 1:01 - 1:02
    We can consider this to be data.
  • 1:02 - 1:05
    Maybe we're measuring
    the heights of our plants
  • 1:05 - 1:06
    in our garden.
  • 1:06 - 1:07
    And let's say we
    have six plants.
  • 1:07 - 1:14
    And the heights are 4 inches,
    3 inches, 1 inch, 6 inches,
  • 1:14 - 1:18
    and another one's 1 inch,
    and another one is 7 inches.
  • 1:18 - 1:21
    And let's say someone just
    said-- in another room, not
  • 1:21 - 1:22
    looking at your
    plants, just said,
  • 1:22 - 1:25
    well, you know, how
    tall are your plants?
  • 1:25 - 1:26
    And they only want
    to hear one number.
  • 1:26 - 1:31
    They want to somehow
    have one number that
  • 1:31 - 1:33
    represents all of these
    different heights of plants.
  • 1:33 - 1:37
    How would you do that?
  • 1:37 - 1:39
    Well, you'd say, well,
    how can I find something
  • 1:39 - 1:41
    that-- maybe I want
    a typical number.
  • 1:41 - 1:44
    Maybe I want some number that
    somehow represents the middle.
  • 1:44 - 1:46
    Maybe I want the
    most frequent number.
  • 1:46 - 1:49
    Maybe I want the number
    that somehow represents
  • 1:49 - 1:51
    the center of all
    of these numbers.
  • 1:51 - 1:53
    And if you said any
    of those things,
  • 1:53 - 1:55
    you would actually have
    done the same things
  • 1:55 - 1:58
    that the people who first came
    up with descriptive statistics
  • 1:58 - 1:58
    said.
  • 1:58 - 2:00
    They said, well,
    how can we do it?
  • 2:00 - 2:05
    And we'll start by thinking
    of the idea of average.
  • 2:05 - 2:08
    And in every day
    terminology, average
  • 2:08 - 2:10
    has a very particular
    meaning, as we'll see.
  • 2:10 - 2:12
    When many people
    talk about average,
  • 2:12 - 2:13
    they're talking
    about the arithmetic
  • 2:13 - 2:15
    mean, which we'll see shortly.
  • 2:15 - 2:18
    But in statistics, average
    means something more general.
  • 2:18 - 2:23
    It really means
    give me a typical,
  • 2:23 - 2:30
    or give me a middle number,
    or-- and these are or's.
  • 2:30 - 2:32
    And really it's
    an attempt to find
  • 2:32 - 2:33
    a measure of central tendency.
  • 2:39 - 2:41
    So once again, you have
    a bunch of numbers.
  • 2:41 - 2:43
    You're somehow trying
    to represent these
  • 2:43 - 2:46
    with one number we'll call
    the average, that's somehow
  • 2:46 - 2:49
    typical, or middle,
    or the center somehow
  • 2:49 - 2:50
    of these numbers.
  • 2:50 - 2:54
    And as we'll see, there's
    many types of averages.
  • 2:54 - 2:57
    The first is the one that you're
    probably most familiar with.
  • 2:57 - 2:58
    It's the one-- and
    people talk about hey,
  • 2:58 - 3:01
    the average on this exam
    or the average height.
  • 3:01 - 3:03
    And that's the arithmetic mean.
  • 3:03 - 3:05
    Just let me write it in.
  • 3:05 - 3:13
    I'll write in yellow,
    arithmetic mean.
  • 3:13 - 3:16
    When arithmetic is a noun,
    we call it arithmetic.
  • 3:16 - 3:20
    When it's an adjective like
    this, we call it arithmetic,
  • 3:20 - 3:22
    arithmetic mean.
  • 3:22 - 3:25
    And this is really just the
    sum of all the numbers divided
  • 3:25 - 3:28
    by-- this is a human-constructed
    definition that we've
  • 3:28 - 3:32
    found useful-- the sum of
    all these numbers divided
  • 3:32 - 3:34
    by the number of
    numbers we have.
  • 3:34 - 3:37
    So given that, what
    is the arithmetic mean
  • 3:37 - 3:39
    of this data set?
  • 3:39 - 3:40
    Well, let's just compute it.
  • 3:40 - 3:46
    It's going to be 4 plus
    3 plus 1 plus 6 plus 1
  • 3:46 - 3:51
    plus 7 over the number
    of data points we have.
  • 3:51 - 3:53
    So we have six data points.
  • 3:53 - 3:55
    So we're going to divide by 6.
  • 3:55 - 4:02
    And we get 4 plus 3 is 7,
    plus 1 is 8, plus 6 is 14,
  • 4:02 - 4:05
    plus 1 is 15, plus 7.
  • 4:05 - 4:08
    15 plus 7 is 22.
  • 4:08 - 4:09
    Let me do that one more time.
  • 4:09 - 4:15
    You have 7, 8, 14, 15,
    22, all of that over 6.
  • 4:15 - 4:17
    And we could write
    this as a mixed number.
  • 4:17 - 4:21
    6 goes into 22 three times
    with a remainder of 4.
  • 4:21 - 4:25
    So it's 3 and 4/6, which is
    the same thing as 3 and 2/3.
  • 4:25 - 4:29
    We could write this as a
    decimal with 3.6 repeating.
  • 4:29 - 4:32
    So this is also 3.6 repeating.
  • 4:32 - 4:34
    We could write it any
    one of those ways.
  • 4:34 - 4:37
    But this is kind of a
    representative number.
  • 4:37 - 4:40
    This is trying to get
    at a central tendency.
  • 4:40 - 4:42
    Once again, these are
    human-constructed.
  • 4:42 - 4:44
    No one ever-- it's
    not like someone just
  • 4:44 - 4:46
    found some religious
    document that said,
  • 4:46 - 4:48
    this is the way that
    the arithmetic mean
  • 4:48 - 4:49
    must be defined.
  • 4:49 - 4:53
    It's not as pure
    of a computation
  • 4:53 - 4:55
    as, say, finding the
    circumference of the circle,
  • 4:55 - 4:57
    which there really is--
    that was kind of-- we
  • 4:57 - 4:58
    studied the universe.
  • 4:58 - 5:01
    And that just fell out of
    our study of the universe.
  • 5:01 - 5:02
    It's a human-constructed
    definition
  • 5:02 - 5:04
    that we found useful.
  • 5:04 - 5:07
    Now there are other ways
    to measure the average
  • 5:07 - 5:10
    or find a typical
    or middle value.
  • 5:10 - 5:14
    The other very typical
    way is the median.
  • 5:14 - 5:16
    And I will write median.
  • 5:16 - 5:17
    I'm running out of colors.
  • 5:17 - 5:19
    I will write median in pink.
  • 5:19 - 5:21
    So there is the median.
  • 5:21 - 5:25
    And the median is literally
    looking for the middle number.
  • 5:25 - 5:27
    So if you were to order
    all the numbers in your set
  • 5:27 - 5:31
    and find the middle one,
    then that is your median.
  • 5:31 - 5:34
    So given that, what's the
    median of this set of numbers
  • 5:34 - 5:36
    going to be?
  • 5:36 - 5:37
    Let's try to figure it out.
  • 5:37 - 5:38
    Let's try to order it.
  • 5:38 - 5:40
    So we have 1.
  • 5:40 - 5:41
    Then we have another 1.
  • 5:41 - 5:43
    Then we have a 3.
  • 5:43 - 5:47
    Then we have a 4, a 6, and a 7.
  • 5:47 - 5:49
    So all I did is
    I reordered this.
  • 5:49 - 5:51
    And so what's the middle number?
  • 5:51 - 5:52
    Well, you look here.
  • 5:52 - 5:55
    Since we have an even number of
    numbers, we have six numbers,
  • 5:55 - 5:57
    there's not one middle number.
  • 5:57 - 6:00
    You actually have two
    middle numbers here.
  • 6:00 - 6:02
    You have two middle
    numbers right over here.
  • 6:02 - 6:03
    You have the 3 and the 4.
  • 6:03 - 6:06
    And in this case, when you
    have two middle numbers,
  • 6:06 - 6:10
    you actually go halfway
    between these two numbers.
  • 6:10 - 6:12
    You're essentially taking the
    arithmetic mean of these two
  • 6:12 - 6:14
    numbers to find the median.
  • 6:14 - 6:16
    So the median is going
    to be halfway in-between
  • 6:16 - 6:19
    3 and 4, which is
    going to be 3.5.
  • 6:19 - 6:24
    So the median in
    this case is 3.5.
  • 6:24 - 6:27
    So if you have an even
    number of numbers, the median
  • 6:27 - 6:29
    or the middle two, the--
    essentially the arithmetic
  • 6:29 - 6:31
    mean of the middle two, or
    halfway between the middle two.
  • 6:31 - 6:33
    If you have an odd
    number of numbers,
  • 6:33 - 6:34
    it's a little bit
    easier to compute.
  • 6:34 - 6:36
    And just so that
    we see that, let
  • 6:36 - 6:37
    me give you another data set.
  • 6:37 - 6:39
    Let's say our data
    set-- and I'll
  • 6:39 - 6:42
    order it for us--
    let's say our data set
  • 6:42 - 6:56
    was 0, 7, 50, I don't know,
    10,000, and 1 million.
  • 6:56 - 6:57
    Let's say that is our data set.
  • 6:57 - 6:58
    Kind of a crazy data set.
  • 6:58 - 7:02
    But in this situation,
    what is our median?
  • 7:02 - 7:04
    Well, here we have five numbers.
  • 7:04 - 7:05
    We have an odd
    number of numbers.
  • 7:05 - 7:07
    So it's easier to
    pick out a middle.
  • 7:07 - 7:12
    The middle is the number that is
    greater than two of the numbers
  • 7:12 - 7:14
    and is less than
    two of the numbers.
  • 7:14 - 7:15
    It's exactly in the middle.
  • 7:15 - 7:19
    So in this case,
    our median is 50.
  • 7:19 - 7:21
    Now, the third measure
    of central tendency,
  • 7:21 - 7:22
    and this is the
    one that's probably
  • 7:22 - 7:26
    used least often in
    life, is the mode.
  • 7:26 - 7:28
    And people often
    forget about it.
  • 7:28 - 7:30
    It sounds like
    something very complex.
  • 7:30 - 7:31
    But what we'll see
    is it's actually
  • 7:31 - 7:33
    a very straightforward idea.
  • 7:33 - 7:36
    And in some ways, it
    is the most basic idea.
  • 7:36 - 7:41
    So the mode is actually the most
    common number in a data set,
  • 7:41 - 7:42
    if there is a most
    common number.
  • 7:42 - 7:44
    If all of the numbers
    are represented equally,
  • 7:44 - 7:46
    if there's no one single
    most common number,
  • 7:46 - 7:47
    then you have no mode.
  • 7:47 - 7:50
    But given that
    definition of the mode,
  • 7:50 - 7:54
    what is the single most common
    number in our original data
  • 7:54 - 7:58
    set, in this data
    set right over here?
  • 7:58 - 8:00
    Well, we only have one 4.
  • 8:00 - 8:01
    We only have one 3.
  • 8:01 - 8:03
    But we have two 1's.
  • 8:03 - 8:05
    We have one 6 and one 7.
  • 8:05 - 8:09
    So the number that shows up
    the most number of times here
  • 8:09 - 8:11
    is our 1.
  • 8:11 - 8:14
    So the mode, the most typical
    number, the most common number
  • 8:14 - 8:18
    here is a 1.
  • 8:18 - 8:20
    So, you see, these
    are all different ways
  • 8:20 - 8:23
    of trying to get at a typical,
    or middle, or central tendency.
  • 8:23 - 8:26
    But they do it in very,
    very different ways.
  • 8:26 - 8:27
    And as we study more
    and more statistics,
  • 8:27 - 8:30
    we'll see that they're
    good for different things.
  • 8:30 - 8:32
    This is used very frequently.
  • 8:32 - 8:35
    The median is really good if you
    have some kind of crazy number
  • 8:35 - 8:36
    out here that could
    have otherwise
  • 8:36 - 8:38
    skewed the arithmetic mean.
  • 8:38 - 8:41
    The mode could also be useful
    in situations like that,
  • 8:41 - 8:43
    especially if you do
    have one number that's
  • 8:43 - 8:46
    showing up a lot
    more frequently.
  • 8:46 - 8:48
    Anyway, I'll leave you there.
  • 8:48 - 8:52
    And we'll-- the next few videos,
    we will explore statistics even
  • 8:52 - 8:53
    deeper.
Title:
Statistics intro: mean, median and mode
Description:

more » « less
Video Language:
English
Team:
Khan Academy
Duration:
08:54
Fran Ontanaya edited English subtitles for Statistics intro: mean, median and mode Jul 13, 2020, 5:59 PM
Amara Bot edited English subtitles for Statistics intro: mean, median and mode Jul 13, 2020, 3:25 PM
Amara Bot edited English subtitles for Statistics intro: mean, median and mode Jul 13, 2020, 3:25 PM

English subtitles

Revisions Compare revisions