Introduction to Statistics

Introduction

What is Statistics?

Statistics: Methods to collect, analyze, present, interpret data, and make decisions.

Two Types of Statistics:

  1. Descriptive: Methods to organize, display, and describe data.
  2. Inferential: Methods that use sample results to help make decisions or predictions about a population.

Note: Probability lies in-between descriptive and inferential statistics.

Course Progression: Descriptive Statistics \to Probability \to Inferential Statistics

Terms in Statistics

Population: The entire group of individuals that we want information about.

Sample: The subset of the population we examine to gather information.

Random Sample: Sample where each element in the population has a chance of being selected.

Sample Size (n): The number of observations in a sample.

Variable: Characteristic of a person or thing.

Observation (x_i): Value of a variable for an element.

Data Set (X): Collection of observations on \ge 1 variables.

Example: Finding the average number of plushies students have in their bedrooms

Types of Variables

Two Types of Variables:

  1. Numerical (Quantitative): Variables whose values are numeric.
  2. Categorical (Qualitative): Variables whose values are not numeric.

Important: Variable type is determined by how we use it.

Note: Different types require different analyses.

Random Sampling

Simple R.S. Select n objects at random from the population.

Stratified R.S. Divide population into strata11. Non-overlapping groups, then select simple random sample for each stratum.

Cluster R.S. Divide population into clusters22. Groups, select clusters at random, then select simple random sample for each cluster.

Note: Clusters v.s. Strata (some v.s. all)

Systematic R.S. Select every kth element in the population.

Multi-Stage R.S.: A mixture or combination of at least two methods above (except simple R.S.)

Note: In this course, we’ll assume the samples we’re given are good samples and gotten through simple R.S., unless told otherwise.


  1. Non-overlapping groups↩︎

  2. Groups↩︎