note: for a basic course in probability, you may want to skip some notes on measure theory.

basics and combinatorics

definitionsample space

the sample space is the set of all possible outcomes of an experiment. it can be finite or infinite.

definitionevent

an event is a subset of the sample space , or an element of the power set of the sample space .

definitionobservable event set

the set of all observable events is denoted by , where .

remark

usually, if is countable, . however, sometimes many events are excluded from since it is not possible for them to happen.

definitionσ-algebra

the set is called a -algebra if:

  1. ;
  2. , , then ; and
  3. , , then .
definitionprobability measure

is a probability measure if it satisfies the following three axioms:

  1. ,
  2. , and
  3. ,

where are disjunct.

remark
  • ,
  • ,
  • if , then , and
  • .
propositionde morgan's laws

let be a set of events.

theoreminclusion-exclusion principle

let be a set of events, then

where

definitionlaplace space

if with where all have the same probability , is called laplace space and has a discrete uniform distribution. for some event , we have

remark

the discrete uniform distribution exists only if is finite.

definitionconditional probability

given two events and with , the probability of given is defined as

theoremtotal probability

let be a set of disjunct events where , then, for any event ,

definitionbayes' rule

let be the set of disjunct event where with for all , then, for an event with , we have

definitionindependence

a set of events are independent if, for all with , we have

definitionfactorial

the factorial function is defined by the product

for integer .

definitiongamma function

let with , the gamma function is defined via the following convergent improper integral:

remarkgamma function properties
  • .
  • .
  • .
  • , .
definitionpermutation

let be the number of total objects and be the number of objects we want to select. a permutation is an arrangement of elements where we care about ordering.

  1. repetition not allowed:
  1. repetition allowed:
definitioncombination

let be the number of total objects and be the number of objects we want to select. a combination is an arrangement of elements where we do not care about ordering.

  1. repetition not allowed:
  1. repetition allowed:
remark

repetition is the same as replacement.

remarkbinomial coefficient properties
  • ,
  • ,
  • ,
  • ,
  • , and
  • .
remarksum properties

let , and .

  • .
  • .
  • .
  • .
  • .
theorembinomial expansion

random variables

definitionrandom variable

let be a probability space. a random variable on is a function

if the image is countable, is called a discrete random variable, otherwise it is called a continuous random variable.

definitionprobability density function / probability mass function

the probability density function (pdf) of a random variable is a function defined as

with discrete, it is called probability mass function.

remark
  • if is a discrete random variable, then .
  • if is a continuous random variable, then .
definitioncumulative distribution

the cumulative distribution function (cdf) of a random variable is a function defined as

if the pdf is given, the cdf can be expressed with

remarkcumulative distribution properties
  • if , then (monotonicity).
  • if , then .
  • and .
  • .
  • .
  • .
definitionexpected value

let be a random variable. the expected value is defined as

remarkexpected value properties
  • if ,
  • ,
  • if ,
  • ,
  • ,
  • , and
  • for independent .
remark

the expected value is a linear operator.

definitionraw moment / central moment

let . the th (raw) moment is defined as

the th central moment is defined as

definitionexpected value of functions

let be a random variable and with , then

definitionmoment-generating function

let be a random variable. the moment-generating function of is defined as

definitioncharacteristic function

let be a random variable. the characteristic function of is defined as

where .

definitionvariance

let be a random variable with . the variance of is defined as

remarkvariance properties
  • ,
  • ,
  • ,
  • ,
  • , and
  • if , .
definitionstandard deviation

let be a random variable with . the standard deviation of is defined as

definitioncovariance

let and be random variables with finite expected value. the covariance of and is defined as

remark

the covariance is a measure of correlation between two random variables. if tends to increase as increases. if tends to decrease as increases. if , then and are uncorrelated.

remarkcovariance properties
  • ,
  • , and
  • .
definitioncorrelation

let and be random variables with finite expected value. the correlation of and is defined as

remark

correlation is the same as covariance but normalized with values between and .

definitioncoefficient of variation

the coefficient of variation is defined as the ratio of the standard deviation to the mean.

definitionindicator function

the indicator function for a set (event) is defined as

definitionsurvival function

let be a continuous random variable with cumulative distribution function on the interval . its survival function or reliability function is defined as

definitionmemorylessness

suppose is a non-negative random variable. the probability distribution of is memoryless if for any , we have

theoremmarkov's inequality

let be a random variable and be an increasing function, then, for all with , we have

remark

for practical uses usually .

theoremchebyshev's inequality

let be a random variable with , then, if ,

theoremjensen's inequality

if is a random variable and is a convex function, then

multivariate distributions

definitionjoint probability density function

the joint probability density function with is a function defined as

definitionjoint cumulative distribution function

the joint cumulative distribution function with is a function defined as

if the joint pdf is given, it can be expressed with

where and .

remark
definitionmarginal probability density function

the marginal probability density function of given a joint pdf is defined as

where , and in the discrete case .

remark

the idea of the marginal probability is to ignore all other random variables and consider only the one we're interested in.

definitionmarginal cumulative distribution function

the marginal cumulative distribution function of given a joint cdf is defined as

definitionconditional distribution

the conditional distribution is defined as

definitionindependence

the random variables are independent if

similarly, if their pdf is absolutely continuous, they are independent if

theoremfunction independence

if the random variables are independent and is a function with , then also are independent.

theorem

the random variables are independent if and only if, , we have

definitionjoint expected value

the joint expected value of a random variable is defined as

where , and in the discrete case .

definitionconditional expected value

the conditional expected value of random variables and is defined as

remark
  • ,
  • if and are independent, and
  • .
definition

let .

where and .

theoremtransformation

when is a strictly increasing function, then

when is a strictly decreasing function, then

equivalently, .

in higher dimensions, the derivative generalizes to the determinant of the jacobian matrix --- the matrix with . thus, for real-valued vector and ,

theoremtransformation

let have a continuous cdf and define . then , i.e., for .

theorem

for and , if and exist, and for all in some neighbourhood of zero, then for all .

remarkmonte carlo integration

let be the integral of a function that is hard to evaluate, then

where is uniformly distributed. then, by the law of large numbers, we know that we can approximate by randomly sampling from .

remarksum

let be independent random variables, then the sum has a pdf evaluated with a convolution between all pdfs

in the special case where , we have

remark

often it is much easier to use properties of the random variables to find the sum instead of evaluating the convolution.

remarkproduct

let and be independent random variables. to evaluate the pdf and the cdf of , we proceed as

where the pdf is

remarkquotient

let and be independent random variables. to evaluate the pdf and the cdf of , we proceed as

where the pdf is

definitioncovariance matrix / correlation matrix

let be random variables. let and for every . the covariance and correlation matrices as defined respectively as

theorem

let and be random variables. if and only if there exist such that .