ΘρϵηΠατπ

Probability Mass Function
Suppose that X is a discrete random varaible, then the probability mass function of X is defined by the function p:[0,1] pX(x)=P(X=x)
Bernoulli Random Variable
We say that a random variable X is a Bernoulli random varaible iff there is some p[0,1] P(X=1)=p and P(X=0)=1p we denote this symbolically as Xbern(p)
Expected Value of a Bernoulli Random Variable
Suppose Xbern(p) then E(X)=p
E(X)=P(X=1)1+P(X=0)0=p1
Binomial Distribution
The random varialbe X is said to have a binomial distribution on n trials, with probability of success per trial p iff X=dZ1+Z2++Zn where Z1,,Zn IID Zibern(p)
Expected Value of the Binomial Distribution
Suppose that Xbinom(n,p) then E(X)=np
Probability Mass Function of the Binomial Distribution
Suppose Xbinom(n,p) then pX(k)=(nk)pk(1p)nk

Note that pX(k) represents the probability of getting exactly k successes in n independent Bernouilli trials each with the same probability.

Coin Counting
A coin is tossed 12 times and 7 heads occur
  • Determine the probability that the 7th head occurs on the 12th toss given that there are exactly 7 heads in these 12 tosses
  • Now determine the probability that the 6th head occurs on the 9th toss given that there are exactly 7 heads in the first 12 tosses
  • Finally determine the probability that the 2nd head toss occurs on the 4th toss and the 6th head occurs on the 9th toss given that there are exactly 7 heads in the first 12 tosses

Let A be the event that the 7th head is on the 12th toss, and B the event that there are exactly 7 heads in these 12 tosses. In this case our goal is to determine P(A|B)=P(AB)P(B), note that the event A is equal to X=k where Xbinom(12,12), and thus P(X=7)=(127)(12)7(12)5.

Let's now try to compute P(AB) which is the probability of getting the 7th head on the 12th toss and exactly 7 heads in those 12 tosses. Note that by getting the 7th head on the 12th toss means that in the first 11 tosses exactly 6 heads occurred in any order, and then independently of that you do your last toss so that P(AB)=((116)(12)6(12)5)12

Now that we've computed the values of P(AB) along with P(B) we divide them to get the answer.


Let's take a similar approach where this time C,D are the events that the 6th head is on 9th toss and there exactly 7 heads in these 12 tosses respectively, so that our goal is to compute P(C|D). Recall that we can compute that via P(C|D)=P(CD)P(D), but we know what P(D) is from our previous answer, as that was P(B), therefore we just need a way to compute P(CD).

P(CD) is the probability that the 6th head is on the 9th toss and there are exactly 7 heads in 12 tosses. We can break this up into two parts, the first being that you have to get exactly 5 heads in the first 8 tosses, then land another head on the 9th toss and then get exactly 1 head in the last 3 tosses, therefore this probability is given by ((85)(12)5(12)3)12((31)(12)1(12)2)


Similarly to the previous two questions, we use the definition of conditional probability to write it as a fraction, we will only need to focus on the numerator which is the probability that the 2nd head occurs on the 4th toss and the 6th head occurs on the 9th toss and that there are exactly 7 heads in the first 12 tosses. It can be handled similarly to the above by splitting it into separate binomial computations.

Consider flipping a coin that has probability θ of coming up heads and 1θ of coming up tails, then the random variable which evalutes to 1 if the coin is heads, and 0 if the coin is tails has the bernouilli distribution

Poisson Distribution
The random variable N is said to have a poisson distribution with average number of successes λ>0 iff P(N=k)=eλλkk! where k0. We write Npois(λ)
Poisson as a Limit of Binomial
For any i suppose we have some pi[0,1], then define λi=ipi. Moreover suppose we have some Xibinom(i,pi). If limkλk exists denote it by λ and we have: P(N=k)=limnP(Xi=k) where Npois(λ)
P(Xn=k)amp;=(nk)pnk(1pn)nkamp;=(nk)(λnn)k(1λnn)nkamp;=n(n1)(nk+1)k(k1)1(λnn)k(1λnn)nkamp;=(1λnn)nλnkk!(n(n1)(nk1)nk(1λnn)k)amp;=(1λnn)nλnkk!((11n)(1k1n)(1λnn)(1λnn)) Note that as n we see that 1λnn1 and 1xn1 where x so that ((11n)(1k1n)(1λnn)(1λnn))1 therefore as n we have that P(Xn=k)eλλkk!

This also allows us to compute a binomial with a large n by approximating it with a poisson.

Number of Trials until Success as a Random Variable
Suppose we have (Zn),n1 such that Zibern(p), then we may recursively define the function Ti:11 as T1=min({n:Zn=1}) and for any k2 Tk=min({n1:Zn=1}{T1,Tk1}) where Tk is said to count the number of trials required until and include the k-th success.

To best understand the above definition consider the sequence (Zn=1),n1 which may be of the form (0,0,1,0,1,0,0,1,) and note that T1=min({3,5,8})=3, to determine T2 we have min({3,5,8}{3})=5 and so on. In this way the formula removes the previous k1 instances to find the kth instance.

Tk=n Equivalence
For any n1 we have: Tk=ni=1n1Zi=k1Zn=1
Negative Binomial
The random variable Y is said to have a negative binomial distribution on k successes each with probability p iff Y=dTk and we denote this as Ynegbin(k,p)
Probability Mass Function of the Negative Binomial Distribution
Suppose that Ynegbin(k,p) for k1, p(0,1] then for any n{k,k+1,k+2,} P(Y=n)=(n1k1)pk(1p)nk
Geometric Distribution
We say that the random variable W has a geometric distribution denoted by Wgeo(p) where p(0,1] when W=dT1
Large Number of Tossed Coins
A coin is tossed 1000 statistically independent times and exactly 3 heads occurs.
  • If we toss it 1000 times more, and let N denote the number of heads we might get this time, estimate the probability that N is anywhere between 1 and 5. [hint: e320.1]
  • Suppose now we toss the coin 1000x times and let Nx denote the number of heads in 1000x tosses and T1 the random number of trials until we obtain the first head. How large does x have to be so that P(T1>x)=1/2 ?
  • Now determine the probability that the 2nd head occurs between 750 and 1250 tosses.

This situation is perfectly modelled by the binomial distribution, (Nbinom(1000,12) ) it's just that it's hard to compute the binomial distribution for large values. Therefore we can employ an approximation using the poisson distribution as discussed previously, we want to know P(N{1,2,3,4,5})=P(N=1)+P(N=2)+P(N=3)+P(N=4)+P(N=5) and we can approximate each such value, using λ=100012=500

  • P(N=1)e50050011
  • P(N=2)e50050022
  • ...
  • P(N=5)e50050055

Therefore by adding all these numbers together we obtain an estimation on the probability of N being anywhere between 1 and 5.


We want to determine a value for x such that P(T1>x)=12, to understand this concretely if x=5 then P(T1>5) asks "what is the probability of getting our first head after 5 tosses of the coin")

Poisson Process
Suppose that (Tn,n) is a poisson process with TnG(n,λ1) and let U=T3T8&V=T3T8T3. Determine the following.
  • EU and σ(U).
  • EV and σ(V).
  • P(U>1/2).
Recall that E(Tn)=nλ as this is the formula for the expected value for the gamma distribution. Also if a random varaible Xgamma(p,θ) then we have E(X1)=1θ(p1) if p>1. Thus we have E(T8T3)=E(T81T3) Since the sequence Tn is IID (why is that idk) then we can say E(T81T3)=E(T8)E(1T3)=8λ11λ(31)=4
Fundamental Properties of Covariance
Let X,Y,Z be random variables and c then we have
  • Var(X)=Cov(X,X)
  • Cov(cX,Y)=cCov(X,Y)=Cov(X,cY)
  • Cov(X+Y,Z)=Cov(X,Z)+Cov(Y,Z)
  • Cov(X,Y+Z)=Cov(X,Y)+Cov(X,Z)
  • Cov(X,Y)=Cov(Y,X)
  • Cov(X,c)=0
Covariance Equations
For any -valued X and Y and any scalars s and t consider the simple difference W=Y(s+tX). But, if EW=0=cov(X,W) this will determine s and t.
  • Assuming that EX=EY=varX=varX=1 and cov(X,Y)=1/2 determine two simultaneous equations thus to obtain the unique α and β such that Y=α+βX+W and E(W)=0=cov(X,W).
  • Now supposing that Xgamma(p=3,θ=2) and Y=X1, determine α and β
  • For the assumptions in b), determine the correlation coefficient ρ(α+βX,Y).
Before doing anything else we consider the following Cov(Y,W)amp;=Cov(Y,Y(s+tX))amp;=Cov(Y,Y)Cov(Y,s+tX)amp;=Var(Y)tCov(Y,X)amp;=1t12 We did this because we knew that the properties of co-variance would result in some expression where some of the sub-expressions are the same as the ones we know information about. One thing that we require is that E(W)=0, we see that E(W)=E(Y(s+tX))=E(Y)stE(X)=0 So we conclude that s=E(Y)tE(X). We also need that Cov(X,W)=0 that is to say Cov(X,W)amp;=Cov(X,Y(s+tX))amp;=Cov(X,Y)Cov(X,s+tX)amp;=Cov(X,Y)(0+tCov(X,X))amp;=Cov(X,Y)tVar(X)amp;=0 in that we have t=Cov(X,Y)Var(X)=12 by our assumptions. This also allows us to deduce that s=E(Y)12E(X) therefore we can now observe that Yamp;=W+(s+tX)amp;=W+(E(Y)12E(X)+(12)X)amp;=(E(Y)12E(X))+12X+W so that α=(E(Y)12E(X)) and β=12.
Sum of Bernoulli
Suppose that Zi IID bern(p=14) for i1, determine the following
  • P(Z1+Z2=0Z3+Z4=1Z5+Z6=2)
  • P(Z1=0Z2+Z4=1|i=66Zi=3)

Recall that if X,Ybern(p) then we know that X+Ybin(2,14). Therefore we can use the binomial formula when trying to compute P(X+Y=k) for some integer k. In our specific question we note that the given events are independent (todo explain why), and thus we have P(Z1+Z2=0Z3+Z4=1Z5+Z6=2)amp;=P(Z1+Z2=0)P(Z3+Z4=1)P(Z5+Z6=2)amp;=(20)(34)2(21)(14)(34)(22)(14)2amp;=916616116amp;=33211


To determine this quantity we first use the definition of conditional probability and then employ the same strategy as above.

Geometric Expectation
Suppose that X|Nbin(N,35) and that Ngeo(23)
  • For the varaible N obtain E(N) and E(N(N1))

Recall that for Ngeo(23) we know that P(N=n)=pqn1 and an easy way to remember that is that geo is the distribution of how many trials are required before the first success. Note that in this case p=23 and q=1p=13. Now let's try to compute the expected value: E(N)amp;=n=1nP(N=n)amp;=n=1npqn1amp;=pn=1nqn1amp;=p(ddq(11q))amp;=p1(1q)2amp;=p1p2amp;=1pamp;=32

Taking a look at E(N(N1)) then we have E(N(N1))amp;=n=2n(n1)pqn1amp;=pqn=2n(n1)qn2amp;=pq(d2dq2(11q))amp;=pq2(1q)3amp;=2qp2amp;=2/3(2/3)2=32

Poisson Quotient
Suppose that Tn,n1 is a poisson process where we know that Tngamma(n,15) and let U=T2T5 and V=T2T5T2

Observe that the random variable U satisfies U=T2T5=T2T2+T3 meaning that Ubeta(2,3), and then we have E(U)=E(T2T5)=E(T2)E(T5)=2/λ5/λ=25