2. Randomized Monte Carlo Algorithm for
approximate median
This lecture was delivered at slow pace and its flavor was that of a
tutorial.
Reason: To show that designing and analyzing a randomized
algorithm demands right insight and just elementary probability.
2
3. A simple probability exercise
âą
There is a coin which gives HEADS with probability Œ and TAILS with
probability Ÿ. The coin is tossed times. What is the probability that we get at
least HEADS ?
[Stirlingâs approximation for Factorial: ]
3
4. Probability of getting
âat least HEADS in tossesâ
Probability of getting at least heads:
âą
Using Stirlingâs approximation
Since , so âŠ
Inverse exponential in .
4
5. Approximate median
Definition: Given an array A[] storing n numbers and Ï” > 0, compute an
element whose rank is in the range [(1- Ï”)n/2, (1+ Ï”)n/2].
Best Deterministic Algorithm:
âą âMedian of Mediansâ algorithm for finding exact median
âą Running time: O(n)
âą No faster algorithm possible for approximate median
Can you give a short proof ?
5
6. œ - Approximate median
A Randomized Algorithm
Rand-Approx-Median(A)
1. Let k ï c log n;
2. S ï â ;
3. For i=1 to k
4.
x ï an element selected randomly uniformly from A;
5.
S ï S U {x};
6. Sort S.
7. Report the median of S.
Running time: O(log n loglog n)
6
7. Analyzing the error probability of
Rand-approx-median
n/4
Left Quarter
Elements of A arranged in
Increasing order of values
3n/4
Right Quarter
When does the algorithm err ?
To answer this question, try to characterize what
will be a bad sample S ?
7
8. Analyzing the error probability of
Rand-approx-median
n/4
Elements of A arranged in
Increasing order of values
Left Quarter
Median of S
3n/4
Right Quarter
Observation: Algorithm makes an error only if k/2 or more elements
sampled from the Right Quarter (or Left Quarter).
8
9. Analyzing the error probability of
Rand-approx-median
âą
n/4
Elements of A arranged in
Increasing order of values
3n/4
Right Quarter
Left Quarter
Pr[ An element selected randomly from A is from Right quarter] = Œ
??
Pr[ Out of k elements sampled from A, at least k/2 are from Right quarter] = ??
for
Exactly the same as the coin
tossing exercise we did !
9
10. Main result we discussed
âą
Theorem: The Rand-approx-median algorithm fails to report œ
-approximate median from array A[1.. ] with probability at most.
Homework: Design a randomized Monte Carlo algorithm for
computing Ï”-approximate median of array A[1.. ] with running
time O(log n loglog n) and error probability for any given
constants Ï” and .
[Do this homework sincerely without any friendâs help.]
10
12. Elementary probability theory
(Relevant for CS648)
âą
âą
We shall mainly deal with discrete probability theory in this course.
We shall take the set theoretic approach to explain probability theory.
Consider any random experiment :
o Tossing a coin 5 times.
o Throwing a dice 2 times.
o Selecting a number randomly uniformly from [1..n].
How to capture the following facts in the theory of probability ?
1. Outcome will always be from a specified set.
2. Likelihood of each possible outcome is non-negative.
3. We may be interested in a collection of outcomes.
12
15. Exercises
A randomized algorithm can also be viewed as a random experiment.
1. What is the sample space associated with Randomized Quick sort ?
2. What is the sample space associated with Rand-approx-median
algorithm ?
15
16. An Important Advice
In the following slides, we shall state well known equations
(highlighted in yellow boxes) from probability theory.
âą You should internalize them fully.
âą We shall use them crucially in this course.
âą Make sincere attempts to solve exercises that follow.
16
19. Exercises
âą
âą
For events ,âŠ, defined over a probability space (,P), prove that P() =
âŠ
)
âą
There are letters envelopes. For each letter, there is a unique envelope in
which it should be placed. A careless postman places the letters randomly
into envelopes (one letter in each envelope). What is the probability that
no letter is placed correctly (into the envelope meant for it) ?
19
20. Conditional Probability
Happening of some event influences the likelihood of happening of other events. This
notion is formally captured by conditional probability as follows.
âą
Probability of event A conditioned on event B, compactly represented as P[A|B],
means the following.
Given that event B has happened, what is the probability that event A has also
happened ?
You might have seen and used the following equation for conditional probability.
P[A|B] =
Can you give suitable reason to justify the validity of the above equation ?
In particular, give justification for ] in numerator and ] in denominator in this
equation.
20
21. Exercises
âą
A man possesses five coins, two of which are double-headed, one is
double-tailed, and two are normal. He shuts his eyes, picks a coin at
random, and tosses it. What is the probability that the lower face of the
coin is a head ? He opens his eyes and sees that the coin is showing heads;
what it the probability that the lower face is a head ? He shuts his eyes
again, and tosses the coin again. What is the probability that the lower
face is a head ? He opens his eyes and sees that the coin is showing heads;
what is the probability that the lower face is a head ? He discards this
coin, picks another at random, and tosses it. What is the probability that it
shows heads ?
21
23. Exercises
âą
âą
There are sticks each of different heights. There are vacant slots arranged
along a line and numbered from 1 to as we move from left to right. The
sticks are placed into the slots according to a uniformly random
permutation. A stick placed at th slot is said to be a dominating stick if its
height is largest among all sticks placed in slots 1 to . Find the probability
that th slot contains a dominating stick.
23
25. Exercises
âą
1.
Two fair dice are rolled. Show that the event that their sum is 7 is
independent of the score shown by the first die.
2.
Let (,P) be a probability space where = {1,2,âŠ,p} for a given prime
number p, and each elementary event has probability 1/p. Show that if
two events A and B defined over (,P) are independent, then at least one
of A and B is either â or .
25