Who Solved the Secretary Problem?

Transcription

Who Solved the Secretary Problem?
Who Solved the Secretary Problem?
Author(s): Thomas S. Ferguson
Source: Statistical Science, Vol. 4, No. 3 (Aug., 1989), pp. 282-289
Published by: Institute of Mathematical Statistics
Stable URL: http://www.jstor.org/stable/2245639
Accessed: 21/11/2010 11:01
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=ims.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact [email protected].
Institute of Mathematical Statistics is collaborating with JSTOR to digitize, preserve and extend access to
Statistical Science.
http://www.jstor.org
StatisticalScience
1989, Vol. 4, No. 3, 282-296
WhoSolved the SecretaryProblem?
Thomas S. Ferguson
Abstract.In MartinGardner'sMathematical
GamescolumnintheFebruary
1960issueofScientific
American,
thereappeareda simpleproblemthathas
cometobe knowntodayas theSecretary
ortheMarriageProblem.
Problem,
It has sincebeentakenup and developedbymanyeminent
and
probabilists
statisticiansand has been extendedand generalizedin manydifferent
directionsso that now one can say that it constitutesa "field"within
The objectof this articleis partly
mathematics-probability-optimization.
historical(to givea freshviewoftheoriginsoftheproblem,
touching
upon
Cayleyand Kepler),partlyreviewofthefield(listingthesubfields
ofrecent
interest),partlyserious(to answerthe questionposed in the title),and
The contentsof thispaperwerefirstgivenas the
partlyentertainment.
AllenT. Craiglectureat theUniversity
ofIowa,1988.
searchprobKeywordsandphrases:Secretary
problem,
marriage
problem,
lem,relativeranks,stoppingtimes,minimaxrules.
and the
thinkyou will findthe journeyinteresting,
conclusionsurprising.
1. INTRODUCTION
In thelate 1950'sand early1960'sthereappeareda
2. STATEMENTOF THE PROBLEM
simple,partlyrecreational,
problemknownas the
secretaryproblem,or the marriageproblem,or the
The reader'sfirstreactionto thetitlemightwellbe
dowryproblem,
thatmadeitswayaroundthematheto ask, "Whichsecretaryproblem?".Afterall, as I
maticalcommunity.
The problemhas a certainappeal.
havejust implied,thereare manyvariationson the
It is easy to state and has a strikingsolution.It
problemin its simplestform
The secretary
problem.
was immediately
takenup and developedby certain
features.
following
has
the
eminentprobabilistsand statisticians,
amongthem
Lindley(1961),Dynkin(1963),Chow,Moriguti,
Robpositionavailable.
1. Thereis one secretarial
bins and Samuels'(1964),and Gilbertand Mosteller
The numbern ofapplicantsis known.
2.
(1966). Since that time,the problemhas been exsequentiallyin
3. The applicantsare interviewed
tendedand generalizedin manydifferent
directions
randomorder,each orderbeingequallylikely.
so thatnowone can saythatit constitutes
a "field"of
4. It is assumedthatyoucan rankall theapplicants
studywithinmathematics-probability-optimization.
frombestto worstwithoutties.The decisionto
One can see fromthereviewpaperbyFreeman(1983)
acceptor rejectan applicantmustbe basedonly
howextensiveand vast the fieldhas become;moreon the relativeranksof thoseapplicantsinterin
itsexponential
over,thefieldhas continued
growth
viewedso far.
theyearssincethatpaperappeared.
5. An applicantonce rejectedcannotlaterbe reOne mainobjectiveofthepresentarticleis historicalled.
oftheproblemwiththeaim
cal,to reviewthehistory
andwillbe satisfiedwith
6. You areveryparticular
ofdetermining
whowasthefirstto solvethesecretary
nothingbut theverybest.(That is, yourpayoff
problem.The historicalreviewmaytakeus far,but I
is 1 ifyouchoosethebestofthen applicantsand
Ootherwise.)
Thomas S. Ferguson is Professorof Mathematics at
Universityof California,Los Angeles.His mailingaddress is: Department of Mathematics, Universityof
California,405 HilgardAvenue,Los Angeles,California
90024.
282
simplesoluThis basic problemhas a remarkably
can be restricted
tion.First,one showsthatattention
to theclass ofrulesthatforsomeintegerr > 1 rejects
the firstr - 1 applicants,and thenchoosesthe next
applicantwho is best in the relativerankingof the
observedapplicants.For sucha rule,theprobability,
WHO SOLVED THE SECRETARY PROBLEM?
283
/0(r),ofselectingthebestapplicantis 1/nforr = 1,
and,forr > 1,
on mathematical
probfianceproblemat a conference
Univerlemsin logisticsheldat GeorgeWashington
1988).
sityin January1950(personalcommunication,
n=E
I personallyremember
Jthapplicantis best
workingon extensionsof the
andyouselectit
(2.1)
j=nr)J=r
problemin the summerof 1959. In any case, many
peopleknewof the problemby the timeit appeared
in print.
j=r \fl/11 ( n j=rJ
Lindley(1961) seemsto be the firstto solve the
The optimalr is the one thatmaximizesthisproba- problemin a scientificjournal. He extends the
bility.For smallvaluesof n, the optimalr can easily
utilitybased on the rank
problemto an arbitrary
be computed.Of interestare the approximate
ofthe applicantselected,and considersin particular
values
oftheoptimalr forlargen. Ifwe let n tendto infinity the problemof minimizing
the expectedrankof the
and writex as the limitof r/n,thenusingt forj/n
thedifapplicantselected,rank1 beingbest.However,
and dt for1/n,the sumbecomesa Riemannapproxi- ficultproblemof finding
forlargen,
the asymptotic,
mationto an integral,
optimalstrategiesand expectedrankwas leftopen,
Robbins
and finallysolvedneatlybyChow,Moriguti,
and Samuels (1964). Dynkin (1963) considersthe
problemas an applicationof the theoryof Markov
(
i)j(
(2.2)
1)(n)
interpreted,
stoppingtimes,and showsthat,properly
x
(-) dt= -x log(x).
theproblemis monotoneso thatthe one-stagelookaheadruleis optimal.
The value ofx thatmaximizesthisquantityis easily
Then,in 1966,camethebasic paperofGilbertand
foundbysettingthederivative
withrespectto x equal
withelegantderivations
and extensionsin
Mosteller,
to zeroand thensolvingforx. Whenthisis donewe
a numberofimportant
In particular,
they
directions.
findthat
allowr choicesto obtainthe best;theyconsiderthe
problemof obtainingthe best or the nextbest;they
optimal x = lle = .367879 *.. , and
(2.3)
treatthe "full-information"
case, in whichone is al= l/e.
optimalprobability
lowedto observethe actualvalues of the applicants
Thus forlargen, it is approximately
optimalto wait
froma given
(presumedto be chosenindependently
untilabout 37% of the applicantshave been inter- distribution),
forboththebest-choice
problemand for
viewedand thento selectthenextrelatively
bestone.
theminimum-rank
problem;theyanalyzesomegame
The probability
ofsuccessis also about37%.
theoreticversionsof the problem;etc. This paper,
This derivation
mayseema littleloose,butit gives
morethanthe others,foreshadowed
the explosionof
therightanswer.Fora lucidpresentation
oftheseand
and effort
that wouldimpact
ideas, generalizations,
relatedresults,see the basic paper of Gilbertand
in 1972and thatcontinuesstrongly
thisarea starting
Mosteller(1966).
willbe mentioned
today.Someofthesecontributions
in Section5.
3. HISTORICALBACKGROUND
This, briefly,is the officialearly historyof the
to discover
secretary
problem.Sincewe areto attempt
There is some obscurityas to the originsof this
who
we
as
is
first
solved
this
shall,
customary
problem,
problem.It seemsto be generally
agreedamongworkin
in
time,
looking
historical
papers,
proceed
backward
ersin thefieldthatthefirststatement
oftheproblem
in
for
of
the
idea
hidden
the
literature.
forgotten
germ
to appear in printoccurredin the February1960
The firstpossibilityoccursin the workof Arthur
,column of Martin Gardner in ScientificAmerican,
whereit is attributed
to Fox and Marnie.A solution Cayley.
to theproblemis outlinedin theMarch1960issueof
4. CAYLEY'S PROBLEM
Scientific American and
\
-
f
attributed to Moser and
Pounder.In 1963,it appearedas a problemin The
AmericanMathematicalMonthlycontributedby Bis-
singerand Siegel(1963); in 1964,a solutionappeared
dueto Bosch.But Mostellerlearnedoftheproblemin
1955 fromAndrewGleason,"who claimedto have
heardit fromanother,"(Gilbertand Mosteller,1966).
HerbRobbinsrecallsdiscussing
theproblemin 195354 at ColumbiaUniversity,
and MerrillFlood recalls
presenting
a versionoftheproblemthathe calledthe
The distinguished
Arthur
Englishmathematician,
Cayley(1821-1895),is perhapsbest knownforhis
seminalworkin thetheoryofalgebraicinvariants.
He
was also one ofthe mostprolificmathematicians
the
worldhas ever known.His collectedworkscontain
some966 paperstouchingon manysubjectsin mathPaper
ematics,theoretical
dynamicsand astronomy.
andsolutions
#705containssome50pagesofproblems
thatCayleysubmitted
to theEducationalTimesfrom
284
T. S. FERGUSON
1871to 1894.One oftheseproblemsCayley(1875),is
as follows:
4528.(ProposedbyProfessor
Cayley)A lottery
is
arrangedas follows:There are n ticketsrepresentinga, b, c, ... pounds respectively.A person
drawsonce;looksat his ticket;and ifhe pleases,
drawsagain(outoftheremaining
n - 1 tickets);
and so on,drawingin all notmorethank times;
and he receivesthevalueofthelastticketdrawn.
Supposingthathe regulateshis drawingsin the
mannermostadvantageousto himaccordingto
the theoryof probabilities,
whatis the value of
hisexpectation?
Fromthe "Solutionby the Proposer,"we see that
thatk, n, and the
Cayleybelievesit to be understood
a, b, c, ... are known numbers. (Note that here k,
ratherthan n, represents
the totalnumberof drawings.)He solvesthe problemby whatis now known
as the methodof backwardinductionof dynamic
programming.
As an example,he takes n = 4, and
a, b, c, d = 1, 2, 3, 4, and findsfork = 1, 2, 3, 4, the
values of the most advantageousexpectationto be
10/4 38/12, 85/24, 4 resp.
fromoblivionby
Cayley'sproblemwas resurrected
Moser(1956),whoalso reformulated
theproblemin a
neaterguisewhichmaybe viewedas an approximation
to the Cayleyproblemwhenn is verylargeand a, b,
n: You observe, sequentially,
c, ... are 1, 2, ...,
randomvariables X1, ... , Xk known to be iid froma
on theinterval(0, 1); ifyoustop
uniform
distribution
then
afterobserving
youreceiveXj as yourreward.
Xj,
The optimalrule,as foundby Moser,is to stopwhen
thereare m observationsleftto be observedif the
is greaterthanEm,
value ofthe presentobservation
wherethe Emare definedrecursively
by Eo = 0 and
and Chowand Robbins(1963):RandomvariablesX1,
are observedsequentially
at a costofc > 0 per
observation;
ifyoustopafterobserving
X,, thenyou
receiveXn - nc. If recall of past observationsis
allowed,thepayoffforstoppingafterobserving
Xnis
max(X1,..., XX) - nc; such problemsweretreatedby
MacQueen and Miller (1960), Derman and Sacks
(1960) and Chow and Robbins (1961). In Karlin
(1962),the problemis solvedwitha discountrather
than a cost.See DeGroot(1970) fora fulleraccount
ofthesedevelopments.
This class ofproblemsformsa ratherdistinctsetof
problems
thatis stillbeingconfused
withthesecretary
problems.Since thereare so manyvariationsof the
basicsecretary
problem(eachofthe6 conditions
listed
in Section2 has beenmodified
byat leastone author),
I thinkit is worthwhile
to tryto definewhata secretaryproblemis. My definition
is: A secretary
problem
...
X2 *
is a sequential observationand selectionproblem in
which the payoffdepends on the observationsonly
throughtheirrelativeranksand nototherwiseon their
actual values.
Withthisdefinition
then,Cayley'sproblemis not
evena secretary
problem.We mustlookelsewhereto
see whosolvedthesecretary
problem.Proceeding
farther back in time,we come to the firstpractical
I couldfindofthesesequentialobservation
application
and selectiontechniques:the selectionof a wifeby
JohannesKepler.
5. KEPLER'S PROBLEM
Germanastronomer,
Johannes
Whenthecelebrated
Kepler (1571-1630),lost his firstwifeto cholerain
a newwifeusingthe same
1611,he set aboutfinding
of
methodical
thoroughness
and carefulconsideration
the data thathe used in finding
the orbitofMars to
be an ellipse.His first,
notaltogether
happy,marriage
Em+1= (1 - E2)/2. The correspondingequations are
discussedin Guttman(1960) forthenormaldistribu- had been arrangedfor him,and this time he was
to makehis owndecision.In a longletter
tion,in Karlin(1962) fortheexponential
distribution determined
on October23, 1613,written
and in Gilbertand Mosteller(1966) forthe inverse to a BaronStrahlendorf
afterhe had madehis selection,he describesin great
powerdistribution.
detailthe problemshe facedand the reasonsbehind
beAlthoughthereare strongpointsof similarity
tweenCayley'sproblemand the secretaryproblem, each ofthe decisionshe made.He arrangedto interviewand to choosefromamongno fewerthaneleven
thereis one important
difference.
The payoffis not
candidatesforhis hand.The processconsumedmuch
one or zerodependingon whether
select
the
best
you
or not; it is a numericalquantitydependingon the
of his attentionand energyfornearly2 years,what
intothevirtuesanddrawbacks
of
the
difference
with
theinvestigations
intrinsic
value
objectselected.This
withher
her
of
each
playsa big rolein the feelingoftheproblemand has
candidate, dowry,negotiations
of
etc.
the
advice
led to a class ofproblems,
calledvariouslythe house
parents,naturalhesitations,
friends,
an
contains
of
Koestler
The book
Arthur
huntingproblem,the problemof sellingan asset,
(1960)
of
the
and
or the searchproblem,witha literatureas largeas
entertaining insightful
exposition
process.
The bookofCarolaBaumgardt(1951) containsmuch
forthe secretary
problems.The basic problemis the
information.
supplementary
Cayley-Moser
problemwithan infinite
horizon,with
it
Suffice
to
one
will
wish
thepayoff
modified
to makecertainthat
saythatoftheelevencandidatesinterviewed,Keplereventually
decidedon thefifth.
It may
to stopin a finitetime.Forexample,Sakaguchi(1961)
WHO SOLVED THE SECRETARY PROBLEM?
be notedthat when n = 11, the function0n(r) of
valuewhenr = 5. Perhaps,
(2.1) takeson itsmaximum
ifKeplerhadbeenawareofthetheoryofthesecretary
a lotoftimeand
he couldhavesavedhimself
problem,
trouble.
oftheoretical
Ofcourse,in all practicalapplications
results,the assumptionsare neverexactlysatisfied,
and in the presentinstancethisis especiallytrueas
we can see fromKepler'sletter.For example,after
candidatenumber5 and beingstrongly
interviewing
attractedto her, Kepler listenedto the advice of
friendswho were concernedwith her lack of high
rank,wealth,parentageand dowry(she was an orphan),and whopersuadedhimto proposeto number
4. Thus, clearlyKeplerthoughthe could recallpast
applicantsin violationof assumption5 of Section2.
in the
Certainly,
Keplerwouldhave been interested
papersofYang (1974),Petruccelli(1981,1984),Rose
(1984),Ferensteinand Enns (1988) and others,who
allow backwardsolicitationwith a cost or with a
q is
q of beingaccepted.The probability
probability
not1 in Kepler'scase sincecandidatenumber4 turned
himdown.He had waitedtoo long.
That Keplerwenton afterhis failurewithnumber
in getting
the
4 showsthathe was notjust interested
best (assumption6 of Section 2). Perhaps he was
the expectedrankor some otherutility
minimizing
in whichcase thepapersofChow,Moriguti,
function,
Robbinsand Samuels(1964),Mucci(1973),Lorenzen
(1979), Frankand Samuels (1980), etc.,wouldhave
him.Since he had been marriedbefore,it
interested
is unrealisticto assumethathe knewnothingabout
women(assumption4 of Section2); he wouldhave
problemof Gilbertand
enjoyedthe full-information
Mosteller(1966) or Tamaki (1986). But it is also
unreasonableto assume that he knew everything
aboutwomen(whodoes?),so themodelswithpartial
information
and learningof Stewart(1978),Samuels
(1981), Campbelland Samuels (1981) or Campbell
(1982) are moreto thepoint.
all 11
On the otherhand,he actuallyinterviewed
candidatesand could have goneon. Perhapshe was
expectinga randomnumberof availablecandidates
(violatingcondition2 ofSection2), in whichcase he
wouldhave enjoyedreadingthe papers of Presman
and Sonin (1972), Gianini-Pettitt(1979), AbdelHamid,Batherand Trustrum(1982),and Brussand
Samuels(1987).Or perhapstherewas a costofobserand Govindarajulu
vationas in Bartoszynski
(1978),
Lorenzen (1981) or Samuels (1985) or a discount
factoras in Rasmussenand Pliska (1976). He would
in the randomarrivalmodels
be interested
certainly
ofSakaguchi(1976,1986),Cowanand Zabczyk(1978)
and Bruss (1987), in the game theoreticmodelsof
Presmanand Sonin(1975),Fushimi(1981) and Enns
and Ferenstein(1987), and in the multiplecriteria
285
ofStadje(1980),Gnedin(1983),Berezovformulation
and Gnedin(1986) or Samuelsand
skiy,Baryshnikov
Chotlos(1986). Possibly,Keplerwas concernedwith
the actual value of his brideand not just withher
amongtheothercandidates.Thiswouldmake
ranking
problemat all, but a stopping-rule
it not a secretary
problemmore like Cayley'sproblemor the search
problemdiscussedin theprevioussection.
It is clearthatmuchmoreresearchneedstobe done
to clarify
whichoftheseproblemsKeplerwas actually
one it was,therecan be no doubt
solving.Whichever
forhim.His newwife,
thattheoutcomewas favorable
whoseeducation,as he says in his letter,musttake
ranhis
borehimsevenchildren,
theplace ofa dowry,
and seemsto haveprovidedthe
householdefficiently,
necessarytranquilhomelifeforhis erraticgeniusto
flourish.
6. THE GAME OF GOOGOL
Let us returnto thequestion:Ofwhichofthemany
problemam I trying
different
versionsofthesecretary
we shouldtake as
to findthe solver?As historians,
theproblemas itfirstappeared
problem,
thesecretary
in print,in MartinGardner'sFebruary1960column
in ScientificAmerican,where it was called the game
ofgoogoland describedas follows.
Asksomeoneto takeas manyslipsofpaperas
he pleases, and on each slip writea different
positivenumber.The numbersmayrangefrom
small fractionsof 1 to a numberthe size of a
googol (1 followedby a hundred0's) or even
larger.These slipsareturnedfacedownand shuffledoverthe top of a table. One at a timeyou
turntheslipsfaceup. The aim is to stopturning
whenyoucometo the numberthatyouguessto
be the largestof the series.You cannotgo back
turnedslip.If youturn
and pickup a previously
overall slips,thenof courseyou mustpick the
last one turned.
The astutereadermaynoticethatthis is not the
simpleformof the secretaryproblemdescribedin
Section2. The actualvalues of the numbersare revealedto thedecisionmakerin violationofcondition
4. Also, there is this "someone"who chooses the
to makeyourproblemofselectnumbers,
presumably
as possible.The gameof
ing the largestas difficult
game.
googolis reallya two-person
This raisestwoquestions.First,can youguarantee
ofselectingthelargestnumberif
a higherprobability
youallowyourdecisionruleto dependon the actual
In otherwords,doesthestated
valuesofthenumbers?
solutiongivethe lowervalue ofthe game?Second,if
youaretoldhowthis"someone"is choosingthenumbersto place on the slips,can you now guaranteea
T. S. FERGUSON
286
higherprobabilityof selectingthe largestnumber?In
otherwords,does the value of the game exist,and can
we find optimal or c-optimal strategies for the sequence chooser? These questions were not addressed
in the solution presented in ScientificAmerican in
March 1960.
The statementof the problemas it appeared in The
AmericanMathematicalMonthlyin 1963 is much the
same, but somewhatmore nebulous since you are not
told where the numbers come from.Perhaps it is a
game against nature, so the second question above
does not arise. In any case, the solutionpresentedfor
the problemin 1964 contains the same oversight.
Those distinguishedstatisticians who worked on
the secretaryproblemin the 1960's were morecareful
in theirstatementsof the problemin specifyingwhat
informationcould be used in the decision rule, but
none of them attacked the above problem.Therefore,
to see who firstsolved the problem,we must proceed
into the 1970's and beyond.
Suppose you were this "someone" who mustchoose
the numbersto writeon the slips. How would you go
about choosingthe numbersto make it as difficultas
possible forme to obtain the largest? You could not
just choose the numbers 1, 2, *.., n, because then I
could wait until the number n appeared and thus
obtain the largest number with probability 1. You
could not just choose them iid fromsome fixeddistribution,because this would lead to the full-information
case solved by Gilbert and Mosteller (1966), who
showed that I can obtain the largest number with
probabilityat least 0.58016 ... , which is the limiting
value forlarge n. So, you mustchoose the numbersin
some dependentfashion.But you mightas well choose
themto be an exchangeableprocess since the numbers
are put in random order anywaybeforebeing shown
to me. Thus, you are drawnto the partial information
models of Stewart (1978), Petruccelli (1980) and
Samuels (1981).
7. PARTIALINFORMATION
MODELS
...
Let X1, *
Xn denote the values of the numbers
on the slips. These may be consideredas the parametersof our statisticalproblem.We want to finda prior
distributionforX1, ... , Xn, withrespectto whichthe
Bayes rule is the usual optimal rule based on the
relativeranks of the Xj.
In the paper of Stewart (1978), the Xj are chosen
iid froma uniformdistributionon the interval(a, f),
denoted by U(a, f), and (a, d) is chosen fromthe
three-parameterPareto distribution.Specifically,
(7.1)
(a, O) is Pa(k, loguo), and
X1,..,
Xn, given (ae, O3, are iid U(ae, f3,
wherethe three-parameterPareto distribution,Pa(k,
lo, uo) with k > 0 and 1l < uo, is the distributionwith
density
g(ae,: kgloguo)
(7.2)
= k(k + l)(uo
-
lo)"
I(<
(fa)k?+2
,u<
)
where I representsthe indicator function.This is a
conjugate familyof distributionsforthe uniformdistributions,and the posterior distributionof (a, )
givenX1, ..*, Xj is also Pareto,
(a , 3), given X1,
(7.3)
*..,
Xj,
is Pa(k + j, 1j, uj), where
= minJ1o,X1,...,Xj
and
Uj= maxfuo,X1,
*..,
Xjl.
Thus, one can interpretthe priorinformationas being
equivalentto a sample of size k froma uniformdistribution with a minimumof loand a maximumof uo.
Stewart obtained a rather strikingresult for this
priordistributionofthe Xj, as follows.First,thepayoff
is changed so that you win only if you stop on the
largestXj and ifthat Xj is greaterthan uo. Then, one
shows that
(7.4)
P (Xi
Un~I Xis*
Xi
k+ j +1
k + n + 1(24 =uj)
This implies that attentioncan be restrictedto rules
that depend only on the relative ranks of the observations including uo. In fact, the problem becomes
equivalent to the secretaryproblemof Section 2 with
n + k + 1 applicants, in which you start with k + 1
applicants already rejected,the largest having value
uO.In particular,the Bayes rule among rules that use
all the informationis the rule that rejects the first
r'- 1 applicants, and selects the next applicant who
is relativelybest (and better than uo), where r' =
max(1, r - k - 1) and r is the optimal value of r for
the secretaryproblem of Section 2 with n + k + 1
applicants. In addition,forall values of k, the probability of win under an optimal rule tends to l/e as
n -* oo!Since forsufficiently
large n, the maximumXj
will be greaterthan uo withprobabilityclose to 1, this
means that given any e > 0, there is an N and a
distributionof the form(7.1) such that for n > N if
(a, O) is chosen fromthis distributionand then the Xj
are chosen as froma uniformdistributionon (a, d),
the probabilityof win in the game of googol is less
than l/e + e.
Thus Stewart has solved googol asymptoticallyfor
n large. Unfortunately,forfixedfiniten, one cannot
find,forall e > 0, e-optimaldistributionsforchoosing
WHO SOLVED THE SECRETARY PROBLEM?
theXj amongthedistributions
he suggests.We must
lookfurther.
The paperofPetruccelli
(1980)considers
some otherdistributions
forthe Xj. If the Xj are
uniformon the interval(0 - 0.5, 0 + 0.5), thenthe
bestinvariantstoppingrulegivesa probability
ofwin
asymptoticto 0.43517... as n -* oo. If the distributions are normalwithmean g and variance1, the
situationis even worse (fromour point of view).
The best invariantrule gives a probabilityof win
asymptoticto the full-information
case, namely,
0.58016.--. Asymptotically,
you mightas well tell
youropponentwhatg youare using.
Finally,we come to the paper of Samuels (1981),
who extendsthe resultsof Stewart.Samuels shows
thatin the modelwherethe Xj are chosenfromthe
uniformdistribution
on (a, ,B),the usual rule (the
optimalrulebased on relativeranks)is minimaxfor
each n. Since the usual rule is an equalizerrule (it
givesthe same probability
of successno matterhow
theXj are chosen),we see thatit is minimaxagainst
alternatives
as well.In other
general,nonparametric
words,the usual solutionachievesthe lowervalue of
thegame.Thus,I believethatSteveSamuelsdeserves
halfof) the
creditforhavingsolved(themoredifficult
secretaryproblemas it appearedin The American
Mathematical
Monthly.
However,thisexhaustsmysearchthrough
the relevant literature.
What can be said about the game
of googol?I can finallygiveyou my answerto the
questionin the titleof this article.Who solvedthe
secretary
problem?Nobody.
Let me hastento apologizeforthisanticlimax,
and
to venturethe opinionthat the reason no one has
solvedthisproblemis thatpossiblyno one was interestedin googolas a game,or perhapsrealizedthere
was a problemyetto be solved.To remedythesituation,letus tryto findthesolutionnow.Withthehint
givenby the paperof Stewart,it turnsout notto be
hard.
I In fact,sinceweareconsidering
onlythebestchoice
we mayconsidera simplerclass ofdistribuproblem,
tionsthan that of Stewart-theone-parameter
uniformand the two-parameter
Pareto distributions.
Thus,we take
0 is Pa(a, 1), and
(8.1
(8.1) X1, *.., Xn, given0, are iid U(0, 0),
wherethe two-parameter
Pareto distribution,
Pa(a,
MO),is the distributionwith density
g(0I a,
mo)
=
aiMa/0a+lI(0
conjugatepriorforU(O,0),and containstheposterior
distributionof 0 givenX1, * **,Xj:
0, given X1, ***, Xj,
(8.3)
is Pa(a + j, mj),
where mj = max(mo,X1, *.., Xj).
Let us pretendthatwe areplayinga gameofgoogol,
I choosingtheXj and youchoosingthestoppingrule.
For a givene > 0, I will choosethe Xj accordingto
(8.1),and findan a (sufficiently
closeto zero)so that
you will win witha probabilityless than O5n+ E, where
O5nis the maximum probabilityyou can guarantee
usingstrategies
thatdependonlyontherelativeranks,
maxrqn(r).
On=
In fact,I willgiveyouan additionalslightadvantage
and stillkeepyourprobability
ofsuccessbelow'On +
e. I willsaythatyouwinifyoustopat thelargestXj,
or ifall the Xj are no greaterthan 1. This willallow
you to restrictattentionto stoppingrulesthat stop
onlyat a relatively
largestXj thatis greaterthan 1.
The probability
youwinis
P(win) = P(all Xj c 1) + P(win*),
(8.4)
wherewin*represents
theeventImaxX > 1 and you
chooseit}. The firsttermdoes not dependon your
and is easilycomputed:
strategy
00
P(all Xj < 1 1O)g(OI a, 1) do
P(all Xj < 1) =
00
(8.5)
=a
J
(1/0)n(1/o)a+1
dO
= a/(n + a).
8. A RESOLUTION
(8.2)
287
Yourproblemis to maximizethesecondtermof(8.4).
First,we findtheprobability,
conditional
at stagej,
thatmj> 1 is alreadyas largeas it willget.
P(Mi
=
Mn IXi,
=E(P(Mi
(8.6)
...
,Xi)
= Mn I a, Xi,
* * Xj)J X1, .. *
Xj)
(mi/M)n-ig(OI a + j, mj) dO
=
Jmj
=
(a + j)/(a + n)
independent of X1,
*..,
Xj. This is an analog of
Stewart'sresult(7.4). If youhave a newcandidateat
stagej, thatis ifXj = mji,it is optimalto selectit if
and onlyif
a +j
ae + n
> P(win*withbeststrategy
fromstagej + 1 on).
>
MO),
wherea > 0 and mo> 0. For simplicity,
we take
formsa
mO=1. This class of Pareto distributions
The rightside of this inequalityis a nonincreasing
ofj, sinceanystrategy
function
availableat stagej +
2 is also availableat stagej + 1. Sincetheleftsideof
T. S. FERGUSON
288
the inequalityis an increasingfunctionofj, an optimal
rule may be foundamong rules of the formfor some
r - 1: rejectthe first-r- 1 applicants and accept the
next applicant forwhichXj = mj, if any.
Using such a strategy,the probabilityof a win*may
be computedas
n
is best I select j).
P(win*) = E P(select j)P(j
j=r
The probabilitythatj is best givenyou select it is just
(8.6). The probabilitythat you select j can be found
using (8.6):
P(select j)
= P(mr1
= mjn1and mj > mjr1)
_a + j -1
= a + r-1{
a +j- i-a \
+j!
a + r-1
(a + j - 1)(a + j)
Combiningthese into (8.4) and letting0n(r,a) denote
the probabilityof a win,we find
On(r, a)
(a:
+
n) +(
1)jzr (a
+-J1)
Now, note that On(r,a) is continuousin a fora > 0,
and thatas a -O,
n(r, a) -X-n(r) forall r= 1, ** *,
0,
n, whereOn(r) is givenby (2.1). Hence, as a
maxr qn(r, a) -- maxr qn(r) =
.n-
an c-optimalmethodofchoosingthe Xj is
Therefore,
given by (8.1), wherea = a(n, c) is chosenso that
Imaxrqn(r, ao)
On
I < e.
This derivation may be considered an alternate
proofof the minimaxityresultof Samuels mentioned
in Section 7. It is interestingto note that this result
cannot be obtained using the distributionsof (7.1).
The optimal rule based on relative ranks is exactly
Stewart'srule with k = -1; but k must be positivefor
(7.1) to be a distribution;hence, we cannot approximate the case k = -1 withdistributions.(In addition,
the term correspondingto (8.5) does not go to zero.)
If it seems strange that the distributions(7.1) were
consideredbeforethe simplerdistributions(8.1), the
reason is that Stewart and Samuels treated in their
papers problems with more general payoffs,not just
the best choice problem,and needed a broader class
of distributions.Unfortunately,
(7.1) does not contain
(8.1). If there is a moral to this, maybe it is that the
simplercases should always be examined first.
NoteAddedin Proof
Steve Samuels has sent me a copy of the book, ProblemsofBest
Choice (in Russian) by B. A. Berezovskiyand A. V. Gnedin, 1984,
Akademia Nauk, USSR, Moscow. This book is devotedsolelyto the
secretaryproblemand its variations.It contains not only a review
of the field but also a careful exposition and new informationas
well. In theirdiscussionofthe partial informationmodel of Stewart,
theyuse the priordistribution(8.1) and derive (8.6). However,this
is onlyused to prove the asymptoticminimaxityresultof Stewart.
REFERENCES
ABDEL-HAMID, A. R., BATHER, J. A. and TRUSTRUM, G. B. (1982).
The secretaryproblemwithan unknownnumberof candidates.
J. Appl. Probab. 19 619-630.
BARTOSZYNSKI, R. and GOVINDARAJULU, Z. (1978). The secretary
problemwithinterviewcost. Sankhya Ser. B 40 11-28.
BAUMGARDT, CAROLA (1951). Johannes Kepler: Life and Letters.
Philosophical Library,Inc., New York.
BEREZOVSKIY, B. A., BARYSHNIKOV, Yu. M. and GNEDIN, A. V.
(1986). On a class of best choice problems. Inform.Sci. 39
111-127.
BISSENGER, B. H. and SIEGEL, C. (1963). Advanced Problem 5086.
Amer. Math. Monthly70 336. (Solution by A. J. Bosch. 71
(1964) 329-330.)
BRUSS, F. T. (1987). On an optimal selection problem of Cowan
and Zabczyk.J. AppL Probab. 24 918-928.
BRUSS, F. T. and SAMUELS,S. M. (1987). A unifiedapproach to a
class of optimal selection problemswith a random numberof
options.Ann. Probab. 15 824-830.
CAMPBELL, G. (1982). The maximum of a sequence with prior
information.Sequential Anal. 1 177-191.
CAMPBELL, G. and SAMUELS, S. (1981). Choosing the best of the
currentcrop.Adv. in Appl. Probab. 13 510-532.
CAYLEY, A. (1875). Mathematical questions with their solutions.
The Educational Times 23 18-19. See The CollectedMathematicalPapers ofArthurCayley10 587-588 (1896). Cambridge
Univ. Press, Cambridge.
CHOW, Y. S., MORIGUTI,S., ROBBINS, H. and SAMUELS,S. M.
(1964). Optimal selection based on relative rank (The "secretary"problem).Israel J. Math. 2 81-90.
CHOW,Y. S. and ROBBINS, H. (1961). A martingalesystemtheorem
and applications. Proc. Fourth BerkeleySymp. Math. Statist.
Probab. 1 93-104. Univ. CaliforniaPress.
CHOW,Y. S. and ROBBINS, H. (1963). On optimal stoppingrules.
Z. Wahrsch.verw.Gebiete2 33-49.
COWAN, R. and ZABCZYK, J. (1978). An optimal selection problem
associated with the Poisson process. TheoryProbab. Appl. 23
584-592.
DEGROOT, M. H. (1970). Optimal Statistical Decisions. McGrawHill, New York.
DERMAN, C. and SACKS, J. (1960). Replacement of periodically
inspectedequipment.Naval Res. Logist. Quart. 7 597-607.
DYNKIN, E. B. (1963). The optimum choice of the instant for
stoppinga Markov process. Soviet Math. Dokl. 4 627-629.
ENNS, E. G. and FERENSTEIN, E. Z. (1987). On a multipersontimesequential game withpriorities.Sequential Anal. 6 239-256.
FERENSTEIN, E. Z. and ENNS, E. G. (1988). Optimal sequential
selectionfroma knowndistributionwithholdingcosts.J.Amer.
Statist.Assoc. 83 382-386.
FRANK, A. Q. and SAMUELS,S. M. (1980). On an optimal stopping
problemof Gusein-Zade. StochasticProcess.Appl. 10 299-311.
FREEMAN, P. R. (1983). The secretaryproblemand its extensionsA review.Internat.Statist.Rev. 51 189-206.
FUSHIMI, M. (1981). The secretaryproblemin a competitivesituation. J. Oper. Res. Soc. Japan 24 350-359.
GARDNER, M. (1960). ScientificAmerican Feb., Mar. (See also
Martin Gardner's New Mathematical Diversions (1966) 35.
Simon and Schuster,New York.)
GIANINI-PErrIrr,J. (1979). Optimal selection based on relative
ranks with a random number of individuals. Adv. in Appl.
Probab. 11 720-736.
WHO SOLVED THE SECRETARY PROBLEM?
GILBERT, J. and MOSTELLER, F. (1966). Recognizingthe maximum
of a sequence. J. Amer. Statist.Assoc. 61 35-73.
GNEDIN, A. V. (1983). Effective stopping on a Pareto-optimal
option.Avtomat.i Telemekh.44 (3) 166-170. (In Russian).
GUTTMAN, I. (1960). On a problemof L. Moser. Canad. Math. Bull.
3 35-39.
KARLIN, S. (1962). Stochastic models and optimalpolicyforselling
an asset. In Studies in Applied Probabilityand Management
Science (K. J. Arrow,S. Karlin and H. Scarf, eds.) 148-158.
StanfordUniv. Press, Stanford,Calif.
KOESTLER, A. (1960). The Watershed.A Biographyof Johannes
Kepler. AnchorBooks, Garden City,NY.
LINDLEY, D. V. (1961). Dynamic programming
and decisiontheory.
Appl. Statist. 10 39-51.
LORENZEN, T. J. (1979). Generalizingthe secretaryproblem.Adv.
in Appl. Probab. 1 1 384-396.
LORENZEN, T. J. (1981). Optimal stoppingwithsamplingcost.Ann.
Probab. 9 167-172.
MACQUEEN, J. and MILLER, R. P., JR. (1960). Optimal persistence
policies. Oper. Res. 8 362-380.
MOSER, L. (1956). On a problem of Cayley. Scripta Math. 22
289-292.
MuccI, A. G. (1973). On a class of secretaryproblems.Ann. Probab.
1 417-427.
PETRUCCELLI, J. D. (1980). On a best choice problemwith partial
information.
Ann. Statist. 8 1171-1174.
PETRUCCELLI, J. D. (1981). Best-choice problemsinvolvinguncertainty of selection and recall of observation.J. Appl. Probab.
18 415-425.
PETRUCCELLI, J. D. (1984). Best-choice problems involvingrecall
withuncertaintyof selectionwhen the numberof observations
is random.Adv. in Appl. Probab. 16 111-130.
PRESMAN, E. L. and SONIN, I. M. (1972). The best-choiceproblem
for a random number of objects. Theory Probab. Appl. 17
657-668.
289
PRESMAN, E. L. and SONIN, I. M. (1975). Equilibriumpoints in a
game relatedto the best choice problem.TheoryProbab. Appl.
20 770-781.
RASMUSSEN, W. T. and PLISKA, S. R. (1976). Choosing the maximum froma sequlencewith a discount function.Appl. Math.
Optim.2 279-289.
ROSE, J. (1984). Optimal sequential selection based on relative
ranks with renewablecall options.J. Amer. Statist.Assoc. 79
430-435.
SAKAGUCHI, M. (1961). Dynamic programmingof some sequential
samplingdesign.J. Math. Anal. Appl. 2 446-466.
SAKAGUCHI, M. (1976). Optimal stoppingproblems for randomly
arrivingoffers.Math. Japon. 21 201-217.
SAKAGUCHI, M. (1986). Best choice problemsforrandomlyarriving
offersduringa randomlifetime.Math. Japon. 31 107-117.
SAMUELS,S. M. (1981). Minimax stoppingrules when the underlying distributionis uniform.J. Amer. Statist. Assoc. 76
188-197.
SAMUELS,S. M. (1985). A best-choiceproblem with linear travel
cost. J. Amer. Statist.Assoc. 80 461-464.
SAMUELS, S. M. and CHOTLOS, B. (1986). A multiple criteria
optimal selection problem. In Adaptive StatisticalProcedures
and Related Topics (J. Van Ryzin,ed.) 62-78. IMS, Hayward,
Calif.
STADJE, W. (1980). Efficientstoppingofa randomseriesofpartially
orderedpoints. MultipleCriteriaDecision Making. Theoryand
ApplicationLecture Notes in Econ. Math. Syst. 177 430-447.
Springer,New York.
STEWART, T. J. (1978). Optimal selection froma randomsequence
with learningof the underlyingdistribution.J. Amer. Statist.
Assoc. 73 775-780.
TAMAKI, M. (1986). A full-information
best-choiceproblem with
finitememory.J. Appl. Probab. 23 718-735.
YANG,M. C. K. (1974). Recognizingthe maximumof a sequence
withbackwardsolicitation.J. Appl. Probab. 1 1 504-512.
Comment:WhoWillSolve the Secretary
Problem?
StephenM. Samuels
JustlikeJohannesKepler,whothrewa newcurve
at the solar system,Tom Fergusonhas givena differentslant to the SecretaryProblem.To its many
beginbysaying"all thatwe
practitioners
whoritually
can observeare the relativeranks,"Ferguson(citing
ineffect,
responds"let'snottake
historical
precedent),
thatassumptionforgranted."The heartofhispaper,
as I see it,is the followingFergusonSecretaryProblem:
Stephen M. Samuels is Professor of Statistics and
Mathematics at Purdue University.His mailing address is: Departmentof Statistics,Mathematical Sciences Building, Purdue University,West Lafayette,
Indiana 47907.
Given n, eitherfindan exchangeable
sequenceof
continuous random variables, X1, X2, *---, Xn, for
which,amongall stoppingrules,r, basedon theX's,
sup P{XT=
max(Xl,
X2,
XXn),
is achievedbya rulebased onlyon the relativeranks
oftheX's-or provethatno suchsequenceexists.
Fergusonhas come withinepsilonof solvingthis
problem.He has exhibitedexchangeablesequences,
foreach n and e > 0, such thatthe best rulebased
onlyon relativerankshas successprobability
within
e ofthesupremum.
But he has leftopenthequestion
ofwhether
thissupremum
can actuallybe attained.
For n = 2, the answeris easy; thereis no such
which
sequence.The following
elementary
argument,