Research - David M. Pennock
Transcription
Research - David M. Pennock
Research Prediction Market Science & Technology at Yahoo! David M. Pennock Mike Dooley, Tej Kasturi, Bernard Mangold, Havi Hoffman Yiling Chen, Chao-Hsien Chu, Sandip Debnath, Rael Dornfest, Joan Fiegenbaum, Gary Flake, Lance Fortnow, Brian Galebach, Lee Giles, Joe Kilian, Steve Lawrence, Tracy Mullen, Rahul Sami, Emile Servan-Schreiber, Michael Wellman, Justin Wolfers Research Prediction markets • Futures market designed to elicit a forecast about some future event • Leverages “wisdom of crowds”, extracting and combining information from distributed sources • Have been used successfully to • Predict election outcomes [IEM, 1988-] • Predict corporate metrics (sales, product release times, …) [HP, 2001-] [MSFT, Eli Lily, Intel, Siemens] [GOOG, 2005-] • Predict movie box office returns [HSX], news [NewsFutures], scientific conjectures [FX], sporting events, judicial nominations, economic numbers, … [inTrade], real estate [HedgeStreet], many others Research Prediction markets research @ Y! 2002-2005 • Computational aspects & mechanism design n 4 • n events, 2n combinations, 22 poss. bets! • Algorithms & computational complexity • Leverage independence (“compact markets”) • “Betting boolean-style”: Generic bidding language 3 • New exchange mechanism: dynamic pari-mutuel market; Cross btw stock market and horse race betting; Ideal for huge numbers of futures and low liquidity common in derivatives trading and gambling 2 • Empirical analyses of real-$/play-$ markets; Does money matter? • Academic: 6 pubs; 4 patents; 2 workshops 1 • Practical: Search futures & Tech Buzz Game Research Search Futures & Tech Buzz Game Research Search futures • • • • Search data: What people worldwide are thinking about today Search futures: What people will be thinking about tomorrow Billions of advertising/business dollars ride on answers Research questions • Is search buzz predictable? If so, what factors promote accuracy? (trading mechanism, currency [real/fantasy], subsidies, dividend rates, noise bots, #traders, competence of traders, ... ) • What types of search concepts? What types of trends? (sudden, cyclical, growth, burst) • How to handle huge numbers of (combinatorial/conditional) markets and low liquidity • Which traders do well? (buzz traders, buy/hold traders, day traders, noise traders, affinity traders, cheaters) • Can machine learning post-processing boost accuracy? • How to combat search spam, manipulation • How to hedge marketing/business risks with real-$ search/PPC futures Research http://buzz.research.yahoo.com • • • Yahoo!,O’Reilly launched Buzz Game 3/05 @ETech Research testbed for investigating search futures Buy “stock” in hundreds of technologies • Earn dividends based on actual search “buzz” • • API interface Exchange mechanism is new Yahoo!R invention Cross btw stock market and horse race betting Research Exchange Interface Dynamic Parimutuel “Market Maker” Research Technology forecasts • iPod phone • What’s next? Google Calendar? price search buzz 8/28: buzz gamers begin bidding up iPod phone 8/29: Apple invites press to “secret” unveiling 9/7: Apple announces Rokr 9/8-9/18: searches for iPod phone soar; early buyers profit • Another Apple unveiling 10/12; iPod Video? 9am 10/5 Research Forecast accuracy Early lessons learned • Average forecast error across 352 stocks • Market closing deadline focuses traders • Dividend levels matter • Intelligent strategies work forecast error rapidly declines as traders zero in on correct predictions end of phase 1 contest period • Randomized bots lost money to real traders • Contest winner followed optimal buzz trading strategy (prices ∝ √buzz); Went from 4th to 1st place in final days • Forecast error does decrease over time Research Prediction markets research @ Y! 2002-2005 • Computational aspects & mechanism design n 4 • n events, 2n combinations, 22 poss. bets! • Algorithms & computational complexity • Leverage independence (“compact markets”) • “Betting boolean-style”: Generic bidding language 3 • New exchange mechanism: dynamic pari-mutuel market; Cross btw stock market and horse race betting; Ideal for huge numbers of futures and low liquidity common in derivatives trading and gambling 2 • Empirical analyses of real-$/play-$ markets; Does money matter? • Academic: 6 pubs; 4 patents; 2 workshops 1 • Practical: Search futures & Tech Buzz Game Research Does Money Matter? Research Real markets vs. market games HSX average log score arbitrage closure IEM Research Real markets vs. market games HSX FX, F1P6 probabilistic forecasts actual forecast source F1P6 linear scoring F1P6 F1-style scoring betting odds F1P6 flat scoring F1P6 winner scoring 100 expected value forecasts 50 20 10 5 489 movies 2 1 1 2 5 10 20 50 100 estimate avg log score -1.84 -1.82 -1.86 -2.03 -2.32 Research Does money matter? Play vs real, head to head Experiment • 2003 NFL Season • ProbabilitySports.com Online football forecasting competition • Contestants assess probabilities for each game • Quadratic scoring rule • ~2,000 “experts”, plus: • NewsFutures (play $) • Tradesports (real $) • Results: • Play money and real money performed similarly • 6th and 8th respectively • Markets beat most of the ~2,000 contestants • Average of experts came 39th (caveat) Used “last trade” prices Electronic Markets, Emile ServanSchreiber, Justin Wolfers, David Pennock and Brian Galebach TradeSports: Correlation=0.96 NewsFutures: Correlation=0.94 90 Prices: TradeSports and NewsFutures 100 TradeSports Prices 100 80 70 60 Fitted Value: Linear regression 45 degree line 75 50 50 40 25 30 20 10 0 0 0 10 20 30 40 50 60 70 Trading Price Prior to Game 80 90 0 100 20 40 60 NewsFutures Prices n=416 over 208 NFL games. Correlation between TradeSports and NewsFutures prices = 0.97 Data are grouped so that prices are rounded to the nearest ten percentage points; n=416 teams in 208 games Prediction Performance of Markets Relative to Individual Experts Rank Observed Frequency of Victory Research Prediction Accuracy Market Forecast Winning Probability and Actual Winning Probability 0 20 40 60 80 100 120 140 160 180 200 220 240 260 280 300 NewsFutures Tradesports 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Week into the NFL season 80 100 Research Does money matter? Play vs real, head to head ProbabilityFootball Avg TradeSports (real-money) NewsFutures (play-money) Difference TS - NF 0.443 0.439 0.436 0.003 (0.012) (0.011) (0.012) (0.016) 0.476 0.468 0.467 0.001 (0.025) (0.023) (0.024) (0.033) Average Quadratic Score 9.323 12.410 12.427 -0.017 = 100 - 400*( lose_price2 ) (4.75) (4.37) (4.57) (6.32) Average Logarithmic Score -0.649 -0.631 -0.631 0.000 = Log(win_price) (0.027) (0.024) (0.025) (0.035) Mean Absolute Error = lose_price [lower is better] Root Mean Squared Error = ?Average( lose_price2 ) [lower is better] [higher is better] [higher (less negative) is better] Statistically: TS ~ NF NF >> Avg TS > Avg Research Prediction markets research @ Y! 2002-2005 • Computational aspects & mechanism design n 4 • n events, 2n combinations, 22 poss. bets! • Algorithms & computational complexity • Leverage independence (“compact markets”) • “Betting boolean-style”: Generic bidding language 3 • New exchange mechanism: dynamic pari-mutuel market; Cross btw stock market and horse race betting; Ideal for huge numbers of futures and low liquidity common in derivatives trading and gambling 2 • Empirical analyses of real-$/play-$ markets; Does money matter? • Academic: 6 pubs; 4 patents; 2 workshops 1 • Practical: Search futures & Tech Buzz Game Research Dynamic Parimutuel Market Research What is a pari-mutuel market? A B • E.g. horse racetrack style wagering • Two outcomes: A B • Wagers: Research What is a pari-mutuel market? A B • E.g. horse racetrack style wagering √A • Two outcomes: B • Wagers: Research What is a pari-mutuel market? A B • E.g. horse racetrack style wagering √A • Two outcomes: B • Wagers: Research What is a pari-mutuel market? • Before outcome is revealed, “odds” are reported, or the amount you would win per dollar if the betting ended now • Horse A: $1.2 for $1; Horse B: $25 for $1; … etc. • Strong incentive to wait • • • • payoff determined by final odds; every $ is same Should wait for best info on outcome, odds ⇒ No continuous information aggregation ⇒ No notion of “buy low, sell high” ; no cash-out Dynamic pari-mutuel market Standard PM: Every $1 bet is the same DPM: Value of each $1 bet varies depending on the status of wagering at the time of the bet Encode dynamic value with a price – price is $ to buy 1 share of payoff – price of A is lower when less is bet on A – as shares are bought, price rises; price is for an infinitesimal share; cost is integral Research Pari-mutuel market Basic idea 1 1 1 1 1 1 1 1 1 1 1 1 Research Dynamic pari-mutuel market Basic idea 1 1 0.2 0.4 1.1 1.3 1.6 0.9 2 5 2. 3 Research How are prices set? • A price function pi(n) gives the instantaneous price of an infinitesimal additional share beyond the nth n • Cost of buying n shares: ∫0 pi(n) dn • Different reasonable assumptions lead to different price functions Research Price functions Share type Constraint/ Assumption Result Losing money p1 = P 2 p2 = P 1 Closed form cost() & shares() Losing money pi/pj = Mi/Mj Closed form cost() & shares() All money pi/pj = Mi/Mj Closed form shares() ; Numeric cost() All money pi/pj = Si/Sj Closed form cost() & shares() Research Prediction markets research @ Y! 2002-2005 • Computational aspects & mechanism design n 4 • n events, 2n combinations, 22 poss. bets! • Algorithms & computational complexity • Leverage independence (“compact markets”) • “Betting boolean-style”: Generic bidding language 3 • New exchange mechanism: dynamic pari-mutuel market; Cross btw stock market and horse race betting; Ideal for huge numbers of futures and low liquidity common in derivatives trading and gambling 2 • Empirical analyses of real-$/play-$ markets; Does money matter? • Academic: 6 pubs; 4 patents; 2 workshops 1 • Practical: Search futures & Tech Buzz Game Research Computational Aspects: Complex Betting [Thanks: Wolfers, Fortnow] Market combinatorics Probability Bush wins each state 90 to 80 to 70 to 60 to 50 to 40 to 30 to 20 to 10 to 0 to 100 (14) 90 (6) 80 (3) 70 (6) 60 (2) 50 (5) 40 (2) 30 (2) 20 (7) 10 (4) Source: www.Tradesports.com; 3/26/2004. [Thanks: Wolfers, Fortnow] Market combinatorics Combinatorial markets Singly exponential Buy/sell multiple securities simultaneously Buy 2 TX & sell 4 CT, pay $10 Exactly analogous to combinatorial auctions 250 possible “bundles” of securities Compound markets Doubly exponential Pr(CA ^ AZ) ? Pr(Elec | FL) ? Pr((IL^NJ)∨(¬IL^¬NJ)) ? Not derivable as a Probability linear combinations of base securities 50 2 2 possible functions “Only” 250 securities neededSource: to www.Tradesports.com; span space 3/26/2004. Bush wins each state 90 to 80 to 70 to 60 to 50 to 40 to 30 to 20 to 10 to 0 to 100 (14) 90 (6) 80 (3) 70 (6) 60 (2) 50 (5) 40 (2) 30 (2) 20 (7) 10 (4) Compound markets I: Brute force I am entitled to: $1 if A1&A2&…&An I am entitled to: $1 if A1&A2&…&An I am entitled to: $1 if A1&A2&…&An I am entitled to: $1 if A1&A2&…&An I am entitled to: $1 if A1&A2&…&An I am entitled to: $1 if A1&A2&…&An I am entitled to: $1 if A1&A2&…&An I am entitled to: $1 if A1&A2&…&An In principle, markets in all possible combinations will get you everything you want In practice, this is infeasible It’s also unnatural Compound markets II: Leverage independence Structure market according to unanimously agreed-upon independencies E4 $1 if E6|E3E5 E5 E1 E6 E2 E3 $1 if E6|E3Ê5 $1 if E6|Ê3E5 $1 if E6|Ê3Ê5 [Pennock & Wellman 2000] Compound markets II: Leverage independence CARA & Markov indep ⇒ risk-neutral indep If all agents have CARA, then market structured as n TRIANGULATE[∪ i=1 MORALIZE(Di)] is op complete E4 E5 E1 E6 E2 E3 $1 if E6|E3E5 $1 if E6|Ê3E5 $1 if E6|E3Ê5 $1 if E6|Ê3Ê5 Can still yield exponential savings (“compact sec. markets”) This example: 19 vs. 63 [Pennock & Wellman 2000] Compound markets III: High-level bidding language A bidding language: write your own security I am entitled to: $1 if Boolean_fn | Boolean_fn For example I am entitled to: $1 if A1 | A2 I am entitled to: $1 if (A1&A7)||A13 | (A2||A5)&A9 I am entitled to: $1 if A1&A7 Offer to buy/sell q units of it at price p Let everyone else do the same Auctioneer must decide who trades with whom at what price… How? (next) More concise/expressive; more natural The matching problem There are many possible matching rules for the auctioneer A natural one: maximize trade subject to no-risk constraint trader gets $$ in state: A1A2 A1A2 A1A2 A1A2 Example: – buy 1 of $1 if A1 for $0.40 – sell 1 of $1 if A1&A2 for $0.10 – sell 1 of $1 if A1&A2 for $0.20 0.60 0.60 -0.40 -0.40 -0.90 0.10 0.10 0.10 0.20 -0.80 0.20 0.20 No matter what happens, auctioneer cannot lose money -0.10 -0.10 -0.10 -0.10 The matching problem | Another way to look at it: Logical reduction | Example: – buy 1 of $1 if A1 for $0.40 – sell 1 of $1 if A1&A2 for $0.10 – sell 1 of $1 if A1&A2 for $0.20 = sell $1 if A1 || Clear match btw buy and sell| for $0.3 The matching problem Divisible orders: will accept any q* ≤ q Indivisible: will accept all or nothing Let Ω=all possible combinations; |Ω|=2n Let αi be fraction of order i filled Let Υiω be payoff for order i in state ω Div. MP: Does ∃αi∈[0,1], ∀ω∈Ω, -∑αiΥiω≥0 Indiv. MP: Does ∃αi∈{0,1}, ∀ω∈Ω, -∑αiΥiω≥0 Optimizations – max trade; max percent orders filled (at least 1 αi > 0) – max auctioneer utility subject to no-risk – max auctioneer utility -- with risk (“market maker”) Divisible vs. indivisible Sell 1 of A1 at $0.50 Buy 1 of (A1&A2) | (A1 || A2) at $0.50 Buy 1 of A1|A2 at $0.40 trader gets $$ in state: A1A2 A1A2 A1A2 A1A2 -0.50 -0.50 0.50 0.50 0.50 -0.50 -0.50 0 0 0.60 0 -0.40 0 -0.40 0 0.10 Divisible vs. indivisible Sell 1 of A1 at $0.50 Buy 1 of (A1&A2) | (A1 || A2) at $0.50 Buy 1 of A1|A2 at $0.40 trader gets $$ in state: A1A2 A1A2 A1A2 A1A2 -0.50 -0.50 0.50 0.50 0.50 -0.50 -0.50 0 0 0.60 0 -0.40 0 -1 0 0.50 Divisible vs. indivisible Sell 1 of A1 at $0.50 Buy 1 of (A1&A2) | (A1 || A2) at $0.50 Buy 1 of A1|A2 at $0.40 trader gets $$ in state: A1A2 A1A2 A1A2 A1A2 -0.50 -0.50 0.50 0.50 0.50 -0.50 -0.50 0 0 0.60 0 -0.40 -0.50 0.10 0.50 0.10 Divisible vs. indivisible Sell 1 of A1 at $0.50 Buy 1 of (A1&A2) | (A1 || A2) at $0.50 Buy 1 of A1|A2 at $0.40 trader gets $$ in state: A1A2 A1A2 A1A2 A1A2 -0.50 -0.50 0.50 0.50 0.50 -0.50 -0.50 0 0 0.60 0 -0.40 0.50 0.10 -0.50 -0.40 Divisible vs. indivisible Sell 1 of A1 at $0.50 Buy 1 of (A1&A2) | (A1 || A2) at $0.50 Buy 1 of A1|A2 at $0.40 trader gets $$ in state: A1A2 A1A2 A1A2 A1A2 3/5 x -0.50 -0.50 0.50 0.50 3/5 x 0.50 -0.50 -0.50 0 0.60 0 -0.40 1x 0 divisible 0 0 0 -0.10 match! Complexity results Divisible orders: will accept any q* ≤ q Indivisible: will accept all or nothing LP # events O(log n) O(n) divisible polynomial co-NP-complete reduction from SAT Natural algorithms reduction from X3C indivisible NP-complete Σ2p complete reduction from T∃∀BF – divisible: linear programming – indivisible: integer programming; logical reduction? Fortnow; Kilian; Sami Open questions Other matching rules – maximize utility subject to no-risk – maximize utility (market maker) What to do with the surplus – can be in cash and “leftover” securities – auctioneer keeps surplus – surplus is shared back among traders, auctioneer; how? Trader optimization problem – how to choose securities, p’s, q’s, subject to limits/penalties for number, complexity of bids – ultimately a game-theoretic question Approximate algorithms, heuristics Incentive properties Research Prediction markets research @ Y! 2002-2005 • Computational aspects & mechanism design n 4 • n events, 2n combinations, 22 poss. bets! • Algorithms & computational complexity • Leverage independence (“compact markets”) • “Betting boolean-style”: Generic bidding language 3 • New exchange mechanism: dynamic pari-mutuel market; Cross btw stock market and horse race betting; Ideal for huge numbers of futures and low liquidity common in derivatives trading and gambling 2 • Empirical analyses of real-$/play-$ markets; Does money matter? • Academic: 6 pubs; 4 patents; 2 workshops 1 • Practical: Search futures & Tech Buzz Game