here - MIT Sloan Sports Analytics Conference

Transcription

here - MIT Sloan Sports Analytics Conference
HP Vertica at MIT Sloan
Sports Analytics Conference
March 1, 2013
Will Cairns, Senior Data Scientist, HP Vertica
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
So… What’s the market’s definition of Big Data?
Datasets whose volume, velocity, variety and complexity are beyond the ability of
commonly used tools to capture, process, store, manage and analyze them.
Velocity
Volume
Big
Data
Complexity
Variety
Information Sources
$€¥
CRM, SCM, ERP Video
IT Ops
Email
Transactional Data Mobile
Audio
Texts
Social Media
Big Data is no longer just a
Buzzword…
2
*Gartner, Inc., “Big Data” Is Only the Beginning of Extreme Information Management”,
Mark A. Beyer, Anne Lapkin, Nicholas Gall, Donald Feinberg, Valentin T. Sribar, Published 7 April 2011
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Search
Images
Information from the Internet of Things:
We have gone beyond the decimal system
Today data scientist uses Yottabytes to describe
how much government data the NSA or FBI have
on people altogether.
27
10
In the near future, Brontobyte will be the measurement
to describe the type of sensor data that will be generated
from the IoT (Internet of Things)
Yottabyte
This is our digital universe today
= 250 trillion of DVDs
10
Exabyte
1 EB of data is created on the internet each day = 250 million DVDs worth of information.
The proposed Square Kilometer Array telescope will generated an EB of data per day
Terabyte
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
21
10
1015
12
9
10
6
10
Zettabyte
1.3 ZB of network traffic
by 2016
18
10
500TB of new data per day are ingested in Facebook
databases
Megabyte
3
This will be our digital
universe tomorrow…
24
10
Brontobyte
Petabyte
The CERN Large Hadron Collider
generates 1PB per second
Gigabyte
The Challenges of Big Data
Variety, Velocity, Volume, Time to Value
90%
75%
48%
of digital content
created by 2015 will
be mixed data
types ¹
of currently deployed
data warehouses will
not scale sufficiently
to meet new
information velocity
and complexity of
demands by 2016²
Worldwide
information volume
growth of digital
content¹
¹Source: IDC Predictions 2012: Competing for 2020
4
²Source: Gartner - The State of Data Warehousing in 2012
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
86%
of corporations
cannot deliver the
right information, at
right time to support
enterprise outcomes
all of the time³
³Source: Coleman Parkes Survey Nov 2012
Today, Data Analysis is Slow, Painful, and Costly
?
?
5
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Legacy Architectures Were Built for a Different World
Yesterday’s data warehouse and analytic
infrastructure
6
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
• Proprietary
• Expensive
• Centralized,
monolithic
• Process laden
• Batch
• Summary
• Slow
Imagine a world where a conversation with your
information drives business decisions
Social Media
Video
Audio
Texts
Messages
Transactional
Data
Word, Excel
Logs
Clickstream
Data
Transactional
Data
“Do 40% more of this
…”
Logs
Images
7
Email
Clickstream
Data
MGD
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Intelligent Information
Insight across
structured, semistructured &
unstructured data
Right time, intelligent
decision making
Positive ROI
Vertica, Purpose Built for Answers in Real Time
50x-1000x faster performance at 30% the cost, proven by hundreds of customers
Volume
Variety
Velocity
1000x
Value
8
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Designed for Answers from the Very First Line of
Code, Vertica Technology Makes the Difference
9
Columnar Storage
and Execution
Achieve best data query performance with unique Vertica column store
Clustering
Add resources on the fly with linear scaling on the grid, commodity hardware
Compression
Store more data, provide more views, 90% less storage required
Continuous
Performance
Query and load 24x7 with zero administration
Database Design
Advanced Analytics
Automated performance tuning
Time-series, geospatial, click-stream and SDK for more
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Hundreds of Customers Across Industries Finding Answers
10
►
Promotional Testing
►
Behavior Analytics
►
Claims Analyses
►
Click Stream Analyses
►
Patient Records
Analyses
►
Network Analyses
►
Customer Analytics
►
Compliance Testing
►
Loyalty Analysis
►
Campaign Management
►
Clinical data Analyses
►
Fraud Monitoring
►
Financial tracking
►
Tick data back-testing
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Learn More about HP Vertica
Enterprise Edition
• Free 30-day evaluation
Community Edition
• Free Download 1TB, 3 nodes
my.vertica.com\evaluate
White paper: The
Disruptive Power of Big
Data
•
11
Now available on our Web site
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Vertica in Action
12
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Separating the Signal from Noise
13
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
14
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
2012 Honda Classic
15
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Force directed network
16
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.