ParStream - NIK Nürnberg
Transcription
ParStream - NIK Nürnberg
. PARSTREAM OVERVIEW CHRISTIAN WERLING DIRECTOR SALES D/A/CH [email protected] Confidential Big Data is a Hype! Confidential 2 Confidential 3 „Die Erfindung hat so viele Mängel, dass es nicht ernsthaft als Kommunikationsmittel taugt. Das Ding hat für uns an sich keinen Wert.“ Memo der Western Union Financial Services zur Erfindung des Telefons, 1876 „Die weltweite Nachfrage nach Kraftfahrzeugen wird eine Million nicht überschreiten - allein schon aus Mangel an verfügbaren Chauffeuren.“ Gottlieb Daimler, Erfinder, 1901 „Das Pferd wird es immer geben, Automobile hingegen sind lediglich eine vorübergehende Modeerscheinung.“ Der Präsident der Michigan Savings Bank, 1903 Confidential 4 „Ich denke, dass es einen Weltmarkt für vielleicht fünf Computer gibt.“ Thomas Watson, CEO von IBM, 1943 „Der Fernseher wird sich auf dem Markt nicht durchsetzen. Die Menschen werden sehr bald müde sein, jeden Abend auf eine Sperrholzkiste zu starren.“ Darryl F. Zanuck, Chef der Filmgesellschaft 20th CenturyFox, 1946 „BIG DATA ist ein reiner Hype. Niemand braucht das.“ Max Mustermann, „Daten- Experte" Nürnberg, 27. Februar 2013 Confidential 5 What BigData NOT is! Confidential 6 Big Data is NOT Storage of large datasets! 7 Big Data is NOT BI + 20% ! Confidential 8 What is ParStream? Confidential 9 ParStream is THE Big Data Analytics Platform ParStream enables Enterprises to exploit Big Data opportunities and beat the competition by speed of implementation and operation REAL-TIME – LOW-LATENCY – HIGH THROUGHPUT Confidential 10 PARSTREAM BIG DATA ANALYTICS PLATFORM 2008 13.000.000.000 * x100 * 500 = Challenge Confidential 11 PARSTREAM BIG DATA ANALYTICS PLATFORM ParStream Empowers People in All Industries to Capture New Business Opportunities Evolving with Big Data • Analyze and Filter Billions of Records • Query Data Structures with 1000’s of columns • Get Answers in Milliseconds without Cubes • Continuous Import Data with Low Latency • Execute 1000’s of Concurrent Queries Confidential 12 ROADBLOCKS Established Databases Vendors can’t Deliver Technical Solutions MapReduce Can’t Deliver Results in Real-Time • Established Database Architectures were not Designed for Big Data • NoSQL approaches cannot deliver in real-time Real-Time Complex Event Processing In-Memory DB • • Extreme Performance can only be Achieved through Parallelization Supporting both Volume and Speed has been Unachievable Operational Data Volumes < 1..10 milli sec 10..100 milli Interactive sec Analytics 1 sec Gigabyte Terabyte OLTP Reporting Petabyte Big Data Volume 1..10 sec Batch Analytics 1..10 min (MapReduce) >10 min Lag Time Confidential 13 HUGE MARKET OPPORTUNITY Big Data Analytics is a game changer in every industry and is a huge market opportunity Many Applications All Industries eCommerce Services Social Networks Telco Facetted Search Web analytics SEOanalytics OnlineAdvertising Ad serving Profiling Targeting Customer attrition prevention Network monitoring Targeting Prepaid account mgmt Finance Trend analysis Fraud detection Automatic trading Risk analysis Energy Oil and Gas Smart metering Smart grids Wind parks Mining Solar Panels Many More Production Mining M2M Sensors Genetics Intelligence Weather Confidential 14 ENABLING KEY BUSINESS SCENARIOS Initial Focus on Scenarios Requiring a Unique Combination of REAL-TIME • LOW LATENCY • HIGH THROUGHPUT Search and Selection Real-Time Analytics OnlineProcessing ParStream enables new levels of search and online shopping satisfaction ParStream drives interactive analytics processes to gain insights faster ParStream responds automatically to large data streams Confidential 15 KEY SCENARIO: SEARCH AND SELECTION ParStream Enables New Levels of Search & Online Shopping Satisfaction Coface Services Changed from Oracle to ParStream Customer Success Story • • • Travel Search Platform Information marketplace Coface Services stopped development with Oracle after 6 years with partial solution ParStream built the intended solution within 4 month running on a single small server Coface Services: “very impressive results, we did not believe that ParStream will be able to deliver such a great solution” Value Proposition • • Build great product search sites with stickiness and better conversion Fast ROI through increased sales revenue Target Market • • • Large Online Shops Information marketplaces Social communities Confidential 16 CUSTOMER SUCCESS STORY: ETRACKER ParStream provides real-time campaign control and web analytics 1,000 to 12,000 times faster than MySQL-cluster Excellent Customers success stories Campaign Control & Web Analytics Continuous data import of new web- clicks every few seconds 10 billion web-clicks of 100 days Continuous data import with maximum latency of 30 seconds Complex analytics for lifesegmentation of customer groups < 2 sec query response time for > 100 concurrent interactive user 20 server cluster, shared nothing Website clicks 50,000 domains <2 sec response time Application Server 100 million rows continuous import per day ParStream Large aggregation multi-stage SQL-queries of many concurrent user 100.000.000.000 rows Confidential 17 KEY SCENARIO: REAL-TIME ANALYTICS ParStream Drives Interactive Analytics Processes to Gain Insights Faster Customers Replace Existing Solutions to Profit from ParStream’s Speed Customer Success Story • • • Etracker discarded MySQL-cluster because ParStream is up to 12.000 times faster Searchmetrics chose ParStream because of efficiency that enabled international roll-out Rio-Tinto changed to ParStream because of speed to sustain competitive advantage Web-Analytics SEO-Analytics Value Proposition • • Driving an interactive analytics process delivers insights quicker and more accurately High ROI through process innovation and greatly reduced infrastructure cost Geo-Spatial Analytics Target Market • • • Ad-spending and Web-Analytics Profiling and targeting Advanced analytics Confidential 18 ARCHITECTURE BUILDING BLOCKS ParStream is a Big Data Analytics Platform Based on a Unique High Performance Compressed Index • Hybrid Columnar/Row Storage • In Memory Technology C++ UDF - API SQL API / JDBC / ODBC Real-Time Analytics Engine • • • Shared Nothing Architecture Standard Interfaces Unique High Performance Compressed Index In-Memory and Disc Technology Multi-Dimensional Partitioning High Performance Compressed Index (HPCI)v Massively Parallel Processing (MPP) Shared Nothing Architecture Fast Hybrid Storage (Columnar/ Row) High Speed Loader with Low Latency Confidential 19 HIGH PERFORMANCE COMPRESSED INDEX The Key to ParStream’s Unmatched Performance STANDARD DB INDEX ARCHITECTURE − High Memory Requirements − High Load on CPUs − Time for Decompression Not Suitable for Big Data Analytics PARSTREAM INDEX ARCHITECTURE + Low Memory Requirements + No Need for Decompression + Patent filing in process Engineered for Big Data Analytics Confidential 20 ORDERS OF MAGNITUDE FASTER ParStream Outperforms PostgreSQL by a Factor of 1000 Delivering Results in Sub-Seconds on Large Data Volumes PostgreSQL (Scale 100 sec) Seconds Slow ParStream (Scale 1 sec) seconds Query 1 1, 0 PostGreSQ L 0, 9 Query 3 exponential ParStream Query 2 Fast Linear 0, 8 1000 times faster 0, 7 Query 1 0, 6 0, 5 Query 2 0, 4 0, 3 0, 2 Query 3 0, 1 0, 0 0 2 0 4 0 6 0 8 0 10 0 12 0 14 0 Number of Rows [Millions] Confidential 16 0 18 0 21 ONE LICENSE – FOUR WAYS TO DELIVER Customer Choice Software Cloud Partner Appliance OEM, ISV, SI v from software that can be configured and run on customer specific infrastructure to cloud and appliance Confidential 22 ParStream @ Mittelstand? Confidential 23 PARSTREAM @ MITTELSTAND • Vorhandene Standard Hardware oder Betrieb in der Cloud • Prototyp in max. 3-5 Tagen • Geringer Schulungsaufwand • „Hello World“ in 2 Minuten • Preisliche Skalierung ausschließlich anhand vom Datenvolumen Confidential 24 Data is the New Oil ParStream Provides the Platform Christian Werling, Director Sales ParStream GmbH Große Sandkaul 2 50667 Köln [email protected]