Presentation
Transcription
Presentation
IBM Change Data Capture and Replication Solutions © 2008 IBM Corporation Information Management Software Unlocking the Business Value of Information Financial Risk Insight Customer & Product Profitability Workforce Dynamic Optimization Supply Chain Multi-channel Marketing Flexible Architecture 2 Information Management Software The IBM InfoSphere Vision An Industry Unique Information Platform • Simplify the delivery of Trusted Information • Accelerate client value • Promote collaboration • Mitigate risk • Modular but Integrated • Scalable – Project to Enterprise 3 Information Management Software InfoSphere Warehouse – Top Warehouse Solution Best-of-breed ETL Tools Best-of-breed Warehouse Engine Best-of-breed Reporting/Ad-hoc Query/Dashboard Data Assets Data Mart Structures Relational Data Management Aggregation InfoSphere Warehouse Report/Ad-hoc Query Transformation ERP Data Integration DashBoard Flat-file Data 4 Datastage InfoSphere Cognos Information Management Software InfoSphere Information Server Information Services Director Publish SOA services for information integration and access Business Glossary Enterprise Data Dictionary QualityStage Data Quality: Standardize, Correct & Match Data DataStage Information Analyzer Data Source Profiling & Problem Diagnosis Global Name Recognition Extract, Transform, and Load in Batch or Real-time Recognize & Classify Multi-cultural names Metadata Server / Metadata Workbench / FastTrack Manage and track consistent metadata across information integration tasks and automate generation of data flow logic Parallel Processing Rich Connectivity to Applications, Data, and Content 5 Federation Server Virtualize access to disparate information CDC & Replication Deliver and replicate changed data Information Management Software GUI Interface 6 Information Management Software Data Quality platform •Foundation to any success business projects •Source System analysis •Auditable Data Quality •DC rules will be stored in MDR •DC rules will be implement in ETL •DC result will be record Data Quality management 7 •Set up Data source view by federation •Auto generate verify SP with MDR •Verify result will be store with MDR Metadata management Information Management Software Data Integration Platform: :Infosphere Warehouse Ease of Design and Manage via Tooling Eclipse Design Studio DBA BI Designer Web Administration Console Data Mining Unstructured Analysis In-Line Analytics Cubing Services Extend Value via Analytics Simplify Warehouse Infrastructure BI Specialist SQW: Design, Deploy, Refine Data Warehouse pSeries Server cpu 1 cpu n-1 pSeries Server cpu 1 cpu n-1 cpu 2 cpu 2 cpu n I/O Channels BCU 1 8 WLM: Workload Management DB2 ESE/DPF/QP/Compression cpu n I/O Channels BCU 2 … pSeries Server cpu 1 cpu n-1 cpu 2 cpu n I/O Channels BCU N DBA IT Specialist Information Management Software Introducing the “IBM Smart Analytics Solution” Overcoming The Obstacles To Business Transformation An integrated, high-performance analytics solution for accelerating delivery of insights for faster, smarter action Able to adjust and grow based on your company’s ever changing business needs broad analytic capabilities powerful warehouse capabilities scalable and fully-integrated IBM hardware set-up services and single point of premium support Delivering results in days instead of months 9 Information Management Software IBM Smart Analytics Solution What's in the system? Infosphere Warehouse and Cognos Deeply optimized by IBM experts Flexible growth to meet changing business needs Analytics Software Options Business Intelligence capabilities Cubing Services Text Analytics & Data Mining . . . more to come Powerful Data Warehouse Warehousing Platform Advanced Workload Management System Automation Hardware & Services Server Platform Storage capacity Build, Deploy, Health Check & Premium Support Services Delivering results in days instead of months! 10 Information Management Software IBM Smart Analytics Solution – Scalable Warehouse Seamless Growth Add capacity & analytic capabilities as requirements evolve Start right 11 Add more capacity Add new analytic capability 11 Information Management Software InfoSphere Information Server Information Services Director Publish SOA services for information integration and access Business Glossary Enterprise Data Dictionary QualityStage Data Quality: Standardize, Correct & Match Data DataStage Information Analyzer Data Source Profiling & Problem Diagnosis Global Name Recognition Extract, Transform, and Load in Batch or Real-time Recognize & Classify Multi-cultural names Metadata Server / Metadata Workbench / FastTrack Manage and track consistent metadata across information integration tasks and automate generation of data flow logic Parallel Processing Rich Connectivity to Applications, Data, and Content 12 Federation Server Virtualize access to disparate information CDC & Replication Deliver and replicate changed data Information Management Software Business Challenges Driving Real-Time Data Integration Dynamic Warehousing & Business Intelligence and Reporting • Yesterday’s data inadequate for inventory and purchasing decisions Data Synchronization and Replication • We need up to date information flowing between applications and to ensure an up-to-date version is always available Real-time Event Detection • Need to pro-actively monitor and respond to business changes ……Without Impacting the Performance of Production Systems 13 Information Management Software What is IBM InfoSphere Change Data Capture? • Provides real-time change data capture and delivery for • Dynamic change data integration • Synchronization Developers Deliver Architects IBM InfoSphere Change Data Capture and Replication Solutions Delivers real time changed data to Information Server, applications and targets or message queues • Replication • Minimal impact on production systems • High scalability and end-toend performance • Guaranteed data integrity 14 Without impacting performance of production systems Information Management Software Business Application EDW Production Systems Enterprise Data Warehouse ETL ERP Billing Finance POS CRM BI Appliance Real-Time CDC Event-Driven SOA ESB Web Applications Real-Time CDC MDR Real-Time CDC 15 Operational Business Intelligence Metadata Repository Up-to-Date Web Portals Master Data Management Information Management Software Expansive Source, Target, Platform Support DATABASES Source & Target DATABASES Source Only TARGETS MESSAGE QUEUE OPERATING SYSTEMS HARDWARE PLATFORMS DB2 z/OS IMS Teradata JMS i5/OS IBM System i Oracle ADABAS Netezza MQ Series z/OS IBM System z Sybase CA-IDMS Greenplum AIX IBM System p MS SQL Server CA-Datacom Informix WebMethods HP-UX HP-9000 MySQL BEA Solaris Intel Information Server MS Windows Sun Cognos Now! Linux Itanium DB2 UDB DB2 i DB2 UDB VSAM IAM Flat File 16 TIBCO Information Management Software 17 Information Management Software Replication Modes: Refresh “Replication Log” Continuous (Real Time) Refresh Push Engine Net Change (Periodic) Database Table Refresh (Full Copy) 18 Information Management Software Replication Modes: Net Change Replication Log Continuous (Real Time) Scrape Push Engine Net Change (Periodic) Database Table Refresh (Full Copy) 19 Information Management Software Replication Modes: Continuous Mirroring Replication Log Continuous (Real Time) Scrape Push Engine Net Change (Periodic) Database Table Refresh (Full Copy) 20 Information Management Software InfoSphere CDC General Architectural Overview Oracle DB2 SQL Server : InfoSphere CDC Binaries InfoSphere CDC Binaries InfoSphere CDC Binaries includes Source and Target replication engines and configuration agent. InfoSphere CDC Binaries GUI Connection GUI Connection GUI Connection Subscriptions ( Replication Threads) Subscriptions (Replication Threads) Win/ Unix/ Linux Subscriptions ( Replication Threads) Management Console Management Console configuration interface . Access Manager controls access to the product . Can be run on a separate server to allow for centralized access Access Manager GUI Connection GUI Connection GUI Connection InfoSphere CDC Binaries InfoSphere CDC Binaries InfoSphere CDC Binaries Interface Connections. Only connected when GUI being used Replication Connections. . Only connected when actively Mirroring or Refreshing Oracle 21 DB2 DB2 Information Management Software Building A Low Latency ODS – General Technical Architecture Source Server(1) Meta Linux/Unix/Windows Agent Administration GUI DB Log Access Server/Management Console Install Engine DB Target ODS InfoSphere CDC Install Agent Agent Source Server(2) Meta DB Log Engine DB Agent Meta InfoSphere CDC Install DB Log InfoSphere CDC Binaries Engine DB InfoSphere CDC Install 22 Meta InfoSphere CDC Metadata Can contain both source and target metadata depending on whether server is publisher, subscriber or both. There is no single unified metadata instance. Information Management Software Replicating “To” An ESB – General Technical Architecture (a) InfoSphere CDC Event Server On Different Server As InfoSphere CDC Source Server(1) Linux/Unix/Windows Agent Meta Administration GUI DB Log Access Server/Management Console Install Engine DB Message Queue Server InfoSphere CDC Install Agent Agent InfoSphere CDC Metadata Meta Can contain both source and target metadata depending on whether server is publisher, subscriber or both. There is no single unified metadata instance. InfoSphere Event Server Metadata Meta Target/Subscriber metadata only.. InfoSphere CDC/Event Server Binaries 23 Meta Engine JMS XML or User Defined InfoSphere CDC Event Server Install JMS XML or User Defined Message Queue Server Information Management Software Replicating “To” An ESB – General Technical Architecture (b) InfoSphere CDC Event Server On Same Server As InfoSphere CDC Source Server(1) Linux/Unix/Windows Agent Meta Administration GUI DB Log Engine DB Access Server/Management Console Install InfoSphere CDC Install JMS XML or User Defined Engine Message Queue Server JMS r XML o ed in f e D User Meta Meta Can contain both source and target metadata depending on whether server is publisher, subscriber or both. There is no single unified metadata instance. Agent Agent InfoSphere CDC Event Server Install Meta InfoSphere CDC/Event Server Binaries 24 InfoSphere CDC Metadata InfoSphere Event Server Metadata Target/subscriber metadata only.. Information Management Software Use Cases 25 Information Management Software Use Cases Real-time Integration Need up-to-date information delivered to a data warehouse Need up-to-date information delivered for eCommerce applications (ie. Downtown gift registry) Store Midtown Store Inventory Uptown Store 26 Information Management Software Use Cases Replication Need to report on operational systems but cannot impact production system – must synchronize data across 1 or more databases Less expensive DBs Need to maintain business continuity during application migrations, consolidations, upgrades 27 Information Management Software Live, Operational Reporting • Distribute reporting workload over existing environments • Real-time data for Business Intelligence Production Systems 28 Information Management Software Data Consolidation and ETL Integration HP Superdome New GSM Billing - Oracle Sun New CRM Public Website IBM System i Billing Data Warehouse IBM System i Billing DataStage Other Applications (a.o. CRM) ODS (Oracle) 29 Information Management Software Direct Connect with InfoSphere DataStage • Enabling real-time response to data changes and business events • Low impact log-based changed data capture • New palette stages on InfoSphere Information Server • Stream data changes into InfoSphere Information Server 30 Information Management Software 31 Information Management Software Features in IBM InfoSphere CDC 32 Information Management Software Filtering CUST_NO L_NAME F_NAME PHONE REP_NO 58699 Smith John 404-555-3874 45 37283 Duggan Ira 613-555-8367 25 89863 Quinn Fran 905-555-1296 11 89732 Muntz Muntz 704-555-2738 25 Integrate entire systems or only a subset of data ROW SELECT REP_NO = 25 Table/row/column-level filtering options available 33 CUST_NO L_NAME F_NAME REP_NO 37283 Duggan Ira 25 89732 Muntz Josie 25 Information Management Software Data Translations • Convert data representations on the fly to integrate disparate systems and database table structures EMP LAST FIRST HIRE_DATE STAT SALARY MAX 1234 Moreiro Nicole 01/05/97 A $55,000 $60,000 2345 Ellison Val 04/12/97 I $40,000 $50,000 34 Transform Fields Derived Fields Concatenation Century Dates EMP_ID FULL_NAME HIRE_DATE STATUS %SALARYMAX 001234 Nicole Moreiro 01/05/1997 Active 92% 002345 Val Ellison 04/12/1997 Inactive 80% Increase Field Size Information Management Software Table Mapping Methods • One-to-one • Source and target tables have similar table structures • Audit • Generates audit trail of data transactions from source • Adaptive Apply • Automatically synchronizes data for dissimilar sources and targets • Summarization • Keeps a running total of numerical values at the target • Consolidation: One-to-One • Merges data from several tables into a single row • Consolidation: One-to-Many • Used to apply a source lookup table change to all affected target rows 35 Information Management Software Guaranteed Data Integrity • Data transactions are applied at the target in the same order as it was generated at the source • Target acknowledges each apply operation to ensure delivery • Data capture continues even if communications link is unavailable • Data applied at target upon communication availability 36 Information Management Software Monitoring • Graphical visualization of replication processes • Event logs, alerts & alarms • Exportable throughput & latency statistics 37 Information Management Software Real-Time Change Data Capture and Delivery Sense and respond to critical Information in real time Provide real-time synchronization between applications Reduce burden on operational systems Maintain business continuity Capitalize on Opportunities Quickly Adapt to Competitive Pressures Gain Operational Efficiencies Minimize Risk During Migration/Consolidation/Upgrades Protect critical business information 38 Information Management Software 39 Information Management Software Backup Slides 40 Information Management Software Expansive Source, Target, Platform Support DATABASES Source & Target DATABASES Source Only TARGETS MESSAGE QUEUE OPERATING SYSTEMS HARDWARE PLATFORMS DB2 z/OS IMS Teradata JMS i5/OS IBM System i Oracle ADABAS Netezza MQ Series z/OS IBM System z Sybase CA-IDMS Greenplum AIX IBM System p MS SQL Server CA-Datacom Informix WebMethods HP-UX HP-9000 MySQL BEA Solaris Intel Information Server MS Windows Sun Cognos Now! Linux Itanium DB2 UDB DB2 i DB2 UDB VSAM IAM Flat File 41 TIBCO Information Management Software Data Consolidation and ETL Integration HP Superdome New GSM Billing - Oracle Sun New CRM Public Website IBM System i Billing Data Warehouse IBM System i Billing DataStage Other Applications (a.o. CRM) ODS (Oracle) 42 Information Management Software Business Application EDW Production Systems Enterprise Data Warehouse ETL ERP Billing Finance POS CRM BI Appliance Real-Time CDC Event-Driven SOA ESB Web Applications Real-Time CDC MDR Real-Time CDC 43 Operational Business Intelligence Metadata Repository Up-to-Date Web Portals Master Data Management Information Management Software Bostik Ltd. - Real-time Analysis Business Challenge: Business Value: • Corporate data consolidated on a nightly basis from two ERP systems for analysis and reporting to not impact daytime performance •On-time delivery of orders up 10% •Volume of data and reporting needs exceeded nightly batch load capacity and hindered business performance •Late orders decreased by 80% •Flexible reporting and analysis schedules •Single view of corporate ERP data •Batch window time slot open Solution: •IBM InfoSphere Change Data Capture consolidated data from ERP systems into a single data warehouse, transformed into a single format •Analyses and reporting can be conducted with the most current data at any time •Eliminated nightly batch window to free up system resources for other activities 44 “While building an effective data warehouse infrastructure and delivering good information to users is critically important, it is just the beginning. The real benefit comes from the ability to make better business decisions.” – Paul Hudson, IT Manager, Bostik Ltd. Information Management Software Tiffany & Company – Data Synchronization Business Challenge: Business Value: • Needed access to up-to-date inventory and product information for online purchasing • Increased customer satisfaction with more accurate Online Gift Registry information which is a significant portion of Tiffany’s online business •Purchases made online needed to be accurate reflected in the inventory system, particularly for online gift registry •Improved inventory tracking results in more effective inventory management Solution: • Implemented IBM InfoSphere Change Data Capture • Real-time data flows to e-Commerce application for online customer purchasing and back to the inventory systems after purchases are made. “DataMirror’s robust, real-time data integration provides us with visibility into inventory levels, sales reports and e-Business channels, allowing us to better manage sales cycles and deliver superior customer experiences worldwide." - Manager of Database Services, Tiffany & Co. 45 Information Management Software CitiStreet – Data Replication Business Challenge: Business Value: • CitiStreet is one of the largest and most experienced global benefits providers servicing over 9 million plan participants across all markets. CitiStreet was formed in partnership between subsidiaries of State Street Corporation and Citigroup • Ensure application availability for plan participants and sponsors. • Business continuity - Ensure 24x7 portal access for plan participants and sponsors • Improve data integrity with a reduced level of maintenance • Maintain p2p synchronization of profile updates (approx 175,000 updates daily) in real time Solution: • Use IBM InfoSphere Replication Server to support redundant, active single sign-on applications for failover processing, replicating profile changes between systems in real time. 46 “Since nearly 10 million of CitiStreet customers are offered 24-hour access to their retirement accounts, the company can't afford downtime and must be able to replicate data changes when they happen. We fully replicate our database over redundancy data lines, so to us the stability and speed of that asynchronous replication is strategic for us.” Barry Strasnick , CIO CitiStreet Information Management Software Large Telco Company - Event Detection Business Challenge: Business Value: •Wanted to give their CRM system access to current billing information. • Increase revenue through proactively selling more time when paid minutes gets low •The billing system is their most critical business system – no performance impact or increased risk could be tolerated • Increase customer satisfaction by anticipating customer need •No performance impact or increased risk to billing system Solution: • IBM InfoSphere Change Data Capture is used to detect customer transactions from billing system and push the information to a message queue • Able to detect specific business events, make that information available to their CRM system, allowing them to automatically respond when a customer runs low on minutes. 47 Billing System Customer Transactions CDC Customer Transactions Message Queue CRM Information Management Software 1 Box Scenario Box 1 CLIENT ACCESS SERVER source target Only one system required Only Intrasystem replication possible Only possible if box 1 is NT or Unix 48 Information Management Software 2 Box Scenario Box 1 CLIENT ACCESS SERVER source target Box 2 Does not require separate access server Set-up of replication via the client dependent on Box 1 being active Only Intrasystem replication possible 49 Information Management Software 3 Box Scenario A Box 1 CLIENT ACCESS SERVER Box 2 Box 3 source target Requires administration of Access Server on all installed systems. Good for locations with few administrators. Only possible if box 1 is Windows NT or UNIX 50 Information Management Software 3 Box Scenario B Box 1 Box 2 Box 3 CLIENT ACCESS SERVER source target Many clients can access CDC, if authorized Does not require separate box for Access Server Only possible if box 2 is Windows NT or UNIX 51 Information Management Software 3 Box Scenario C Box 1 Box 2 Box 3 CLIENT ACCESS SERVER source target Many clients can access CDC, if authorized Does not require separate box for Access Server Only possible if box 2 is Windows NT or UNIX 52 Information Management Software 4 Box Scenario Box 3 Box 1 CLIENT ACCESS SERVER Box 2 Box 4 source target • Does not require separate access server 53