DQ Developer 9x Specialist Skill Set Inventory

Transcription

DQ Developer 9x Specialist Skill Set Inventory
Skill Set Inventory
Data Quality 9.x: Developer, Specialist
Certification
About the Informatica Certified Professional (ICP) Program
Informatica certification is a two-stage structure that correlates professional certification with the
“what, why and how” of Informatica implementations:
•Specialist – To achieve Specialist recognition, a candidate must pass a written exam, proctored in
person or via webcam. The Specialist exam validates that the individual understands the product and
can apply the skills and capabilities required to contribute to a project as a full team member.
•Expert – To reach the Expert level an Informatica Certified Specialist must pass Informatica Velocity
Best Practices and Implementation Methodologies Certification. This certification validates that you are
able to lead a project implementation team following our best practices.
Our new exams have been developed to define product competency by job role, where demonstrated
performance and outcomes validate Informatica implementation skills.
About the ICS Data Quality Developer Exam and the Skill Set Inventory
This exam measures your competency as a member of a project implementation team. This involves
having an in-depth knowledge of each of the Data Quality processes from Profiling to Standardization,
Matching and Consolidation. You need to be able to select and configure the appropriate Data Quality
transformation for your requirements and build, debug and execute Data Quality mappings as well as
integrate mappings into Power Center.
The skill set inventory is used to guide your preparation before taking the exam. It is an outline of the
technical topics and subject areas that are covered in each exam. The skill set inventory includes test
domain weighting, test objectives and topical content. The topics and concepts are included to clarify
the test objectives.
Informatica University, Copyright ©Informatica Corporation, November 2014
Test takers will be tested on:
•
•
•
•
•
•
•
•
•
•
Be able to navigate through the Developer Tool
Use Project Collaboration techniques; shared Tags, Comments, Profiles, reference tables to
share information with Analysts
Use a variety of profiling methods to profile data
Use Analyst built profiles to develop mappings
Build mappings and mapplets to cleanse and standardize data
Perform Address Validation
Identify duplicate records in a dataset
Automatically and manually consolidate duplicate records to create a master record
Execute mappings/mapplets in PowerCenter
Use Data Quality Mappings in an Excel spreadsheet (connect to Web Services)
Training Prerequisites
The skills and knowledge areas measured by this exam are focused on product core functionality inside
the realm of a standard project implementation. Training materials, supporting documentation and
practical experience may become sources of question development.
The suggested training prerequisites for this certification level are the completion of the following
Informatica course(s):
onDemand Training
Data Quality 9.5: Analyst – onDemand w/ 40 lab hours
Informatica Developer Tool: Introduction - onDemand w/ 30 lab hours
Data Quality 9.x: Developer– onDemand w/ 40 lab hours
Data Quality 9.x: Developer Level 2 – onDemand w/ 40 lab hours
Exam Test Domains
The test domains and the extent to which they are represented as an estimated percentage of the exam
follows:
Title
% of Exam
Informatica Overview
Analyst Collaboration
Profiling
Standardization/Mapplets
Address Validation
Matching
Consolidation and the DQA
10%
10%
15%
10%
5%
10%
10%
Informatica University, Copyright ©Informatica Corporation, November 2014
Integration with PowerCenter
Object Import and Export
DQ for Excel
Parameters
Content
5%
5%
5%
5%
10%
Question Format
You may select from one or more response offerings to answer a question.
You will score the question correctly if your response accurately completes the statement or answers
the question. Incorrect distractors are given as possible correct answers so that those without the
required skills and experience may wrongly select that choice.
A passing grade of 70% is needed to achieve recognition as an Data Quality Developer 9.x: Specialist
certification.
You are given 90 minutes to complete the exam.
Test formats used in this exam are:
• Multiple Choice: Select one option that best answers the question or completes the statement
• Multiple Response: Select all that apply to best answer the question or complete the statement
• True/False: After reading the statement or questions select the best answer
Exam Policy
•
•
If you do not pass on your 1st attempt, you must wait 2 weeks after the exam to retake the exam
You may take the exam up to three times in one year from the date of your first exam attempt
Test Topics
The exam will contain 70 questions comprised of topics that span across the sections listed below. In
order to ensure that you are prepared for the test, review the subtopics associated with each section.
Informatica Overview
•
Be able to describe the Informatica 9 architecture and set up including the repositories
required for installation
•
Be able to provide information on the Data Quality process and dimensions of data quality
Analyst Collaboration
•
Be able to use and describe the functionality of the Analyst Tool including Scorecarding,
Reference Table Management, Tags, Filters, Profiles and Comments
•
Be able to describe how Analysts and Developers can collaborate on projects
Informatica University, Copyright ©Informatica Corporation, November 2014
•
Describe the benefits of project collaboration to a team
Profiling
•
Be able to perform and interpret column, rule, comparative and mid-stream profiling
Standardization/Mapplets
•
Be aware of where data standardization fits in the DQ process
•
Apply, configure and troubleshoot data standardization transformations
•
Be able to differentiate between the various parsing techniques available
•
Recognize how reference tables are used in the standardization process
Address Validation
•
Verify the importance of Address Validation
•
Configure the Address Validation transformation including explaining each mode that is
available and what custom templates can be used for
•
Interpret the AV outputs generated
Matching
•
Know how to develop and build match plans to identify duplicate or related data
•
Be able to differentiate between the matching algorithms available and explain how
matching scores are calculated
•
Know how to configure Identity Matching option for the match transformation
•
Explain populations, the identity match strategies that are available and how they are used
in Identity Matching
Consolidation and the DQA
•
Explain Automatic and Manual consolidation techniques and configure the Transformations
used for automatic consolidation
•
Use the Exception Transformation to generate and populate the Bad Records and Duplicate
Records tables
•
Troubleshoot any problems that may occur
Informatica University, Copyright ©Informatica Corporation, November 2014
•
Be able to generate a survivor record in the DQA
•
Remove duplicates using DQA
Integration with Power Center
•
Explain how to integrate IDQ mappings and mapplets into PowerCenter Workflows including
how content is handled
Object Import and Export
•
Describe the difference between the Basic and Advanced Import options available
•
Explain Dependency conflict resolution and how it is handled
•
Describe how to Export a Project
DQ for Excel
•
Be able to describe the requirements for integration for Excel
•
Explain the techniques for creating mappings for DQ for Excel
•
Explain the Web service capabilities required for DQ for Excel
•
Provide an explanation on how to use the Informatica ribbon in Excel
Parameters
•
Explain what parameters are and why they are used including what parameter functions are
supported
•
Identify which Transformations can use parameters
•
Describe the process of exporting mappings with parameters built in
Content
•
Explain what Content is, what is contained in the Core Accelerator and why it is used
Informatica University, Copyright ©Informatica Corporation, November 2014
Sample Test Questions
Scorecarding is performed on which of the following?
Option Text
A.
Per record basis
B.
Per column/attribute basis
C.
Per table basis
D.
Per database basis
Select all correct statements for IDQ grouping and matching:
Option Text
A.
IDQ field level matching does not utilize grouping
B.
When field level matching is performed, the records within each group will be compared against
each other
C.
When field level matching is performed, matching will be performed across multiple groups in a
single match transformation
D.
When field level matching is performed, matching will not be performed across groups, therefore it
is imperative grouping is performed on a complete and accurate field(s)
What types of profiling can be performed in the Developer Tool in Data Quality?
Option Text
A.
Column Profiling, Primary Key Inference, Dependency Inference
B.
Column and Join Profiling only
C.
Column, Join Analysis, Mid Stream, Comparative Profiling
D.
Column, Join Analysis, Mid Stream, Comparative, Primary and Foreign Key and Overlap Profiling
Data Consolidation manages the following processes (select all that apply):
Option Text
A.
Identifies duplicates within a data set
B.
Merging or linking duplicate or related records
C.
Removing duplicates from a dataset
D.
Replacing inaccurate data
Informatica University, Copyright ©Informatica Corporation, November 2014
Select the architecture description that best describes IDQ 9.1
Option Text
A.
The architecture is completely server based; there is no client. Both Developer and Analyst users
access IDQ through a web based interface
B.
The architecture is both client and server based. The Data Integration, Analyst, and Model
Repository services all run on the server. Developer is the client application
C.
IDQ has both a client and server repository. Production ready data quality mappings are
transferred from the client repository to the server repository
D.
IDQ does not have a repository. All rule and mapping work is stored as XML files in the server
directory structure
Informatica University, Copyright ©Informatica Corporation, November 2014