Friday, 24 October 2014

Info sphere Data Stage and Quality Stage 9.1 training content

Info sphere Data Stage and Quality Stage 9.1

Data Warehouse Fundamentals
An introduction to Data Warehousing – purpose of Data Warehouse – Data Warehouse Architecture – Operational Data Store – OLTP Vs Warehouse Applications – Data Marts- Data marts Vs Data Warehouses – Data Warehouse Life cycle .
 Data Modeling
Introduction to Data Modeling – Entity Relationship model (E-R model) – Data Modeling for Data Warehouse, Normalization process – Dimensions and fact tables – Star Schema and Snowflake Schemas.
ETL Design Process
Introduction to Extraction, Transformation & Loading- Types of ETL Tools – Key tools in the market.
Introduction to Data stage Version 9.1
Datastage introduction – IBM information Server architecture – DataStage components – DataStage main functions – Client components.
Data Stage Designer
Introduction to Data stage Designer – Importance of Parallelism – Pipeline Parallelism – Partition Parallelism – Partitioning and collecting – Symmetric Multi Processing (SMP) Massively Parallel Processing (MPP) – Partition techniques – Data stage Repository Palette – Passive and Active stages – Job design overview – Designer work area – Annotations – Creating jobs – Importing flat file definitions – Managing the Metadata environment – Dataset management – Deletion of Dataset – Routines – Arguments.
Working with Parallel Job Stages
Database Stages
Oracle connector –Teradata Connector – ODBC –DB2 Connector
File Stages
Sequential file – Dataset – File set – Lookup file set-Complex Flat File Stage
Processing Stages
Copy – Filter – Funnel – Sort- Remove duplicate – Aggregator – Modify – Compress – Expand – Decode – Encode – Switch – Pivot stage -Lookup – Join – Merge –look up, join and merge – change capture – Change apply – Compare – Difference – Surrogate key generator – Transformer
Debug Stages
Head – Tail – Peek – Column generator – Row generator –Write Range Map Stage.
Real Time Stages
XML input – XML output –XML Transformer, MQ plug in
Local and Shared containers
Routines creation
Advanced Stages in Parallel Jobs (Version 9.1)
Range Look process – Surrogate key generator stage – Slowly changing dimension stage – iway stage – FTP stage-Pivot Enterprise – Job performance analysis – Resource estimation- Performance Optimizer – Slowly Changing Dimensions implementation , Transformer stage looping condition, Transformer stage Last Row handling
Datastage Director
Introduction to Data stage Director – Validating Data stage Jobs – Executing Data stage jobs – Job execution status – Monitoring a job – Job log view – job scheduling – Creating Batches – Scheduling batches.
DATASTAGE Administrator
Data stage project Administration - Editing projects and Adding Projects – Deleting projects Cleansing up project files – Environmental Variables–Environment management – Auto purging – Rutime Column Propagation(RCP) – Add checkpoints for sequencer – NLS configuration – Generated OSH (Orchestra Engine) – System formats like data, timestamp – Project protect – Version details.

Job Sequencers
Arrange job activities in Sequencer – Triggers in Sequencer – Restablity – Recoverability – Notification activity – Terminator activity – Wait for file activity – Start Look activity – Execute Command activity – Nested Condition activity – Exception handling activity – User Variable activity – End Loop activity – Adding Checkpoints

Info sphere Quality Stage 9.1

ü  Why Data Quality
ü  Data Quality Challenges
ü  Types of  Data Quality Tools Provided by IBM
ü  Differences between IA and QS
ü  Quality stage Architecture
ü  Data stage Quality Stages
o   Investigate Stage
§  Default Class Descriptions
§  Word Investigation
§  Character Discrete Investigation
§  Character concatenate investigation

o   Standardize Stage
§  Standardize Process
§  Domain Specific Rule sets
§  Domain Preprocessing Rule sets
§  Creation of Custom Rule sets with Examples( SEPLIST/STRIPLIST,Classification file,Dictionary Files,Pattern Action File,Lookup Tables,Override Tables )
§  Introduction to Pattern Action language
§  Types Of Patterns ( Conditional and Unconditional )
§  Build customized Action statements using PAL with Examples
§  Standardize Quality Assessment Report ( SQA )

o   Match Stage
§  Match Process
§  Creation of Match Passes
§  Match Frequency Stage & Reports
§  Unduplicate Match Stage with Examples
§  Reference Match Stage with Examples

o   Survive Stage
§  Importance of Survive Stage
§  Build Survive Process
§  Implementation of Survive Rules

o   AVI Stage ( Newly Added in QS 9.1 )
§  AVI Process
§  Types of Processing ( Parse and Validation )
§  Types of Reports ( Suggestion and Correction Report ) with Examples
§  Detail Explanation of Each Report

ü  Data Rules Stage ( Newly Added in QS 9.1)
§  Importance of Data Rules Stage
§  How to build the rules in Data Rules Stage with Example
§  How to map the rules to columns
§  Explanation of output Reports


ü  Explanation about the entire Data Quality Life Cycle



IBM Information Server Administration
IBM Info sphere Data Stage administration – Opening the IBM Information Server Web console – setting up a project ion the console – Customizing the project dashboard – Setting up security – Creating users in the console – Assigning security roles to users and groups – Managing licenses – Managing active sessions – Managing logs – Managing schedules – Backing up and restoring IBM Information Server.



DATASTAGE  9.1 REAL TIME PROJECT

v  Project architecture and BRD discussion                           
v  Dimensional tables and fact tables with modeling           
v  Flow of subject area discussion                                         
v  Design of HLD’s and LLD’s for a project                          
v  Project flow-job design process with ETL Documents    
v  Complex jobs discussion-unit testing process                  
v  System & User Acceptance & Regression & End-to-End Testing   
v  Deployment Process of code to different phases                            
v  Creation of job design documents or overview docs, tech specs  
v  Production support process                                                              
v  UNIX scripting for automation of code                                             
v  Discussion of scheduling process with Control-M/Autosys           
v  Fixing of Defects and Problem Tickets and Incidents                     

Additional Features

v  Data stage project on Banking & Insurance & Health Care domain.
v  Data stage Certification Guidance.
v  Performance TuNing of Parallel Jobs.
v  Datastage Installation process and setup.
v  Well Versed Materials Which Covers Data warehousing Basics, Datastage Concepts Unix Commands, Shall Script, Databases.




3 comments:

  1. I really appreciate information shared above. It’s of great help. If someone want to learn Online (Virtual) instructor lead live training in Data stage administrator training, kindly contact us http://www.maxmunus.com/contact
    MaxMunus Offer World Class Virtual Instructor led training on TECHNOLOGY. We have industry expert trainer. We provide Training Material and Software Support. MaxMunus has successfully conducted 100000+ trainings in India, USA, UK, Australlia, Switzerland, Qatar, Saudi Arabia, Bangladesh, Bahrain and UAE etc.
    For Demo Contact us.
    Sangita Mohanty
    MaxMunus
    E-mail: sangita@maxmunus.com
    Skype id: training_maxmunus
    Ph:(0) 9738075708 / 080 - 41103383
    http://www.maxmunus.com/

    ReplyDelete
  2. I wish to show thanks to you just for bailing me out of this particular
    trouble.As a result of checking through the net and meeting
    techniques that were not productive, I thought my life was done.
    javascript training in Chennai
    mysql dba training in chennai

    ReplyDelete