SlideShare a Scribd company logo
1 of 3
Call Us: +91- 8885560202 (India)
+1-707-666-8949 (USA)
Mail Us: Info@VirtualNuggets.com
1.Explain the ETL process? How many steps ETL contains?
Explain with example.
ETL stands for Extraction, Transforming and Loading.
Data is extracted from the source(database servers), and applied for the generating business role
on it.
The following are the steps involved :
Define the source [ define the odbc connection to the database source ]
Define the target [ create the odbc connection to the target database ]
Create the mapping [ Apply business role here by adding transformations and define the data
flow from source to target ]
Create the session [ Mapping instructions ]
Create the work flow [ Instructions that run on the sessions ]
2.Explain the Full load & Incremental or Refresh load-ETL?
Initial Load : It is the process of populating all the data warehousing tables for the very first
time
Full Load : While loading the data for the first time, all the set records are loaded at a stretch
depending on the volume. It erases all the content of tables and reloads with fresh data
Incremental Load : Applying the dynamic changes as and when necessary in a specific period.
The schedule is predefined each period
3.What is the three tier data warehouse?-ETL
The data ware is thought of as a three tier system
The middle layer provides the data that is usable in a secure way to the end users.
The other two layers are on the other side of the middle tier. One from the end users and
the other from back end data storage.-ETL
 The 1st layer is known as source layer where the data lands
 The 2nd layer is known as integration layer where data is stored after transformation
 The 3rd layer is known as dimension layer where the actual presentation layer stands.
4. What are snapshots? What are materialized views &
where do we use them? What is a materialized view log?-
ETL
 Snapshots are copies of read-only data of a master table.
 They are located on a remote node that is refreshed periodically to reflect the changes
made to the master table.
 They are replica of tables
Views
Views are built by using attributes of one or more tables.
View with single table can be updated, whereas view with multiple tables cannot be updated
Materialized View log
A materialized view is a pre computed table that has aggregated or joined data from fact tables
and dimension tables.
To put it simple, a materialized view is an aggregate table.
5.What is the difference between Power Center and Power
mart.-ETL
Power Center
 Processes large volumes of the data
 ERP sources such as SAP,PeopleSoft,Oracle Apps. can be connected with the power
center
 Session partition is allowed to improving the performance of an ETL transaction
Power Mart
 Processes low volumes of data
 Does not providing connections to ERP sources
 Does not allow session partitions

More Related Content

What's hot

Hadoop MapReduce joins
Hadoop MapReduce joinsHadoop MapReduce joins
Hadoop MapReduce joinsShalish VJ
 
Adbms 27 parallel database distribution architecture
Adbms 27 parallel database distribution architectureAdbms 27 parallel database distribution architecture
Adbms 27 parallel database distribution architectureVaibhav Khanna
 
Sql server introduction
Sql server introductionSql server introduction
Sql server introductionRiteshkiit
 
Distributed design alternatives
Distributed design alternativesDistributed design alternatives
Distributed design alternativesPooja Dixit
 
Reduce Side Joins
Reduce Side Joins Reduce Side Joins
Reduce Side Joins Edureka!
 
Join Algorithms in MapReduce
Join Algorithms in MapReduceJoin Algorithms in MapReduce
Join Algorithms in MapReduceShrihari Rathod
 
Applying stratosphere for big data analytics
Applying stratosphere for big data analyticsApplying stratosphere for big data analytics
Applying stratosphere for big data analyticsAvinash Pandu
 
Stratosphere with big_data_analytics
Stratosphere with big_data_analyticsStratosphere with big_data_analytics
Stratosphere with big_data_analyticsAvinash Pandu
 
6.2 my sql queryoptimization_part1
6.2 my sql queryoptimization_part16.2 my sql queryoptimization_part1
6.2 my sql queryoptimization_part1Trần Thanh
 
MapReduce and parallel DBMSs: friends or foes?
MapReduce and parallel DBMSs: friends or foes?MapReduce and parallel DBMSs: friends or foes?
MapReduce and parallel DBMSs: friends or foes?Spyros Eleftheriadis
 
Map reduce advantages over parallel databases
Map reduce advantages over parallel databases Map reduce advantages over parallel databases
Map reduce advantages over parallel databases Ahmad El Tawil
 
Informatica perf points
Informatica perf pointsInformatica perf points
Informatica perf pointsdba3003
 
Spot db consistency checking and optimization in spatial database
Spot db  consistency checking and optimization in spatial databaseSpot db  consistency checking and optimization in spatial database
Spot db consistency checking and optimization in spatial databasePratik Udapure
 

What's hot (20)

Hadoop MapReduce joins
Hadoop MapReduce joinsHadoop MapReduce joins
Hadoop MapReduce joins
 
Adbms 27 parallel database distribution architecture
Adbms 27 parallel database distribution architectureAdbms 27 parallel database distribution architecture
Adbms 27 parallel database distribution architecture
 
Sql server introduction
Sql server introductionSql server introduction
Sql server introduction
 
Distributed design alternatives
Distributed design alternativesDistributed design alternatives
Distributed design alternatives
 
Reduce Side Joins
Reduce Side Joins Reduce Side Joins
Reduce Side Joins
 
Join Algorithms in MapReduce
Join Algorithms in MapReduceJoin Algorithms in MapReduce
Join Algorithms in MapReduce
 
Chapter16
Chapter16Chapter16
Chapter16
 
Applying stratosphere for big data analytics
Applying stratosphere for big data analyticsApplying stratosphere for big data analytics
Applying stratosphere for big data analytics
 
1 ddbms jan 2011_u
1 ddbms jan 2011_u1 ddbms jan 2011_u
1 ddbms jan 2011_u
 
Pregel - Paper Review
Pregel - Paper ReviewPregel - Paper Review
Pregel - Paper Review
 
basic concept of ddbms
basic concept of ddbmsbasic concept of ddbms
basic concept of ddbms
 
Stratosphere with big_data_analytics
Stratosphere with big_data_analyticsStratosphere with big_data_analytics
Stratosphere with big_data_analytics
 
6.2 my sql queryoptimization_part1
6.2 my sql queryoptimization_part16.2 my sql queryoptimization_part1
6.2 my sql queryoptimization_part1
 
Abstract.DOCX
Abstract.DOCXAbstract.DOCX
Abstract.DOCX
 
MapReduce and parallel DBMSs: friends or foes?
MapReduce and parallel DBMSs: friends or foes?MapReduce and parallel DBMSs: friends or foes?
MapReduce and parallel DBMSs: friends or foes?
 
Map reduce advantages over parallel databases
Map reduce advantages over parallel databases Map reduce advantages over parallel databases
Map reduce advantages over parallel databases
 
Informatica perf points
Informatica perf pointsInformatica perf points
Informatica perf points
 
Spot db consistency checking and optimization in spatial database
Spot db  consistency checking and optimization in spatial databaseSpot db  consistency checking and optimization in spatial database
Spot db consistency checking and optimization in spatial database
 
The strength of a spatial database
The strength of a spatial databaseThe strength of a spatial database
The strength of a spatial database
 
Hadoop map reduce v2
Hadoop map reduce v2Hadoop map reduce v2
Hadoop map reduce v2
 

Viewers also liked

Mount Carmel opens its postgrad courses to boys
Mount Carmel opens its postgrad courses to boysMount Carmel opens its postgrad courses to boys
Mount Carmel opens its postgrad courses to boysKiran Shaw
 
IIMB throws open gates to its alumni
IIMB throws open gates to its alumniIIMB throws open gates to its alumni
IIMB throws open gates to its alumniKiran Shaw
 
Science Behind The News: TAP(Triaminopyrimidine) discovered as a promising dr...
Science Behind The News: TAP(Triaminopyrimidine) discovered as a promising dr...Science Behind The News: TAP(Triaminopyrimidine) discovered as a promising dr...
Science Behind The News: TAP(Triaminopyrimidine) discovered as a promising dr...Kiran Shaw
 
Lesson 1 object-oriented programming overview
Lesson 1   object-oriented programming overviewLesson 1   object-oriented programming overview
Lesson 1 object-oriented programming overviewKỳ Tôn Thất
 
RS Components- Connecting the Voice of the Customer with ROI and Culture
RS Components- Connecting the Voice of the Customer with ROI and Culture   RS Components- Connecting the Voice of the Customer with ROI and Culture
RS Components- Connecting the Voice of the Customer with ROI and Culture Scott Jayes
 

Viewers also liked (8)

Mount Carmel opens its postgrad courses to boys
Mount Carmel opens its postgrad courses to boysMount Carmel opens its postgrad courses to boys
Mount Carmel opens its postgrad courses to boys
 
Or ch1
Or ch1Or ch1
Or ch1
 
IIMB throws open gates to its alumni
IIMB throws open gates to its alumniIIMB throws open gates to its alumni
IIMB throws open gates to its alumni
 
Science Behind The News: TAP(Triaminopyrimidine) discovered as a promising dr...
Science Behind The News: TAP(Triaminopyrimidine) discovered as a promising dr...Science Behind The News: TAP(Triaminopyrimidine) discovered as a promising dr...
Science Behind The News: TAP(Triaminopyrimidine) discovered as a promising dr...
 
Or ch2 (2)
Or ch2 (2)Or ch2 (2)
Or ch2 (2)
 
Stack & queue
Stack & queueStack & queue
Stack & queue
 
Lesson 1 object-oriented programming overview
Lesson 1   object-oriented programming overviewLesson 1   object-oriented programming overview
Lesson 1 object-oriented programming overview
 
RS Components- Connecting the Voice of the Customer with ROI and Culture
RS Components- Connecting the Voice of the Customer with ROI and Culture   RS Components- Connecting the Voice of the Customer with ROI and Culture
RS Components- Connecting the Voice of the Customer with ROI and Culture
 

Similar to Etl interview questions

Top answers to etl interview questions
Top answers to etl interview questionsTop answers to etl interview questions
Top answers to etl interview questionssrimaribeda
 
ELT Publishing Tool Overview V3_Jeff
ELT Publishing Tool Overview V3_JeffELT Publishing Tool Overview V3_Jeff
ELT Publishing Tool Overview V3_JeffJeff McQuigg
 
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdfabhaybansal43
 
127556030 bisp-informatica-question-collections
127556030 bisp-informatica-question-collections127556030 bisp-informatica-question-collections
127556030 bisp-informatica-question-collectionsAmit Sharma
 
Building the DW - ETL
Building the DW - ETLBuilding the DW - ETL
Building the DW - ETLganblues
 
123448572 all-in-one-informatica
123448572 all-in-one-informatica123448572 all-in-one-informatica
123448572 all-in-one-informaticahomeworkping9
 
ETL Process & Data Warehouse Fundamentals
ETL Process & Data Warehouse FundamentalsETL Process & Data Warehouse Fundamentals
ETL Process & Data Warehouse FundamentalsSOMASUNDARAM T
 
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...
To Study  E T L ( Extract, Transform, Load) Tools Specially  S Q L  Server  I...To Study  E T L ( Extract, Transform, Load) Tools Specially  S Q L  Server  I...
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...Shahzad
 
Extract, Transform and Load.pptx
Extract, Transform and Load.pptxExtract, Transform and Load.pptx
Extract, Transform and Load.pptxJesusaEspeleta
 
ETL Tools Ankita Dubey
ETL Tools Ankita DubeyETL Tools Ankita Dubey
ETL Tools Ankita DubeyAnkita Dubey
 
The best ETL questions in a nut shell
The best ETL questions in a nut shellThe best ETL questions in a nut shell
The best ETL questions in a nut shellSrinimf-Slides
 
An Overview on Data Quality Issues at Data Staging ETL
An Overview on Data Quality Issues at Data Staging ETLAn Overview on Data Quality Issues at Data Staging ETL
An Overview on Data Quality Issues at Data Staging ETLidescitation
 
Data Warehouse - What you know about etl process is wrong
Data Warehouse - What you know about etl process is wrongData Warehouse - What you know about etl process is wrong
Data Warehouse - What you know about etl process is wrongMassimo Cenci
 
Sqlserver interview questions
Sqlserver interview questionsSqlserver interview questions
Sqlserver interview questionsTaj Basha
 
Should ETL Become Obsolete
Should ETL Become ObsoleteShould ETL Become Obsolete
Should ETL Become ObsoleteJerald Burget
 

Similar to Etl interview questions (20)

Top answers to etl interview questions
Top answers to etl interview questionsTop answers to etl interview questions
Top answers to etl interview questions
 
ELT Publishing Tool Overview V3_Jeff
ELT Publishing Tool Overview V3_JeffELT Publishing Tool Overview V3_Jeff
ELT Publishing Tool Overview V3_Jeff
 
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
 
127556030 bisp-informatica-question-collections
127556030 bisp-informatica-question-collections127556030 bisp-informatica-question-collections
127556030 bisp-informatica-question-collections
 
Building the DW - ETL
Building the DW - ETLBuilding the DW - ETL
Building the DW - ETL
 
Data warehouse physical design
Data warehouse physical designData warehouse physical design
Data warehouse physical design
 
123448572 all-in-one-informatica
123448572 all-in-one-informatica123448572 all-in-one-informatica
123448572 all-in-one-informatica
 
ETL Process & Data Warehouse Fundamentals
ETL Process & Data Warehouse FundamentalsETL Process & Data Warehouse Fundamentals
ETL Process & Data Warehouse Fundamentals
 
Etl techniques
Etl techniquesEtl techniques
Etl techniques
 
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...
To Study  E T L ( Extract, Transform, Load) Tools Specially  S Q L  Server  I...To Study  E T L ( Extract, Transform, Load) Tools Specially  S Q L  Server  I...
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...
 
Dwh faqs
Dwh faqsDwh faqs
Dwh faqs
 
Extract, Transform and Load.pptx
Extract, Transform and Load.pptxExtract, Transform and Load.pptx
Extract, Transform and Load.pptx
 
ETL Tools Ankita Dubey
ETL Tools Ankita DubeyETL Tools Ankita Dubey
ETL Tools Ankita Dubey
 
The best ETL questions in a nut shell
The best ETL questions in a nut shellThe best ETL questions in a nut shell
The best ETL questions in a nut shell
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
An Overview on Data Quality Issues at Data Staging ETL
An Overview on Data Quality Issues at Data Staging ETLAn Overview on Data Quality Issues at Data Staging ETL
An Overview on Data Quality Issues at Data Staging ETL
 
Data Warehouse - What you know about etl process is wrong
Data Warehouse - What you know about etl process is wrongData Warehouse - What you know about etl process is wrong
Data Warehouse - What you know about etl process is wrong
 
ETL_Methodology.pptx
ETL_Methodology.pptxETL_Methodology.pptx
ETL_Methodology.pptx
 
Sqlserver interview questions
Sqlserver interview questionsSqlserver interview questions
Sqlserver interview questions
 
Should ETL Become Obsolete
Should ETL Become ObsoleteShould ETL Become Obsolete
Should ETL Become Obsolete
 

Etl interview questions

  • 1. Call Us: +91- 8885560202 (India) +1-707-666-8949 (USA) Mail Us: Info@VirtualNuggets.com 1.Explain the ETL process? How many steps ETL contains? Explain with example. ETL stands for Extraction, Transforming and Loading. Data is extracted from the source(database servers), and applied for the generating business role on it. The following are the steps involved : Define the source [ define the odbc connection to the database source ] Define the target [ create the odbc connection to the target database ] Create the mapping [ Apply business role here by adding transformations and define the data flow from source to target ] Create the session [ Mapping instructions ] Create the work flow [ Instructions that run on the sessions ] 2.Explain the Full load & Incremental or Refresh load-ETL? Initial Load : It is the process of populating all the data warehousing tables for the very first time Full Load : While loading the data for the first time, all the set records are loaded at a stretch depending on the volume. It erases all the content of tables and reloads with fresh data Incremental Load : Applying the dynamic changes as and when necessary in a specific period. The schedule is predefined each period
  • 2. 3.What is the three tier data warehouse?-ETL The data ware is thought of as a three tier system The middle layer provides the data that is usable in a secure way to the end users. The other two layers are on the other side of the middle tier. One from the end users and the other from back end data storage.-ETL  The 1st layer is known as source layer where the data lands  The 2nd layer is known as integration layer where data is stored after transformation  The 3rd layer is known as dimension layer where the actual presentation layer stands. 4. What are snapshots? What are materialized views & where do we use them? What is a materialized view log?- ETL  Snapshots are copies of read-only data of a master table.  They are located on a remote node that is refreshed periodically to reflect the changes made to the master table.  They are replica of tables Views Views are built by using attributes of one or more tables. View with single table can be updated, whereas view with multiple tables cannot be updated Materialized View log A materialized view is a pre computed table that has aggregated or joined data from fact tables and dimension tables. To put it simple, a materialized view is an aggregate table. 5.What is the difference between Power Center and Power mart.-ETL Power Center  Processes large volumes of the data  ERP sources such as SAP,PeopleSoft,Oracle Apps. can be connected with the power center
  • 3.  Session partition is allowed to improving the performance of an ETL transaction Power Mart  Processes low volumes of data  Does not providing connections to ERP sources  Does not allow session partitions