Training Curriculum for Talend For Big Data consists of how to integrate Talend with Hadoop and perform various ETL Operations, Learn Big Data in Minutes not in months, No Pre-requisite, No Java/Map-Reduce Knowledge Required,Integrate your data in best way in Hadoop
2. 2
TalendforBig Data Training- Bharat Khanna
Curriculum
Chapter 1: Initial Set Up
Installing Talend Open Studio for Big Data on Windows
Installing HortonworksSandbox on Virtual Box
ConfiguringTalend for Hadoop
Installing Oracle XE Database
Other Software Requirements
Chapter 2: Getting started with Talend Open Studio for Big Data
Starting the studio
Creating the project
WalkthroughStudio GUI
Configuringyour own Talend View
Creating an example job
Chapter 3: ConfiguringMetadata
Creating a Built-in Schema
Creating a Repository Schema
Creating a generic schema from existing metadata
Chapter 4: Workingwith Files
ReadingDelimited files
ReadingPositional Files
ReadingExcel Files
ReadingXMLFiles
3. 3
TalendforBig Data Training- Bharat Khanna
ReadingRegex Files
Chapter 5: Workingwith Databases
tOracleConnection
tOracleInput
Using SQL Builder in Talend
tOracleOutput
tOracleRow
Chapter 6: Using contexts
Creating Built-In Contexts
Using Contexts Prompts
Creating Contextsgroup in Repository
Implicit tContextLoad
tContextLoad and tContextDump
Chapter 7: Using tMap
Overview of tMap
Connectingmultipleinputsto tMap
Creating joinsin tMap
Catching Rejects in tMap
Using Expression Builder in tMap
Using Variables in tMap
Performancetuningof look upsin tMap
4. 4
TalendforBig Data Training- Bharat Khanna
Chapter 8: Usage of Processing Components
tJoin
tFilterRow
tSortRow
tAggregateRow
tAggregateSortedRow
tReplace
tNormalize
tDeNormalize
Chapter 9: Usage of Iterative Componentsin Talend
tFileList
tFlowtoIterate
tIteratetoFlow
Chapter 10: UsingJava in Talend
Creating Routinesin Talend
Using tJava component
Using tJavaRow component
Using tJavaFlex component
Chapter 11: Configuring Statistics and Logs in Talend
CapturingStatistics at Project Level
Using tWarn, tDie, tFlowMeter
Using tFlowMeterCatcher, tLogcatcher
Using Global Variables in Talend
5. 5
TalendforBig Data Training- Bharat Khanna
Chapter 12: Getting Started with Hadoop
Creating Built-In connection to HDFS
Creating Repository connection to HDFS
Creating your firstHDFS Job
Chapter 13: More on HDFS
Using tHDFSPutand tHDFSGet
Using tHDFSInput
Iterating usingtHDFSList
Using tHDFSPropertiesand tHDFSRowCount
Chapter 14: Working on HIVE
Creating Hiveconnection
Getting deep into HIVE Concepts
Workingon HIVE in command line
Workingwith External Tables of HIVE in Talend
Workingwith Managed Tables of HIVE in Talend
Using external jars/UDF in HIVE
PerformingELTin HIVE in Talend
Chapter 15: Working on PIG
Creating PIG connection
Getting deep into PIG Concepts
Workingon PIG in command line
Using tPigLoad and tPigStoreResult
Workingwith tPigMap
6. 6
TalendforBig Data Training- Bharat Khanna
Joiningvarioussourcesusing tPigJoin
Other processingcomponentsin PIG
Chapter 16: Linking RDBMS and HDFS using SQOOP
Using tSQOOPExport
Using tSQOOPImport
Chapter 17: Talend Data Integration Advance Concepts
Using tRunJob
Passing parametersto Sub Job
Passing context variables from Parent Job to Child Job
Passing context variables from Child Job to Parent Job
Talend for Big Data Project
*Note: - Chapter 1 – 11 deals with Data Integration perspective and Chapter 12- 16 deals with Hadoop
perspective as it is important to build foundation on Talend for Data Integration before moving to Talend
for Big Data.
For trainingrequirements,please emailme at bharat3khanna@gmail.com