The document provides an overview of tasks and skills to learn for a career in data science and analytics. It lists technologies like SQL Server, Linux, networking protocols, Python, TensorFlow, Kafka, Terraform, and tools like Tableau. It also mentions companies in Pakistan and Dubai to explore for work opportunities and lists top companies employing data scientists in Dubai. Finally, it provides some YouTube video links on related topics like Spark vs Hadoop, data center standards, and networking fundamentals.
2. Tasks To Do
• Install SQL Server and communicate via some client
• Cloud Deployments
• Understand Linux Architecture and basic commands
• Understand IP Addressing
• Understand Hypervisor
• Understand protocols: DNS, DHCP, HTTP, SSL, TLS, HTTPS, FTP, SMTp
• Master Python & TENSORFLOW
• What are micro-services ? Vs API !
• HashiCorp’s TERRAFORM
• Study of bahria research groups https://bahria.edu.pk/oric/
3. Companies to work in future
• Ublox Lahore https://www.u-
blox.com/en/job-openings
• NETSOL
• TERESOL
• CONTOUR SOFTWARE
https://contour-
software.com/careers/#Jobs
• TERADATA
http://nicat.pk/
4. DS Tools and Requirement
Tool Requirement Tool Requirement Tool Requirement
KAFKA BIG Data Messaging TERRAFORM
Multi-Cloud
management through
code
TENSORFLOW
Low-level software library
created by google to implement
ML models and solve complex
numerical problems
HADOOP BIG Data online storage DOCKER KERAS
High Level Deep Learning API in
Python for easy implementation
and computation of neural
networks
APACHE
SPARC
BIG Data Stream
handling real-time
KUBERNETES PYTORCH
Low-Level API developed by
Facbeook for NLP and computer
vision. More powerful version of
numpy
TABELAU
18. TOP COMPANIES WORKING IN DATA SCIENCE
IN DUBAI
Eurasian
Resources Group -
ERG
Cobblestone Kognitiv
Corporation
Careem First Abu Dhabi
Bank
VISA
DATABUZZ LTD Foodics nybl Constellation
Software, Inc.
The Emirates
Group
TRANSFERWISE
TMC Binance.US Careem Al Futtaim Agility
ARTEFACT MARS Landmark Group Amazon Middle
East and NA
UHRS
RAK BANK Millennium Plaza
Hotel Dubai
Standard
Chartered Bank
Affaan
Technologies
Siemens
DataRobot Arthur Lawrence Parsons
International
Manipal Academy
of Higher
Education, Dubai
GMG
WOW AI LLC APCO Worldwide Accenture BlackSky Swvl
Dataiku Emirates NBD Procter & Gamble Zayed University Mastercard
77. Cluster Computing / Programming
• A computer cluster is a set of computers that work together so that
they can be viewed as a single system. Unlike grid computers,
computer clusters have each node set to perform the same task,
controlled and scheduled by software
85. Processing / Computing requirement is either
- Too large
- Or it takes too long
On standard computers
86. If A task on ON-PREMISE 16 PC Cluster with 4 core processors
each ( = 64 processing nodes) takes 3 months .. then same task
can be done in just 16 hours on 125,000 cores on cloud at
same or no incremental cost ! >>> CLOUD Computing benefits
On-premisis
cluster
146. The most widely-used
engine for scalable
computing
Thousands of companies,
including 80% of the
Fortune 500, use Apache
Spark™.
Over 2,000 contributors to
the open source project
from industry and
academia.
147.
148.
149.
150.
151.
152.
153.
154.
155.
156. HADOOP is batch processing only
But SPARK is real time processing also. !!
186. What is KAFKA – explained again
• A messaging system
• Simplifies management of data pipelines
• Retain messages even when there is issue in a pipeline due to
network issue
• Any sink of message system can subcribe to data pipeline
• Queue and public subscribe Model
187.
188.
189.
190.
191.
192.
193.
194.
195.
196.
197.
198.
199.
200.
201.
202.
203. (1) What is terraform in Hindi/Urdu | Lec-01 | Terraform
tutorial for beginners | Infrastructure as Code - YouTube
216. ⭐Why Data Science is important?
Data Science is taking over each and every industry domain.
Machine Learning and especially Deep Learning are the most
important aspects of Data Science that are being deployed
everywhere from search engines to online movie
recommendations. Taking the Intellipaat Data Science training
& Data Science Course can help professionals to build a solid
career in a rising technology domain and get the best jobs in
top organizations.
⭐Why Artificial Intelligence is important?
Artificial Intelligence is taking over each and every industry
domain. Machine Learning and especially Deep Learning are
the most important aspects of Artificial Intelligence that are
being deployed everywhere from search engines to online
movie recommendations. Taking the Intellipaat deep learning
training & Artificial Intelligence Course can help professionals
to build a solid career in a rising technology domain and get the
best jobs in top organizations.
217. (1) Data Science vs Artificial Intelligence | DS vs AI | Intellipaat - YouTube
218.
219.
220.
221.
222.
223.
224.
225.
226.
227.
228.
229.
230.
231.
232.
233.
234.
235.
236.
237.
238.
239.
240. What is spring boot ? – Microservices are executed using springboot
245. === Networking Fundamentals - Module 1 === Lesson 1 - Network
Devices
Part 1: https://youtu.be/bj-Yfakjllc
Part 2: https://youtu.be/H7-NR3Q3BeI
Lesson 2 - OSI Model
Part 1: https://youtu.be/LkolbURrtTs
Part 2: https://youtu.be/0aGqGKrRE0g
Lesson 3 - Everything Hosts to do speak on the Internet
Part 1: https://youtu.be/gYN2qN11-wE
Part 2: https://youtu.be/JI9Zm2tbUoE
Lesson 4 - Everything Switches do to facilitate communication
Part 1: https://youtu.be/AhOU2eOpmX0
Part 2: https://youtu.be/G7GyWjJtjNs
Lesson 5 - Everything Routers do to facilitate communication
Part 1: https://youtu.be/AzXys5kxpAM
Part 2: https://youtu.be/Ep-x_6kggKA
Part 3: https://youtu.be/zmxLg4jV0ts
Lesson 6 - Networking Protocols - https://youtu.be/E5bSumTAHZE -
ARP, FTP, SMTP, HTTP, SSL, TLS, HTTPS, DNS, DHCP - Four items
MUST be configured for Internet Connectivity
Lesson 7 - How Data moves through the Internet -
https://youtu.be/YJGGYKAV4pA - Interview question: What happens when
you type "site.com" into a web browser?