SlideShare a Scribd company logo
1 of 17
Amity Institute of Information Technology
Introduction to Data Science
BSc.IT/BCA/ DUAL VI Semester
Faculty: Dr. Shambhu Kumar Jha
1
Amity Institute of Information Technology
Module I
Introduction to Big Data
Difference between Big Data and Data Science,
2
Amity Institute of Information Technology
Introduction to Big Data
let’s start by understanding what is Big Data?
 Big Data: It is large or voluminous data, information,
or the relevant statistics acquired by large
organizations and ventures from various sources.
 Many software and data storages is created and
prepared as it is difficult to compute the big data
manually.
 It is used to discover patterns and trends and make
decisions related to human behavior and interaction
technology
3
Amity Institute of Information Technology
Introduction to Big Data
Big data encompasses following wide variety of data types:
 Structured data, such as transactions and financial records;
 Unstructured data, such as email, text, documents and multimedia
files;
 Semi structured data, such as web server logs and streaming data
from sensors.
Big data is often characterized by the three V's:
• Large volume of data in many environments;
• Variety of data types frequently stored in big data systems; and
• Velocity at which much of the data is generated, collected and
processed. 4
Amity Institute of Information Technology
Why Big Data is Important
Companies use big data to :
 Improve operations,
 Provide better customer service,
 Create personalized marketing campaigns and take other actions to
increase revenue and profits.
 Competitive advantage over those that don't because they're able to
make faster and more informed business decisions.
 Example: Big data provides valuable insights into customers that
companies can use to refine their marketing, advertising and
promotions in order to increase customer engagement and conversion
rates.
 Both historical and real-time data can be analyzed to assess the
evolving preferences of consumers or corporate buyers, enabling
businesses to become more responsive to customer wants and needs.5
Amity Institute of Information Technology
Is big data part of data science?
• Big Data is essentially a special application of data science, in
which the data sets are enormous and require overcoming logistical
challenges to deal with them.
• The primary concern is efficiently capturing, storing, extracting,
processing, and analyzing information from these enormous data
sets.
• Big data is a combination of structured, semi structured and
unstructured data collected by organizations that can be mined for
information and used in machine learning projects, predictive
modeling and other advanced analytics applications.
6
Amity Institute of Information Technology
Differences between Big Data and Data
Science:
DATA SCIENCE BIG DATA
It is about the collection,
processing, analysing, and utilizing
of data in various operations.
It is about extracting vital and valuable
information from a huge amount of data.
It is a field of study just like
Computer Science, Applied
Statistics, or Applied Mathematics.
It is a technique for tracking and
discovering trends in complex data sets.
7
Amity Institute of Information Technology
Differences between Big Data and Data
Science:
The goal is to build data-dominant
products for a venture.
The goal is to make data more vital and
usable i.e. by extracting only important
information from the huge data within
existing traditional aspects.
Tools mainly used in Data Science
include SAS, R, Python, etc
Tools mostly used in Big Data include
Hadoop, Spark, Flink, etc.
It is a superset of Big Data as data
science consists of Data scrapping,
cleaning, visualization, statistics, and
many more techniques.
It is a sub-set of Data Science as mining
activities which is in a pipeline of Data
science.
8
Amity Institute of Information Technology
Differences between Big Data and Data
Science:
It is mainly used for scientific purposes.
It is mainly used for business purposes and
customer satisfaction.
It broadly focuses on the science of the data.
It is more involved with the processes of handling
voluminous data.
It is mainly used for scientific purposes.
It is mainly used for business purposes and
customer satisfaction.
9
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications Of Big Data Finance
o
o
o
o
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications of Big Data: Social Network
Social media in the current scenario is
considered as the largest data generator.
The stats have shown that around 500+
terabytes of new data get generated into the
databases of social media every day, particularly
in the case of Facebook.
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications of Big Data: Healthcare
Nowadays, doctors rely mostly on patients’
clinical records, which means that a lot of data
needs to be gathered, that too for different
patients.
Since there is a large amount of data coming
from different sources, in various formats, the
need to handle this large amount of data is
increased
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications of Big Data E-Commerce
Maintaining customer relationships is the most important in the e-
commerce industry.
E-commerce websites have different marketing ideas to retail their
merchandise to their customers, to manage transactions, and to
implement better tactics of using innovative ideas with Big Data to
improve businesses.
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications of Big Data: Education
The education sector holds a lot of information with regard to curriculum,
students, and faculty.
The information is analyzed to get insights that can enhance the operational
adequacy of the educational organization.
Collecting and analyzing information of a student such as attendance, test scores,
grades, and other issues take up a lot of data.
So, big data makes an approach for a progressive framework wherein this data
can be stored and analyzed making it easier for the institutes to work with.
Amity Institute of Information Technology
Application of Big data
Big Data in Communications
•Gaining new subscribers, retaining customers, and expanding
within current subscriber bases are top priorities for
telecommunication service providers.
•The solutions to these challenges lie in the ability to combine
and analyze the masses of customer-generated data and
machine-generated data that is being created every day.
15
Amity Institute of Information Technology
Skills Required Becoming a Data Scientist
• In-depth knowledge of SAS or R. For data science, R is generally
preferred.
• Python coding: Python is the most common coding language that is
used in data science, along with Java, Perl, and C/C++.
• Hadoop platform: Although not always a requirement, knowing the
Hadoop platform is still preferred for the field. Having some
experience in Hive or Pig is also beneficial.
• SQL database/coding: Although NoSQL and Hadoop have become a
significant part of data science, it is still preferred if you can write
and execute complex queries in SQL.
• Working with unstructured data: It is essential that a data
scientist can work with unstructured data, whether on social media,
video feeds, or audio.
16
Amity Institute of Information Technology
Skills Required Becoming a Big Data Specialist
• Analytical skills: These skills are essential for making sense of data,
and determining which data is relevant when creating reports and
looking for solutions.
• Creativity: You need to have the ability to create new methods to
gather, interpret, and analyze a data strategy. Mathematics and
statistical skills: Good, old-fashioned “number crunching” is also
necessary, be it in data science, data analytics, or big data.
• Computer science: Computers are the backbone of every data strategy.
Programmers will have a constant need to come up with algorithms to
process data into insights.
• Business skills: Big data professionals will need to have an
understanding of the business objectives that are in place, as well as
the underlying processes that drive the growth of the business and its
profits.
17

More Related Content

Similar to L3 Big Data and Application.pptx

data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptxNamrataBhatt8
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to knowV2Soft
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesBhanu Prakash
 
The Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate EnvironmentThe Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate EnvironmentIRJET Journal
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big DataSonovate
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Oomph! Recruitment
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 
Big Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperBig Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperVasu S
 
OVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdfOVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdfcareer tech
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A StudyIRJET Journal
 
Big Data why Now and where to?
Big Data why Now and where to?Big Data why Now and where to?
Big Data why Now and where to?Fady Sayah
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyanIAESIJEECS
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Aditya205306
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big DataIRJET Journal
 
Learn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in KarnatakaLearn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in KarnatakaREVA University
 

Similar to L3 Big Data and Application.pptx (20)

data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptx
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to know
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websites
 
The Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate EnvironmentThe Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate Environment
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Big Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperBig Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - Whitepaper
 
OVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdfOVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdf
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A Study
 
Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
Big Data why Now and where to?
Big Data why Now and where to?Big Data why Now and where to?
Big Data why Now and where to?
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Learn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in KarnatakaLearn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in Karnataka
 

More from Shambhavi Vats

L4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxL4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxShambhavi Vats
 
L2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxL2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxShambhavi Vats
 
L1 Introduction DS.pptx
L1 Introduction DS.pptxL1 Introduction DS.pptx
L1 Introduction DS.pptxShambhavi Vats
 
Vaastav talwar a010145020052 adbms psda vaastav talwar
Vaastav talwar a010145020052 adbms psda   vaastav talwarVaastav talwar a010145020052 adbms psda   vaastav talwar
Vaastav talwar a010145020052 adbms psda vaastav talwarShambhavi Vats
 
38 object-concepts (1)
38 object-concepts (1)38 object-concepts (1)
38 object-concepts (1)Shambhavi Vats
 
Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Shambhavi Vats
 
wool from fibre to fabric
wool from fibre to fabricwool from fibre to fabric
wool from fibre to fabricShambhavi Vats
 
road side rain water harvesting
road side rain water harvestingroad side rain water harvesting
road side rain water harvestingShambhavi Vats
 

More from Shambhavi Vats (11)

L4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxL4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptx
 
L2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxL2 DS Tools and Application.pptx
L2 DS Tools and Application.pptx
 
L1 Introduction DS.pptx
L1 Introduction DS.pptxL1 Introduction DS.pptx
L1 Introduction DS.pptx
 
Vaastav talwar a010145020052 adbms psda vaastav talwar
Vaastav talwar a010145020052 adbms psda   vaastav talwarVaastav talwar a010145020052 adbms psda   vaastav talwar
Vaastav talwar a010145020052 adbms psda vaastav talwar
 
38 object-concepts (1)
38 object-concepts (1)38 object-concepts (1)
38 object-concepts (1)
 
Bca 2b
Bca 2bBca 2b
Bca 2b
 
Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)
 
wool from fibre to fabric
wool from fibre to fabricwool from fibre to fabric
wool from fibre to fabric
 
urban livehood
urban livehoodurban livehood
urban livehood
 
road side rain water harvesting
road side rain water harvestingroad side rain water harvesting
road side rain water harvesting
 
Water
WaterWater
Water
 

Recently uploaded

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 

Recently uploaded (20)

Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 

L3 Big Data and Application.pptx

  • 1. Amity Institute of Information Technology Introduction to Data Science BSc.IT/BCA/ DUAL VI Semester Faculty: Dr. Shambhu Kumar Jha 1
  • 2. Amity Institute of Information Technology Module I Introduction to Big Data Difference between Big Data and Data Science, 2
  • 3. Amity Institute of Information Technology Introduction to Big Data let’s start by understanding what is Big Data?  Big Data: It is large or voluminous data, information, or the relevant statistics acquired by large organizations and ventures from various sources.  Many software and data storages is created and prepared as it is difficult to compute the big data manually.  It is used to discover patterns and trends and make decisions related to human behavior and interaction technology 3
  • 4. Amity Institute of Information Technology Introduction to Big Data Big data encompasses following wide variety of data types:  Structured data, such as transactions and financial records;  Unstructured data, such as email, text, documents and multimedia files;  Semi structured data, such as web server logs and streaming data from sensors. Big data is often characterized by the three V's: • Large volume of data in many environments; • Variety of data types frequently stored in big data systems; and • Velocity at which much of the data is generated, collected and processed. 4
  • 5. Amity Institute of Information Technology Why Big Data is Important Companies use big data to :  Improve operations,  Provide better customer service,  Create personalized marketing campaigns and take other actions to increase revenue and profits.  Competitive advantage over those that don't because they're able to make faster and more informed business decisions.  Example: Big data provides valuable insights into customers that companies can use to refine their marketing, advertising and promotions in order to increase customer engagement and conversion rates.  Both historical and real-time data can be analyzed to assess the evolving preferences of consumers or corporate buyers, enabling businesses to become more responsive to customer wants and needs.5
  • 6. Amity Institute of Information Technology Is big data part of data science? • Big Data is essentially a special application of data science, in which the data sets are enormous and require overcoming logistical challenges to deal with them. • The primary concern is efficiently capturing, storing, extracting, processing, and analyzing information from these enormous data sets. • Big data is a combination of structured, semi structured and unstructured data collected by organizations that can be mined for information and used in machine learning projects, predictive modeling and other advanced analytics applications. 6
  • 7. Amity Institute of Information Technology Differences between Big Data and Data Science: DATA SCIENCE BIG DATA It is about the collection, processing, analysing, and utilizing of data in various operations. It is about extracting vital and valuable information from a huge amount of data. It is a field of study just like Computer Science, Applied Statistics, or Applied Mathematics. It is a technique for tracking and discovering trends in complex data sets. 7
  • 8. Amity Institute of Information Technology Differences between Big Data and Data Science: The goal is to build data-dominant products for a venture. The goal is to make data more vital and usable i.e. by extracting only important information from the huge data within existing traditional aspects. Tools mainly used in Data Science include SAS, R, Python, etc Tools mostly used in Big Data include Hadoop, Spark, Flink, etc. It is a superset of Big Data as data science consists of Data scrapping, cleaning, visualization, statistics, and many more techniques. It is a sub-set of Data Science as mining activities which is in a pipeline of Data science. 8
  • 9. Amity Institute of Information Technology Differences between Big Data and Data Science: It is mainly used for scientific purposes. It is mainly used for business purposes and customer satisfaction. It broadly focuses on the science of the data. It is more involved with the processes of handling voluminous data. It is mainly used for scientific purposes. It is mainly used for business purposes and customer satisfaction. 9
  • 10. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications Of Big Data Finance o o o o
  • 11. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications of Big Data: Social Network Social media in the current scenario is considered as the largest data generator. The stats have shown that around 500+ terabytes of new data get generated into the databases of social media every day, particularly in the case of Facebook.
  • 12. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications of Big Data: Healthcare Nowadays, doctors rely mostly on patients’ clinical records, which means that a lot of data needs to be gathered, that too for different patients. Since there is a large amount of data coming from different sources, in various formats, the need to handle this large amount of data is increased
  • 13. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications of Big Data E-Commerce Maintaining customer relationships is the most important in the e- commerce industry. E-commerce websites have different marketing ideas to retail their merchandise to their customers, to manage transactions, and to implement better tactics of using innovative ideas with Big Data to improve businesses.
  • 14. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications of Big Data: Education The education sector holds a lot of information with regard to curriculum, students, and faculty. The information is analyzed to get insights that can enhance the operational adequacy of the educational organization. Collecting and analyzing information of a student such as attendance, test scores, grades, and other issues take up a lot of data. So, big data makes an approach for a progressive framework wherein this data can be stored and analyzed making it easier for the institutes to work with.
  • 15. Amity Institute of Information Technology Application of Big data Big Data in Communications •Gaining new subscribers, retaining customers, and expanding within current subscriber bases are top priorities for telecommunication service providers. •The solutions to these challenges lie in the ability to combine and analyze the masses of customer-generated data and machine-generated data that is being created every day. 15
  • 16. Amity Institute of Information Technology Skills Required Becoming a Data Scientist • In-depth knowledge of SAS or R. For data science, R is generally preferred. • Python coding: Python is the most common coding language that is used in data science, along with Java, Perl, and C/C++. • Hadoop platform: Although not always a requirement, knowing the Hadoop platform is still preferred for the field. Having some experience in Hive or Pig is also beneficial. • SQL database/coding: Although NoSQL and Hadoop have become a significant part of data science, it is still preferred if you can write and execute complex queries in SQL. • Working with unstructured data: It is essential that a data scientist can work with unstructured data, whether on social media, video feeds, or audio. 16
  • 17. Amity Institute of Information Technology Skills Required Becoming a Big Data Specialist • Analytical skills: These skills are essential for making sense of data, and determining which data is relevant when creating reports and looking for solutions. • Creativity: You need to have the ability to create new methods to gather, interpret, and analyze a data strategy. Mathematics and statistical skills: Good, old-fashioned “number crunching” is also necessary, be it in data science, data analytics, or big data. • Computer science: Computers are the backbone of every data strategy. Programmers will have a constant need to come up with algorithms to process data into insights. • Business skills: Big data professionals will need to have an understanding of the business objectives that are in place, as well as the underlying processes that drive the growth of the business and its profits. 17

Editor's Notes

  1. 1
  2. The data generated mainly consist of videos, photos, message exchanges, etc. A single activity on any social media site generates a lot of data which is again stored and gets processed whenever required. Since the data stored is in terabytes, it would take a lot of time for processing if it is done by our legacy systems. Big Data is a solution to this problem.