SlideShare ist ein Scribd-Unternehmen logo
1 von 24
THE FUTURE REVOLUTION ON
BIG DATA
SUBMITTED TO: SUBMITTED BY:
DR.BHUMIKA GUPTA ARPIT ARORA
CS-I(130214)
1
PRESENTATION ON:
CONTENTS:
 INTRODUCTION
 TYPES OF DATA
 WHAT DOES BIG DATA LOOKS LIKE ?
 THE FOUR V’s
 THE BIG DATA TREND AND GROWTH
 DATA FROM TRILLIONS OF SENSORS AND INTERNET
 BIG DATA AND BIGGER OPPORTUNITY
 BIG DATA SOURCES
 BIG DATA PROBLEMS OR BIG DATA OPPORTUNITY
 DATA PROCESSING AND ANALYTICS
 NEW APPROACHES FOR PROCESSING BIG DATA
 CONCLUSIONS
2
INTRODUCTION
 Big data is a popular, but poorly defined marketing buzzword.
 “Big data is a collection of data sets so large and complex that it becomes
difficult to process using on-hand database management tools or traditional
data processing applications.”
 Big data is data that exceeds the processing capacity of conventional
database systems.
 The data is too big, moves too fast, or doesn’t fit the structures of your
database architectures.
 To gain value from this data, you must choose an alternative way to process it.
3
TYPES OF DATA
Big Data is made of structured and unstructured information. The term structured
data and unstructured data refers to that is identifiable based on it is organized in
a structure or not.
4
TYPES OF DATA
STRUCTURED DATA
 The most common form of
Structured data or Structured data
records is a database where
specific information is stored
based on a methodology of
columns and rows.
 Structured data is also searchable
by data type within content.
 Structured data is understood by
computers and is also efficiently
organized for human readers.
 Relational databases and
spreadsheets are examples of
structured data.
 Structured information is the data
in databases and is about 10% of
the story.
UNSTRUCTURED DATA
 The term Unstructured Data refers
to information that either does not
have a pre-defined data model
and/or does not fit well into
relational tables.
 The term unstructured data refers
to any data that has no
identifiable structure.
 Unstructured Data is “human
information” like emails, videos,
tweets, Facebook posts, call
centre conversations, closed
circuit TV footage, mobile phone
calls, website clicks.
 Unstructured information is 90% of
Big Data today.
5
WHAT DOES BIG DATA LOOK LIKE ?
 In 2012, Gartner formalized their Big Data definition as a "3V" framework -
high Volume, high Velocity and high Variety information asset.
 The IBM adds a fourth "V" of Veracity to add trust and noise filtering to
the challenge of Big Data analysis.
 Input data to big data systems could be chatter from social networks,
web server logs, traffic flow sensors, satellite imagery, broadcast audio
streams, banking transactions, MP3s of rock music ,GPS trails etc.
6
THE FOUR V’S
VOLUME:
 There are many factors that contribute to the increase in data volume –
transaction-based data stored through the years, text data constantly
streaming in from social media, increasing amounts of sensor data being
collected, etc.
 In the past, excessive data volume created a storage issue.
 But with today's decreasing storage costs, other issues emerge, including how
to determine relevance among the large volumes of data.
VARIETY:
 Today data comes in all types of formats – from traditional databases to
hierarchical data stores created by end users to text documents, email, meter-
collected data, video, audio, stock ticker data and financial transactions.
 Different browsers send different data and they may be using different software
versions or vendors to communicate with you.
7
VELOCITY:
 Velocity "means both how fast data is being produced and how fast the data
must be processed to meet demand".
 The Retailers who are able to quickly utilize that information, by recommending
additional purchases, for instance, gain competitive advantage.
 The Smartphone era increases again the rate of data inflow, as consumers carry
with them a streaming source of reallocated images and audio data.
VERACITY:
 Veracity deals with uncertain or imprecise data.
 When we start talking about social media data like Tweets, Facebook posts,
etc. how much faith should we put in the data.
 Due to the velocity of data like stock trades, machine/sensor generated events,
you cannot spend the time to “cleanse” it and get rid of the uncertainty, so you
must process it as is understanding the uncertainty in the data.
8
THE BIG DATA TREND AND GROWTH
 Big Data is only getting bigger .90% of the data in the world today was
created within the last two years.
 Companies worldwide are already implementing their plans to store and
analyze big data.
 The majority of IT decision-makers surveyed work for companies in the
manufacturing, healthcare, financial services industries, and other
companies.(Graph shown below..)
 Only 13 percent of those have a big data solution that is fully implemented.
9
.
10
TRENDS OF BIG DATA
11
 It is projected that within a few years we will have to talk in Bronto
bytes (10^27) when we discuss data coming from sensors.
 The Internet of Things, or Machine to Machine(M2M) communication,
connects billions of devices with each other and thereby generates
an unfathomable amount of data.
 In 2020, 40% of all data in the world will be M2M data.
 Sensor data, or M2M data, is data from readings made by machine
sensors that measure pre-set conditions at regular intervals.
 Examples of the data that is collected include log data, geo
location data, CPU utilization, temperature, rules, etc.
DATA FROM TRILLIONS OF SENSORS AND
INTERNET
12
BIG DATA ON SENSORS
13
 Even though "Big Data" has now been around for a few years, the
opportunities for startups seem to keep growing, just as the amount of
data keeps growing.
 According to IBM, companies have captured more data in the last
two years than in the previous 2000 years.
 This data comes from sensors, social media posts, digital pictures and
videos, purchase transactions, everywhere.
 Every day, we create 2.5 quintillion bytes of data, much of it
unstructured and far beyond the capability of conventional
databases.
 Hence one segment of the opportunity is the need for new database
technologies, like Hadoop, a distributed file system originally designed
for indexing the Web.
BIG DATA AND BIGGER OPPORTUNITY
14
BIG DATA SOURCES
15
BIG DATA SOURCES INCLUDE:
 Social Networking and Media: There are currently over 900 million Facebook
users, 250 million Twitter users and 156 million public blogs. Each Facebook
update, Tweet, blog post and comment creates multiple new data points.
 Mobile Devices: There are over 5 billion mobile phones in use worldwide. Mobile
devices, particularly smart phones and tablets, also make it easier to use social
media and other data-generating applications.
 Internet Transactions: Billions of online purchases, stock trades and other
transactions happen every day. Each creates a number of data points collected
by retailers, banks, credit cards, credit agencies and others.
 Networked Devices and Sensors: Electronic devices including servers and
other IT hardware, temperature sensors etc., create semi-structured log data that
record every action.
 Big Data is the Future of Healthcare: With big data poised to change the
healthcare, organizations need to understand this phenomenon and realize the
envisioned benefits.
16
 The advent of the Web, mobile devices and other technologies has caused a
fundamental change to the nature of data.
 Big Data has qualities that differentiate it from traditional corporate data.
Centralized, highly structured and easily manageable.
 Now data is highly distributed, loosely structured, and increasingly large in
volume.
BIG DATA PROBLEMS OR BIG DATA OPPORTUNITY
DATA PROCESSING AND ANALYTICS
THE TRADITIONAL WAY
17
NEW APPROACHES FOR PROCESSING BIG DATA
HADOOP
 Hadoop is an open source framework for processing, storing and
analyzing massive amounts of distributed unstructured data.
Originally created by Doug Cutting at Yahoo!
 It was designed to handle petabytes and exabytes of data
distributed over multiple nodes in parallel.
 Hadoop is now a project of the Apache Software Foundation,
where hundreds of contributors continuously improve the core
technology.
 Hadoop breaks up Big Data into multiple parts so each part can
be processed and analyzed at the same time.
18
NoSQL
 A related new style of database called NoSQL (Not Only SQL)
has emerged to, like Hadoop, process large volumes of multi-
structured data.
 However, whereas Hadoop is adept at supporting large-scale,
batch-style historical analysis and NoSQL databases are aimed,
for the most part at serving up discrete data stored among large
volumes of multi-structured data to end-user and automated Big
Data applications.
19
HYBRID BIG DATA INTEGERATION
 So while some companies may specifically be looking for Hadoop
or NoSQL data integration, others are looking for a comprehensive
solution that will connect on the back end to multiple big data
sources as well as their enterprise data stores.
 An all-round big data integration offering will enable you to input,
output, manipulate and report on data using Hadoop and NoSQL
stores, including: Apache Cassandra, Hadoop, NoSQL databases
and traditional data stores etc.
20
CONCLUSIONS
 New systems using big data will extend, and possibly replace, our
traditional DBMS’s.
 There is no question that there is enough data available that traditional
database management systems will be defeated completely.
 Cloud computing may also prove valuable for big data.
 Currently available systems for health care domains and Social Media
and Retail limits the functionalities due to the ever increases in
demands.
 As a result the integration of new technologies is necessary to cope up
with on-demand.
21
THE END
22
?
ANY QUERIES
23
THANK YOU
24

Weitere ähnliche Inhalte

Was ist angesagt?

BBDO Proximity: Big-data May 2013
BBDO Proximity: Big-data May 2013BBDO Proximity: Big-data May 2013
BBDO Proximity: Big-data May 2013Brian Crotty
 
Big Data in the Fund Industry: From Descriptive to Prescriptive Data Analytics
Big Data in the Fund Industry: From Descriptive to Prescriptive Data AnalyticsBig Data in the Fund Industry: From Descriptive to Prescriptive Data Analytics
Big Data in the Fund Industry: From Descriptive to Prescriptive Data AnalyticsBroadridge
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAgeFriendlyEconomy
 
Big data 2017 final
Big data 2017   finalBig data 2017   final
Big data 2017 finalAmjid Ali
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 
Notes from the Observation Deck // A Data Revolution
Notes from the Observation Deck // A Data Revolution Notes from the Observation Deck // A Data Revolution
Notes from the Observation Deck // A Data Revolution gngeorge
 
Implementation of application for huge data file transfer
Implementation of application for huge data file transferImplementation of application for huge data file transfer
Implementation of application for huge data file transferijwmn
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Yaman Hajja, Ph.D.
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaSkillspeed
 
Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data PresentationMatthew Urdan
 

Was ist angesagt? (19)

Big data ppt
Big data pptBig data ppt
Big data ppt
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
BBDO Proximity: Big-data May 2013
BBDO Proximity: Big-data May 2013BBDO Proximity: Big-data May 2013
BBDO Proximity: Big-data May 2013
 
Big Data in the Fund Industry: From Descriptive to Prescriptive Data Analytics
Big Data in the Fund Industry: From Descriptive to Prescriptive Data AnalyticsBig Data in the Fund Industry: From Descriptive to Prescriptive Data Analytics
Big Data in the Fund Industry: From Descriptive to Prescriptive Data Analytics
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big Data
 
Big data 2017 final
Big data 2017   finalBig data 2017   final
Big data 2017 final
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big data
Big dataBig data
Big data
 
Big data-analytics-ebook
Big data-analytics-ebookBig data-analytics-ebook
Big data-analytics-ebook
 
Notes from the Observation Deck // A Data Revolution
Notes from the Observation Deck // A Data Revolution Notes from the Observation Deck // A Data Revolution
Notes from the Observation Deck // A Data Revolution
 
Implementation of application for huge data file transfer
Implementation of application for huge data file transferImplementation of application for huge data file transfer
Implementation of application for huge data file transfer
 
Bigdata.pptx
Bigdata.pptxBigdata.pptx
Bigdata.pptx
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
 
Team 2 Big Data Presentation
Team 2 Big Data PresentationTeam 2 Big Data Presentation
Team 2 Big Data Presentation
 

Andere mochten auch

RFID Application
RFID ApplicationRFID Application
RFID Applicationelenaleelee
 
FINAL MARKETING PLAN
FINAL MARKETING PLAN FINAL MARKETING PLAN
FINAL MARKETING PLAN Meghan Brieck
 
Zones Corporate Value
Zones Corporate ValueZones Corporate Value
Zones Corporate ValueRobert Downs
 
Research_Review_Smoking Cessation.docx
Research_Review_Smoking Cessation.docxResearch_Review_Smoking Cessation.docx
Research_Review_Smoking Cessation.docxKelsey Chen
 
B2b lead generation process by Vista lead generation
B2b lead generation process by Vista lead generationB2b lead generation process by Vista lead generation
B2b lead generation process by Vista lead generationVista Lead Generation
 
Creative journal
Creative journalCreative journal
Creative journalHuynh Le
 

Andere mochten auch (9)

Google search
Google searchGoogle search
Google search
 
RFID Application
RFID ApplicationRFID Application
RFID Application
 
FINAL MARKETING PLAN
FINAL MARKETING PLAN FINAL MARKETING PLAN
FINAL MARKETING PLAN
 
FinalPres
FinalPresFinalPres
FinalPres
 
Brew & Chew Final
Brew & Chew FinalBrew & Chew Final
Brew & Chew Final
 
Zones Corporate Value
Zones Corporate ValueZones Corporate Value
Zones Corporate Value
 
Research_Review_Smoking Cessation.docx
Research_Review_Smoking Cessation.docxResearch_Review_Smoking Cessation.docx
Research_Review_Smoking Cessation.docx
 
B2b lead generation process by Vista lead generation
B2b lead generation process by Vista lead generationB2b lead generation process by Vista lead generation
B2b lead generation process by Vista lead generation
 
Creative journal
Creative journalCreative journal
Creative journal
 

Ähnlich wie 130214 copy

Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.saranya270513
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notesMohit Saini
 
Idc big data whitepaper_final
Idc big data whitepaper_finalIdc big data whitepaper_final
Idc big data whitepaper_finalOsman Circi
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICSNAGARAJAGIDDE
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Hritika Raj
 
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Stuart Blair
 
Big data Seminar/Presentation
Big data Seminar/PresentationBig data Seminar/Presentation
Big data Seminar/PresentationKirtimaan Chhabra
 
Analysis on big data concepts and applications
Analysis on big data concepts and applicationsAnalysis on big data concepts and applications
Analysis on big data concepts and applicationsIJARIIT
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataAkshata Humbe
 
Big data seminor
Big data seminorBig data seminor
Big data seminorberasrujana
 

Ähnlich wie 130214 copy (20)

Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Idc big data whitepaper_final
Idc big data whitepaper_finalIdc big data whitepaper_final
Idc big data whitepaper_final
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
new.pptx
new.pptxnew.pptx
new.pptx
 
Big data
Big dataBig data
Big data
 
Business with Big data
Business with Big dataBusiness with Big data
Business with Big data
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
 
Big data Ppt
Big data PptBig data Ppt
Big data Ppt
 
Big data
Big data Big data
Big data
 
Big data-ppt-
Big data-ppt-Big data-ppt-
Big data-ppt-
 
Monetize Big Data
Monetize Big DataMonetize Big Data
Monetize Big Data
 
Big data Seminar/Presentation
Big data Seminar/PresentationBig data Seminar/Presentation
Big data Seminar/Presentation
 
Analysis on big data concepts and applications
Analysis on big data concepts and applicationsAnalysis on big data concepts and applications
Analysis on big data concepts and applications
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
Big Data - CRM's Promise Land
Big Data - CRM's Promise LandBig Data - CRM's Promise Land
Big Data - CRM's Promise Land
 

Kürzlich hochgeladen

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 

Kürzlich hochgeladen (20)

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 

130214 copy

  • 1. THE FUTURE REVOLUTION ON BIG DATA SUBMITTED TO: SUBMITTED BY: DR.BHUMIKA GUPTA ARPIT ARORA CS-I(130214) 1 PRESENTATION ON:
  • 2. CONTENTS:  INTRODUCTION  TYPES OF DATA  WHAT DOES BIG DATA LOOKS LIKE ?  THE FOUR V’s  THE BIG DATA TREND AND GROWTH  DATA FROM TRILLIONS OF SENSORS AND INTERNET  BIG DATA AND BIGGER OPPORTUNITY  BIG DATA SOURCES  BIG DATA PROBLEMS OR BIG DATA OPPORTUNITY  DATA PROCESSING AND ANALYTICS  NEW APPROACHES FOR PROCESSING BIG DATA  CONCLUSIONS 2
  • 3. INTRODUCTION  Big data is a popular, but poorly defined marketing buzzword.  “Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications.”  Big data is data that exceeds the processing capacity of conventional database systems.  The data is too big, moves too fast, or doesn’t fit the structures of your database architectures.  To gain value from this data, you must choose an alternative way to process it. 3
  • 4. TYPES OF DATA Big Data is made of structured and unstructured information. The term structured data and unstructured data refers to that is identifiable based on it is organized in a structure or not. 4
  • 5. TYPES OF DATA STRUCTURED DATA  The most common form of Structured data or Structured data records is a database where specific information is stored based on a methodology of columns and rows.  Structured data is also searchable by data type within content.  Structured data is understood by computers and is also efficiently organized for human readers.  Relational databases and spreadsheets are examples of structured data.  Structured information is the data in databases and is about 10% of the story. UNSTRUCTURED DATA  The term Unstructured Data refers to information that either does not have a pre-defined data model and/or does not fit well into relational tables.  The term unstructured data refers to any data that has no identifiable structure.  Unstructured Data is “human information” like emails, videos, tweets, Facebook posts, call centre conversations, closed circuit TV footage, mobile phone calls, website clicks.  Unstructured information is 90% of Big Data today. 5
  • 6. WHAT DOES BIG DATA LOOK LIKE ?  In 2012, Gartner formalized their Big Data definition as a "3V" framework - high Volume, high Velocity and high Variety information asset.  The IBM adds a fourth "V" of Veracity to add trust and noise filtering to the challenge of Big Data analysis.  Input data to big data systems could be chatter from social networks, web server logs, traffic flow sensors, satellite imagery, broadcast audio streams, banking transactions, MP3s of rock music ,GPS trails etc. 6
  • 7. THE FOUR V’S VOLUME:  There are many factors that contribute to the increase in data volume – transaction-based data stored through the years, text data constantly streaming in from social media, increasing amounts of sensor data being collected, etc.  In the past, excessive data volume created a storage issue.  But with today's decreasing storage costs, other issues emerge, including how to determine relevance among the large volumes of data. VARIETY:  Today data comes in all types of formats – from traditional databases to hierarchical data stores created by end users to text documents, email, meter- collected data, video, audio, stock ticker data and financial transactions.  Different browsers send different data and they may be using different software versions or vendors to communicate with you. 7
  • 8. VELOCITY:  Velocity "means both how fast data is being produced and how fast the data must be processed to meet demand".  The Retailers who are able to quickly utilize that information, by recommending additional purchases, for instance, gain competitive advantage.  The Smartphone era increases again the rate of data inflow, as consumers carry with them a streaming source of reallocated images and audio data. VERACITY:  Veracity deals with uncertain or imprecise data.  When we start talking about social media data like Tweets, Facebook posts, etc. how much faith should we put in the data.  Due to the velocity of data like stock trades, machine/sensor generated events, you cannot spend the time to “cleanse” it and get rid of the uncertainty, so you must process it as is understanding the uncertainty in the data. 8
  • 9. THE BIG DATA TREND AND GROWTH  Big Data is only getting bigger .90% of the data in the world today was created within the last two years.  Companies worldwide are already implementing their plans to store and analyze big data.  The majority of IT decision-makers surveyed work for companies in the manufacturing, healthcare, financial services industries, and other companies.(Graph shown below..)  Only 13 percent of those have a big data solution that is fully implemented. 9
  • 11. 11  It is projected that within a few years we will have to talk in Bronto bytes (10^27) when we discuss data coming from sensors.  The Internet of Things, or Machine to Machine(M2M) communication, connects billions of devices with each other and thereby generates an unfathomable amount of data.  In 2020, 40% of all data in the world will be M2M data.  Sensor data, or M2M data, is data from readings made by machine sensors that measure pre-set conditions at regular intervals.  Examples of the data that is collected include log data, geo location data, CPU utilization, temperature, rules, etc. DATA FROM TRILLIONS OF SENSORS AND INTERNET
  • 12. 12 BIG DATA ON SENSORS
  • 13. 13  Even though "Big Data" has now been around for a few years, the opportunities for startups seem to keep growing, just as the amount of data keeps growing.  According to IBM, companies have captured more data in the last two years than in the previous 2000 years.  This data comes from sensors, social media posts, digital pictures and videos, purchase transactions, everywhere.  Every day, we create 2.5 quintillion bytes of data, much of it unstructured and far beyond the capability of conventional databases.  Hence one segment of the opportunity is the need for new database technologies, like Hadoop, a distributed file system originally designed for indexing the Web. BIG DATA AND BIGGER OPPORTUNITY
  • 15. 15 BIG DATA SOURCES INCLUDE:  Social Networking and Media: There are currently over 900 million Facebook users, 250 million Twitter users and 156 million public blogs. Each Facebook update, Tweet, blog post and comment creates multiple new data points.  Mobile Devices: There are over 5 billion mobile phones in use worldwide. Mobile devices, particularly smart phones and tablets, also make it easier to use social media and other data-generating applications.  Internet Transactions: Billions of online purchases, stock trades and other transactions happen every day. Each creates a number of data points collected by retailers, banks, credit cards, credit agencies and others.  Networked Devices and Sensors: Electronic devices including servers and other IT hardware, temperature sensors etc., create semi-structured log data that record every action.  Big Data is the Future of Healthcare: With big data poised to change the healthcare, organizations need to understand this phenomenon and realize the envisioned benefits.
  • 16. 16  The advent of the Web, mobile devices and other technologies has caused a fundamental change to the nature of data.  Big Data has qualities that differentiate it from traditional corporate data. Centralized, highly structured and easily manageable.  Now data is highly distributed, loosely structured, and increasingly large in volume. BIG DATA PROBLEMS OR BIG DATA OPPORTUNITY
  • 17. DATA PROCESSING AND ANALYTICS THE TRADITIONAL WAY 17
  • 18. NEW APPROACHES FOR PROCESSING BIG DATA HADOOP  Hadoop is an open source framework for processing, storing and analyzing massive amounts of distributed unstructured data. Originally created by Doug Cutting at Yahoo!  It was designed to handle petabytes and exabytes of data distributed over multiple nodes in parallel.  Hadoop is now a project of the Apache Software Foundation, where hundreds of contributors continuously improve the core technology.  Hadoop breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time. 18
  • 19. NoSQL  A related new style of database called NoSQL (Not Only SQL) has emerged to, like Hadoop, process large volumes of multi- structured data.  However, whereas Hadoop is adept at supporting large-scale, batch-style historical analysis and NoSQL databases are aimed, for the most part at serving up discrete data stored among large volumes of multi-structured data to end-user and automated Big Data applications. 19
  • 20. HYBRID BIG DATA INTEGERATION  So while some companies may specifically be looking for Hadoop or NoSQL data integration, others are looking for a comprehensive solution that will connect on the back end to multiple big data sources as well as their enterprise data stores.  An all-round big data integration offering will enable you to input, output, manipulate and report on data using Hadoop and NoSQL stores, including: Apache Cassandra, Hadoop, NoSQL databases and traditional data stores etc. 20
  • 21. CONCLUSIONS  New systems using big data will extend, and possibly replace, our traditional DBMS’s.  There is no question that there is enough data available that traditional database management systems will be defeated completely.  Cloud computing may also prove valuable for big data.  Currently available systems for health care domains and Social Media and Retail limits the functionalities due to the ever increases in demands.  As a result the integration of new technologies is necessary to cope up with on-demand. 21