SlideShare ist ein Scribd-Unternehmen logo
1 von 14
3 pillars of big data :
Structured Data
Semi Structured Data
Unstructured Data
By Prowebscraper
What is big data ?
- Big data stands for large volumes of data.
- It could be both structured and unstructured.
- It’s not about the amount of data but what
businesses do with the given data that counts.
- Businesses leverage on big data for insights that
can propel their growth.
Example of Big Data
Following are some of the examples of 'Big Data' :
- Facebook users send roughly 31.25 million
messages and watch 2.77 million videos, every
minute!!
- Walmart customers’ transactions provide the
company with about 2.5 petabytes of data, every
hour.
Example of Big Data
- YouTube usage jumped three times from 2014-
2016 with users uploading 400 hours of new
video each minute of every day! Now, in 2017,
users are watching 4,146,600 videos every
minute.
- Instagram users upload 46,740 million posts
every minute!
- 5.2 BILLION daily Google Searches in 2017.
As you can see, there are 3 pillars of Big Data :
1. Structured data
2. Unstructured data
3. Semi structured data
1. Structured data
- Structured Data refer to the data which is already
stored in databases, in an ordered manner.
- It accounts for about 20% of the total existing data,
and is used the most in programming and computer-
related activities.
Example of Structured data :
- Meta-data (Time and date of creation, File size,
Author etc.)
- Library Catalogues (date, author, place, subject, etc)
- Census records (birth, income, employment, place
etc.)
- Economic data (GDP, PPI, ASX etc.)
2. Unstructured data :
- Any data with unknown form or the structure is
classified as unstructured data. In addition to the
size being huge, unstructured data poses multiple
challenges in terms of its processing for deriving
value out of it.
- The rest of the data created, about 80% of the total
account for unstructured big data.
Example of Unstructured data :
- Media ( MP3, digital photos, audio and video files )
- Text files (Word processing, spreadsheets,
presentations etc. )
- Social Media ( Data from Facebook, Twitter,
LinkedIn)
3. Semi-Structured data :
- Semi-structured data is a form of structured data
that does not conform with the formal structure of
data models associated with relational databases or
other forms of data tables, but nonetheless contains
tags or other markers to separate semantic elements
and enforce hierarchies of records and fields within
the data. Therefore, it is also known as self-
describing structure.
Example of Semi-Structured data :
- Personal data stored in a XML file-
<rec><name>Harry</name><sex>Male</sex><age>35</age></rec>
<rec><name>Justin</name><sex>Female</sex><age>41</age></rec>
<rec><name>Shawn</name><sex>Male</sex><age>29</age></rec>
<rec><name>Ed sheeran</name><sex>Male</sex><age>26</age></rec>
<rec><name>Drake</name><sex>Male</sex><age>35</age></rec>
- As you can notice, Internet is a maze of unbounded
data.
- But it can be understood, interpreted and used
under these three categories.
- You can benefit from each type of data if you
understand their characteristics and strengths.
- Businesses worldwide construct their empire on
these three pillars and capitalize on their limitless
potential.
The question is,
do you???
References :
http://www.prowebscraper.com/blog/structured-vs-
unstructured-data-best-thing-you-need-to-know/

Weitere ähnliche Inhalte

Was ist angesagt? (20)

DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
5 v of big data
5 v of big data5 v of big data
5 v of big data
 
Classification of data
Classification of dataClassification of data
Classification of data
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
Data analytics
Data analyticsData analytics
Data analytics
 
Business Analytics Overview
Business Analytics OverviewBusiness Analytics Overview
Business Analytics Overview
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Data Science
Data ScienceData Science
Data Science
 
Data Modeling PPT
Data Modeling PPTData Modeling PPT
Data Modeling PPT
 
Data analytics vs. Data analysis
Data analytics vs. Data analysisData analytics vs. Data analysis
Data analytics vs. Data analysis
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
7 Big Data Challenges and How to Overcome Them
7 Big Data Challenges and How to Overcome Them7 Big Data Challenges and How to Overcome Them
7 Big Data Challenges and How to Overcome Them
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
 
Data Analytics Life Cycle
Data Analytics Life CycleData Analytics Life Cycle
Data Analytics Life Cycle
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
 
Data science
Data scienceData science
Data science
 

Ähnlich wie 3 pillars of big data : structured data, semi structured data and unstructured data

Types of Big Data.pptx
Types of Big Data.pptxTypes of Big Data.pptx
Types of Big Data.pptxvarun453331
 
Data and Information.docx
Data and Information.docxData and Information.docx
Data and Information.docxswarna627082
 
Topic guide big data as a technology
Topic guide  big data as a technologyTopic guide  big data as a technology
Topic guide big data as a technologyMrmmbs Vision
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataAkshata Humbe
 
Data set module 1
Data set   module 1Data set   module 1
Data set module 1Data-Set
 
Unit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdfUnit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdfRanjeet Bhalshankar
 
Data set Introduction to Big Data
Data set   Introduction to Big DataData set   Introduction to Big Data
Data set Introduction to Big DataData-Set
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Aditya205306
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1Parviz Vakili
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET Journal
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.saranya270513
 

Ähnlich wie 3 pillars of big data : structured data, semi structured data and unstructured data (20)

Types of Big Data.pptx
Types of Big Data.pptxTypes of Big Data.pptx
Types of Big Data.pptx
 
Data and Information.docx
Data and Information.docxData and Information.docx
Data and Information.docx
 
Bigdata Hadoop introduction
Bigdata Hadoop introductionBigdata Hadoop introduction
Bigdata Hadoop introduction
 
Overview of Big Data
Overview of Big DataOverview of Big Data
Overview of Big Data
 
Topic guide big data as a technology
Topic guide  big data as a technologyTopic guide  big data as a technology
Topic guide big data as a technology
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
Data set module 1
Data set   module 1Data set   module 1
Data set module 1
 
Unit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdfUnit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdf
 
Data set Introduction to Big Data
Data set   Introduction to Big DataData set   Introduction to Big Data
Data set Introduction to Big Data
 
big-data.pdf
big-data.pdfbig-data.pdf
big-data.pdf
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
Big Data.pptx
Big Data.pptxBig Data.pptx
Big Data.pptx
 
Lecture #03
Lecture #03Lecture #03
Lecture #03
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
 
Big data Paper
Big data PaperBig data Paper
Big data Paper
 
Chapter 1 big data
Chapter 1 big dataChapter 1 big data
Chapter 1 big data
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 
Big data
Big dataBig data
Big data
 

Kürzlich hochgeladen

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 

Kürzlich hochgeladen (20)

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 

3 pillars of big data : structured data, semi structured data and unstructured data

  • 1. 3 pillars of big data : Structured Data Semi Structured Data Unstructured Data By Prowebscraper
  • 2. What is big data ? - Big data stands for large volumes of data. - It could be both structured and unstructured. - It’s not about the amount of data but what businesses do with the given data that counts. - Businesses leverage on big data for insights that can propel their growth.
  • 3. Example of Big Data Following are some of the examples of 'Big Data' : - Facebook users send roughly 31.25 million messages and watch 2.77 million videos, every minute!! - Walmart customers’ transactions provide the company with about 2.5 petabytes of data, every hour.
  • 4. Example of Big Data - YouTube usage jumped three times from 2014- 2016 with users uploading 400 hours of new video each minute of every day! Now, in 2017, users are watching 4,146,600 videos every minute. - Instagram users upload 46,740 million posts every minute! - 5.2 BILLION daily Google Searches in 2017.
  • 5. As you can see, there are 3 pillars of Big Data : 1. Structured data 2. Unstructured data 3. Semi structured data
  • 6. 1. Structured data - Structured Data refer to the data which is already stored in databases, in an ordered manner. - It accounts for about 20% of the total existing data, and is used the most in programming and computer- related activities.
  • 7. Example of Structured data : - Meta-data (Time and date of creation, File size, Author etc.) - Library Catalogues (date, author, place, subject, etc) - Census records (birth, income, employment, place etc.) - Economic data (GDP, PPI, ASX etc.)
  • 8. 2. Unstructured data : - Any data with unknown form or the structure is classified as unstructured data. In addition to the size being huge, unstructured data poses multiple challenges in terms of its processing for deriving value out of it. - The rest of the data created, about 80% of the total account for unstructured big data.
  • 9. Example of Unstructured data : - Media ( MP3, digital photos, audio and video files ) - Text files (Word processing, spreadsheets, presentations etc. ) - Social Media ( Data from Facebook, Twitter, LinkedIn)
  • 10. 3. Semi-Structured data : - Semi-structured data is a form of structured data that does not conform with the formal structure of data models associated with relational databases or other forms of data tables, but nonetheless contains tags or other markers to separate semantic elements and enforce hierarchies of records and fields within the data. Therefore, it is also known as self- describing structure.
  • 11. Example of Semi-Structured data : - Personal data stored in a XML file- <rec><name>Harry</name><sex>Male</sex><age>35</age></rec> <rec><name>Justin</name><sex>Female</sex><age>41</age></rec> <rec><name>Shawn</name><sex>Male</sex><age>29</age></rec> <rec><name>Ed sheeran</name><sex>Male</sex><age>26</age></rec> <rec><name>Drake</name><sex>Male</sex><age>35</age></rec>
  • 12. - As you can notice, Internet is a maze of unbounded data. - But it can be understood, interpreted and used under these three categories. - You can benefit from each type of data if you understand their characteristics and strengths.
  • 13. - Businesses worldwide construct their empire on these three pillars and capitalize on their limitless potential. The question is, do you???