SlideShare a Scribd company logo
1 of 5
Top Big Data Terms
Term Definition
Hadoop Open-source software framework that supports the running of applications
on large clusters of commodity hardware. Hadoop is written in Java.
HDFS Stands for Hadoop Distributed File System. HDFS is a distributed file system
that stores large files across multiple machines. The system replicates data
across multiple machines and understand what data is being processed when
and by whom
MapReduce MapReduce is a programming model for processing large data sets with a
parallel, distributed algorithm on a cluster. Its Map() procedure filters and
sorts and its Reduce() procedure performs summary operations.
Hive A Data Warehouse infrastructure built on top of Hadoop for providing data
summarization, query, and analysis.
Hbase HBase is an open source, non-relational, distributed database and runs on
top of HDFS.
Cassandra Apache Cassandra is an open source distributed database management
system designed to handle very large amounts of data spread out across
many commodity servers.
Source: Wikipedia (mainly)
Sizes that Matter
Name Value Example
1 Bit = The smallest unit of data that a computer uses. It can be used
to represent two states of information, such as Yes or No.
1 Byte = 8 Bits. A Byte can represent 256 states of information. 1 Byte
could be equal to one character. 10 Bytes could be equal to a
word. 100 Bytes would equal an average sentence.
1 kilobyte (kB) 1024 bytes 1 Kilobyte would be equal to a paragraph.
1 megabyte (MB) 1024 kB 3-1/2 inch floppy disks can hold 1.44 Megabytes or the
equivalent of a small book. 600 Megabytes is about the
amount of data that will fit on a CD-ROM disk.
1 gigabyte (GB) 1024 MB 1GB could hold the contents of about 10 yards of books .
1 terabyte (TB) 1024 GB 1 TB could hold 1,000 copies of the Encyclopedia Britannica.
1 petabyte (PB) 1024 TB 500 million floppy disks
1 exabyte (EB) 1024 PB 5 Exabytes could = all of the words ever spoken by mankind.
1 zettabyte (ZB) 1024 PB ?
Source: http://www.whatsabyte.com/
TRY IT @ WWW.SISENSE.COM
Glossary of Big Data Terms

More Related Content

More from Bruno Aziza

AI Weekly - April 5, 2021
AI Weekly - April 5, 2021AI Weekly - April 5, 2021
AI Weekly - April 5, 2021Bruno Aziza
 
Ai Weekly - March 29, 2021
Ai Weekly - March 29, 2021Ai Weekly - March 29, 2021
Ai Weekly - March 29, 2021Bruno Aziza
 
AI Weekly - March 22, 2021
AI Weekly - March 22, 2021AI Weekly - March 22, 2021
AI Weekly - March 22, 2021Bruno Aziza
 
AI Weekly - March 7, 2021
AI Weekly - March 7, 2021AI Weekly - March 7, 2021
AI Weekly - March 7, 2021Bruno Aziza
 
AI Weekly - March 1, 2021
AI Weekly - March 1, 2021AI Weekly - March 1, 2021
AI Weekly - March 1, 2021Bruno Aziza
 
AI Weekly - February 22, 2021
AI Weekly - February 22, 2021AI Weekly - February 22, 2021
AI Weekly - February 22, 2021Bruno Aziza
 
AI Weekly February 7, 2021
AI Weekly February 7, 2021AI Weekly February 7, 2021
AI Weekly February 7, 2021Bruno Aziza
 
AI Weekly - January 30, 2021
AI Weekly - January 30, 2021AI Weekly - January 30, 2021
AI Weekly - January 30, 2021Bruno Aziza
 
AI Weekly - January 17, 2021
AI Weekly - January 17, 2021AI Weekly - January 17, 2021
AI Weekly - January 17, 2021Bruno Aziza
 
AI Weekly - January 11, 2021
AI Weekly - January 11, 2021AI Weekly - January 11, 2021
AI Weekly - January 11, 2021Bruno Aziza
 
AI Weekly - December 27, 2020
AI Weekly  - December 27, 2020AI Weekly  - December 27, 2020
AI Weekly - December 27, 2020Bruno Aziza
 
AI Weekly - December 7, 2020
AI Weekly - December 7, 2020AI Weekly - December 7, 2020
AI Weekly - December 7, 2020Bruno Aziza
 
AI Weekly - November 30, 2020
AI Weekly - November 30, 2020AI Weekly - November 30, 2020
AI Weekly - November 30, 2020Bruno Aziza
 
AI Weekly: Predictions for 2021
AI Weekly: Predictions for 2021AI Weekly: Predictions for 2021
AI Weekly: Predictions for 2021Bruno Aziza
 
AI Weekly November 8, 2020
AI Weekly  November 8, 2020AI Weekly  November 8, 2020
AI Weekly November 8, 2020Bruno Aziza
 
Ai Weekly - November 1, 2020
Ai Weekly - November 1, 2020Ai Weekly - November 1, 2020
Ai Weekly - November 1, 2020Bruno Aziza
 
AI Weekly - October 18, 2020
AI Weekly - October 18, 2020AI Weekly - October 18, 2020
AI Weekly - October 18, 2020Bruno Aziza
 
AI Weekly - July 26, 2020
AI Weekly - July 26, 2020AI Weekly - July 26, 2020
AI Weekly - July 26, 2020Bruno Aziza
 
AI Weekly - July 5, 2020
AI Weekly - July 5, 2020AI Weekly - July 5, 2020
AI Weekly - July 5, 2020Bruno Aziza
 
AI Weekly - June 15, 2020
AI Weekly - June 15, 2020AI Weekly - June 15, 2020
AI Weekly - June 15, 2020Bruno Aziza
 

More from Bruno Aziza (20)

AI Weekly - April 5, 2021
AI Weekly - April 5, 2021AI Weekly - April 5, 2021
AI Weekly - April 5, 2021
 
Ai Weekly - March 29, 2021
Ai Weekly - March 29, 2021Ai Weekly - March 29, 2021
Ai Weekly - March 29, 2021
 
AI Weekly - March 22, 2021
AI Weekly - March 22, 2021AI Weekly - March 22, 2021
AI Weekly - March 22, 2021
 
AI Weekly - March 7, 2021
AI Weekly - March 7, 2021AI Weekly - March 7, 2021
AI Weekly - March 7, 2021
 
AI Weekly - March 1, 2021
AI Weekly - March 1, 2021AI Weekly - March 1, 2021
AI Weekly - March 1, 2021
 
AI Weekly - February 22, 2021
AI Weekly - February 22, 2021AI Weekly - February 22, 2021
AI Weekly - February 22, 2021
 
AI Weekly February 7, 2021
AI Weekly February 7, 2021AI Weekly February 7, 2021
AI Weekly February 7, 2021
 
AI Weekly - January 30, 2021
AI Weekly - January 30, 2021AI Weekly - January 30, 2021
AI Weekly - January 30, 2021
 
AI Weekly - January 17, 2021
AI Weekly - January 17, 2021AI Weekly - January 17, 2021
AI Weekly - January 17, 2021
 
AI Weekly - January 11, 2021
AI Weekly - January 11, 2021AI Weekly - January 11, 2021
AI Weekly - January 11, 2021
 
AI Weekly - December 27, 2020
AI Weekly  - December 27, 2020AI Weekly  - December 27, 2020
AI Weekly - December 27, 2020
 
AI Weekly - December 7, 2020
AI Weekly - December 7, 2020AI Weekly - December 7, 2020
AI Weekly - December 7, 2020
 
AI Weekly - November 30, 2020
AI Weekly - November 30, 2020AI Weekly - November 30, 2020
AI Weekly - November 30, 2020
 
AI Weekly: Predictions for 2021
AI Weekly: Predictions for 2021AI Weekly: Predictions for 2021
AI Weekly: Predictions for 2021
 
AI Weekly November 8, 2020
AI Weekly  November 8, 2020AI Weekly  November 8, 2020
AI Weekly November 8, 2020
 
Ai Weekly - November 1, 2020
Ai Weekly - November 1, 2020Ai Weekly - November 1, 2020
Ai Weekly - November 1, 2020
 
AI Weekly - October 18, 2020
AI Weekly - October 18, 2020AI Weekly - October 18, 2020
AI Weekly - October 18, 2020
 
AI Weekly - July 26, 2020
AI Weekly - July 26, 2020AI Weekly - July 26, 2020
AI Weekly - July 26, 2020
 
AI Weekly - July 5, 2020
AI Weekly - July 5, 2020AI Weekly - July 5, 2020
AI Weekly - July 5, 2020
 
AI Weekly - June 15, 2020
AI Weekly - June 15, 2020AI Weekly - June 15, 2020
AI Weekly - June 15, 2020
 

Recently uploaded

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Recently uploaded (20)

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Glossary of Big Data Terms

  • 1.
  • 2. Top Big Data Terms Term Definition Hadoop Open-source software framework that supports the running of applications on large clusters of commodity hardware. Hadoop is written in Java. HDFS Stands for Hadoop Distributed File System. HDFS is a distributed file system that stores large files across multiple machines. The system replicates data across multiple machines and understand what data is being processed when and by whom MapReduce MapReduce is a programming model for processing large data sets with a parallel, distributed algorithm on a cluster. Its Map() procedure filters and sorts and its Reduce() procedure performs summary operations. Hive A Data Warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis. Hbase HBase is an open source, non-relational, distributed database and runs on top of HDFS. Cassandra Apache Cassandra is an open source distributed database management system designed to handle very large amounts of data spread out across many commodity servers. Source: Wikipedia (mainly)
  • 3. Sizes that Matter Name Value Example 1 Bit = The smallest unit of data that a computer uses. It can be used to represent two states of information, such as Yes or No. 1 Byte = 8 Bits. A Byte can represent 256 states of information. 1 Byte could be equal to one character. 10 Bytes could be equal to a word. 100 Bytes would equal an average sentence. 1 kilobyte (kB) 1024 bytes 1 Kilobyte would be equal to a paragraph. 1 megabyte (MB) 1024 kB 3-1/2 inch floppy disks can hold 1.44 Megabytes or the equivalent of a small book. 600 Megabytes is about the amount of data that will fit on a CD-ROM disk. 1 gigabyte (GB) 1024 MB 1GB could hold the contents of about 10 yards of books . 1 terabyte (TB) 1024 GB 1 TB could hold 1,000 copies of the Encyclopedia Britannica. 1 petabyte (PB) 1024 TB 500 million floppy disks 1 exabyte (EB) 1024 PB 5 Exabytes could = all of the words ever spoken by mankind. 1 zettabyte (ZB) 1024 PB ? Source: http://www.whatsabyte.com/
  • 4. TRY IT @ WWW.SISENSE.COM