SlideShare ist ein Scribd-Unternehmen logo
1 von 10
MongoDB
● What is it ?
● Features
● Tools
● Use with Hadoop
● Hadoop Tools
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
MongoDB – What is it ?
● Document oriented NoSql database
● BSON schema data format ( Binary JSON )
● Released as open source / free
● Can be used as a distributed database
● Has load balancing
● Has replication
● Written in C++
● Licensed via Apache
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
MongoDB – Features
● Queries
– By field
– By regular expression
– User defined java script functions
– By range
● Indexes
– Primary and secondary
– Any document field
● Replication
– Master can replicate to multiple slaves
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
MongoDB – Features
● Load balancing
– Data split across multple shards
– DB scales using shards
– New machines can be added to running database
● Map reduce can be used for aggregation
● File storage via GridFS
– Load balanced file system
– File system with replication
– Functions available for file manipulation
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
MongoDB – Tools
● Mongo – a db access shell and admin tool
● Mongostat – a status tool similar to vmstat
● Mongotop – top processes like Unix top command
● Mongosniff – low level traffic sniffing
● Mongoimport – import JSON, CSV, TSV plus others
● Mongoexport – export tool ( as import )
● Mongodump – dump database contents
● Mongostore – reload database dumps
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
MongoDB – With Hadoop
● Hadoop connector available from github
● Allows Hadoop I/O
● Compiles with SBT build tool
● Supports Hadoop
– 0.20/0.20.x
– 1.0/1.0.x
– 1.1/1.1.x
– 0.21/0.21.x
– CDH3
– CDH4
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
MongoDB – Attributes
The image on the left shows how Hadoop and its tools are used with
MongoDB via a connector. The image on the right shows MongoDB
attributes.
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
MongoDB – Hadoop Tools
● The Hadoop connector supports
– Map Reduce
– Pig
– Hadoop streaming
– Flume
– Hive
– Hive BSON file access
● MongoDB can use HDFS for storage
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
MongoDB – Architecture
● A db server
– has many databases
● A database
– Has many collections
● A collection
– Has many documents
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
Contact Us
● Feel free to contact us at
– www.semtech-solutions.co.nz
– info@semtech-solutions.co.nz
● We offer IT project consultancy
● We are happy to hear about your problems
● You can just pay for those hours that you need
● To solve your problems

Weitere ähnliche Inhalte

Mehr von Mike Frampton

An introduction to Apache Mesos
An introduction to Apache MesosAn introduction to Apache Mesos
An introduction to Apache Mesos
Mike Frampton
 
An introduction to Pentaho
An introduction to PentahoAn introduction to Pentaho
An introduction to Pentaho
Mike Frampton
 

Mehr von Mike Frampton (20)

Apache Airavata
Apache AiravataApache Airavata
Apache Airavata
 
Apache MADlib AI/ML
Apache MADlib AI/MLApache MADlib AI/ML
Apache MADlib AI/ML
 
Apache MXNet AI
Apache MXNet AIApache MXNet AI
Apache MXNet AI
 
Apache Gobblin
Apache GobblinApache Gobblin
Apache Gobblin
 
Apache Singa AI
Apache Singa AIApache Singa AI
Apache Singa AI
 
Apache Ranger
Apache RangerApache Ranger
Apache Ranger
 
OrientDB
OrientDBOrientDB
OrientDB
 
Prometheus
PrometheusPrometheus
Prometheus
 
Apache Tephra
Apache TephraApache Tephra
Apache Tephra
 
Apache Kudu
Apache KuduApache Kudu
Apache Kudu
 
Apache Bahir
Apache BahirApache Bahir
Apache Bahir
 
Apache Arrow
Apache ArrowApache Arrow
Apache Arrow
 
JanusGraph DB
JanusGraph DBJanusGraph DB
JanusGraph DB
 
Apache Ignite
Apache IgniteApache Ignite
Apache Ignite
 
Apache Samza
Apache SamzaApache Samza
Apache Samza
 
Apache Flink
Apache FlinkApache Flink
Apache Flink
 
Apache Edgent
Apache EdgentApache Edgent
Apache Edgent
 
Apache CouchDB
Apache CouchDBApache CouchDB
Apache CouchDB
 
An introduction to Apache Mesos
An introduction to Apache MesosAn introduction to Apache Mesos
An introduction to Apache Mesos
 
An introduction to Pentaho
An introduction to PentahoAn introduction to Pentaho
An introduction to Pentaho
 

Kürzlich hochgeladen

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

An introduction to Mongdb

  • 1. MongoDB ● What is it ? ● Features ● Tools ● Use with Hadoop ● Hadoop Tools www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 2. MongoDB – What is it ? ● Document oriented NoSql database ● BSON schema data format ( Binary JSON ) ● Released as open source / free ● Can be used as a distributed database ● Has load balancing ● Has replication ● Written in C++ ● Licensed via Apache www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 3. MongoDB – Features ● Queries – By field – By regular expression – User defined java script functions – By range ● Indexes – Primary and secondary – Any document field ● Replication – Master can replicate to multiple slaves www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 4. MongoDB – Features ● Load balancing – Data split across multple shards – DB scales using shards – New machines can be added to running database ● Map reduce can be used for aggregation ● File storage via GridFS – Load balanced file system – File system with replication – Functions available for file manipulation www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 5. MongoDB – Tools ● Mongo – a db access shell and admin tool ● Mongostat – a status tool similar to vmstat ● Mongotop – top processes like Unix top command ● Mongosniff – low level traffic sniffing ● Mongoimport – import JSON, CSV, TSV plus others ● Mongoexport – export tool ( as import ) ● Mongodump – dump database contents ● Mongostore – reload database dumps www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 6. MongoDB – With Hadoop ● Hadoop connector available from github ● Allows Hadoop I/O ● Compiles with SBT build tool ● Supports Hadoop – 0.20/0.20.x – 1.0/1.0.x – 1.1/1.1.x – 0.21/0.21.x – CDH3 – CDH4 www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 7. MongoDB – Attributes The image on the left shows how Hadoop and its tools are used with MongoDB via a connector. The image on the right shows MongoDB attributes. www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 8. MongoDB – Hadoop Tools ● The Hadoop connector supports – Map Reduce – Pig – Hadoop streaming – Flume – Hive – Hive BSON file access ● MongoDB can use HDFS for storage www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 9. MongoDB – Architecture ● A db server – has many databases ● A database – Has many collections ● A collection – Has many documents www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 10. Contact Us ● Feel free to contact us at – www.semtech-solutions.co.nz – info@semtech-solutions.co.nz ● We offer IT project consultancy ● We are happy to hear about your problems ● You can just pay for those hours that you need ● To solve your problems