Putting the Go in MongoDB: How We Rebuilt The MongoDB Tools in Go

•Als POTX, PDF herunterladen•

1 gefällt mir•2,608 views

With the release of MongoDB 3.0, the tools (mongodump, mongoimport, mongotop, etc) have been completely re-designed and re-written in Go to improve maintainability and performance. In this section, we'll give an architectural overview of the tools, describe how we used Go's native capacities to improve their parallelism, and also take a deep technical dive into their internals. We'll discuss performance, usability and integration improvements and share advanced techniques for power users. With a better understanding of how the tools work, you should feel comfortable effectively using and contributing to the tools.

Technologie

Putting the Go in MongoDB
Wisdom Omuya
Software Engineer
wisdom@mongodb.com

$Invalid Documents ● Nested CSV, TSV imports { "_id": ObjectId("5542593837096bdf8bbb6d91"), “name.first”: ”Wisdom”, ”name.last”: ”Omuya" } name.first,name.last Wisdom,Omuya$

$mongo> db.test.find({“name.first":”Wisdom"}) Fetched 0 record(s) in 0ms Unretrievable Data { "_id": ObjectId("5542593837096bdf8bbb6d91"), “name.first”: ”Wisdom”, ”name.last”: ”Omuya" }$

General
● Storage engine support
● Wiredtiger, RocksDB, etc
● Backwards compatibility
● Dump and restore BSON
● Import and export JSON

● Excellent support for concurrency
● Runs on all supported platforms
● Easier to write and debug
● Fun!
Why Go?

New Mongoimport Flags
--numDecodingWorkers (autodetect to max)
--numInsertionWorkers (default 1)
--batchSize (default 10k)
--numThreads (autodetect to max)
--maintainInsertionOrder (default false)

Input Validation
● No more broken imports, illegal field names
”a”, “b.”

$Dot-nesting ● Nested CSV, TSV imports name.first,name.last Wisdom,Omuya { "_id": ObjectId("55425a3c37096bdf8bbb6d93"), "name": { "first": ”Wisdom", "last": ”Omuya" } }$

Write Concern Specificity
● New default w=majority on import/restore/files
o safer, matches what our users assume is
happening
● --writeConcern flag
e.g. ‘{w: 3, j: true, fsync: false, wtimeout: 400}’

New Mongorestore Flags
--numParallelCollections (aka -j default=4)
--numInsertionWorkersPerCollection (default=1)
--batchSize(default 10k)
--numThreads (autodetect to max)
--maintainInsertionOrder (default false)

Mongorestore Concurrency in 3.0
2X speedup
4X speedup

What’s next?
• mongorestore and mongodump now support
compression and archiving
• No need for directories containing BSON files
mongodump -d weather -h localhost --archive --gzip |
mongorestore -h remotehost --archive --gzip

Resources
• Repository:
https://github.com/mongodb/mongo-tools
• Usage Guide:
https://docs.mongodb.org/master/reference/program/
• Issue Tracking:
https://jira.mongodb.org/browse/tools/

Weitere ähnliche Inhalte

Was ist angesagt?

MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...

MongoDB

New to MongoDB? This talk will introduce the philosophy and features of MongoDB. We’ll discuss the benefits of the document-based data model that MongoDB offers by walking through how one can build a simple app. We’ll cover inserting, updating, and querying the database of books. This session will jumpstart your knowledge of MongoDB development, providing you with context for the rest of the day's content.

Dev Jumpstart: Build Your First App with MongoDB

MongoDB

With growing trend of Big data, companies are tend to rely on high cost SIEM solutions. However, with introduction of open source and lightweight cluster management solution like ElasticSearch this has been the highlight of the year. Similarly, the log aggregation has been simplified by logstash and kibana providing a visual look to the complex data structure. This presentation will exactly cater to this need of having a appropriate log analysis+Detecting Intrusion+Visualizing data in a powerful interface.

Attack monitoring using ElasticSearch Logstash and Kibana

Prajal Kulkarni

MongoDB in the Middle of a Hybrid Cloud and Polyglot Persistence Architecture

MongoDB

Presented by Osmar Olivo, Product Manager, MongoDB Experience level: Introductory WiredTiger is MongoDB's first officially supported pluggable storage engine as well as the new default engine in 3.2. It exposes several new features and configuration options. This talk will highlight the major differences between the MMAPV1 and WiredTiger storage engines including currency, compression, and caching.

MongoDB Days Silicon Valley: A Technical Introduction to WiredTiger

MongoDB

«Scrapy internals» Александр Сибиряков, Scrapinghub

it-people

A Free New World: Atlas Free Tier and How It Was Born Speaker: Louisa Berger, Senior Software Engineer Speaker: Vincent Do, Fullstack Engineer, MongoDB Level: 200 (Intermediate) Track: How We Build MongoDB Last year, MongoDB released Atlas – a new Database as as Service product that takes handles running, monitoring, and maintaining your MongoDB deployment in the Cloud. This winter, we added a new Free Tier option to the product, which allows users to try out Atlas with their own real data for free. Lead Automation engineer Louisa Berger and Atlas engineer Vincent Do will talk about how it works behind the scenes, and why you might want to try out Atlas. This talk is intended for developers, and will take you through the technical details of the architecture, and show you the techniques and challenges in building a multi-tenant MongoDB. What You Will Learn: - Insights on how/why you should use the Atlas free tier - How the Atlas free tier was designed and implemented - Best practices for building a multi-tenant MongoDB application

A Free New World: Atlas Free Tier and How It Was Born

MongoDB

New to MongoDB? This talk will introduce the philosophy and features of MongoDB. We’ll discuss the benefits of the document-based data model that MongoDB offers by walking through how one can build a simple app to store books. We’ll cover inserting, updating, and querying the database of books. This session will jumpstart your knowledge of MongoDB development, providing you with context for the rest of the day's content.

Dev Jumpstart: Build Your First App with MongoDB

MongoDB

Sharding

MongoDB

Back to Basics 2017: Introduction to Sharding

MongoDB

MongoDB's New Aggregation framework

Chris Westin

Mongodb beijingconf yottaa_3.3

Yottaa

This talk will describe the changes which went into MongoDB 3.0 in order to allow storage engines to achieve their maximum concurrency potential. In MongoDB 3.0, concurrency control has been separated into two levels: top-level, which protects the database catalog, and storage engine-level, which allows each individual storage engine implementation to manage its own concurrency. We will start from the top and introduce the concept of multi-granularity locking and how it protects the database catalog. We will then explain how the MongoDB lock manager works and how it allows storage engines to manage their own concurrency control without imposing any additional overhead.

Concurrency Control in MongoDB 3.0

MongoDB

MongoDB is a leading nosql database. It is horizonatally scalable, document datastore. In this introduction given at Dr Dobbs Conference, Bangalore and Pune in April 2014, I show schema design with an example blog application and Python code snippets. I delivered the same in the maiden MongoDB Evening event at Delhi and Gurgaon in May 2014. When constructing a data model for your MongoDB collection for CMS, there are various options you can choose from, each of which has its strengths and weaknesses. The three basic patterns are: 1.Store each comment in its own document. 2.Embed all comments in the “parent” document. 3.A hybrid design, stores comments separately from the “parent,” but aggregates comments into a small number of documents, where each contains many comments. Code sample and wiki documentation is available on https://github.com/prasoonk/mycms_mongodb/wiki.

MongoDB Introduction talk at Dr Dobbs Conference, MongoDB Evenings at Bangalo...

Prasoon Kumar

NoSQL benchmarking

Prasoon Kumar

Back to Basics Spanish 4 Introduction to sharding

MongoDB

Александр Сергиенко, Senior Android Developer, DataArt

Alina Vilk

Monitoramento com ELK - Elasticsearch - Logstash - Kibana

Waldemar Neto

MongoFr : MongoDB as a log Collector

Pierre Baillet

This presentation will discuss implementing external authentication when using Percona Server for MongoDB and MongoDB Enterprise. It will review authentication using OpenLDAP or ActiveDirectory and ActiveDirectory with Kerberos. The presentation will also include examples of the configurations required by these external directory services. It will also review the LDAP Authorization features introduced in MongoDB Enterprise 3.4.

MongoDB - External Authentication

Jason Terpko

Was ist angesagt? (20)

MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...

Dev Jumpstart: Build Your First App with MongoDB

Attack monitoring using ElasticSearch Logstash and Kibana

MongoDB in the Middle of a Hybrid Cloud and Polyglot Persistence Architecture

MongoDB Days Silicon Valley: A Technical Introduction to WiredTiger

«Scrapy internals» Александр Сибиряков, Scrapinghub

A Free New World: Atlas Free Tier and How It Was Born

Dev Jumpstart: Build Your First App with MongoDB

Sharding

Back to Basics 2017: Introduction to Sharding

MongoDB's New Aggregation framework

Mongodb beijingconf yottaa_3.3

Concurrency Control in MongoDB 3.0

MongoDB Introduction talk at Dr Dobbs Conference, MongoDB Evenings at Bangalo...

NoSQL benchmarking

Back to Basics Spanish 4 Introduction to sharding

Александр Сергиенко, Senior Android Developer, DataArt

Monitoramento com ELK - Elasticsearch - Logstash - Kibana

MongoFr : MongoDB as a log Collector

MongoDB - External Authentication

Ähnlich wie Putting the Go in MongoDB: How We Rebuilt The MongoDB Tools in Go

Back to Basics, webinar 2: La tua prima applicazione MongoDB

MongoDB

Scaling MongoDB; Sharding Into and Beyond the Multi-Terabyte Range

MongoDB

Back to Basics Webinar 2 - Your First MongoDB Application

Joe Drumgoole

Back to Basics Webinar 2: Your First MongoDB Application

MongoDB

Webinaire 2 de la série « Retour aux fondamentaux » : Votre première applicat...

MongoDB

Building your first app with MongoDB

Norberto Leite

High profile security breaches have become embarrassingly common, but ultimately avoidable. Now more than ever, database security is a critical component of any production application. In this talk we'll learn to secure your deployment in accordance with best practices and compliance regulations. We'll explore the MongoDB Enterprise features which ensure HIPAA and PCI compliance, and protect you against attack, data exposure and a damaged reputation.

Architecting Secure and Compliant Applications with MongoDB

MongoDB

MongoDB Command Line Tools

Rainforest QA

2011/07/30(土)、第五回MongoDB勉強会の発表資料。要旨は、次の通りです。オンラインゲームポータルサイト – アットゲームズ – で、大規模リニューアルを7月上旬に行いました。その際LAMP環境で構築されているサイト内のSNS機能にMongoDBを導入した機能拡張を行いました。その時の開発の事例についてご紹介いたします。

ココロもつながるオンラインゲーム–アットゲームズ–のMongoDB導入事例

Naoki Sega

Back to Basics 2017: Mí primera aplicación MongoDB

MongoDB

Mongo db dla administratora

Łukasz Jagiełło

Streaming Data Pipelines with MongoDB and Kafka at ao.com

MongoDB

MongoDB 101

Abhijeet Vaikar

MongoDB Basics Unileon

Juan Antonio Roy Couto

MongoDB

Rawin Windygallery

moma-django is a MongoDB manager for Django. It provides native Django ORM support for MongoDB documents, including the query API and the admin interface. It was developed as a part of two commercial products and released as an open source. In the talk we will review the motivation behind its developments, its features and go through 2-3 examples of how to use some of the features: migrating an existing model, advanced queries and the admin interface. If time permits we will discuss unit testing and south migrations. Please find the video at: http://www.youtube.com/watch?v=cxQKTDLjb-w Also check out: https://twitter.com/gadioren and www.ITculate.io

moma-django overview --> Django + MongoDB: building a custom ORM layer

Gadi Oren

Cracking JWT tokens: a tale of magic, Node.js and parallel computing - Code E...

Luciano Mammino

Back to Basics: My First MongoDB Application

MongoDB

Back to Basics 2017 - Your First MongoDB Application

Joe Drumgoole

Get expertise with mongo db

Amit Thakkar

Ähnlich wie Putting the Go in MongoDB: How We Rebuilt The MongoDB Tools in Go (20)

Back to Basics, webinar 2: La tua prima applicazione MongoDB

Scaling MongoDB; Sharding Into and Beyond the Multi-Terabyte Range

Back to Basics Webinar 2 - Your First MongoDB Application

Back to Basics Webinar 2: Your First MongoDB Application

Webinaire 2 de la série « Retour aux fondamentaux » : Votre première applicat...

Building your first app with MongoDB

Architecting Secure and Compliant Applications with MongoDB

MongoDB Command Line Tools

ココロもつながるオンラインゲーム–アットゲームズ–のMongoDB導入事例

Back to Basics 2017: Mí primera aplicación MongoDB

Mongo db dla administratora

Streaming Data Pipelines with MongoDB and Kafka at ao.com

MongoDB 101

MongoDB Basics Unileon

MongoDB

moma-django overview --> Django + MongoDB: building a custom ORM layer

Cracking JWT tokens: a tale of magic, Node.js and parallel computing - Code E...

Back to Basics: My First MongoDB Application

Back to Basics 2017 - Your First MongoDB Application

Get expertise with mongo db

Mehr von MongoDB

During this talk we'll navigate through a customer's journey as they migrate an existing MongoDB deployment to MongoDB Atlas. While the migration itself can be as simple as a few clicks, the prep/post effort requires due diligence to ensure a smooth transfer. We'll cover these steps in detail and provide best practices. In addition, we’ll provide an overview of what to consider when migrating other cloud data stores, traditional databases and MongoDB imitations to MongoDB Atlas.

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas

MongoDB

MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!

MongoDB

MongoDB Kubernetes operator and MongoDB Open Service Broker are ready for production operations. Learn about how MongoDB can be used with the most popular container orchestration platform, Kubernetes, and bring self-service, persistent storage to your containerized applications. A demo will show you how easy it is to enable MongoDB clusters as an External Service using the Open Service Broker API for MongoDB

MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...

MongoDB

MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB

MongoDB

MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...

MongoDB

Time series data is increasingly at the heart of modern applications - think IoT, stock trading, clickstreams, social media, and more. With the move from batch to real time systems, the efficient capture and analysis of time series data can enable organizations to better detect and respond to events ahead of their competitors or to improve operational efficiency to reduce cost and risk. Working with time series data is often different from regular application data, and there are best practices you should observe. This talk covers: Common components of an IoT solution The challenges involved with managing time-series data in IoT applications Different schema designs, and how these affect memory and disk utilization – two critical factors in application performance. How to query, analyze and present IoT time-series data using MongoDB Compass and MongoDB Charts At the end of the session, you will have a better understanding of key best practices in managing IoT time-series data with MongoDB.

MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data

MongoDB

MongoDB SoCal 2020: MongoDB Atlas Jump Start

MongoDB

Our clients have unique use cases and data patterns that mandate the choice of a particular strategy. To implement these strategies, it is mandatory that we unlearn a lot of relational concepts while designing and rapidly developing efficient applications on NoSQL. In this session, we will talk about some of our client use cases, the strategies we have adopted, and the features of MongoDB that assisted in implementing these strategies.

MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]

MongoDB

Encryption is not a new concept to MongoDB. Encryption may occur in-transit (with TLS) and at-rest (with the encrypted storage engine). But MongoDB 4.2 introduces support for Client Side Encryption, ensuring the most sensitive data is encrypted before ever leaving the client application. Even full access to your MongoDB servers is not enough to decrypt this data. And better yet, Client Side Encryption can be enabled at the "flick of a switch". This session covers using Client Side Encryption in your applications. This includes the necessary setup, how to encrypt data without sacrificing queryability, and what trade-offs to expect.

MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2

MongoDB

MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...

MongoDB

MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!

MongoDB

When you need to model data, is your first instinct to start breaking it down into rows and columns? Mine used to be too. When you want to develop apps in a modern, agile way, NoSQL databases can be the best option. Come to this talk to learn how to take advantage of all that NoSQL databases have to offer and discover the benefits of changing your mindset from the legacy, tabular way of modeling data. We’ll compare and contrast the terms and concepts in SQL databases and MongoDB, explain the benefits of using MongoDB compared to SQL databases, and walk through data modeling basics so you feel confident as you begin using MongoDB.

MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset

MongoDB

MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart

MongoDB

MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...

MongoDB

MongoDB .local San Francisco 2020: Aggregation Pipeline Power++

MongoDB

MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...

MongoDB

MongoDB Atlas Data Lake is a new service offered by MongoDB Atlas. Many organizations store long term, archival data in cost-effective storage like S3, GCP, and Azure Blobs. However, many of them do not have robust systems or tools to effectively utilize large amounts of data to inform decision making. MongoDB Atlas Data Lake is a service allowing organizations to analyze their long-term data to discover a wealth of information about their business. This session will take a deep dive into the features that are currently available in MongoDB Atlas Data Lake and how they are implemented. In addition, we'll discuss future plans and opportunities and offer ample Q&A time with the engineers on the project.

MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive

MongoDB

Virtual assistants are becoming the new norm when it comes to daily life, with Amazon’s Alexa being the leader in the space. As a developer, not only do you need to make web and mobile compliant applications, but you need to be able to support virtual assistants like Alexa. However, the process isn’t quite the same between the platforms. How do you handle requests? Where do you store your data and work with it to create meaningful responses with little delay? How much of your code needs to change between platforms? In this session we’ll see how to design and develop applications known as Skills for Amazon Alexa powered devices using the Go programming language and MongoDB.

MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang

MongoDB

MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...

MongoDB

Il n’a jamais été aussi facile de commander en ligne et de se faire livrer en moins de 48h très souvent gratuitement. Cette simplicité d’usage cache un marché complexe de plus de 8000 milliards de $. La data est bien connu du monde de la Supply Chain (itinéraires, informations sur les marchandises, douanes,…), mais la valeur de ces données opérationnelles reste peu exploitée. En alliant expertise métier et Data Science, Upply redéfinit les fondamentaux de la Supply Chain en proposant à chacun des acteurs de surmonter la volatilité et l’inefficacité du marché.

MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...

MongoDB

Mehr von MongoDB (20)

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas

MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!

MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...

MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB

MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...

MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data

MongoDB SoCal 2020: MongoDB Atlas Jump Start

MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]

MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2

MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...

MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!

MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset

MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart

MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...

MongoDB .local San Francisco 2020: Aggregation Pipeline Power++

MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...

MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive

MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang

MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...

MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...

Kürzlich hochgeladen

MS Copilot expands with MS Graph connectors

Nanddeep Nachan

Vector Search -An Introduction in Oracle Database 23ai.pptx

Remote DBA Services

In this keynote, Asanka Abeysinghe, CTO,WSO2 will explore the shift towards platformless technology ecosystems and their importance in driving digital adaptability and innovation. We will discuss strategies for leveraging decentralized architectures and integrating diverse technologies, with a focus on building resilient, flexible, and future-ready IT infrastructures. We will also highlight WSO2's roadmap, emphasizing our commitment to supporting this transformative journey with our evolving product suite.

Platformless Horizons for Digital Adaptability

WSO2

Six Myths about Ontologies: The Basics of Formal Ontology

johnbeverley2021

How to Troubleshoot Apps for the Modern Connected Worker

ThousandEyes

Discover the innovative features and strategic vision that keep WSO2 an industry leader. Explore the exciting 2024 roadmap of WSO2 API management, showcasing innovations, unified APIM/APK control plane, natural language API interaction, and cloud native agility. Discover how open source solutions, microservices architecture, and cloud native technologies unlock seamless API management in today's dynamic landscapes. Leave with a clear blueprint to revolutionize your API journey and achieve industry success!

WSO2's API Vision: Unifying Control, Empowering Developers

WSO2

DBX First Quarter 2024 Investor Presentation

Dropbox

The microservices honeymoon is over. When starting a new project or revamping a legacy monolith, teams started looking for alternatives to microservices. The Modular Monolith, or 'Modulith', is an architecture that reaps the benefits of (vertical) functional decoupling without the high costs associated with separate deployments. This talk will delve into the advantages and challenges of this progressive architecture, beginning with exploring the concept of a 'module', its internal structure, public API, and inter-module communication patterns. Supported by spring-modulith, the talk provides practical guidance on addressing the main challenges of a Modultith Architecture: finding and guarding module boundaries, data decoupling, and integration module-testing. You should not miss this talk if you are a software architect or tech lead seeking practical, scalable solutions. About the author With two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Victor Rentea

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Zilliz

Elevate Developer Efficiency & build GenAI Application with Amazon Q

Bhuvaneswari Subramani

ICT role in 21st century education and its challenges

rafiqahmad00786416

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

Keynote 2: APIs in 2030: The Risk of Technological Sleepwalk Paolo Malinverno, Growth Advisor - The Business of Technology Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

apidays

Join our latest Connector Corner webinar to discover how UiPath Integration Service revolutionizes API-centric automation in a 'Quote to Cash' process—and how that automation empowers businesses to accelerate revenue generation. A comprehensive demo will explore connecting systems, GenAI, and people, through powerful pre-built connectors designed to speed process cycle times. Speakers: James Dickson, Senior Software Engineer Charlie Greenberg, Host, Product Marketing Manager

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

DianaGray10

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Edi Saputra

Corporate and higher education. Two industries that, in the past, have had a clear divide with very little crossover. The difference in goals, learning styles and objectives paved the way for differing learning technologies platforms to evolve. Now, those stark lines are blurring as both sides are discovering they have content that’s relevant to the other. Join Tammy Rutherford as she walks through the pros and cons of corporate and higher ed collaborating. And the challenges of these different technology platforms working together for a brighter future.

Corporate and higher education May webinar.pptx

Rustici Software

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

Effective data discovery is crucial for maintaining compliance and mitigating risks in today's rapidly evolving privacy landscape. However, traditional manual approaches often struggle to keep pace with the growing volume and complexity of data. Join us for an insightful webinar where industry leaders from TrustArc and Privya will share their expertise on leveraging AI-powered solutions to revolutionize data discovery. You'll learn how to: - Effortlessly maintain a comprehensive, up-to-date data inventory - Harness code scanning insights to gain complete visibility into data flows leveraging the advantages of code scanning over DB scanning - Simplify compliance by leveraging Privya's integration with TrustArc - Implement proven strategies to mitigate third-party risks Our panel of experts will discuss real-world case studies and share practical strategies for overcoming common data discovery challenges. They'll also explore the latest trends and innovations in AI-driven data management, and how these technologies can help organizations stay ahead of the curve in an ever-changing privacy landscape.

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc

Dubai, known for its towering skyscrapers, luxurious lifestyle, and relentless pursuit of innovation, often finds itself in the global spotlight. However, amidst the glitz and glamour, the emirate faces its own set of challenges, including the occasional threat of flooding. In recent years, Dubai has experienced sporadic but significant floods, disrupting normalcy and posing unique challenges to its infrastructure. Among the critical nodes in this bustling metropolis is the Dubai International Airport, a vital hub connecting the world. This article delves into the intersection of Dubai flood events and the resilience demonstrated by the Dubai International Airport in the face of such challenges.

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Orbitshub

Following the popularity of “Cloud Revolution: Exploring the New Wave of Serverless Spatial Data,” we’re thrilled to announce this much-anticipated encore webinar. In this sequel, we’ll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you’re building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

Kürzlich hochgeladen (20)

MS Copilot expands with MS Graph connectors

Vector Search -An Introduction in Oracle Database 23ai.pptx

Platformless Horizons for Digital Adaptability

Six Myths about Ontologies: The Basics of Formal Ontology

How to Troubleshoot Apps for the Modern Connected Worker

WSO2's API Vision: Unifying Control, Empowering Developers

DBX First Quarter 2024 Investor Presentation

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Elevate Developer Efficiency & build GenAI Application with Amazon Q

ICT role in 21st century education and its challenges

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Corporate and higher education May webinar.pptx

Apidays New York 2024 - The value of a flexible API Management solution for O...

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Putting the Go in MongoDB: How We Rebuilt The MongoDB Tools in Go

1. Putting the Go in MongoDB Wisdom Omuya Software Engineer wisdom@mongodb.com

2. MongoDB v3.0.0

3. WHAT TOOLS? tools

4. downloads.mongodb.org

5. Import mongofiles

6. Export

7. Monitoring mongostat mongotop

8. Others bsondump mongooplog

9. ROUGH EDGES tools

10. Mongorestore Concurrency in 2.6

11. Invalid Documents ● Nested CSV, TSV imports { "_id": ObjectId("5542593837096bdf8bbb6d91"), “name.first”: ”Wisdom”, ”name.last”: ”Omuya" } name.first,name.last Wisdom,Omuya

12. mongo> db.test.find({“name.first":”Wisdom"}) Fetched 0 record(s) in 0ms Unretrievable Data { "_id": ObjectId("5542593837096bdf8bbb6d91"), “name.first”: ”Wisdom”, ”name.last”: ”Omuya" }

13. REQUIREMENTS tools

14. Loose Coupling

15. Shorter Release Cycle

16. Portability

17. Concurrent Processing CPU

18. General ● Storage engine support ● Wiredtiger, RocksDB, etc ● Backwards compatibility ● Dump and restore BSON ● Import and export JSON

19. ● Excellent support for concurrency ● Runs on all supported platforms ● Easier to write and debug ● Fun! Why Go?

20. MONGOIMPORT tools

21. mongoimport

22. Mongoimport Architecture

23. New Mongoimport Flags --numDecodingWorkers (autodetect to max) --numInsertionWorkers (default 1) --batchSize (default 10k) --numThreads (autodetect to max) --maintainInsertionOrder (default false)

24. Input Validation ● No more broken imports, illegal field names ”a”, “b.”

25. Dot-nesting ● Nested CSV, TSV imports name.first,name.last Wisdom,Omuya { "_id": ObjectId("55425a3c37096bdf8bbb6d93"), "name": { "first": ”Wisdom", "last": ”Omuya" } }

26. Import/Export Type Fidelity

27. Write Concern Specificity ● New default w=majority on import/restore/files o safer, matches what our users assume is happening ● --writeConcern flag e.g. ‘{w: 3, j: true, fsync: false, wtimeout: 400}’

28. Mongoimport Concurrency in 3.0

29. MONGORESTORE tools

30. Mongorestore Architecture

31. New Mongorestore Flags --numParallelCollections (aka -j default=4) --numInsertionWorkersPerCollection (default=1) --batchSize(default 10k) --numThreads (autodetect to max) --maintainInsertionOrder (default false)

32. Mongorestore Concurrency in 3.0 2X speedup 4X speedup

33. What’s next? • mongorestore and mongodump now support compression and archiving • No need for directories containing BSON files mongodump -d weather -h localhost --archive --gzip | mongorestore -h remotehost --archive --gzip

34. Resources • Repository: https://github.com/mongodb/mongo-tools • Usage Guide: https://docs.mongodb.org/master/reference/program/ • Issue Tracking: https://jira.mongodb.org/browse/tools/

35. THANKS! tools

Hinweis der Redaktion

Don’t say ‘kind of’!
First release for MongoDB tools in Go Propose a thesis: we rewrote it, it's much faster and you'll be a more effective user because of the concurrency and control we've introuced
First release for MongoDB tools in Go We rewrote it, it's much faster and you'll be a more effective user because of the concurrency and control we've introuced
Mongofiles usually files above our 16MB limit
Stat: Collects statistics of your mongod’s; Similar to vmstat or iostat Top: Tracks the amount of time spent per operation on different namespaces
aka, mon-goop-log :)
why couldn't we multithread the old version of the database? 12 core Intel(R) Core(TM) i7-3930K CPU @ 3.20GHz
Old tool didn’t try and prevent you from shooting yourself in foot
Unfindable imports
24MB -> 6MB In order to parse the query a chunk of the query engine, all of boost We can iterate faster
easier for the community to contribute to the tools easier to iterate separately
Emphasize concurrency and control
quick growth of the team, onboarding
Mongofiles usually files above our 16MB limit
Short anecdote? Tool gains not server gains batch size is # of docs
One design principle is we didn't want to potentially overwhelm a mongodb server when ppl are used to single threaded
As much nesting as you want
Full JSON roundtripping
No concurrency controls in 2.6 23GB <-> 11 collections Emphasize concurrency and control
cheap way of making asynchronous writes
No concurrency controls in 2.6 23GB <-> 11 collections Emphasize concurrency and control
Not write to disk at al -> separate CPU Compressed network archive archive support is not in 3.0, but initially available in 3.1.x

Putting the Go in MongoDB: How We Rebuilt The MongoDB Tools in Go

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Putting the Go in MongoDB: How We Rebuilt The MongoDB Tools in Go

Ähnlich wie Putting the Go in MongoDB: How We Rebuilt The MongoDB Tools in Go (20)

Mehr von MongoDB

Mehr von MongoDB (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Putting the Go in MongoDB: How We Rebuilt The MongoDB Tools in Go

Hinweis der Redaktion