Weather of the Century: Visualization

•Als PPTX, PDF herunterladen•

9 gefällt mir•1,923 views

The document discusses visualizing weather data stored in MongoDB. It describes extracting location and temperature data from MongoDB documents into NumPy arrays, using that data to perform grid interpolation and contour mapping with SciPy and Matplotlib. It then compares the performance of this process using PyMongo versus a new library called Monary, finding Monary is over 7 times faster for querying large datasets. In the end it thanks various Python libraries that helped enable this visualization and analysis of weather data from MongoDB.

Daten & Analysen Technologie

The Weather of the Century:
Visualization
A. Jesse Jiryu Davis
Senior Python Engineer, MongoDB
@jessejiryudavis

Visualization Pipeline
MongoDB PyMongo Python dicts NumPy SciPy Matplotlib

{
ts: ISODate("1991-01-01T00:00:00Z"),
position: {
type: "Point",
coordinates: [
-94.6,
39.117
]
},
airTemperature: {
value: 45,
quality: "1"
}
}
GeoJSON

import numpy
import pymongo
data = []
db = pymongo.MongoClient().my_database
for doc in db.collection.find(query):
data.append((
doc['position']['coordinates'][0],
doc['position']['coordinates'][1],
doc['airTemperature']['value']))
arrays = numpy.array(data)

# NumPy column access syntax.
lons = arrays[:, 0]
lats = arrays[:, 1]
temps = arrays[:, 2]

from scipy import griddata
from matplotlib import pyplot
xs = numpy.linspace(-180, 180, 361)
ys = numpy.linspace(-90, 90, 181)
zs = griddata(lats, lons, temps,
(xs, ys),
method='linear')
Magic!!
pyplot.contour(xs, ys, zs)
Also magic!!

from matplotlib import pyplot
xs = numpy.linspace(-180, 180, 361)
ys = numpy.linspace(-90, 90, 181)
zs = griddata(lats, lons, temps,
(xs, ys),
method='linear')
pyplot.contour(xs, ys, zs)

Barycentric Interpolation
What temperature? 53
48
54
Weighted Average
51.1

import numpy
import pymongo
Not terrifically fast
data = []
db = pymongo.MongoClient().my_database
for doc in db.collection.find(query):
data.append((
doc['position']['coordinates'][0],
doc['position']['coordinates'][1],
doc['airTemperature']['value']))
arrays = numpy.array(data)

Analyzing large datasets
• Querying: 109k documents per second
• (On localhost)
• Can we go faster?
• Enter “Monary”

Monary
by David Beach
MongoDB PyMongo Python dicts NumPy Matplotlib
MongoDB Monary NumPy Matplotlib

import monary
data = []
connection = monary.Monary()
arrays = monary_connection.query(
db='my_database',
coll='collection',
query=query,
fields=[
'position.coordinates.0',
'position.coordinates.1',
'airTemperature.value'],
types=[
'float32',
'float32',
'float32'])

Monary
• PyMongo: 109k documents per second
• Monary: 817k documents per second

• Author:
David Beach
• Interns:
Kyle Suarez
Matt Cotter
• Mentors:
A. Jesse Jiryu Davis
Jason Carey
Monary

Monary
Recent features:
• Easy installation
• Nested field access
• Aggregation
• Python 3

• Insert, update, remove
• SSL and authentication mechanisms
• parallelCollectionScan
Monary
Future:

Thanks
• Monary
• NumPy
• SciPy
• Matplotlib

Thank you
A. Jesse Jiryu Davis
Senior Python Engineer, MongoDB
#MongoDBWorld

Empfohlen

The Weather of the CenturyMongoDB

The Weather of the Century Part 3: VisualizationMongoDB

Weather of the Century: Design and PerformanceMongoDB

The Weather of the Century Part 2: High PerformanceMongoDB

Deep dumpster diving 2010RonnBlack

Using PyPy instead of Python for speedEnplore AB

AJUG April 2011 Raw hadoop exampleChristopher Curtin

MongoDB World 2019: Event Horizon: Meet Albert Einstein As You Move To The CloudMongoDB

Empfohlen

The Weather of the CenturyMongoDB

The Weather of the Century Part 3: VisualizationMongoDB

Weather of the Century: Design and PerformanceMongoDB

The Weather of the Century Part 2: High PerformanceMongoDB

Deep dumpster diving 2010RonnBlack

Using PyPy instead of Python for speedEnplore AB

AJUG April 2011 Raw hadoop exampleChristopher Curtin

MongoDB World 2019: Event Horizon: Meet Albert Einstein As You Move To The CloudMongoDB

R and cppRomain Francois

Блохин Леонид - "Mist, как часть Hydrosphere"Provectus

Network Analysis with networkX : Real-World Example-2Kyunghoon Kim

Azure Stream Analytics Project : On-demand real-time analyticsLamprini Koutsokera

Sydney Python Presentation (Feb 2010) - Tracking Large Metallic Objects / Goo...Kelvin Nicholson

Network Analysis with networkX : Real-World Example-1Kyunghoon Kim

Codemimidas

k-means algorithm implementation on HadoopStratos Gounidellis

CloudClustering: Toward a scalable machine learning toolkit for Windows AzureAnkur Dave

R Data Visualization-Spatial data and Maps in R: Using R as a GISDr. Volkan OBAN

Sea Amsterdam 2014 November 19GoDataDriven

The Directions Pipeline at Mapbox - AWS Meetup Berlin June 2015Johan

MongoDB Project: Relational databases to Document-Oriented databasesLamprini Koutsokera

The easiest consistent hashingDaeMyung Kang

Queue in swiftjoonjhokil

Cloud flare jgc bigo meetup rolling hashesCloudflare

Ac cuda c_5Josh Wyatt

Influxdb and time series dataMarcin Szepczyński

Boosting command line experience with python and awkKirill Pavlov

Metrics stack 2.0Dieter Plaetinck

Building your first application w/mongoDB MongoSV2011Steven Francia

MongoDB, PHP and the cloud - php cloud summit 2011Steven Francia

Weitere ähnliche Inhalte

Was ist angesagt?

R and cppRomain Francois

Блохин Леонид - "Mist, как часть Hydrosphere"Provectus

Network Analysis with networkX : Real-World Example-2Kyunghoon Kim

Azure Stream Analytics Project : On-demand real-time analyticsLamprini Koutsokera

Sydney Python Presentation (Feb 2010) - Tracking Large Metallic Objects / Goo...Kelvin Nicholson

Network Analysis with networkX : Real-World Example-1Kyunghoon Kim

Codemimidas

k-means algorithm implementation on HadoopStratos Gounidellis

CloudClustering: Toward a scalable machine learning toolkit for Windows AzureAnkur Dave

R Data Visualization-Spatial data and Maps in R: Using R as a GISDr. Volkan OBAN

Sea Amsterdam 2014 November 19GoDataDriven

The Directions Pipeline at Mapbox - AWS Meetup Berlin June 2015Johan

MongoDB Project: Relational databases to Document-Oriented databasesLamprini Koutsokera

The easiest consistent hashingDaeMyung Kang

Queue in swiftjoonjhokil

Cloud flare jgc bigo meetup rolling hashesCloudflare

Ac cuda c_5Josh Wyatt

Influxdb and time series dataMarcin Szepczyński

Boosting command line experience with python and awkKirill Pavlov

Metrics stack 2.0Dieter Plaetinck

Was ist angesagt? (20)

R and cpp

Блохин Леонид - "Mist, как часть Hydrosphere"

Network Analysis with networkX : Real-World Example-2

Azure Stream Analytics Project : On-demand real-time analytics

Sydney Python Presentation (Feb 2010) - Tracking Large Metallic Objects / Goo...

Network Analysis with networkX : Real-World Example-1

Code

k-means algorithm implementation on Hadoop

CloudClustering: Toward a scalable machine learning toolkit for Windows Azure

R Data Visualization-Spatial data and Maps in R: Using R as a GIS

Sea Amsterdam 2014 November 19

The Directions Pipeline at Mapbox - AWS Meetup Berlin June 2015

MongoDB Project: Relational databases to Document-Oriented databases

The easiest consistent hashing

Queue in swift

Cloud flare jgc bigo meetup rolling hashes

Ac cuda c_5

Influxdb and time series data

Boosting command line experience with python and awk

Metrics stack 2.0

Ähnlich wie Weather of the Century: Visualization

Building your first application w/mongoDB MongoSV2011Steven Francia

MongoDB, PHP and the cloud - php cloud summit 2011Steven Francia

Latinowarekchodorow

Building Applications with MongoDB - an IntroductionMongoDB

Building Your First MongoDB Application (Mongo Austin)MongoDB

Webinar: Building Your First App with MongoDB and JavaMongoDB

Real-Time Integration Between MongoDB and SQL DatabasesEugene Dvorkin

Introduction to MongoDBNosh Petigara

Hadoop - MongoDB Webinar June 2014MongoDB

Building Apps with MongoDBNate Abele

An introduction into Spring DataOliver Gierke

Building a web application with mongo dbMongoDB

Building a Location-based platform with MongoDB from Zero.Ravi Teja

Introduction to NOSQL And MongoDBBehrouz Bakhtiari

Data access 2.0? Please welcome: Spring Data!Oliver Gierke

PostgreSQLからMongoDBへBasuke Suzuki

Real-Time Integration Between MongoDB and SQL Databases MongoDB

San Francisco Java User Groupkchodorow

Nosh slides mongodb web application - mongo philly 2011MongoDB

Building Your First MongoDB ApplicationRick Copeland

Ähnlich wie Weather of the Century: Visualization (20)

Building your first application w/mongoDB MongoSV2011

MongoDB, PHP and the cloud - php cloud summit 2011

Latinoware

Building Applications with MongoDB - an Introduction

Building Your First MongoDB Application (Mongo Austin)

Webinar: Building Your First App with MongoDB and Java

Real-Time Integration Between MongoDB and SQL Databases

Introduction to MongoDB

Hadoop - MongoDB Webinar June 2014

Building Apps with MongoDB

An introduction into Spring Data

Building a web application with mongo db

Building a Location-based platform with MongoDB from Zero.

Introduction to NOSQL And MongoDB

Data access 2.0? Please welcome: Spring Data!

PostgreSQLからMongoDBへ

Real-Time Integration Between MongoDB and SQL Databases

San Francisco Java User Group

Nosh slides mongodb web application - mongo philly 2011

Building Your First MongoDB Application

Mehr von MongoDB

MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB

MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB

MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB

MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB

MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB

MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB

MongoDB SoCal 2020: MongoDB Atlas Jump StartMongoDB

MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB

MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB

MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB

MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB

MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB

MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB

MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB

MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB

MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB

MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB

MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB

MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB

MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB

Mehr von MongoDB (20)

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas

MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!

MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...

MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB

MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...

MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data

MongoDB SoCal 2020: MongoDB Atlas Jump Start

MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]

MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2

MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...

MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!

MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset

MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart

MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...

MongoDB .local San Francisco 2020: Aggregation Pipeline Power++

MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...

MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive

MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang

MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...

MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...

Kürzlich hochgeladen

While-For-loop in python used in collegessuser7a7cd61

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一fhwihughh

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档208367051

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7

Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics

Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen

Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics

Semantic Shed - Squashing and Squeezing.pptxMike Bennett

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort

Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

How we prevented account sharing with MFAAndrei Kaleshka

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch

Kürzlich hochgeladen (20)

While-For-loop in python used in college

DBA Basics: Getting Started with Performance Tuning.pdf

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf

Advanced Machine Learning for Business Professionals

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

Data Factory in Microsoft Fabric (MsBIP #82)

Heart Disease Classification Report: A Data Analysis Project

Semantic Shed - Squashing and Squeezing.pptx

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)

Biometric Authentication: The Evolution, Applications, Benefits and Challenge...

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

How we prevented account sharing with MFA

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

Top 5 Best Data Analytics Courses In Queens

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

Call Girls In Dwarka 9654467111 Escorts Service

GA4 Without Cookies [Measure Camp AMS]

Weather of the Century: Visualization

1. The Weather of the Century: Visualization A. Jesse Jiryu Davis Senior Python Engineer, MongoDB @jessejiryudavis

2. Serious MongoDB Talk

3. Serious MongoDB Talk Database

4. Serious MongoDB Talk

5. This Talk

6. Where’s the data from?

7. Where’s the data from?

8. How Much Is There?

9. Visualization

10. Visualization Pipeline MongoDB PyMongo Python dicts NumPy SciPy Matplotlib

11. { ts: ISODate("1991-01-01T00:00:00Z"), position: { type: "Point", coordinates: [ -94.6, 39.117 ] }, airTemperature: { value: 45, quality: "1" } } GeoJSON

12. import numpy import pymongo data = [] db = pymongo.MongoClient().my_database for doc in db.collection.find(query): data.append(( doc['position']['coordinates'][0], doc['position']['coordinates'][1], doc['airTemperature']['value'])) arrays = numpy.array(data)

13. # NumPy column access syntax. lons = arrays[:, 0] lats = arrays[:, 1] temps = arrays[:, 2]

14.

15. from scipy import griddata from matplotlib import pyplot xs = numpy.linspace(-180, 180, 361) ys = numpy.linspace(-90, 90, 181) zs = griddata(lats, lons, temps, (xs, ys), method='linear') Magic!! pyplot.contour(xs, ys, zs) Also magic!!

16.

17. from matplotlib import pyplot xs = numpy.linspace(-180, 180, 361) ys = numpy.linspace(-90, 90, 181) zs = griddata(lats, lons, temps, (xs, ys), method='linear') pyplot.contour(xs, ys, zs)

18. Triangulation

19. Triangulation

20. Triangulation What temperature?

21. Barycentric Interpolation What temperature? 53 48 54 Weighted Average 51.1

22. Interpolation 51.1

23. Interpolation

24. Interpolation

25. Contours

26. Contours

27.

28.

29.

30.

31.

32.

33. import numpy import pymongo Not terrifically fast data = [] db = pymongo.MongoClient().my_database for doc in db.collection.find(query): data.append(( doc['position']['coordinates'][0], doc['position']['coordinates'][1], doc['airTemperature']['value'])) arrays = numpy.array(data)

34. Analyzing large datasets • Querying: 109k documents per second • (On localhost) • Can we go faster? • Enter “Monary”

35. Monary by David Beach MongoDB PyMongo Python dicts NumPy Matplotlib MongoDB Monary NumPy Matplotlib

36. import monary data = [] connection = monary.Monary() arrays = monary_connection.query( db='my_database', coll='collection', query=query, fields=[ 'position.coordinates.0', 'position.coordinates.1', 'airTemperature.value'], types=[ 'float32', 'float32', 'float32'])

37. Monary • PyMongo: 109k documents per second • Monary: 817k documents per second

38. Visualization

39. • Author: David Beach • Interns: Kyle Suarez Matt Cotter • Mentors: A. Jesse Jiryu Davis Jason Carey Monary

40. Monary Recent features: • Easy installation • Nested field access • Aggregation • Python 3

41. • Insert, update, remove • SSL and authentication mechanisms • parallelCollectionScan Monary Future:

42. Thanks • Monary • NumPy • SciPy • Matplotlib

43. Thanks

44. Thank you A. Jesse Jiryu Davis Senior Python Engineer, MongoDB #MongoDBWorld

Hinweis der Redaktion

This will not be a serious MongoDB talk.
Serious MongoDB talks show slides with lots of hairy data.
There’s usually a cylinder this means we’ve gotten very serious because we’re talking about databases.
And when things get really serious there’s multiple cylinders in boxes. You’re not going to see this stuff because this is not a serious MongoDB talk.
This will be a talk about making pretty pictures. Also, Math Open source Python packages that can analyze & visualize data from MongoDB specialized MongoDB driver that can parse almost a million documents per second But this isn’t a serious talk because there won’t be any cylinders. If you came for cylinders, I don’t want you to be disappointed.
A little review if you weren’t at Randall’s or Andre’s talks in this series. We downloaded 2.5 billion weather measurements from the US Government.
That teal logo is the NOAA logo, National Oceanic and Atmospheric Administration The stations do have cylinders, does that mean they’re databases? Stations have various frequencies: once per day, twice, hourly, every 5 minutes, ….
Exponentially-growing data set André showed how you can choose the price-performance tradeoff that’s right for you: Single-server. Massively sharded cluster. I went with the single-server option.
Oops, a picture of a cylinder. Must’ve snuck in from another slide deck.
I used Python to generate this visualization. Global air temperature each hour in December last year. The remainder of this talk is going to discuss: open source Python packages algorithms performance issues.
There are such powerful open source data analysis tools in Python, the code to do all this is quite simple The work’s all been done for me.
explain this code: we use pymongo to get data from mongodb pymongo represents bson documents as python dicts we take the values from each dict and put it in a python array then we convert the python array to a numpy array
get pointers to the three columns in the numpy array
now these latitudes, longitudes, and temperatures represent the stations that reported at the given hour how do we make the contour plot? we have to interpolate among these points to come up with a temperature map of the whole globe
TODO: scipy
I’ll explain momentarily how SciPy and Matplotlib are able to do this. But first notice all the white areas.
Step one: interpolation. We’re going to transform a messy distribution of points into a perfectly even grid.
We begin with a point somewhere on earth for each station that reported a temperature at the hour we’re plotting. The arrangement is very uneven.
In order to interpolate them, we first perform a Delaunay triangulation. Comes up with a set of non-overlapping triangles for all the points.
Next overlay a grid. We want to know the temperature at each grid intersection.
This is called Barycentric Interpolation, use that at the cocktail party later on.
Temperatures at the corners are 48, 54, and 53. (Sorry one is cut off.)
Here’s the grid point we need to make a temperature for.
Measure the area of each of these three triangles.
Use that for a weighted-average of the three temperatures. In this case it’s 51.1 degrees. This is called Barycentric Interpolation, use that at the cocktail party later on.
So Barycentric Interpolation can be applied to any grid point. Brings us from this…
To this! So we can discard our original samples now and just use the grid.
Now we’ve finished interpolating, the next stage is contouring.
Contouring is much too complicated for me to understand, Matplotlib just takes care of it somehow.
Finally, we fill in the colors. But notice we can only contour the spaces between stations. There’s no way to know about the edges. So that gets us from this map of just the stations.
To this map with contours and colors.
But now you see why we have blank edges. We can only fill in the spaces between stations, and my program doesn’t understand that the North Pole is between Canada and Russia. So I came up with a hack to fill in the rest of the space.
Here’s our flat projection of the Earth. Matplotlib doesn’t know that the left edge connects to the right edge. It doesn’t know that if you keep heading North from the United States you end up in Russia.
So I just flipped and tiled the earth. Now there are 7 earths laid out on a super-earth-sized grid.
That allows us to go from this…
… to this! TODO: the movie again! So my program just re-executes the process once for each hour’s worth of data, for the whole month of December. But it’s a little slow, takes almost a second to generate each frame, which means that creating a minute-long movie might take 10 minutes of rendering time.
This is one of the bottlenecks: creating and discarding Python dictionaries, plus all the time spent on hashtable lookups.
These are idealized circumstances of course: no network latency, data is already in memory. It’s fast, but can we go faster?
Monary! Directly from MongoDB to NumPy, all written in C. No intermediate Python dictionaries. Written by David Beach, a financial analyst. Just an open source project by a MongoDB community member.
Monary is staticly-typed You get numpy arrays back, directly, with no further processing
6458 rows for 1991-06-02 12:00:00 PyMongo takes 0.0593 sec Monary takes 0.0079 sec Monary is 8x faster
Now we can generate this near-realtime
David Beach, original author Me, and Jason Carey: MongoDB driver engineers, overseers Kyle Suarez and Matt Cotter: Interns, contributing this summer Rutgers, Carleton College
David Beach, original author Me, and Jason Carey: MongoDB driver engineers, overseers Kyle Suarez and Matt Cotter: Interns, contributing this summer Rutgers, Carleton College
David Beach, original author Me, and Jason Carey: MongoDB driver engineers, overseers Kyle Suarez and Matt Cotter: Interns, contributing this summer
So we can query data from MongoDB using Python and achieve very high throughput, using Monary and NumPy We can do sophisticated processing and visualization of that data using SciPy and Matplotlib