Making earth observation data available by using Amazon S3 is accelerating scientific discovery and enabling the creation of new products. Attend and learn how the scale and performance of Amazon S3 lets earth scientists, researchers, startups, and GIS professionals gather and analyze planetary-scale data without worrying about limitations of bandwidth, storage, memory, or processing power. Learn how AWS is being used to combine satellite imagery, social data, and telemetry data to produce new products and services. Learn also how Amazon S3 provides much more than storage, and how an open geospatial data lake on Amazon S3 can be used as the basis for planetary-scale applications built with Amazon EMR, Amazon API Gateway, and AWS Lambda. As part of this talk, AWS customer Digital Globe demonstrates how they use open data stored in S3 to distribute high-resolution satellite imagery to their customers around the world.
2. What to Expect from the Session
• About open data on AWS
• Advantages of Amazon S3 for sharing data
• How E&J Gallo and DigitalGlobe use AWS to work with
geospatial data
3. Why Does AWS Care About Open Data?
Sharing data on AWS makes it accessible
to a large and growing community of
researchers, entrepreneurs, and
enterprises who use the AWS cloud.
4. “…data must be organized, well-
documented, consistently formatted, and
error free. Cleaning the data is often the
most taxing part of data science, and is
frequently 80% of the work.”
- Data Driven by DJ Patil and Hilary Mason
Undifferentiated Heavy Lifting
16. AWS Cloud Credits for Research
provide promotional AWS cloud credits
for anyone to conduct research using
Earth Observation data.
aws.amazon.com/earth/research-credits
19. E&J Gallo Winery
Largest Winery in the world
• Established in 1933 and headquartered in Modesto,
California, E. & J. Gallo Winery remains a privately-held
and ever-growing company that employs 6,000 people
worldwide.
• Largest land owner in the State of California
Products and Distribution
• With products available in more than 90 countries, E. &
J. Gallo Winery is the largest exporter of California wine,
and imports wines from Argentina, Italy, New Zealand
and Spain
• WE hold 90 brands and include table and sparkling
wines, beverage products, dessert wines and distilled
spirits
22. Wine and Grape Supply
Grower Relations
• Manage external grape purchases
• Inform growers on best management practices
Gallo Vineyards Inc.
• Gallo owned vineyards
• Irrigation management
• Yield and Quality estimation
• Best management practices for vineyard management
• Operations planning and scheduling
Viticulture Chemistry and Enology
• Internal research division
• Quality and yield enhancement
• Variable rate irrigation
Winemaking
23. Complexities of Wine Growing
Not a Biomass Product
• Manage more than just nitrogen and water
• A number of activities go against a given vine
Perennial Crop
• Average vineyard lifecycle of 24 years
• Vine management is an on-going activity
• Canopy Management
• Shoot Thinning
• Vine Balancing
• Trellis Management
• Nutrients
• Pesticides
High Value Crop Management
• Direct correlation between ranch management plan and
quality
• Yield impact associated with canopy management
activities
25. Why Data Matters
We operate in an ‘Activity’ based agricultural environment
from which we can drive insights
• Events or Transactions help us manage our business
• These events are both structured and unstructured
Structured Events such as…
• Tons of grapes delivered to a specific winery
• Distribution of soil across a vineyard
• Ranch management transactions applied (i.e. shoot
thinning, leafing, dropping fruit etc.)
Unstructured Events such as…
• Imagery
• Soil Moisture Nodes
• Yield Monitors
• Mechanized Asset Control
26. Data Driven Value
Having a robust data and analytics platform allows us to
drive insights
• Grape to Bottle data management
• Improve Quality
• Increase Yield
• Detect and prevent early on-set disease
• Maintain a competitive supply position through predictive
yield estimation
27. Data Lake Model
EC2
Elastic Load Balancing
S3
Redshift
SQL on DB
Instance
EMR
Customer Facing
Product Layer
EDW
Reports
Data Processing,
Analysis, R&D Layer
Data Collection
Layer
Dashboard
Insights
D
a
t
a
L
a
y
e
r
44. We have common frameworks to accelerate prototyping
We need a common data set to evaluate performance
45. - 50cm WV-2 image w/8 spectral bands
- Each image 200m2
- Covering Rio de Janeiro, Brazil
- 7000 Geotiff images
Available for your computing pleasure on
46. Rio Olympics – Use Case
• Governments around the world were preparing for the
upcoming Olympics in Rio De Janeiro
• Thorough understanding of the security and safety of the
athletes is front of mind
• Knowledge of high risk areas, traffic patterns, line-of-sight
and overall structure distribution is critical to ensure a
secure games
47.
48.
49.
50.
51.
52.
53.
54.
55. What you can expect
1) Satellite imagery analytics will be way more common in your industry
2) Computer vision tasks are going to change the way we see the world
3) We are going to learn a lot