Amazon Web Services proporciona una amplia gama de servicios que le ayudarán a crear e implementar aplicaciones de análisis de big data de forma rápida y sencilla. AWS ofrece un acceso rápido a recursos de TI económicos y flexibles, algo que permitirá escalar prácticamente cualquier aplicación de big data con rapidez, incluidos almacenamiento de datos, análisis de clics, detección de elementos fraudulentos, motores de recomendación, proceso ETL impulsado por eventos, informática sin servidor y procesamiento del Internet de las cosas.
https://aws.amazon.com/es/big-data/
5. 95% of the 1.2 zettabytes
of data in the digital
universe is unstructured
70% of of this is user-
generated content
Unstructured data growth
explosive, with estimates
of compound annual
growth (CAGR) at 62%
from 2008 – 2012.
Source: IDC
GB TB
PB
ZB
EB
Big Data: Unconstrained data growth
13. Gartner: User Survey Analysis: Key Trends Shaping the Future of Data Center Infrastructure Through 2011
IDC: Worldwide Business Analytics Software 2012–2016 Forecast and 2011 Vendor Shares
Available for analysis
Generated data
Data volume - Gap
1990 2000 2010 2020
14. Elastic and highly scalable
No upfront capital expense
Only pay for what you use
+
+
Available on-demand
+
=
Remove constraints
17. Big data and AWS Cloud computing
Big data Cloud computing
Variety, volume, and velocity
requiring new tools
Variety of compute, storage,
and networking options
18. Big data and AWS Cloud computing
Potentially massive datasets Massive, virtually unlimited capacity
Big data Cloud computing
19. Big data and AWS Cloud computing
Iterative, experimental style of
data manipulation and analysis
Iterative, experimental style of
infrastructure deployment/usage
Big data Cloud computing
20. Big data and AWS Cloud computing
Frequently not steady-state
workload; peaks and valleys
At its most efficient with highly
variable workloads
Big data Cloud computing
21. Big data and AWS Cloud computing
Absolute performance not as
critical as “time to results”;
shared resources are a
bottleneck
Parallel compute projects allow
each workgroup to have more
autonomy, get faster results
Big data Cloud computing
31. Hadoop/HDFS clusters
Hive, Pig, Impala, HBase
Easy to use; fully managed
On-demand and spot pricing
Tight integration with S3,
DynamoDB, and Kinesis
Amazon
Elastic
MapReduce
32. Automatically Discovers Data
Catalog makes data searchable
Catalog contains tables & jobs
Automatically generates ETL
Scheduled or event-based jobs
Amazon
Glue
36. Media streaming
Free steak campaign
Disaster recovery
Web site & media sharing
Facebook app
Ground campaign
SAP & SharePoint
Marketing web site
Social Media Monitoring
Consumer social app
IT operations
Mars exploration ops
Interactive TV apps
Consumer social app
Facebook page
Securities Trading Data Archiving
Financial markets analytics
Web and mobile apps
Big data analytics
Digital media
Ticket pricing optimization
Streaming webcasts
Mobile analytics
Consumer social app
Core IT and media
37. 1.2TB/Day logs
30TB /Day data
250 Hadoop Jobs
75Billion transactions/Day
5 Petabytes of Data
A few AWS customer on Big Data / Data Lakes
25 PB Data Warehouse
on Amazon S3
> 1PB read each day
40. Breakthrough
advances
Optimization and
automation
AI and ML enable innovation at scale…
New features
for existing products
“After decades of false starts, artificial intelligence is on the verge of a
breakthrough, with the latest progress propelled by machine learning.”
McKinsey Global Institute, Artificial Intelligence The Next Digital Frontier? June 2017
41. The potential impact of AI/ML is enterprise-wide
Compliance,
Surveillance, and
Fraud Detection
Pricing and
Product
Recommendation
Document
Processing
Trading Customer
Experience
• Credit card/account fraud
detection
• Anti-money laundering/
Sanctions
• Investigations optimization
• Sales practices/
transaction surveillance
• Compliance processes
optimization
• Regulatory mapping
• Enhanced customer
service through voice
services and chatbots
• Call center
optimization
• Personal financial
management
• Loan/Insurance
underwriting
• Sales/recommendations of
financial products
• Credit assessments
• Contract ingestion and
analytics
• Financial information
extraction
• Common financial
instrument taxonomy
• Corporate actions
• Portfolio management/
robo-advising
• Algorithmic trading
• Sentiment/news analysis
• Geospatial image analysis
• Predictive grid computing
capacity management
AI/ML use cases are gaining traction in Financial Services
42. What is preventing the industry from moving ahead?
AI/ML
expertise is
rare
Building and
scaling AI/ML
technology is hard
Deploying and operating
models in production is
time-consuming and
expensive
A lack of cost-effective,
easy-to-use, and
scalable AI/ML services
43. AWS offers a range of solutions to make AI/ML more accessible
PollyLex Rekognition
Deep Learning FrameworksMachine Learning PlatformsAmazon AI/ML Services
Usability/simplicity:
leverages AWS AI/ML expertise
Greater control:
customer-specific models
Amazon ML
Spark & EMR
Kinesis
Batch
ECS
Customization of
offerings at scale
More personal and
efficient customer
interactions
Operational
efficiencies
Novel investment/
trading opportunities
Benefits for Financial Services Institutions
and others...
44. Our deep experience with AI/ML differentiates our services
Product
recommendation
engine
Robot-enabled
fulfillment
centers
New
product
categories
Amazon has invested in AI/ML since our
inception, and we share our knowledge and
capabilities with our customers
20171995
Natural language
processing-supported
contact centers
ML-driven supply
chain and
capacity planning
Checkout-free
shopping
using deep learning
62. Fraud.net is running AI/ML on AWS to predict financial crime
“Amazon Machine Learning helps us reduce complexity and make sense of emerging fraud patterns.
We can see correlations we wouldn’t have been able to see otherwise and answer questions it would
have taken us way too long to answer ourselves.
”
Fraud.net is the world’s leading
crowdsourced fraud prevention
platform, aggregating and
analyzing large amounts of
fraud data from thousands of
online merchants in real time.
The platform protects more
than 2 percent of all U.S. e-
commerce.
- Oliver Clark, CTO, Fraud.net
• To address its scalability needs, Fraud.net chose AWS to host its customer
platform, relying on services including DynamoDB, Lambda, S3, and Redshift
• Recently, Fraud.net started using Amazon Machine Learning, which helps its
developers build models and enables the use of APIs to get predictions for
applications without having to deploy prediction generation code
• Fraud.net can now easily launch and train new machine-learning models to target
evolving forms of fraud
• Using AWS, Fraud.net can maintain its fast application response times of under
200 milliseconds and save its customers about $1 million a week through fraud
detection and prevention
63. BuildFax uses Amazon ML to help insurers avoid losses
“Amazon Machine Learning democratizes the process of building predictive models. It’s easy and fast
to use and has machine-learning best practices encapsulated in the product, which lets us deliver
results significantly faster than in the past.
”
BuildFax aggregates dispersed
building permit data from across
the United States and provides it
to other businesses, especially
insurance companies, and
economic analysts. The
company also tracks trends like
housing remodels and new
commercial construction.
- Joe Emison, Founder & Chief Technology Officer, BuildFax
• BuildFax’s core customer base is insurance companies, which spend billions of
dollars annually on roof losses
• The company initially built predictive models based on ZIP codes and other general
data, but building the models was complex and the results did not provide enough
differentiators
• BuildFax now uses Amazon Machine Learning to provide roof-age and job-cost
estimations for insurers and builders, with property-specific values that don’t need
to rely on broad, ZIP code-level estimate
• Models that previously took six months or longer to create are now complete in four
weeks or fewer
64. Do you want to talk more about it?
Let´s schedule a meeting.