SlideShare a Scribd company logo
1 of 18
What are AWS Analytics
Services?
It is a service offered by Amazon that helps in processing and
providing insights to data by analyzing it. There are many
services in the analytics section which have been discussed
below:
https://forms.office.com/r/DKwT0JCEUZ
Amazon Athena
• An interactive query engine which helps in easy analysis of data stored in S3 with the help of
SQL queries. It is server-less (start query instantly), hence having no infrastructure that needs
to be managed. The user only pays for the queries they run ($5 per terabyte scanned by your
query).
• Athena is easy to use. Simply point to your data in Amazon S3, define the schema, and start
querying using standard SQL. Most results are delivered within seconds. With Athena, there’s
no need for complex ETL jobs to prepare your data for analysis. This makes it easy for anyone
with SQL skills to quickly analyze large-scale datasets.
• Athena is out-of-the-box integrated with AWS Glue Data Catalog, allowing you to create a
unified metadata repository across various services, crawl data sources to discover schemas
and populate your Catalog with new and modified table and partition definitions, and
maintain schema versioning.
Amazon EMR
• An Industry-leading cloud big-data platform for processing huge amount of data using open
source tools such as Apache Hadoop, Spark, Hive, Hbase, and Presto etc.
• It makes it easy to setup, operate, and scale your big data environments by automating time
consuming tasks like provisioning capacity and tuning clusters.
• It provides a Hadoop managed framework, which helps in quick and cost-effective method of
processing large amounts of data across on-the-g- scalable EC2 instances. EMR notebooks are
similar to Jupiter notebooks, and provide an environment to develop and collaborate for
dynamic querying and exploratory analysis if data.
• It is highly-secure and offers reliability to handle a wide range of uses cases, which include,
but are not limited to machine learning, scientific simulation, log analysis, web indexing, data
transformations (ETL), and bioinformatics.
Amazon CloudSearch
• It is a service managed by AWS Cloud,
• It makes the process of setting-up, managing and scaling a search solution for a website or an
application an easy task.
• It supports 34 languages and comes with features such as highlighting, auto-complete, and
geospatial searchability.
• With ease of operation feature: features like auto-scaling, self-healing clusters, high
availability and data durability
• Easy administration: (security) built in integration with IAM and (auditing) built-in integration
with Cloudtrail.
• You don’t have to become a search expert or worry about h/w provisioning, setup and
maintenance.
• Offers pay-as-you-go pricing with no upfront expenses or long-term commitments.
• Search endpoint, document endpoint, Document ARN, Searchable documents
Amazon ElasticSearch service
• It helps ElasticSearch in the process of deploying, securing, operating and scaling data by
searching, analysing and visualizing this data in real-time.
• It comes with easy-to-use APIs which can be used for log analysis, full-text search, scaling
requirements, monitoring applications, clickstream analytics, and security.
• It can also be integrated with open-source tools such as Kibana, Logstash which help in
ingesting data and visualization of data respectively.
• It’s a open source search engine, writen in java language and is based on Apache Leucin.
• It is real-time distributed & analytics engine
• It supports full-text search instead of table/scheme.
• It is document oriented
Amazon Kinesis
• It makes the process of collecting, processing, and
analysing real-time data an easy task. This helps in
achieving real-time insights to data so that this data
can be analysed and actions can be taken based on the
insights quickly. Real-time data including video, audio,
application logs, website clickstreams, IoT telemetry
data (meant for machine learning) can be ingested and
this data can be responded to immediately in contrast
to waiting for all the data to arrive before beginning
pre-processing on it. It currently offers services like-
Kinesis Data Firehose, Kinesis Data Analytics, Kinesis
Data Streams, and Kinesis Video Streams.
Amazon redshift
• It is a quick, and scalable data warehouse which
helps in analysing user data in a cost-efficient and
simple manner. It delivers 10 times quicker
performance in comparison to other data
warehousing services, since it uses machine
learning, massively parallel query execution and
columnar storage on high-performance disks.
Petabytes of data can be queried upon, and a
new data warehouse can be setup and deployed
in a matter of minutes.
Amazon Quicksight
• It is a cloud powered business intelligence service, which is fast, and helps in delivery insights
to people in an organization. It allows users to create and publish interactive visual
dashboards which can be accessed via mobile devices and browsers. These visual dashboards
can be integrated with other applications thereby providing customers with powerful self-
serving analysis service.
• Functions of Quicksight (Business Intelligence tool)
• Performance management: -
• Data wrangling – Converting the data into a structured format (acquising, combining,
cleansing)
• Data mining – ETL
• Data analytics
• Data warehouse -
• Predictive modelling
AWS data pipeline
• It helps process and move data between different
AWS resources (compute and storage devices).
Data can be regularly accessed from the place it is
stored, it can be transformed and processed at
scale. The result of this data processing can be
transferred to other AWS services such as S3,
RDS, DynamoDB, and EMR. It helps in the
creation of complex data processing workloads
which provide facilities such as high fault
tolerance, high availability and repeatability.
AWS Glue
• It is a completely managed ETL service (Extract,
Transform, Load) which helps users prepare and
load their data for the purpose of analysis. An ETL
job can be set up and run with a few mouse clicks
from the AWS Management Console itself. Glue
can be pointed to the location of data stored, and
it discovers the data and its metadata and stores
it in Glue Catalog. Once the data is in the catalog,
it can be searched, queried, and made available
for ETL process.
Amazon Lake Formation
• It is a service that helps in securing data lake. Data Lake
can be visualised as a centralized, customized and
secured data repository which stores this data in the
original form as well as a processed form meant for
data analysis. It helps combine various types of
analytics that help in gaining deeper insights to data
thereby helping make better business decisions. But
the process of setting up and managing a data lake
requires a lot of manual efforts. But Lake Formation
makes this process an easy one by collecting data and
cataloguing it. This data is then classified using ML
algorithms as well as providing security for sensitive
data.
Amazon Managed streaming
for Kafka (MSK)
• It is a service that helps in building and
running applications that use Apache Kafka. It
is fully managed and helps process streaming
data. Apache Kafka is open-source and is used
to build real-time streaming data pipelines
and application. With the help of MSK, Kafka
API can be used to populate data lakes, reflect
changes in the database and use machine
learning to power other applications.
Application Services
• Application Services (often
used instead of application
management services or
application services
management) are a pool of
services such as load
balancing, application
performance monitoring,
application acceleration,
autoscaling,
micro-segmentation, service
proxy and service discovery
needed to optimally deploy,
run and improve
applications.
What is Application Services
Management?
• The process of configuring, monitoring, optimizing and
orchestrating different app services is known as application
services management.
• Today, organizations with their own data centers or which
use the public cloud, handle applications services
management. In the early days of online
adoption, application service providers (or ASPs) were
companies which would deliver applications to end users
for a fixed cost. This single tenant, hosted model was
largely replaced by the advent of the Software-as-a-Service
(SaaS) delivery model which was multi-tenant and on-
demand.
What is Cloud Application Services?
• Cloud App Services are a wide range of
specific application services for applications
deployed in cloud-based resources. Services
such as load balancing, application firewalling
and service discovery can be achieved for
applications running in private, public, hybrid
or multi-cloud environments.
What are App Modernization
Services?
• Traditional applications were built as monolithic blocks of software. These
monolithic applications have long life cycles because any changes or updates to
one function, usually requires reconfiguring the entire application. This costly and
time consuming process delays advancements and updates in application
development.
• Application Modernization Services enable the migration of monolithic, legacy
application architectures to new application architectures that more closely match
the business needs of modern enterprises’ application portfolio. Application
modernization is often part of an organization’s digital transformation.
• An example of this is the use of a microservices architecture where all app services
are created individually and deployed separately from one another. This allows for
scaling services based on specific business needs. Services can also be rapidly
changed without affecting other parts of the application. Application-centric
enterprises are choosing microservices architectures to take advantage of flexible
container-based infrastructure models.
Thank You
Any Suggestions

More Related Content

What's hot

AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...
AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...
AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...Amazon Web Services
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftAmazon Web Services
 
AWS Data Transfer Services: Data Ingest Strategies Into the AWS Cloud
AWS Data Transfer Services: Data Ingest Strategies Into the AWS CloudAWS Data Transfer Services: Data Ingest Strategies Into the AWS Cloud
AWS Data Transfer Services: Data Ingest Strategies Into the AWS CloudAmazon Web Services
 
Full Stack Analytics on AWS - AWS Summit Cape Town 2017
Full Stack Analytics on AWS - AWS Summit Cape Town 2017 Full Stack Analytics on AWS - AWS Summit Cape Town 2017
Full Stack Analytics on AWS - AWS Summit Cape Town 2017 Amazon Web Services
 
Big Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSBig Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSAmazon Web Services
 
Scalable Data Analytics - DevDay Austin 2017 Day 2
Scalable Data Analytics - DevDay Austin 2017 Day 2Scalable Data Analytics - DevDay Austin 2017 Day 2
Scalable Data Analytics - DevDay Austin 2017 Day 2Amazon Web Services
 
(BDT317) Building A Data Lake On AWS
(BDT317) Building A Data Lake On AWS(BDT317) Building A Data Lake On AWS
(BDT317) Building A Data Lake On AWSAmazon Web Services
 
Getting Started with Amazon QuickSight
Getting Started with Amazon QuickSightGetting Started with Amazon QuickSight
Getting Started with Amazon QuickSightAmazon Web Services
 
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...Amazon Web Services
 
AWS Kinesis - Streams, Firehose, Analytics
AWS Kinesis - Streams, Firehose, AnalyticsAWS Kinesis - Streams, Firehose, Analytics
AWS Kinesis - Streams, Firehose, AnalyticsSerhat Can
 
Real-Time Streaming: Intro to Amazon Kinesis
Real-Time Streaming: Intro to Amazon KinesisReal-Time Streaming: Intro to Amazon Kinesis
Real-Time Streaming: Intro to Amazon KinesisAmazon Web Services
 
(BDT308) Using Amazon Elastic MapReduce as Your Scalable Data Warehouse | AWS...
(BDT308) Using Amazon Elastic MapReduce as Your Scalable Data Warehouse | AWS...(BDT308) Using Amazon Elastic MapReduce as Your Scalable Data Warehouse | AWS...
(BDT308) Using Amazon Elastic MapReduce as Your Scalable Data Warehouse | AWS...Amazon Web Services
 
SRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
SRV420 Analyzing Streaming Data in Real-time with Amazon KinesisSRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
SRV420 Analyzing Streaming Data in Real-time with Amazon KinesisAmazon Web Services
 
Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2Amazon Web Services
 
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...Amazon Web Services
 
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)Amazon Web Services
 
Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv
Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv
Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv Amazon Web Services
 
Best Practices for Building a Data Lake on AWS
Best Practices for Building a Data Lake on AWSBest Practices for Building a Data Lake on AWS
Best Practices for Building a Data Lake on AWSAmazon Web Services
 

What's hot (20)

AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...
AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...
AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
 
AWS Data Transfer Services: Data Ingest Strategies Into the AWS Cloud
AWS Data Transfer Services: Data Ingest Strategies Into the AWS CloudAWS Data Transfer Services: Data Ingest Strategies Into the AWS Cloud
AWS Data Transfer Services: Data Ingest Strategies Into the AWS Cloud
 
Full Stack Analytics on AWS - AWS Summit Cape Town 2017
Full Stack Analytics on AWS - AWS Summit Cape Town 2017 Full Stack Analytics on AWS - AWS Summit Cape Town 2017
Full Stack Analytics on AWS - AWS Summit Cape Town 2017
 
Big Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSBig Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWS
 
Scalable Data Analytics - DevDay Austin 2017 Day 2
Scalable Data Analytics - DevDay Austin 2017 Day 2Scalable Data Analytics - DevDay Austin 2017 Day 2
Scalable Data Analytics - DevDay Austin 2017 Day 2
 
(BDT317) Building A Data Lake On AWS
(BDT317) Building A Data Lake On AWS(BDT317) Building A Data Lake On AWS
(BDT317) Building A Data Lake On AWS
 
Getting Started with Amazon QuickSight
Getting Started with Amazon QuickSightGetting Started with Amazon QuickSight
Getting Started with Amazon QuickSight
 
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
 
AWS Kinesis - Streams, Firehose, Analytics
AWS Kinesis - Streams, Firehose, AnalyticsAWS Kinesis - Streams, Firehose, Analytics
AWS Kinesis - Streams, Firehose, Analytics
 
Real-Time Streaming: Intro to Amazon Kinesis
Real-Time Streaming: Intro to Amazon KinesisReal-Time Streaming: Intro to Amazon Kinesis
Real-Time Streaming: Intro to Amazon Kinesis
 
Introduction to Amazon Athena
Introduction to Amazon AthenaIntroduction to Amazon Athena
Introduction to Amazon Athena
 
(BDT308) Using Amazon Elastic MapReduce as Your Scalable Data Warehouse | AWS...
(BDT308) Using Amazon Elastic MapReduce as Your Scalable Data Warehouse | AWS...(BDT308) Using Amazon Elastic MapReduce as Your Scalable Data Warehouse | AWS...
(BDT308) Using Amazon Elastic MapReduce as Your Scalable Data Warehouse | AWS...
 
SRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
SRV420 Analyzing Streaming Data in Real-time with Amazon KinesisSRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
SRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
 
Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2
 
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
 
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)
AWS re:Invent 2016: Automating Workflows for Analytics Pipelines (DEV401)
 
Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv
Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv
Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv
 
Best Practices for Building a Data Lake on AWS
Best Practices for Building a Data Lake on AWSBest Practices for Building a Data Lake on AWS
Best Practices for Building a Data Lake on AWS
 
Introduction to AWS Glue
Introduction to AWS GlueIntroduction to AWS Glue
Introduction to AWS Glue
 

Similar to AWS Analytics Services Guide

Aws re invent 2018 recap
Aws re invent 2018 recapAws re invent 2018 recap
Aws re invent 2018 recapCloudHesive
 
Data Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxData Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxArunPandiyan890855
 
Data Engineering
Data EngineeringData Engineering
Data Engineeringkiansahafi
 
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014Amazon Web Services
 
Qubole on AWS - White paper
Qubole on AWS - White paper Qubole on AWS - White paper
Qubole on AWS - White paper Vasu S
 
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)Amazon Web Services
 
Introduction to Azure fundamentals of cloud.pptx
Introduction to Azure fundamentals of cloud.pptxIntroduction to Azure fundamentals of cloud.pptx
Introduction to Azure fundamentals of cloud.pptxNadir Arain
 
Unit 2 part 1.pptx
Unit 2 part 1.pptxUnit 2 part 1.pptx
Unit 2 part 1.pptxSargamKuntal
 
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAmazon Web Services
 
Third party cloud services cloud computing
Third party cloud services cloud computingThird party cloud services cloud computing
Third party cloud services cloud computingSohailAliMalik
 
Welcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution OverviewWelcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution OverviewAmazon Web Services
 
AWS Summit Singapore Webinar Edition | Architecting a Serverless Data Lake on...
AWS Summit Singapore Webinar Edition | Architecting a Serverless Data Lake on...AWS Summit Singapore Webinar Edition | Architecting a Serverless Data Lake on...
AWS Summit Singapore Webinar Edition | Architecting a Serverless Data Lake on...Amazon Web Services
 
Amazon AWS vs Azure Cloud vs Kubernetes
Amazon AWS vs Azure Cloud vs KubernetesAmazon AWS vs Azure Cloud vs Kubernetes
Amazon AWS vs Azure Cloud vs KubernetesStridely Solutions
 
Enabling SQL Access to Data Lakes
Enabling SQL Access to Data LakesEnabling SQL Access to Data Lakes
Enabling SQL Access to Data LakesVasu S
 
Database project edi
Database project ediDatabase project edi
Database project ediRey Jefferson
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
 
利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料Amazon Web Services
 

Similar to AWS Analytics Services Guide (20)

Aws re invent 2018 recap
Aws re invent 2018 recapAws re invent 2018 recap
Aws re invent 2018 recap
 
Data Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptxData Modernization_Harinath Susairaj.pptx
Data Modernization_Harinath Susairaj.pptx
 
Data Engineering
Data EngineeringData Engineering
Data Engineering
 
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
 
Qubole on AWS - White paper
Qubole on AWS - White paper Qubole on AWS - White paper
Qubole on AWS - White paper
 
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
 
Introduction to Azure fundamentals of cloud.pptx
Introduction to Azure fundamentals of cloud.pptxIntroduction to Azure fundamentals of cloud.pptx
Introduction to Azure fundamentals of cloud.pptx
 
Unit 2 part 1.pptx
Unit 2 part 1.pptxUnit 2 part 1.pptx
Unit 2 part 1.pptx
 
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
 
Third party cloud services cloud computing
Third party cloud services cloud computingThird party cloud services cloud computing
Third party cloud services cloud computing
 
Demistifying serverless on aws
Demistifying serverless on awsDemistifying serverless on aws
Demistifying serverless on aws
 
Welcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution OverviewWelcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution Overview
 
AWS Summit Singapore Webinar Edition | Architecting a Serverless Data Lake on...
AWS Summit Singapore Webinar Edition | Architecting a Serverless Data Lake on...AWS Summit Singapore Webinar Edition | Architecting a Serverless Data Lake on...
AWS Summit Singapore Webinar Edition | Architecting a Serverless Data Lake on...
 
Amazon AWS vs Azure Cloud vs Kubernetes
Amazon AWS vs Azure Cloud vs KubernetesAmazon AWS vs Azure Cloud vs Kubernetes
Amazon AWS vs Azure Cloud vs Kubernetes
 
Enabling SQL Access to Data Lakes
Enabling SQL Access to Data LakesEnabling SQL Access to Data Lakes
Enabling SQL Access to Data Lakes
 
Database project edi
Database project ediDatabase project edi
Database project edi
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
Aws centralized logs
Aws centralized logsAws centralized logs
Aws centralized logs
 

Recently uploaded

Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptMadan Karki
 
Solving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptSolving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptJasonTagapanGulla
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHC Sai Kiran
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...Chandu841456
 
Internet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxInternet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxVelmuruganTECE
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating SystemRashmi Bhat
 
Research Methodology for Engineering pdf
Research Methodology for Engineering pdfResearch Methodology for Engineering pdf
Research Methodology for Engineering pdfCaalaaAbdulkerim
 
Industrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptIndustrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptNarmatha D
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionMebane Rash
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONjhunlian
 
home automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadhome automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadaditya806802
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the weldingMuhammadUzairLiaqat
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxsiddharthjain2303
 
Earthing details of Electrical Substation
Earthing details of Electrical SubstationEarthing details of Electrical Substation
Earthing details of Electrical Substationstephanwindworld
 

Recently uploaded (20)

Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.ppt
 
Solving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptSolving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.ppt
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECH
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...
 
Internet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxInternet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptx
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating System
 
Research Methodology for Engineering pdf
Research Methodology for Engineering pdfResearch Methodology for Engineering pdf
Research Methodology for Engineering pdf
 
Industrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptIndustrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.ppt
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of Action
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
 
home automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadhome automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasad
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the welding
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptx
 
Earthing details of Electrical Substation
Earthing details of Electrical SubstationEarthing details of Electrical Substation
Earthing details of Electrical Substation
 

AWS Analytics Services Guide

  • 1. What are AWS Analytics Services? It is a service offered by Amazon that helps in processing and providing insights to data by analyzing it. There are many services in the analytics section which have been discussed below: https://forms.office.com/r/DKwT0JCEUZ
  • 2. Amazon Athena • An interactive query engine which helps in easy analysis of data stored in S3 with the help of SQL queries. It is server-less (start query instantly), hence having no infrastructure that needs to be managed. The user only pays for the queries they run ($5 per terabyte scanned by your query). • Athena is easy to use. Simply point to your data in Amazon S3, define the schema, and start querying using standard SQL. Most results are delivered within seconds. With Athena, there’s no need for complex ETL jobs to prepare your data for analysis. This makes it easy for anyone with SQL skills to quickly analyze large-scale datasets. • Athena is out-of-the-box integrated with AWS Glue Data Catalog, allowing you to create a unified metadata repository across various services, crawl data sources to discover schemas and populate your Catalog with new and modified table and partition definitions, and maintain schema versioning.
  • 3. Amazon EMR • An Industry-leading cloud big-data platform for processing huge amount of data using open source tools such as Apache Hadoop, Spark, Hive, Hbase, and Presto etc. • It makes it easy to setup, operate, and scale your big data environments by automating time consuming tasks like provisioning capacity and tuning clusters. • It provides a Hadoop managed framework, which helps in quick and cost-effective method of processing large amounts of data across on-the-g- scalable EC2 instances. EMR notebooks are similar to Jupiter notebooks, and provide an environment to develop and collaborate for dynamic querying and exploratory analysis if data. • It is highly-secure and offers reliability to handle a wide range of uses cases, which include, but are not limited to machine learning, scientific simulation, log analysis, web indexing, data transformations (ETL), and bioinformatics.
  • 4. Amazon CloudSearch • It is a service managed by AWS Cloud, • It makes the process of setting-up, managing and scaling a search solution for a website or an application an easy task. • It supports 34 languages and comes with features such as highlighting, auto-complete, and geospatial searchability. • With ease of operation feature: features like auto-scaling, self-healing clusters, high availability and data durability • Easy administration: (security) built in integration with IAM and (auditing) built-in integration with Cloudtrail. • You don’t have to become a search expert or worry about h/w provisioning, setup and maintenance. • Offers pay-as-you-go pricing with no upfront expenses or long-term commitments. • Search endpoint, document endpoint, Document ARN, Searchable documents
  • 5. Amazon ElasticSearch service • It helps ElasticSearch in the process of deploying, securing, operating and scaling data by searching, analysing and visualizing this data in real-time. • It comes with easy-to-use APIs which can be used for log analysis, full-text search, scaling requirements, monitoring applications, clickstream analytics, and security. • It can also be integrated with open-source tools such as Kibana, Logstash which help in ingesting data and visualization of data respectively. • It’s a open source search engine, writen in java language and is based on Apache Leucin. • It is real-time distributed & analytics engine • It supports full-text search instead of table/scheme. • It is document oriented
  • 6. Amazon Kinesis • It makes the process of collecting, processing, and analysing real-time data an easy task. This helps in achieving real-time insights to data so that this data can be analysed and actions can be taken based on the insights quickly. Real-time data including video, audio, application logs, website clickstreams, IoT telemetry data (meant for machine learning) can be ingested and this data can be responded to immediately in contrast to waiting for all the data to arrive before beginning pre-processing on it. It currently offers services like- Kinesis Data Firehose, Kinesis Data Analytics, Kinesis Data Streams, and Kinesis Video Streams.
  • 7. Amazon redshift • It is a quick, and scalable data warehouse which helps in analysing user data in a cost-efficient and simple manner. It delivers 10 times quicker performance in comparison to other data warehousing services, since it uses machine learning, massively parallel query execution and columnar storage on high-performance disks. Petabytes of data can be queried upon, and a new data warehouse can be setup and deployed in a matter of minutes.
  • 8. Amazon Quicksight • It is a cloud powered business intelligence service, which is fast, and helps in delivery insights to people in an organization. It allows users to create and publish interactive visual dashboards which can be accessed via mobile devices and browsers. These visual dashboards can be integrated with other applications thereby providing customers with powerful self- serving analysis service. • Functions of Quicksight (Business Intelligence tool) • Performance management: - • Data wrangling – Converting the data into a structured format (acquising, combining, cleansing) • Data mining – ETL • Data analytics • Data warehouse - • Predictive modelling
  • 9. AWS data pipeline • It helps process and move data between different AWS resources (compute and storage devices). Data can be regularly accessed from the place it is stored, it can be transformed and processed at scale. The result of this data processing can be transferred to other AWS services such as S3, RDS, DynamoDB, and EMR. It helps in the creation of complex data processing workloads which provide facilities such as high fault tolerance, high availability and repeatability.
  • 10. AWS Glue • It is a completely managed ETL service (Extract, Transform, Load) which helps users prepare and load their data for the purpose of analysis. An ETL job can be set up and run with a few mouse clicks from the AWS Management Console itself. Glue can be pointed to the location of data stored, and it discovers the data and its metadata and stores it in Glue Catalog. Once the data is in the catalog, it can be searched, queried, and made available for ETL process.
  • 11. Amazon Lake Formation • It is a service that helps in securing data lake. Data Lake can be visualised as a centralized, customized and secured data repository which stores this data in the original form as well as a processed form meant for data analysis. It helps combine various types of analytics that help in gaining deeper insights to data thereby helping make better business decisions. But the process of setting up and managing a data lake requires a lot of manual efforts. But Lake Formation makes this process an easy one by collecting data and cataloguing it. This data is then classified using ML algorithms as well as providing security for sensitive data.
  • 12. Amazon Managed streaming for Kafka (MSK) • It is a service that helps in building and running applications that use Apache Kafka. It is fully managed and helps process streaming data. Apache Kafka is open-source and is used to build real-time streaming data pipelines and application. With the help of MSK, Kafka API can be used to populate data lakes, reflect changes in the database and use machine learning to power other applications.
  • 14. • Application Services (often used instead of application management services or application services management) are a pool of services such as load balancing, application performance monitoring, application acceleration, autoscaling, micro-segmentation, service proxy and service discovery needed to optimally deploy, run and improve applications.
  • 15. What is Application Services Management? • The process of configuring, monitoring, optimizing and orchestrating different app services is known as application services management. • Today, organizations with their own data centers or which use the public cloud, handle applications services management. In the early days of online adoption, application service providers (or ASPs) were companies which would deliver applications to end users for a fixed cost. This single tenant, hosted model was largely replaced by the advent of the Software-as-a-Service (SaaS) delivery model which was multi-tenant and on- demand.
  • 16. What is Cloud Application Services? • Cloud App Services are a wide range of specific application services for applications deployed in cloud-based resources. Services such as load balancing, application firewalling and service discovery can be achieved for applications running in private, public, hybrid or multi-cloud environments.
  • 17. What are App Modernization Services? • Traditional applications were built as monolithic blocks of software. These monolithic applications have long life cycles because any changes or updates to one function, usually requires reconfiguring the entire application. This costly and time consuming process delays advancements and updates in application development. • Application Modernization Services enable the migration of monolithic, legacy application architectures to new application architectures that more closely match the business needs of modern enterprises’ application portfolio. Application modernization is often part of an organization’s digital transformation. • An example of this is the use of a microservices architecture where all app services are created individually and deployed separately from one another. This allows for scaling services based on specific business needs. Services can also be rapidly changed without affecting other parts of the application. Application-centric enterprises are choosing microservices architectures to take advantage of flexible container-based infrastructure models.