2. AMAZON CONFIDENTIAL
§ 1:00-1:10pm Imagine Your Future for Video: Our Innovation Strategy
§ 1:10-1:45pm Cloud-based Video On Demand
§ 1:45-2:30pm Cloud-enabled Live and Linear Video
§ 2:30-3:15pm Personalizing Video Experiences with AWS Media Services
§ 3:15-3:30pm BREAK
§ 3:30-4:15pm The Role of Machine Learning in Media Workflows
§ 4:15-5:00pm MAM/DAM and Media Analytics
§ 5:00-6:00pm NETWORKING HAPPY HOUR
2
AGENDA
3. AMAZON CONFIDENTIAL 3
IMAGINE YOUR
FUTURE FOR VIDEO:
Our Innovation Strategy
Aslam Khader
Chief Product Officer
AWS Elemental
13. AMAZON CONFIDENTIAL 13
THE INDUSTRY IS REORIENTING AROUND THE VIEWERCLOUD RECONNECTS YOU WITH YOUR AUDIENCE
Shift
Mobility Personal
ReliableQuality
Quality
Reliability Scalability
Agility Resiliency
16. AMAZON CONFIDENTIAL 16
BROADCAST QUALITY WORKFLOWS IN MINUTES
Business & Control Systems: CMS – MAM – OVP – MSP, etc
C A P T URE I N G E S T P R O C E SS & S T O RE D I S P L A YD E L I V E RY
M e d i a C o n v e r t M e d i a L i v e M e d i a P a c k a g e M e d i a S t o r e M e d i T a i l o rMediaTailorMediaStoreMediaPackageMediaLiveMediaConvert
17. AMAZON CONFIDENTIAL 23
INTEGRATE WITH COMPLEMENTARY AWS SERVICES
Amazon Rekognition Video
Image recognition, Celebrity, Actions
Amazon Translate
Natural and fluent language translation
Amazon Transcribe
Automatic speech recognition
ML/AI services
Amazon
CloudFront
Amazon
Route 53
Amazon
Direct Connect
AWS
Lambda
AWS
CloudFormation
Amazon
CloudWatch
20. CONTINUE TO BUILD AWS MEDIA SERVICES PARTNER ECOSYSTEM
CDN
Amazon
CloudFront
3RD Party CDN
DIGITAL RIGHTS
MANAGEMENT
AWS
Media Services
and Appliances
INTEGRATED VIDEO
PROCESSING AND DELIVERY
VIDEO QC
MEDIA LOGISTICSMAM / DAM
AD TECH
20
24. AMAZON CONFIDENTIAL 24
CHALLENGES OF ON-PREM TRANSCODING
§ Set up and manage
physical infrastructure
§ Scaling is challenging;
build for peak
§ Large investments
upfront
25. AMAZON CONFIDENTIAL
§ Serverless
§ Any file-to-file use case
§ All video markets
§ PAYU library conversion
§ Speed and Scale
25
ON-DEMAND VIDEO WORKFLOWS – CLOUD ENABLES:
“With our ability to provide
programmers with live and
on-demand channel playout
using AWS Media Services,
Qello makes it easy for
content programmers to get
new services running.”
- Brian Lisi, CEO of Qello
26. AMAZON CONFIDENTIAL 26
AWS ELEMENTAL MEDIACONVERT
§ Access to professional grade video features and quality
§ No set up or management of underlying infrastructure
§ Automatically scales in response to variations in incoming video
volume
§ Ability to manage capacity and jobs processed in parallel allows
scaling up to thousands of assets
AWS Elemental MediaConvert is a file-based video processing service
that allows anyone, with any size content library, to easily and reliably
transcode on-demand content for broadcast and multiscreen delivery
27. AMAZON CONFIDENTIAL 27
MEDIACONVERT CONTRUCTS
JOB TEMPLATE
METADATA
OTHER
JOB
CONFIG.
INPUT 1
INPUT
CONFIGURATION
HLS OUTPUT GROUP
PRESET 1
PRESET 2
PRESET n
INPUT MEDIAASSET(S)
JOB 1
INPUT 1
INPUT FILE 1
INPUT
CONFIGURATION
HLS OUTPUT GROUP
OUTPUT 1
OUTPUT 2
OUTPUT n
METADATA
OTHER
JOB
CONFIG.
QUEUE(S)
Manage resources for parallel
processing of jobs
NOTIFICATIONSAND METRICS
DOWNSTREAM
PROCESSING
TRANSCODED MEDIA IN S3
Amazon
CLOUDWATCH
HTTP/HTTPS
Email
SMS
Application
LAMBDA
SQS
28. AMAZON CONFIDENTIAL
AWS ELEMENTAL MEDIACONVERT ROADMAP
Reserved
Transcoding Slots
Support
CMAF Presets
and templates
QVBR, PQAvailability in 14
Regions
Split and Stitch
Support
IMF Ingest
Please note that this document is intended to outline our general product direction and should not be relied on in making a purchasing decision and does not
represent a commitment, promise or legal obligation to deliver any material, code or functionality. It is intended for information purposes only, and may not be
incorporated into any contract. Any forward-looking indication of plans for products including the development, release, and timing of any features or
functionality is preliminary and all future release dates are tentative and are subject to change.
Security
Enhancements
Encrypted Inputs, SPEKE
certificate encryption
Currently available
Roadmap Features
Production
Formats
Bill Tagging
29. AMAZON CONFIDENTIAL 29
GLOBAL DEPLOYMENT
§ AWS MediaConvert available in
14 regions
§ US East (Virginia)
§ US East (Ohio)
§ US West (Oregon)
§ US West (Northern California)
§ Canada (Central)
§ EU (Frankfurt)
§ EU (Ireland)
§ EU (London)
§ Asia Pacific (Mumbai)
§ Asia Pacific (Seoul)
§ Asia Pacific (Singapore)
§ Asia Pacific (Sydney)
§ Asia Pacific (Tokyo)
§ South America (São Paulo)
30. AMAZON CONFIDENTIAL
VIDEO ON DEMAND CLOUDFORMATION TEMPLATE
VOD Starter Kit deployed in any region
AWS Lambda adds videos to the source S3
bucket to start the conversion workflow
Web page displays the data stored in the
VOD starter kit Amazon DynamoDB table,
along with a preview of the converted video
assets
1
2
3
1
2
3
31. AMAZON CONFIDENTIAL 31
WHAT’S NEW
§ AWS MediaConvert - new basic pricing tier
§ AVC codec and single-pass encoding
§ Limited features such as trimming, stitching, and static overlays
§ Best option for processing user-generated content or other short-form content
that is primarily meant for distribution over the web
§ Easily identify features that take you to the professional pricing tier
32. AMAZON CONFIDENTIAL
§ New Pricing Tier:
§ Customers asked for PAAS mode
§ Pay once for “all you can eat transcoding”
§ Reserved Transcoding Slots:
§ Can be used with Professional Tier or Basic Tier Jobs and Templates
• Prices starting at $400 a month (depending on region)
• With a 12 Month Commitment
• Can be used in conjunction with on-demand slots for additional capacity
RTS PRICING
36. AMAZON CONFIDENTIAL
CBR DISADVANTAGES
0
5
10
15
20
25
Time 1 2 3 4 5 6 7 8 9
Video Complexity (Constant Quality)
CBR Bitrate
Wasted bits
Wasted
bits
Bad VQ
Bitrate
However, video complexity varies too dramatically for practical constant bitrate
Confidential
37. AMAZON CONFIDENTIAL
0
5
10
15
20
25
Time 1 2 3 4 5 6 7 8 9
WHAT WE WANT IS THIS
Bitrate
Video Complexity (Constant Quality)
Max Bitrate (Network Bandwidth)
QVBR
No Wasted Bits – Bits Savings!
Quality-defined VBR (QVBR) uses the right amount of bits to reach quality target
Confidential
41. AMAZON CONFIDENTIAL 41
CUSTOMER TESTIMONIAL
§ We have been able to meet the needs of Magnolia and other companies
by…
§ Delivering complex "broadcast-grade" advanced encoding and audio/video
processing features
§ Providing an elastic approach to scale
§ Having a predictable cost model, using CloudWatch metrics for modeling
§ As an AWS Service, we get a lot of AWS "stuff" for free!
§ Job tracking via SNS/CloudWatch events – no need to constantly poll AWS
Elemental for status or have a separate callback listener
§ Easier cost tracking and allocation per customer via CloudWatch metrics
§ AWS Security Integration (IAM Roles) - Grant Customer Accounts Rights
§ Deployments and Ops time savers
§ Versioning and upgrading to new releases is now as simple as changing API
endpoints
§ We think of it as a "job now," not a group of customer servers
§ Moved to JSON API, code friendly with our developers
44. AMAZON CONFIDENTIAL
§ More dynamic events
§ Flexible Ultra HD & HDR
§ Converging IP Headend
§ PAYG disaster recovery
§ Integrated live-to-VOD
44
LIVE & LINEAR VIDEO WORKFLOWS – CLOUD ENABLES:
“Working with these new
media services has enabled
us to free up time to build up
redundant workflows quickly
and easily as well as to focus
on innovating new services
and enhancing our existing
offers.”
- Marco de la Cruz, Sub-
Director, Cinépolis KLIC
45. AMAZON CONFIDENTIAL
AWS ELEMENTAL MEDIALIVE
§ Built for broadcast workload requirements
§ Streamlined setup and operation
§ Channel or event-based pricing
AWS Elemental
MediaLive
LIVE VIDEO
PROCESSING
AWS Elemental
MediaPackage
JUST-IN-TIME
PACKAGING & ORIGIN
Amazon
CloudFront
CDN
Live TV
INPUT
Amazon
Direct Connect
DATA TRANSFER DEVICES
46. AMAZON CONFIDENTIAL
§ Distribute content to multiple delivery protocols &
content protection standards
§ Add live-to-VOD capabilities
§ Multi-AZ redundancy, elastic scale
§ Pricing on video ingested and originated
AWS ELEMENTAL MEDIAPACKAGE
AWS Elemental
MediaLive
LIVE VIDEO PROCESSING
AWS Elemental
MediaPackage
ORIGINATION AND JUST-IN-TIME
PACKAGING CDN
Amazon
CLOUDFRONTLive Channel
SOURCE DISPLAY DEVICES
Multiple OTT devices
AWS Elemental
MediaLive
LIVE VIDEO
PROCESSING
AWS Elemental
MediaPackage
JUST-IN-TIME
PACKAGING & ORIGIN
LIVE
47. AMAZON CONFIDENTIAL
§ Access a durable high performance storage tier
§ Transitioned to S3 in minutes for durability
§ Integrates with any CDN
§ Simple integration
§ Priced on data GBs ingested
AWS ELEMENTAL MEDIASTORE
SOURCE
FILE
LIVE CDN
Amazon
CLOUDFRONTAmazon
CLOUDFRONT
DISPLAY DEVICES
Multiple OTT devices
AWS Elemental
MediaLive
LIVE VIDEO
PROCESSING
AWS Elemental
MediaStore
MEDIA-OPTIMZED
STORAGE & ORIGIN
48. AMAZON CONFIDENTIAL
MEDIALIVE 1H2018 RELEASES
Resource editing
RTMP/S outputs
& Sydney
Channel logging
& Frankfurt
Console alerts &
Seoul
JSON
import/export
Reserved
inputs/outputs
Sao Paulo
Please note that this document is intended to outline our general product direction and should not be relied on in making a purchasing decision and does not
represent a commitment, promise or legal obligation to deliver any material, code or functionality. It is intended for information purposes only, and may not be
incorporated into any contract. Any forward-looking indication of plans for products including the development, release, and timing of any features or
functionality is preliminary and all future release dates are tentative and are subject to change.
49. AMAZON CONFIDENTIAL
MEDIALIVE 2H2018 RELEASES
SCTE insertion
Input switching Enhanced VQ
QVBR
Pipeline locking
Static image
overlay
HLS External
manifests
Pause outputs
Mumbai
GovCloud
Please note that this document is intended to outline our general product direction and should not be relied on in making a purchasing decision and does not
represent a commitment, promise or legal obligation to deliver any material, code or functionality. It is intended for information purposes only, and may not be
incorporated into any contract. Any forward-looking indication of plans for products including the development, release, and timing of any features or
functionality is preliminary and all future release dates are tentative and are subject to change.
2018 Roadmap
50. AMAZON CONFIDENTIAL
AWS ELEMENTAL MEDIAPACKAGE ROADMAP
Input Redundancy
DASH updates and
enhancements
??
VOD JITP
Multi-period
DASH
CDN
authentication
Please note that this document is intended to outline our general product direction and should not be relied on in making a purchasing decision and does not represent a commitment, promise or legal obligation to deliver
any material, code or functionality. It is intended for information purposes only, and may not be incorporated into any contract. Any forward-looking indication of plans for products including the development, release, and
timing of any features or functionality is preliminary and all future release dates are tentative and are subject to change.
2018 Roadmap
More regional
deployments
DCA
50
51. AMAZON CONFIDENTIAL
HOW IT WORKS
SOURCE
Input A
Input B
Output Group
Output 1
Output 2
Output 3
Output 4
Output A
Output B
Origin Amazon
CloudFront
CDN
Source Content AWS Elemental MediaLive
an AWS Elemental MediaLive channel
converts redundant live inputs into two
streams of multiple compressed outputs.
Origination and Packaging
- AWS Elemental MediaStore
- AWS Elemental MediaPackage
CDN
Amazon CloudFront
and/or other CDNs
deliver video to output
devices
55. AMAZON CONFIDENTIAL
§ Use Case: Broadcast live directly from consumer devices in a lightweight workflow
§ Workflow: Convert from RTMP to HLS using NGINX and MediaStore as an efficient origin
55
USER-GENERATED CONTENT WORKFLOW
CDN
Amazon
CloudFront
DISPLAY DEVICES
NGINX/RTMP
module
CONVERSION
ON EC2
RTMP
Output
SOURCE
AWS Elemental
MediaStore
MEDIA-OPTIMZED
STORAGE & ORIGIN
56. AMAZON CONFIDENTIAL
§ Use Case: Premium video with encoders at source delivering OTT to consumers
§ Workflow: Create ABR bouquet on ground and deliver using MediaStore as an efficient
origin
LIVE WORKFLOW WITH ENCODING ON GROUND
Live Channel
SOURCE
AWS Elemental
MediaStore
MEDIA-OPTIMZED
STORAGE & ORIGIN
AWS Elemental
LIVE
LIVE VIDEO
PROCESSING
CDN
Amazon
CloudFront
DISPLAY DEVICES
57. AMAZON CONFIDENTIAL
§ Use Case: (v)MVPD adding multiscreen to their offerings
§ Workflow: Source channels into AWS, transcode in MediaLive, package to multiple
devices, secure, originate in MediaPackage and deliver from the cloud
TYPICAL PAY-TV WORKFLOW
AWS
Elemental
LIVE
CDN
Amazon
CloudFrontLive Channel
SOURCE DISPLAY DEVICES
Multiple OTT Devices
AWS Elemental
MediaPackage
ORIGINATION AND
PACKAGING
AWS Elemental
MediaLive
LIVE VIDEO
PROCESSING
AWS
Elemental
LIVE
LIVE VIDEO
PROCESSING
58. AMAZON CONFIDENTIAL
DEMO WORKFLOW
AWS
Elemental
LIVE
Live Channel
SOURCE DISPLAY DEVICES
Multiple OTT Devices
AWS Elemental
MediaPackage
ORIGINATION AND
PACKAGING
AWS Elemental
MediaLive
LIVE VIDEO
PROCESSING
AWS
Elemental
LIVE
LIVE VIDEO
PROCESSING
CDN
Amazon
CloudFront
63. AMAZON CONFIDENTIAL 63
EVOLUTION OF OTT ADVERTISING
Burnt-in
static ads
Client-side
advertising
insertion
Server-side
advertising
Insertion
2007 2012 2017
64. AMAZON CONFIDENTIAL
§ Personalized viewing
experiences
§ Increased content value
§ Deep Analytics
§ AI/ML enabled services
§ Greater monetization
64
PERSONALIZATION VIDEO WORKFLOWS – CLOUD ENABLES:
“We used the new AWS
Elemental Media Tailor service,
which allows us to deliver
targeted ads that seamlessly
inserted into the main content for
a great viewer experience … I am
confident AWS Media Services is
the path to use the cloud for live
streaming at scale.”
- Greg Hart, Global Head of
Amazon Video
65. AMAZON CONFIDENTIAL 65
INTRODUCING AWS ELEMENTAL MEDIATAILOR
§ Ads are better monetized, more consistent in video quality
and easier to manage across multi-platform environments
§ Managed transcoding provides a better user experience
§ Standards-based client and server-side ad reporting within a
single service
§ More control over the player, origin and CDN while providing a
better end-viewer experience at scale
AWS Elemental MediaTailor is a content personalization and
monetization service that allows customers to implement server-side ad
insertion while maintaining high quality of service
66. AMAZON CONFIDENTIAL
REGION
AWS ELEMENTAL MEDIATAILOR TYPICAL WORKFLOW (LIVE)
Video with SCTE 35/104
Personalized
manifest
*.m3u8 (not
cacheable)
VAST request &
response (which
ad should I show
for this user?)
Conditioned manifest with
ad markers
Elemental
LIVE
Live Video Processing
Manifestcache
AWS Elemental MediaTailor
ON-THE-FLY/MANAGED TRANSCODING
Amazon
CLOUDFRONT
CDN
Amazon ETS Amazon S3
CDN
Amazon
CLOUDFRONT
DEVICES
DFP, Freewheel or
otherADS
ADS
AWS Elemental
LIVE
LIVE VIDEO PROCESSING
AWS Elemental
DELTA
VIDEO DELIVERY PLATFORM
AWS Elemental
MediaTailor
MANIFEST MANIPULATION
Fetch ad segments
(cacheable on CDN)
Fetch content
segments (cacheable
on CDN)
Hybrid client-side
and server side
reporting
LIVE VIDEO
PROCESSING
AWS Elemental
MediaLive
AWS Elemental
MediaPackage
JUST IN TIME
PACKAGING &
ORIGIN
67. AMAZON CONFIDENTIAL
TYPICAL WORKFLOW (VOD)
Personalized
manifest
*.m3u8 (not
cacheable)
Fetch ad segments
(cacheable on CDN)
Fetch content segments
(cacheable on CDN)
VAST request &
response (which ad
should I show for
this user?)Manifestcache
DISPLAY DEVICES
+ reporting library
AWS
ELEMENTAL
MEDIA TAILOR
MANIFEST
MANIPULATION
AWS ELEMENTAL MEDIA
TAILOR
ON-THE-FLY/MANAGED TRANSCODING
Amazon
CLOUDFRONT
CDN
ADS
DFP, Freewheel or
other ADS
Amazon ETS Amazon S3
STATIC ORIGIN
Amazon S3
STORAGESTORAGE
File-based media
Assets can have CUE markers or we
rely on VMAP timing to schedule ads
69. AMAZON CONFIDENTIAL
AWS ELEMENTAL MEDIATAILOR 2018 ROADMAP
Granular transcoding policies
instead of “default transcode”
available
Please note that this document is intended to outline our general product direction and should not be relied on in making a purchasing decision and does not
represent a commitment, promise or legal obligation to deliver any material, code or functionality. It is intended for information purposes only, and may not be
incorporated into any contract. Any forward-looking indication of plans for products including the development, release, and timing of any features or
functionality is preliminary and all future release dates are tentative and are subject to change.
2018 Roadmap
LIVE Multi-Period
DASH support
FRA, PDX regions
Audio normalization
available (Backend
migration to EMC)
Pre-transcode API
available
Other DASH fast follow
features (i.e. subtitling,
VOD support)
Pre-rolls in LIVE
70. AMAZON CONFIDENTIAL
WE’RE BUILT WITH SCALE IN MIND
Manifest Manipulation
Availability Zone
Availability Zone
Availability Zone
Elastic Load
Balancing
DynamoDB
Config
&Session
Store
Users
71. AMAZON CONFIDENTIAL
LIVE VIDEO
PROCESSING
AWS Elemental
MediaLive
AWS Elemental
MediaPackage
JUST IN TIME
PACKAGING &
ORIGIN
AWS Elemental
MediaTailor
PERSONALIZATION
& MONETIZATION
AWS Elemental
Live
ON-PREMISES
ENCODING
Amazon
CloudFront
CDN
DEVICES
Live Channel
SOURCE
MEDIATAILOR DEMO WORKFLOW
73. AMAZON CONFIDENTIAL
WHY MACHINE LEARNING FOR MEDIA?
Increase
engagement and
content discovery
Accelerate existing
production processes
Launch new content
or services
73
75. AMAZON CONFIDENTIAL 75
MACHINE LEARNING IS HARD
Deployment
• Setup and
manage
inference
clusters
• Manage and
auto scale
inference
APIs
• Testing,
versioning
and
monitoring
Data Wrangling
• Setup and
manage
Notebook
environments
• Get data to
notebooks
securely
Experimentation
• Setup and
manage clusters
• Scale/distribute
ML algorithms
81. AMAZON CONFIDENTIAL
AMAZON REKOGNITION
Object & scene
detection
Facial
analysis
Face
comparison
Face
search
Celebrity
detection
Image
moderation
Text
detection
"Amazon Rekognition allows us to scalably identify
and track actors across millions of frames of content
with much higher reliability than any other solution
we've used.”
- Jared Browarnik, Co-Founder & CTO, TheTake
“Amazon Rekognition enables us to quickly and
efficiently add value through various automated
metadata tagging processes, and images and video
segments are much easier to find for our enterprise
and our customers.”
- Shane Murphy, Solutions Engineer, Scrippsnetworks
81
82. AMAZON CONFIDENTIAL
AMAZON REKOGNITION VIDEO
Object, scene & activity detection Face search Facial analysis Activity pathing
Unsafe content detection Celebrity detection Text in images
82
83. AMAZON CONFIDENTIAL
AMAZON TRANSCRIBE
A fully managed and continuously trained automatic speech recognition (ASR)
service that takes in audio and automatically generates accurate transcripts
Support for audio in
many formats and
low fidelity
Amazon S3
integration
Time stamps and
confidence scores
Hello
/Hola
English and Spanish
§
Punctuation
83
85. AMAZON CONFIDENTIAL
USING AMAZON ML SERVICES FOR MEDIA
§ Use services such as Amazon Rekognition
& Amazon Transcribe to generate metadata
about your content
§ Store that metadata and make it searchable
§ Retrieve only the portion of the
content you want
§ Prepare it for timely use
Live and file
Sources
Amazon
ML Services
ML
Amazon
DynamoDB
Database
Live and file
Content
Content Indexing / Metadata Generation Content Retrieval / Action Metadata
AWS Elemental
Media Services
Media processing
AWS Elemental
Media Services
Media processing
85
86. AMAZON CONFIDENTIAL
CONTENT INDEXING / METADATA GENERATION – IMAGE
TheChallenge
§ A broadcaster wants to
add metadata to
existing archive of
video content
§ Index metadata and
video to make it
searchable
§ Keep costs low
TheSolution
§ Use AWS Elemental
MediaConvert to
extract frames from
video content
§ Use Amazon
Rekognition to analyze
and create metadata
for video content
TheBenefit
§ Video tagged with
objects, scene and
celebrities detection
§ Five-second frame
extraction keeps cost
low while providing
searching index
86
87. AMAZON CONFIDENTIAL
CONTENT INDEXING / METADATA GENERATION – IMAGE
AWS Elemental
MediaConvert job transcodes
file and extracts JPEG
frames to S3 bucket.
AWS Lambda function
triggered by Amazon S3
object-created event tells
Amazon Rekognition to
analyze the JPEG file.
Amazon Rekognition
performs requested
operation on image (i.e.,
object detection, celebrity
recognition, etc.).
Amazon Rekognition returns
result to AWS Lambda, which
stores tags and confidence
scores in Amazon DynamoDB,
Amazon Redshift, Amazon
Elasticsearch Service, Amazon
RDS, or whichever service best
suits the use case.
AWS Elemental
MediaConvert
File-based
processing
Amazon S3
Storage
AWS Lambda
Serverless
Amazon Rekognition
ML / AI
Amazon DynamoDB
Database
File
Source
87
88. AMAZON CONFIDENTIAL
CONTENT INDEXING / METADATA GENERATION – VIDEO
TheChallenge
§ A content producer
wants to log who is in
each scene of new
episode of a show
§ Raw video files are
~200 GB for 60 min
TheSolution
§ Use AWS Elemental
MediaConvert to
compress video
content (but retain
quality
§ Use Amazon
Rekognition Video to
analyze and create
metadata for video
content
TheBenefit
§ Video tagged
celebrities detection
and timing and position
of celebrity
§ Video files reduces to
<8 GB for 60 mins to
reduce costs
88
89. AMAZON CONFIDENTIAL
CONTENT INDEXING / METADATA GENERATION – VIDEO
AWS Elemental MediaConvert job
transcodes source file to
H.264/MP4 at a bit rate such that
the file size is <8 GB.
AWS Lambda function triggered by
Amazon S3 object-created event
tells Amazon Rekognition to
analyze the video file.
Amazon Rekognition Video
performs requested operation on
video (i.e., person tracking,
celebrity recognition, etc.).
Amazon Rekognition returns result to
AWS Lambda, which stores tags and
confidence scores in Amazon
DynamoDB, Amazon Redshift, Amazon
Elasticsearch Service, Amazon RDS, or
whichever service best suits the use
case.
AWS Elemental
MediaConvert
File-based
processing
Amazon S3
Storage
AWS Lambda
Serverless
Amazon Rekognition
ML / AI
Amazon DynamoDB
Database
File
Source
89
90. AMAZON CONFIDENTIAL
CONTENT INDEXING / METADATA GENERATION – TRANSCRIPTION
TheChallenge
§ An online training
provider has 1,000s of
hours of video that need
captions
§ Video is in a variety of
formats
TheSolution
§ Use AWS Elemental
MediaConvert create
audio only version of
content
§ Use Amazon Transcribe
to generate timestamped
transcription
§ Convert Amazon
Transcribe output to
captions file
TheBenefit
§ All formats of video
content get captions
added to make them
more accessible
§ Option to run Amazon
Transcribe output
through Amazon
Translate to get multi-
language captions
90
91. AMAZON CONFIDENTIAL
CONTENT INDEXING / METADATA GENERATION – TRANSCRIPTION
AWS Elemental MediaConvert
job transcodes source file,
creating audio-only rendition
for Amazon Transcribe.
AWS Elemental
MediaConvert also
creates normal
audio/video output.
AWS Lambda function
triggered by Amazon S3
object-created event creates a
new Transcribe job.
Amazon Transcribe outputs
JSON file of detected words
and timing.
Lambda function converts Amazon
Transcribe JSON into subtitle
format (such as WebVTT, SRT, or
TTML) and delivers to
Amazon S3 bucket with content.
AWS Elemental
MediaConvert
File-based
processing
AWS Lambda
Serverless
Amazon Transcribe
ML / AI
File
Source
Amazon S3
Storage
Amazon S3
Storage
91
92. AMAZON CONFIDENTIAL
CONTENT RETRIEVAL WITH AWS ELEMENTAL MEDIACONVERT CLIPPING
TheChallenge
§ The content producer
would like to create a
promo clip of all of the
scenes from their
episode that contain a
particular actor
§ Remember, the source
file is 60 minutes long
and 200 GB
TheSolution
§ Amazon Rekognition
video facial recognition
identifies when the star
appears in the source
video
§ AWS Elemental
MediaConvert uses
time references to
selectively transcode
source video
TheBenefit
§ Faster and more cost-
effective clip
generation as only the
video contents that has
been identified as
featuring the celebrity
is transcoded
92
93. AMAZON CONFIDENTIAL
CONTENT RETRIEVAL WITH AWS ELEMENTAL MEDIACONVERT CLIPPING
AWS Elemental MediaConvert
transcodes clips from the source file,
using only the time range(s)
specified.
AWS Elemental
MediaConvert
File-based
processing
Amazon S3
Storage
Clipped file
Output
Amazon DynamoDB
Database
AWS Lambda
Serverless
Lambda function
queries database for
metadata being
searched.
Lambda function creates
AWS Elemental MediaConvert
transcode job specifying time(s)
from source to clip.
93
94. AMAZON CONFIDENTIAL
§ Built in 3 weeks
§ Indexed against 99,000 people
§ Index created in one day
§ Saved ~9,000 hours a year in manual
curation costs
§ Live video with frame sampling
AUTOMATING FOOTAGE TAGGING WITH AMAZON REKOGNITION
Previously, only about half of all footage was indexed due to
the immense time requirements required by manual processes
94