SlideShare a Scribd company logo
1 of 21
Data Management and
Streaming Strategies
in Drakensang Online
Andre “Floh” Weissflog
Head of Development Berlin
Bigpoint GmbH
Browser Games Forum 2011
2
• Hack & Slay F2P MMO.
• Embedded in browser.
• No client download or installation.
• Impressive 3D graphics.
• 8 months from concept to Closed Beta.
3
4
Team & Technology Background
5
Drakensang Online is developed by the same team that also worked on the previous
single-player Drakensang “boxed titles”.
Very skilled and experienced team, very solid and proven production pipeline
and processes (and very little online experience).
The technology behind our client and server is the Nebula3 engine (~250k lines of C/C++, 12
years of continuous development).
Epic stories will be written about how we went from building offline boxed-titles to
creating an MMO running in a browser in 8 months.
…but not Today…
Today we’ll talk about data management
and streaming in Drakensang Online…
Data Management Core Demands
6
• Browser Game Experience Click & play, no installation.
• Fast Startup Time Start in seconds, not minutes
• On-Demand Streaming: Only download what’s needed
• Client-Side Caching: CDN traffic cost, user experience
• Robustness: The web is slow and unreliable.
• Minimize Data Size: But keep graphics quality high!
Drakensang Online: The Assets
7
• 50 interconnected maps
• 240 monsters
• 120 NPCs
• 1400 items
• 300 animated effects
317 MB uncompressed
139 MB compressed
27035 filesAsset Types
Textures
Audio
Meshes
Navigation
Locale
Maps
Anims
Other
Two Main Achievements
8
Aggressive Data Size Reduction
We’re working with 3%of the data size we’ve been used to.
This is mainly an organizational achievement.
All departments have to agree on data size budgets…
…even if it’s hard (e.g. for the graphics guys)
Seamless Data Streaming During Gameplay
That’s mainly a programming achievement.
Client engine must support asynchronous I/O from the
ground up.
Maps as “static web pages”
9
This was really the initial idea behind our data
management.
Our maps reference many small asset files “somewhere
on the web”, just like a static web page.
All asset files are loaded asynchronously. Render loop
can deal with assets which are not yet loaded.
Really clever local file caching prevents unnecessary
HTTP requests (down to no HTTP traffic at all if the
cache is completely up-to-date).
Asset Size Reduction: Extreme Granularity
10
Our maps are built from really small and simple 3D objects:
Extremely granular set of
reusable objects. Each with
only a few dozen to a few
hundred triangles.
One map may have 100k of those
micro 3D objects, but only a few
dozen DIFFERENT objects.
That’s very good for data
size. But…
OMG TEH
DRAW CALLS !!
Nebula3 uses hardware
instancing and runtime-
baking to reduce draw
calls from about 5000
down to 250 per frame.
Asset Size Reduction: Textures
11
Typical texture
dimensions:
256 x 256
128 x 128
64 x 64
All textures are DXT compressed (and then – like all other files – ZIP compressed)
Bump textures use DXT5NM compression, aka the Carmack-Trick
(that’s why they’re grey, not purple).
Texture types:
color
bump
emissive
specular
Typical download sizes for textures are somewhere between 5 kByte
and 40 kByte.
Textures are aggressively shared and reused.
Asset Size: Lights, Shadows& Decals
12
All lighting and shadowing is completely dynamic. No light-maps needed.
We use deferred Pre-Pass-Lighting to enable “infinite point-lights” with
moderate fill-rate requirements.
Volumetric Decals are used to hide the tile-nature of the ground.
Post-Effects add bloom, fog, color saturation and color balancing.
Two Streaming Strategies:
13
1) ON DEMAND:
Client requests data right when it’s needed.
IO requests are handled asynchronously.
May take several seconds until data is ready.
Until then:
Render a placeholder or…
Render nothing.
2) BACKGROUND STREAMING:
Low-priority background thread just for cache warm-up.
One streaming list per map created during build process.
Download and update items in local file cache.
Very speculative, can’t really predict what’s needed next.
The web as a (really unreliable) hard-disc
14
Nebula3 always had a powerful I/O system:
• async I/O is standard, not the exception
• “massively multi-threaded”, currently 10 I/O-threads
• pluggable filesystems
• all file paths are URLs
We built an “HTTP filesystem” on top of this:
• only uses HTTP GET requests
• transparently replaces local-disc file I/O
• MD5 checksums for all files
• ZIP compression for all files
• hierarchical caching system
• directory walking and file-exists checks
• CDN support
• reasonably fail-safe (high latencies, dropped
connections, corrupt
downloads, tampering…)
“Massively Multi-Threaded IO”
15
IO Dispatcher
Thread
Main Thread
Render Thread
Audio Thread
… Thread
Incoming IO Requests
IO Threads
Cache
(Local HD)
Cache
Hit?Yes
Web
Server
No
Update Cache
Served IO Requests
• up to 10 HTTP-GET requests concurrently in flight (tweakable)
• low-prio IO requests don’t block high-prio requests
• no difference in high-level code between file I/O and http I/O
Hierarchical File Cache
16
ROOT MD5 KEY
e4680c8372bc6c527043a97631b404d7
Directory “Table Of Content” File
Leaf TOCs Leaf TOCs Leaf TOCs
1. Daily-Build-Process: compress files and create per-file MD5 hashes.
2. Each directory has a “Table Of Content” file with all MD5 hashes.
3. TOCs are also compressed and MD5’d.
4. ...with one single “Directory TOC” at the top.
5. …and a single Root MD5 Key for the Directory TOC.
• Client receives Root MD5 Key at startup.
• Downloads TOC files if not in cache…
• TOC files provide MD5 keys for files.
• Zero HTTP GETs if cache is up-to-date!
The Cache Chain
17
We basically have a 3-level cache for assets:
RAM DISC
CDN
PROXY
CDN
ORIGIN
Level 1 Level 2 Level 3
RAM Cache Hit?
Resource objects are shared in RAM using their ResourceIDs.
DISC Cache Hit?
MD5-named file must exist in local file cache.
MD5 of actual file-content must match build-time MD5 hash.
CDN Proxy Cache Hit?
MD5-hash appended to HTTP GET URL, forces a cache miss if proxy only has
out-of-date file of the same name.
CDN Origin Server has the definite data.
404’s Considered Harmful
18
Remember this from the previous page:
RAM DISC
CDN
PROXY
CDN
ORIGIN
1st 2nd 3rd 4th
“Sh*t happens”.
Trying to load non-existing assets will happen with such a complex data set.
HTTP requests for non-existing files will cut through all the way to the origin server.
CDN’s usually cache 404’s for a few seconds, but it’s better to not bother the web servers with
404’s at all.
Our Table-Of-Content-Files let us detect non-existing files on the client without a server
roundtrip.
Conclusion
19
• Data management is the job of the whole team.
• Careful, early asset size planning.
• Team must commit on size budgets.
• “Programming Magic” alone doesn’t solve problems.
• Programming, Level Design and Art Department must work hand in hand.
• Oh, and: Check your CDN cost plan (if you pay per HTTP request, our solution is sub-optimal)
Thank You 
Questions?
20
Find us on
Bigpoint GmbH
Alexanderstraße 5
10178 Berlin
Germany
Bigpoint Inc.
500 Howard Street
Suite 300
San Francisco, CA 94105
Bigpoint Distribuição de
Entretenimento Online Ltda.
Av. Brig. Faria Lima
3729 cj. 528
04538-905 São Paulo
Brazil
Bigpoint GmbH
Andre Weissflog
Head of Development Berlin
Drehbahn 47-48
20354 Hamburg
Germany
Tel +49 40.88 14 13 - 0
Fax +49 40.88 14 13 - 11
info@bigpoint.net
www.bigpoint.net
Contact us
Bigpoint International Services
Limited
1 Villa Zimmermann
Ta’Xbiex Terrace
XBX 1035 Ta’Xbiex
Malta
21

More Related Content

What's hot

Spark performance tuning - Maksud Ibrahimov
Spark performance tuning - Maksud IbrahimovSpark performance tuning - Maksud Ibrahimov
Spark performance tuning - Maksud IbrahimovMaksud Ibrahimov
 
Introduction to Apache Spark Ecosystem
Introduction to Apache Spark EcosystemIntroduction to Apache Spark Ecosystem
Introduction to Apache Spark EcosystemBojan Babic
 
Emr spark tuning demystified
Emr spark tuning demystifiedEmr spark tuning demystified
Emr spark tuning demystifiedOmid Vahdaty
 
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDB
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDBScylla Summit 2022: New AWS Instances Perfect for ScyllaDB
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDBScyllaDB
 
Top 5 Mistakes to Avoid When Writing Apache Spark Applications
Top 5 Mistakes to Avoid When Writing Apache Spark ApplicationsTop 5 Mistakes to Avoid When Writing Apache Spark Applications
Top 5 Mistakes to Avoid When Writing Apache Spark ApplicationsCloudera, Inc.
 
Hardware Assisted Latency Investigations
Hardware Assisted Latency InvestigationsHardware Assisted Latency Investigations
Hardware Assisted Latency InvestigationsScyllaDB
 
Common Strategies for Improving Performance on Your Delta Lakehouse
Common Strategies for Improving Performance on Your Delta LakehouseCommon Strategies for Improving Performance on Your Delta Lakehouse
Common Strategies for Improving Performance on Your Delta LakehouseDatabricks
 
Performance Troubleshooting Using Apache Spark Metrics
Performance Troubleshooting Using Apache Spark MetricsPerformance Troubleshooting Using Apache Spark Metrics
Performance Troubleshooting Using Apache Spark MetricsDatabricks
 
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Ro...
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Ro...Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Ro...
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Ro...Spark Summit
 
Fine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsFine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsDatabricks
 
Apache Spark Performance tuning and Best Practise
Apache Spark Performance tuning and Best PractiseApache Spark Performance tuning and Best Practise
Apache Spark Performance tuning and Best PractiseKnoldus Inc.
 
On Improving Broadcast Joins in Apache Spark SQL
On Improving Broadcast Joins in Apache Spark SQLOn Improving Broadcast Joins in Apache Spark SQL
On Improving Broadcast Joins in Apache Spark SQLDatabricks
 
How to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache SparkHow to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache SparkDatabricks
 
Oracle Performance Tuning Fundamentals
Oracle Performance Tuning FundamentalsOracle Performance Tuning Fundamentals
Oracle Performance Tuning FundamentalsCarlos Sierra
 
Building a SIMD Supported Vectorized Native Engine for Spark SQL
Building a SIMD Supported Vectorized Native Engine for Spark SQLBuilding a SIMD Supported Vectorized Native Engine for Spark SQL
Building a SIMD Supported Vectorized Native Engine for Spark SQLDatabricks
 
HBaseCon2017 Community-Driven Graphs with JanusGraph
HBaseCon2017 Community-Driven Graphs with JanusGraphHBaseCon2017 Community-Driven Graphs with JanusGraph
HBaseCon2017 Community-Driven Graphs with JanusGraphHBaseCon
 
Dynamic filtering for presto join optimisation
Dynamic filtering for presto join optimisationDynamic filtering for presto join optimisation
Dynamic filtering for presto join optimisationOri Reshef
 
Deep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDeep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDatabricks
 
Troubleshooting MySQL from a MySQL Developer Perspective
Troubleshooting MySQL from a MySQL Developer PerspectiveTroubleshooting MySQL from a MySQL Developer Perspective
Troubleshooting MySQL from a MySQL Developer PerspectiveMarcelo Altmann
 

What's hot (20)

A G1GC Saga-KCJUG.pptx
A G1GC Saga-KCJUG.pptxA G1GC Saga-KCJUG.pptx
A G1GC Saga-KCJUG.pptx
 
Spark performance tuning - Maksud Ibrahimov
Spark performance tuning - Maksud IbrahimovSpark performance tuning - Maksud Ibrahimov
Spark performance tuning - Maksud Ibrahimov
 
Introduction to Apache Spark Ecosystem
Introduction to Apache Spark EcosystemIntroduction to Apache Spark Ecosystem
Introduction to Apache Spark Ecosystem
 
Emr spark tuning demystified
Emr spark tuning demystifiedEmr spark tuning demystified
Emr spark tuning demystified
 
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDB
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDBScylla Summit 2022: New AWS Instances Perfect for ScyllaDB
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDB
 
Top 5 Mistakes to Avoid When Writing Apache Spark Applications
Top 5 Mistakes to Avoid When Writing Apache Spark ApplicationsTop 5 Mistakes to Avoid When Writing Apache Spark Applications
Top 5 Mistakes to Avoid When Writing Apache Spark Applications
 
Hardware Assisted Latency Investigations
Hardware Assisted Latency InvestigationsHardware Assisted Latency Investigations
Hardware Assisted Latency Investigations
 
Common Strategies for Improving Performance on Your Delta Lakehouse
Common Strategies for Improving Performance on Your Delta LakehouseCommon Strategies for Improving Performance on Your Delta Lakehouse
Common Strategies for Improving Performance on Your Delta Lakehouse
 
Performance Troubleshooting Using Apache Spark Metrics
Performance Troubleshooting Using Apache Spark MetricsPerformance Troubleshooting Using Apache Spark Metrics
Performance Troubleshooting Using Apache Spark Metrics
 
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Ro...
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Ro...Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Ro...
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Ro...
 
Fine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsFine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark Jobs
 
Apache Spark Performance tuning and Best Practise
Apache Spark Performance tuning and Best PractiseApache Spark Performance tuning and Best Practise
Apache Spark Performance tuning and Best Practise
 
On Improving Broadcast Joins in Apache Spark SQL
On Improving Broadcast Joins in Apache Spark SQLOn Improving Broadcast Joins in Apache Spark SQL
On Improving Broadcast Joins in Apache Spark SQL
 
How to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache SparkHow to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache Spark
 
Oracle Performance Tuning Fundamentals
Oracle Performance Tuning FundamentalsOracle Performance Tuning Fundamentals
Oracle Performance Tuning Fundamentals
 
Building a SIMD Supported Vectorized Native Engine for Spark SQL
Building a SIMD Supported Vectorized Native Engine for Spark SQLBuilding a SIMD Supported Vectorized Native Engine for Spark SQL
Building a SIMD Supported Vectorized Native Engine for Spark SQL
 
HBaseCon2017 Community-Driven Graphs with JanusGraph
HBaseCon2017 Community-Driven Graphs with JanusGraphHBaseCon2017 Community-Driven Graphs with JanusGraph
HBaseCon2017 Community-Driven Graphs with JanusGraph
 
Dynamic filtering for presto join optimisation
Dynamic filtering for presto join optimisationDynamic filtering for presto join optimisation
Dynamic filtering for presto join optimisation
 
Deep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDeep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache Spark
 
Troubleshooting MySQL from a MySQL Developer Perspective
Troubleshooting MySQL from a MySQL Developer PerspectiveTroubleshooting MySQL from a MySQL Developer Perspective
Troubleshooting MySQL from a MySQL Developer Perspective
 

Viewers also liked

Credit Rating Assessment & Impact
Credit Rating Assessment & ImpactCredit Rating Assessment & Impact
Credit Rating Assessment & ImpactResurgent India
 
Huertas y figueras audiencia juvenil
Huertas y figueras   audiencia juvenilHuertas y figueras   audiencia juvenil
Huertas y figueras audiencia juvenilJesús Bustos García
 
Prospektus usaha bazzocha
Prospektus usaha bazzochaProspektus usaha bazzocha
Prospektus usaha bazzochaBasri Adhi
 
Going Global Innovation (GGI) - Innovation Information Forum
Going Global Innovation (GGI) - Innovation Information ForumGoing Global Innovation (GGI) - Innovation Information Forum
Going Global Innovation (GGI) - Innovation Information ForumMaRS Discovery District
 
Programmatic display ad clickers ARE your clients
Programmatic display ad clickers ARE your clientsProgrammatic display ad clickers ARE your clients
Programmatic display ad clickers ARE your clientsGilles Giudicelli
 
Centro educativo petén
Centro educativo peténCentro educativo petén
Centro educativo peténSoypattyGm
 
SIA311 Better Together: Microsoft Exchange Server 2010 and Microsoft Forefron...
SIA311 Better Together: Microsoft Exchange Server 2010 and Microsoft Forefron...SIA311 Better Together: Microsoft Exchange Server 2010 and Microsoft Forefron...
SIA311 Better Together: Microsoft Exchange Server 2010 and Microsoft Forefron...Louis Göhl
 
Devi ahilya vishwavidyalaya prospectus 2016 17 educationiconnect.com 786200...
Devi ahilya vishwavidyalaya prospectus 2016   17 educationiconnect.com 786200...Devi ahilya vishwavidyalaya prospectus 2016   17 educationiconnect.com 786200...
Devi ahilya vishwavidyalaya prospectus 2016 17 educationiconnect.com 786200...00007123
 
Presentacion club saldo movistar movilnet-digitel-henry
Presentacion club saldo movistar movilnet-digitel-henryPresentacion club saldo movistar movilnet-digitel-henry
Presentacion club saldo movistar movilnet-digitel-henryHenry Brito Delgado
 

Viewers also liked (20)

Usos del internet
Usos del internetUsos del internet
Usos del internet
 
Credit Rating Assessment & Impact
Credit Rating Assessment & ImpactCredit Rating Assessment & Impact
Credit Rating Assessment & Impact
 
Huertas y figueras audiencia juvenil
Huertas y figueras   audiencia juvenilHuertas y figueras   audiencia juvenil
Huertas y figueras audiencia juvenil
 
Silabo central
Silabo centralSilabo central
Silabo central
 
Alimentacionninos
AlimentacionninosAlimentacionninos
Alimentacionninos
 
Prospektus usaha bazzocha
Prospektus usaha bazzochaProspektus usaha bazzocha
Prospektus usaha bazzocha
 
Artículo rocas 2
Artículo rocas 2Artículo rocas 2
Artículo rocas 2
 
IurisTalent
IurisTalentIurisTalent
IurisTalent
 
Aprender vivir con toc
Aprender vivir con tocAprender vivir con toc
Aprender vivir con toc
 
Going Global Innovation (GGI) - Innovation Information Forum
Going Global Innovation (GGI) - Innovation Information ForumGoing Global Innovation (GGI) - Innovation Information Forum
Going Global Innovation (GGI) - Innovation Information Forum
 
Programmatic display ad clickers ARE your clients
Programmatic display ad clickers ARE your clientsProgrammatic display ad clickers ARE your clients
Programmatic display ad clickers ARE your clients
 
Las flakis
Las flakisLas flakis
Las flakis
 
Basta no-soy-indio
Basta no-soy-indioBasta no-soy-indio
Basta no-soy-indio
 
Fix 3M 2016
Fix 3M 2016Fix 3M 2016
Fix 3M 2016
 
Centro educativo petén
Centro educativo peténCentro educativo petén
Centro educativo petén
 
SIA311 Better Together: Microsoft Exchange Server 2010 and Microsoft Forefron...
SIA311 Better Together: Microsoft Exchange Server 2010 and Microsoft Forefron...SIA311 Better Together: Microsoft Exchange Server 2010 and Microsoft Forefron...
SIA311 Better Together: Microsoft Exchange Server 2010 and Microsoft Forefron...
 
Geografia
GeografiaGeografia
Geografia
 
Tabaquismo. belen
Tabaquismo. belenTabaquismo. belen
Tabaquismo. belen
 
Devi ahilya vishwavidyalaya prospectus 2016 17 educationiconnect.com 786200...
Devi ahilya vishwavidyalaya prospectus 2016   17 educationiconnect.com 786200...Devi ahilya vishwavidyalaya prospectus 2016   17 educationiconnect.com 786200...
Devi ahilya vishwavidyalaya prospectus 2016 17 educationiconnect.com 786200...
 
Presentacion club saldo movistar movilnet-digitel-henry
Presentacion club saldo movistar movilnet-digitel-henryPresentacion club saldo movistar movilnet-digitel-henry
Presentacion club saldo movistar movilnet-digitel-henry
 

Similar to Data Management and Streaming Strategies in Drakensang Online

Caching Methodology & Strategies
Caching Methodology & StrategiesCaching Methodology & Strategies
Caching Methodology & StrategiesTiệp Vũ
 
Caching methodology and strategies
Caching methodology and strategiesCaching methodology and strategies
Caching methodology and strategiesTiep Vu
 
Netflix Open Source Meetup Season 4 Episode 2
Netflix Open Source Meetup Season 4 Episode 2Netflix Open Source Meetup Season 4 Episode 2
Netflix Open Source Meetup Season 4 Episode 2aspyker
 
Frontera: open source, large scale web crawling framework
Frontera: open source, large scale web crawling frameworkFrontera: open source, large scale web crawling framework
Frontera: open source, large scale web crawling frameworkScrapinghub
 
The Pendulum Swings Back: Converged and Hyperconverged Environments
The Pendulum Swings Back: Converged and Hyperconverged EnvironmentsThe Pendulum Swings Back: Converged and Hyperconverged Environments
The Pendulum Swings Back: Converged and Hyperconverged EnvironmentsTony Pearson
 
Cloud computing UNIT 2.1 presentation in
Cloud computing UNIT 2.1 presentation inCloud computing UNIT 2.1 presentation in
Cloud computing UNIT 2.1 presentation inRahulBhole12
 
VMworld 2013: VMware Mirage Storage and Network Deduplication, DEMYSTIFIED
VMworld 2013: VMware Mirage Storage and Network Deduplication, DEMYSTIFIED VMworld 2013: VMware Mirage Storage and Network Deduplication, DEMYSTIFIED
VMworld 2013: VMware Mirage Storage and Network Deduplication, DEMYSTIFIED VMworld
 
Gears of Perforce: AAA Game Development Challenges
Gears of Perforce: AAA Game Development ChallengesGears of Perforce: AAA Game Development Challenges
Gears of Perforce: AAA Game Development ChallengesPerforce
 
Leveraging Structured Data To Reduce Disk, IO & Network Bandwidth
Leveraging Structured Data To Reduce Disk, IO & Network BandwidthLeveraging Structured Data To Reduce Disk, IO & Network Bandwidth
Leveraging Structured Data To Reduce Disk, IO & Network BandwidthPerforce
 
Alexander Sibiryakov- Frontera
Alexander Sibiryakov- FronteraAlexander Sibiryakov- Frontera
Alexander Sibiryakov- FronteraPyData
 
Experience In Building Scalable Web Sites Through Infrastructure's View
Experience In Building Scalable Web Sites Through Infrastructure's ViewExperience In Building Scalable Web Sites Through Infrastructure's View
Experience In Building Scalable Web Sites Through Infrastructure's ViewPhuwadon D
 
Design and Implementation of a High- Performance Distributed Web Crawler
Design and Implementation of a High- Performance Distributed Web CrawlerDesign and Implementation of a High- Performance Distributed Web Crawler
Design and Implementation of a High- Performance Distributed Web CrawlerGeorge Ang
 
PAC 2019 virtual Mark Tomlinson
PAC 2019 virtual Mark TomlinsonPAC 2019 virtual Mark Tomlinson
PAC 2019 virtual Mark TomlinsonNeotys
 
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, VectorizedData Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, VectorizedHostedbyConfluent
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservicesBigstep
 
Data Grids with Oracle Coherence
Data Grids with Oracle CoherenceData Grids with Oracle Coherence
Data Grids with Oracle CoherenceBen Stopford
 
OSDC 2017 | Something Openshift Kubernetes Containers by Kristian Köhntopp
OSDC 2017 | Something Openshift Kubernetes Containers by Kristian KöhntoppOSDC 2017 | Something Openshift Kubernetes Containers by Kristian Köhntopp
OSDC 2017 | Something Openshift Kubernetes Containers by Kristian KöhntoppNETWAYS
 

Similar to Data Management and Streaming Strategies in Drakensang Online (20)

A faster web
A faster webA faster web
A faster web
 
Caching Methodology & Strategies
Caching Methodology & StrategiesCaching Methodology & Strategies
Caching Methodology & Strategies
 
Caching methodology and strategies
Caching methodology and strategiesCaching methodology and strategies
Caching methodology and strategies
 
Netflix Open Source Meetup Season 4 Episode 2
Netflix Open Source Meetup Season 4 Episode 2Netflix Open Source Meetup Season 4 Episode 2
Netflix Open Source Meetup Season 4 Episode 2
 
Frontera: open source, large scale web crawling framework
Frontera: open source, large scale web crawling frameworkFrontera: open source, large scale web crawling framework
Frontera: open source, large scale web crawling framework
 
The Pendulum Swings Back: Converged and Hyperconverged Environments
The Pendulum Swings Back: Converged and Hyperconverged EnvironmentsThe Pendulum Swings Back: Converged and Hyperconverged Environments
The Pendulum Swings Back: Converged and Hyperconverged Environments
 
Cloud computing UNIT 2.1 presentation in
Cloud computing UNIT 2.1 presentation inCloud computing UNIT 2.1 presentation in
Cloud computing UNIT 2.1 presentation in
 
VMworld 2013: VMware Mirage Storage and Network Deduplication, DEMYSTIFIED
VMworld 2013: VMware Mirage Storage and Network Deduplication, DEMYSTIFIED VMworld 2013: VMware Mirage Storage and Network Deduplication, DEMYSTIFIED
VMworld 2013: VMware Mirage Storage and Network Deduplication, DEMYSTIFIED
 
Gears of Perforce: AAA Game Development Challenges
Gears of Perforce: AAA Game Development ChallengesGears of Perforce: AAA Game Development Challenges
Gears of Perforce: AAA Game Development Challenges
 
Leveraging Structured Data To Reduce Disk, IO & Network Bandwidth
Leveraging Structured Data To Reduce Disk, IO & Network BandwidthLeveraging Structured Data To Reduce Disk, IO & Network Bandwidth
Leveraging Structured Data To Reduce Disk, IO & Network Bandwidth
 
Alexander Sibiryakov- Frontera
Alexander Sibiryakov- FronteraAlexander Sibiryakov- Frontera
Alexander Sibiryakov- Frontera
 
Experience In Building Scalable Web Sites Through Infrastructure's View
Experience In Building Scalable Web Sites Through Infrastructure's ViewExperience In Building Scalable Web Sites Through Infrastructure's View
Experience In Building Scalable Web Sites Through Infrastructure's View
 
CERNBox: Site Report
CERNBox: Site ReportCERNBox: Site Report
CERNBox: Site Report
 
Design and Implementation of a High- Performance Distributed Web Crawler
Design and Implementation of a High- Performance Distributed Web CrawlerDesign and Implementation of a High- Performance Distributed Web Crawler
Design and Implementation of a High- Performance Distributed Web Crawler
 
PAC 2019 virtual Mark Tomlinson
PAC 2019 virtual Mark TomlinsonPAC 2019 virtual Mark Tomlinson
PAC 2019 virtual Mark Tomlinson
 
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, VectorizedData Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
 
globus.pptx
globus.pptxglobus.pptx
globus.pptx
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
 
Data Grids with Oracle Coherence
Data Grids with Oracle CoherenceData Grids with Oracle Coherence
Data Grids with Oracle Coherence
 
OSDC 2017 | Something Openshift Kubernetes Containers by Kristian Köhntopp
OSDC 2017 | Something Openshift Kubernetes Containers by Kristian KöhntoppOSDC 2017 | Something Openshift Kubernetes Containers by Kristian Köhntopp
OSDC 2017 | Something Openshift Kubernetes Containers by Kristian Köhntopp
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 

Data Management and Streaming Strategies in Drakensang Online

  • 1. Data Management and Streaming Strategies in Drakensang Online Andre “Floh” Weissflog Head of Development Berlin Bigpoint GmbH Browser Games Forum 2011
  • 2. 2 • Hack & Slay F2P MMO. • Embedded in browser. • No client download or installation. • Impressive 3D graphics. • 8 months from concept to Closed Beta.
  • 3. 3
  • 4. 4
  • 5. Team & Technology Background 5 Drakensang Online is developed by the same team that also worked on the previous single-player Drakensang “boxed titles”. Very skilled and experienced team, very solid and proven production pipeline and processes (and very little online experience). The technology behind our client and server is the Nebula3 engine (~250k lines of C/C++, 12 years of continuous development). Epic stories will be written about how we went from building offline boxed-titles to creating an MMO running in a browser in 8 months. …but not Today… Today we’ll talk about data management and streaming in Drakensang Online…
  • 6. Data Management Core Demands 6 • Browser Game Experience Click & play, no installation. • Fast Startup Time Start in seconds, not minutes • On-Demand Streaming: Only download what’s needed • Client-Side Caching: CDN traffic cost, user experience • Robustness: The web is slow and unreliable. • Minimize Data Size: But keep graphics quality high!
  • 7. Drakensang Online: The Assets 7 • 50 interconnected maps • 240 monsters • 120 NPCs • 1400 items • 300 animated effects 317 MB uncompressed 139 MB compressed 27035 filesAsset Types Textures Audio Meshes Navigation Locale Maps Anims Other
  • 8. Two Main Achievements 8 Aggressive Data Size Reduction We’re working with 3%of the data size we’ve been used to. This is mainly an organizational achievement. All departments have to agree on data size budgets… …even if it’s hard (e.g. for the graphics guys) Seamless Data Streaming During Gameplay That’s mainly a programming achievement. Client engine must support asynchronous I/O from the ground up.
  • 9. Maps as “static web pages” 9 This was really the initial idea behind our data management. Our maps reference many small asset files “somewhere on the web”, just like a static web page. All asset files are loaded asynchronously. Render loop can deal with assets which are not yet loaded. Really clever local file caching prevents unnecessary HTTP requests (down to no HTTP traffic at all if the cache is completely up-to-date).
  • 10. Asset Size Reduction: Extreme Granularity 10 Our maps are built from really small and simple 3D objects: Extremely granular set of reusable objects. Each with only a few dozen to a few hundred triangles. One map may have 100k of those micro 3D objects, but only a few dozen DIFFERENT objects. That’s very good for data size. But… OMG TEH DRAW CALLS !! Nebula3 uses hardware instancing and runtime- baking to reduce draw calls from about 5000 down to 250 per frame.
  • 11. Asset Size Reduction: Textures 11 Typical texture dimensions: 256 x 256 128 x 128 64 x 64 All textures are DXT compressed (and then – like all other files – ZIP compressed) Bump textures use DXT5NM compression, aka the Carmack-Trick (that’s why they’re grey, not purple). Texture types: color bump emissive specular Typical download sizes for textures are somewhere between 5 kByte and 40 kByte. Textures are aggressively shared and reused.
  • 12. Asset Size: Lights, Shadows& Decals 12 All lighting and shadowing is completely dynamic. No light-maps needed. We use deferred Pre-Pass-Lighting to enable “infinite point-lights” with moderate fill-rate requirements. Volumetric Decals are used to hide the tile-nature of the ground. Post-Effects add bloom, fog, color saturation and color balancing.
  • 13. Two Streaming Strategies: 13 1) ON DEMAND: Client requests data right when it’s needed. IO requests are handled asynchronously. May take several seconds until data is ready. Until then: Render a placeholder or… Render nothing. 2) BACKGROUND STREAMING: Low-priority background thread just for cache warm-up. One streaming list per map created during build process. Download and update items in local file cache. Very speculative, can’t really predict what’s needed next.
  • 14. The web as a (really unreliable) hard-disc 14 Nebula3 always had a powerful I/O system: • async I/O is standard, not the exception • “massively multi-threaded”, currently 10 I/O-threads • pluggable filesystems • all file paths are URLs We built an “HTTP filesystem” on top of this: • only uses HTTP GET requests • transparently replaces local-disc file I/O • MD5 checksums for all files • ZIP compression for all files • hierarchical caching system • directory walking and file-exists checks • CDN support • reasonably fail-safe (high latencies, dropped connections, corrupt downloads, tampering…)
  • 15. “Massively Multi-Threaded IO” 15 IO Dispatcher Thread Main Thread Render Thread Audio Thread … Thread Incoming IO Requests IO Threads Cache (Local HD) Cache Hit?Yes Web Server No Update Cache Served IO Requests • up to 10 HTTP-GET requests concurrently in flight (tweakable) • low-prio IO requests don’t block high-prio requests • no difference in high-level code between file I/O and http I/O
  • 16. Hierarchical File Cache 16 ROOT MD5 KEY e4680c8372bc6c527043a97631b404d7 Directory “Table Of Content” File Leaf TOCs Leaf TOCs Leaf TOCs 1. Daily-Build-Process: compress files and create per-file MD5 hashes. 2. Each directory has a “Table Of Content” file with all MD5 hashes. 3. TOCs are also compressed and MD5’d. 4. ...with one single “Directory TOC” at the top. 5. …and a single Root MD5 Key for the Directory TOC. • Client receives Root MD5 Key at startup. • Downloads TOC files if not in cache… • TOC files provide MD5 keys for files. • Zero HTTP GETs if cache is up-to-date!
  • 17. The Cache Chain 17 We basically have a 3-level cache for assets: RAM DISC CDN PROXY CDN ORIGIN Level 1 Level 2 Level 3 RAM Cache Hit? Resource objects are shared in RAM using their ResourceIDs. DISC Cache Hit? MD5-named file must exist in local file cache. MD5 of actual file-content must match build-time MD5 hash. CDN Proxy Cache Hit? MD5-hash appended to HTTP GET URL, forces a cache miss if proxy only has out-of-date file of the same name. CDN Origin Server has the definite data.
  • 18. 404’s Considered Harmful 18 Remember this from the previous page: RAM DISC CDN PROXY CDN ORIGIN 1st 2nd 3rd 4th “Sh*t happens”. Trying to load non-existing assets will happen with such a complex data set. HTTP requests for non-existing files will cut through all the way to the origin server. CDN’s usually cache 404’s for a few seconds, but it’s better to not bother the web servers with 404’s at all. Our Table-Of-Content-Files let us detect non-existing files on the client without a server roundtrip.
  • 19. Conclusion 19 • Data management is the job of the whole team. • Careful, early asset size planning. • Team must commit on size budgets. • “Programming Magic” alone doesn’t solve problems. • Programming, Level Design and Art Department must work hand in hand. • Oh, and: Check your CDN cost plan (if you pay per HTTP request, our solution is sub-optimal)
  • 21. Find us on Bigpoint GmbH Alexanderstraße 5 10178 Berlin Germany Bigpoint Inc. 500 Howard Street Suite 300 San Francisco, CA 94105 Bigpoint Distribuição de Entretenimento Online Ltda. Av. Brig. Faria Lima 3729 cj. 528 04538-905 São Paulo Brazil Bigpoint GmbH Andre Weissflog Head of Development Berlin Drehbahn 47-48 20354 Hamburg Germany Tel +49 40.88 14 13 - 0 Fax +49 40.88 14 13 - 11 info@bigpoint.net www.bigpoint.net Contact us Bigpoint International Services Limited 1 Villa Zimmermann Ta’Xbiex Terrace XBX 1035 Ta’Xbiex Malta 21