Enabling fast pages and furious development while supporting a billion users

Enabling Fast Pages and Furious
Development While Supporting a
Billion Users

Subbu Subramanian, Ph.D.
Software Engineer

Pottery Challenge: Day 3

Team 1: Make a PERFECT Pot Team 2: Make 20 pots

500M 700B 30B 2.5M
daily active users minutes spent pieces of content sites using
on the site every shared each month social plugins
month

Latests stats @ http://newsroom.fb.com/content/default.aspx?NewsAreaId=22

Require tackling some

UNIQUE TECHNOLOGY CHALLENGES

Scaling traditional websites

Bob’s data

Bob

Scaling Facebook: Interconnected data

Bob


Bob Brian


Felicia Bob Brian

News Feed

950 million unique home pages

Actor ID - Object ID - Story type Multifeed

Actor ID - Object ID - Story type Multifeed

Stories for up to 5000 friends in milliseconds

Memcache

TAO (Custom Ca

1 Billion operations per second

while supporting growth … ?
Uniﬁed Mobile
Sites
Questions
800M
New2011
Proﬁle
Messages
2010
Groups
2010
2010
Mobile Event
2010

</>
Places
2010
Photos Update
2010
Social Plugins
Open 2010
Graph
2010

The Stream
2009

Translations
2008
Sign Up
Platform launch
New Apps New Apps NewsFeed 2007
February 2004 2006
2004/2005

2004 2011

Pottery Challenge

Team 1: Make a PERFECT Pot Team 2: Make 20 pots

Scale

Move Fast
Photo by Eole: at http://www.ﬂickr.com/photos/eole/2193801804// and used
under Creative Commons license

Move Fast

Moving fast does not mean poor quality

We want a high ship rate

Invest in removing friction that slows us down

Empower Engineers

Starting on day ONE

Follow your passion – pick your team

Push any time you want to

…

Commits per Month

1/1/2006 1/1/2007 1/1/2008 1/1/2009 1/1/2010 1/1/2011 1/1/2012

Move Fast Scale Big Be Bold

and build things with min resources and innovate

OMG!

How can Infrastructure support these goals?
q  Pre-empt issues before they hit production

q  Know immediately when things go wrong

q  Know what to do when things go wrong


= LOTS OF INSTRUMENTATION, TOOLS and AUTOMATION

Perﬂab (aka Difﬂab)
• Performance test every commit
• Spot regressions before deploy

Perﬂab (aka Difﬂab)
• Also tracks slow drift regressions
• Helps us push thousands of revs per week

Gatekeeper
Simple code
but powerful check

if!
(gk_check(‘secret_project’,
$user) {
!
render_cool_feature();}
else {
!
render_normal_feature();}!

•  Many options for precise
targeting
•  500M+ gatekeeper checks
performed every second

Assigning ownership to failures


q  Pre-empt issues before they hit production

q  Know immediately when things go wrong

q  Know what to do when things go wrong


*Lots* of Instrumentation = Fire Hose of Data

Claspin
• High-density heatmap viewer for large services
• Find needles in a haystack - drilldown quickly

tasks sevmanager logview testconsole

differential wirehog domino groups

hipal hsh hud kobold

ods opsfeed scuba serf

Scuba: a tool for diving into an emerald sea of data

Requirements for data exploration

Need Don’t Need

ü  Speed ⤫  Replication

ü  Real-time data ⤫  Transactions

ü  Ad-hoc ﬁltering and grouping ⤫  Long retention

ü  AVG, SUM, COUNT, histograms percentiles ⤫  Table joins

⤫  Unique keys

⤫  Full map-reduce

Hive (hadoop)
•  “Unlimited” storage/CPU

•  Full-featured Query
Language

•  Numerous tools and
Frameworks

But Slow!

Scuba: Data Model

Few pre-deﬁned types and operations

“Data Sets”

- No upfront schema declaration

- Stored In memory

- Sorted by Time stamp

Scuba Data Types: Integers

VLQ encoded array of 64bit integers (single
char*)

O(1) lookup

Usage

Aggregate on these (SUM, AVG, etc...)

Filter (==, =, =)

“Which pages on the site have an average
wall time in the last hour 2 seconds”

Scuba Data Types: Normals
Strings mapped to ints, Stored as array of ints

size: 4 bytes

String Normalization

// char* = int32

'home.php', ’dc1', 'a2', 'en_US' = 32, 14, 3, 289

Usage

Group By value

Filter (==, !=, in set, not in set) “Top 10 countries that have the slowest pages today”

Scuba Data Types: Denorm

Array of plain ol' char*

To be used ONLY for unique identifiers that
would not benefit from normalization

size: 8 + strlen + 1 bytes

Usage

None other than displaying the value

Cannot filter or group by these. No native
regex support.

Scuba Data Types: Tagsets
A set of normals. Stored as bit vector

size: 8 + 2 + ceil(I / 8) bytes where I is the max
index represented

Usage

Filter (has all tags, has some tags, has none
of these tags)

Bit Set

'timeline', 'mercury', 'titan_drafts' = 0, 5, 14

“Is there a difference in cpu usage across users in test group A vs test group B”

Storage

1 aggregator / box

8+ leaves / box

Hundreds of boxes

Data stored over Terabytes of RAM!

Leaves

Multiple per machine ( #cores / N )

Only queried by the aggregator on the local
machine

Persist all write trafﬁc to disk (compressed).
Replay all writes on startup

Store all samples efﬁciently in memory

Leaves are independent; No shared state

Aggregation

Queries distributed, and aggregated as a
binary tree

(For now, there is no sorting of results. All
aggregation operations must be
commutative and associative.)

Operations

4 functions:

visit, summarize, combine, and ﬁnalize

Also a Hive-SQL like query language
interface

Enabling fast pages and furious development while supporting a billion users

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (16)

Ähnlich wie Enabling fast pages and furious development while supporting a billion users

Ähnlich wie Enabling fast pages and furious development while supporting a billion users (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Enabling fast pages and furious development while supporting a billion users