Scalable Architectures - Taming the Twitter Firehose

•

104 gefällt mir•50,260 views

Lorenzo Alberton

Technologie

Outline

1) SOAs
scaling the platform

2

Outline

1) SOAs
scaling the platform

2) Message Queues
scaling the communication

2

Outline

1) SOAs
scaling the platform

2) Message Queues
scaling the communication

3) Monitoring
scaling the maintainability
2

DataSift Architecture
High-level overview

3

DataSift Architecture

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 4

1/4) Ingestion of Input Streams

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 5

2/4) Filtering

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 6

3/4) Delivery / Frontend

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 7

4/4) Monitoring / Historics / Analytics

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 8

DataSift

350+ Million
input messages/day *

~330 Million
from the Twitter Firehose alone

9

DataSift

2 Terabyte
messages processed in real time
and stored every day

~1 Petabyte
of storage available

10

DataSift

Thousands
of concurrent, custom output streams

all crafted with tender love
and surgical precision

11

Service-Oriented Architectures

Service Service Service
A B C

Loose Coupling - Separation of Responsibilities
http://en.wikipedia.org/wiki/Service-oriented_architecture 13

Service-Oriented Architectures

Consumer

Service Service Service
A B C

Separate ConsumersSeparation of Responsibilities
Loose Coupling - From Service Implementation
http://en.wikipedia.org/wiki/Service-oriented_architecture 13

Service-Oriented Architectures

Consumer Consumer

Proxy Cache

Service Service Service
A B C

Separate ConsumersSeparation of Responsibilities
Loose Couplingcaching atService Implementation
Aggressive - From application level
http://en.wikipedia.org/wiki/Service-oriented_architecture 13

Service-Oriented Architectures

Orchestrator

Service Service Service
A B C

Orchestration of distinctFrom ServiceResponsibilities
Separate ConsumersSeparation of Implementation
Loose Couplingcaching at accessible over a network
Aggressive - units application level
http://en.wikipedia.org/wiki/Service-oriented_architecture 13

Service-Oriented Architectures

Orchestrator

JSON Thrift
XML

Service Service Service
A B C

Communication distinctFrom Service Implementation
Separate ConsumersSeparation interoperablenetwork
Orchestration of via a -well-deﬁnedof Responsibilities
Loose Couplingcaching at accessible over a format
Aggressive units application level
http://en.wikipedia.org/wiki/Service-oriented_architecture 13

Independent Horizontal Scaling

Service
A

Orchestrator

Service
B

14

Independent Horizontal Scaling

Service
A

Orchestrator

Load Balancer
Service
B1 Load balancing
-
Service
Service Multiple nodes
B
B2

14

Independent Horizontal Scaling

Rev.Proxy
Better single-node
Service performances with
A application-level
caching

Orchestrator

Load Balancer
Service
B1 Load balancing
-
Service
Service Multiple nodes
B
B2

14

Cell Architectures

Ensure that everything
+1 you develop has at least
one additional instance
N + 1 design
of that system in the
event of failure.

Have multiple live,
isolated nodes of the
multiple
same type to distribute
live nodes the load.

http://highscalability.com/blog/2012/5/9/cell-architectures.html 15

Cardinality of Nodes on Each Service

3 2 2
5
2
2
2
8 8
5

7 60+
7 7
7 7 7
http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 16

Load-Balancing with HAProxy
# /etc/haproxy.cfg

global frontend http-in
   daemon    bind *:80
   maxconn 256    default_backend mysvc

defaults backend mysvc
   mode http    server s1 10.0.1.10:7474 maxconn 32
   timeout connect 5000ms    server s2 10.0.1.11:7474 maxconn 32
   timeout client 50000ms
   timeout server 50000ms listen admin
   bind *:8080
   stats enable

Start by running
/usr/sbin/haproxy -f /etc/haproxy.cfg

http://haproxy.1wt.eu/ 18

$Load-Balancing with Varnish backend node01 { backend node02 { .host = "svc01.myhost.com"; .host = "svc02.myhost.com"; .probe = { .probe = { .url = "/"; .url = "/"; .interval = 1s; .interval = 1s; .timeout = 50 ms; .timeout = 50 ms; .window = 2; .window = 2; .threshold = 2; .threshold = 2; } } } } director mysvcdir round-robin { {.backend = node01;} mysvc {.backend = node02;} Request 50% node 01 Varnish } sub vcl_recv { set req.backend = mysvcdir; 50% mysvc return(pass); round-robin node 02 } http://varnish-cache.org 19$

Caching with Varnish
No special directives required to cache normal requests.
Just use the defaults, and set Cache-Control headers.

<?php
$ttl = 300; //cache for 5 minutes
$ts = new DateTime('@' . (time() + $ttl));
header("Expires: " . $ts->format(DateTime::RFC1123));
header("Cache-Control: max-age=$ttl, must-revalidate");
?>

Warning: by default, pages with cookies are not cached
21

Application Programming Interfaces

APIs
Software-to-Software Contract

22

API Docs Guidelines

Simple (RESTful verbs, actions on resources)

Well deﬁned (action, endpoint, parameters, response)

Discoverable (self-describing endpoint)

Working documentation

23

APIs everywhere: Internal & External

http://mashery.com/solution/iodocs http://console.datasift.com/ 24

Service API discovery
GET /<servicename>/api

25

$Service Host Discovery - Conﬁg Mgr GET /conﬁguration/<servicename>/hosts HTTP/1.1 200 OK Content-Type: application/json; charset=UTF-8 { “service”: “<servicename>”, “hosts”:[ “10.0.1.33:80”, “10.0.1.34:80” ], “base_path”: “/svc/xyz/” } 26$

$Service Host Discovery - Zookeeper ZooKeeper is a centralized service for maintaining conﬁguration information, naming, providing distributed synchronization, and providing group services. http://zookeeper.apache.org/ <?php $zk = new Zookeeper(); $zk->connect('localhost:2181'); //server $params = array(array( 'perms' => Zookeeper::PERM_ALL, 'scheme' => 'world', 'id' => 'anyone' )); if (!$zk->exists('/services/mysvc/host') { $zk->create('/services', 'config for internal services', $params); $zk->create('/services/mysvc', 'config for mysvc', $params); $zk->create('/services/mysvc/host', 'http://my.site.com', $params); } 27$

SOA - Scale Each Component

http://www.thisnext.com/item/647CD0BE/Matryoshkas-Nesting-Dolls 29

SOA - Scale Each Component

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 30

SOA - Scale Each Component

SOA: Independently
scalable services.
Example on distributing
processing load:

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 30

Workers for sharing processing load

Distribute
processing load
among workers.

Lightweight
orchestration,
heavy lifting in
separate,
asynchronous
processes

31

Scale all things!

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 32

Scale all things!

Example on scaling
large data volumes:

http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 32

In Case of “Big Data”...

With lots of data,
move the processing
logic itself to the
storage nodes
(I/O is expensive)

Map/Reduce,
Parallel Processing

33

Message Queues
Asynchronous Communication

34

Messaging
ZeroMQ: PUSH-PULL, REQ-REP, PUB-SUB (multicast, broadcast)

Internal communication: pass messages to the next processing
stage in the pipeline, control events, monitoring.
Very high throughput. Socket library.

Kafka/Redis: PUSH-PULL with persistence*

Internal message / workload buffering and distribution

Node.js: WebSockets / HTTP Streaming

Message delivery (output)

35

Message queues as Buffers (Decoupling)

P C
Unpredictable load spikes

36

Message queues as Buffers (Decoupling)

P C
Unpredictable load spikes

P C

Load normalisation / smoothing

36

Message queues as Buffers (Decoupling)

P C
Unpredictable load spikes

P C

Load normalisation / smoothing
Batching ⇒ higher throughput
36

Redis Processing Queue
<?php //producer(s)
$redis = new Redis();
$redis->connect('127.0.0.1', 6379, 1.5); // timeout 1.5 seconds
...
// push items to the queue as they are produced
$redis->lPush('queue:xyz', $item);
...

<?php
... //consumer(s)
while (true) {
// read items off the queue as they are available
// block for up to 2 seconds (timeout)
$item = redis->brPop('queue:xyz', 2);
...
}

https://github.com/nicolasff/phpredis https://github.com/chrisboulton/php-resque 38

Kafka Processing Queue
Producer
<?php
$host = '127.0.0.1';
$port = 9092;
$producer = new Kafka_Producer($host, $port);
$messages = array(
'aaa',
'bbb',
'ccc',
);
$topic = 'test';

// send a batch of messages (MessageSet)
$bytes_sent = $producer->send($messages, $topic);

https://github.com/apache/kafka/tree/trunk/clients/php/src/examples
40

$Kafka Processing Queue Consumer <?php $timeout = 2; $maxSize = 1000000; $host = '127.0.0.1'; $port = 9092; $partition = 0; $offset = 0; $topic = 'test'; $consumer = new Kafka_SimpleConsumer($host, $port, $timeout, $maxSize); while (true) { $request = new Kafka_FetchRequest($topic, $partition, $offset, $maxSize); $messages = $consumer->fetch($request); foreach ($messages as $msg) { echo $msg->payload(); } $offset += $messages->validBytes(); } 41$

0mq PUSH-PULL (Workload Distribution)

Consumer 1

p ull

push pull
Producer Consumer 2

(blocking operation, pu
ll
until delivered to
one worker)
Consumer 3

42

$ZeroMQ Producer (PUSH) <?php $context = new ZMQContext(); $producer = $context->getSocket(ZMQ::SOCKET_PUSH); $producer->bind('tcp://*:5555'); // send tasks to workers. foreach ($tasks as $task) { // Blocking operation until the message // is received by one (and only one) worker $producer->send($task); } ... http://zguide.zeromq.org/php:all 44$

$ZeroMQ Consumers (PULL) <?php $context = new ZMQContext(); $worker = $context->getSocket(ZMQ::SOCKET_PULL); $worker->connect('tcp://myhost:5555'); // process tasks forever while (true) { // receive a message (blocking operation) $task = $worker->recv(); ... } 45$

0mq PUSH-PULL (Mux)

Producer 1
pus
hR
1, R
2, R
3
push R4
Producer 2 pull Consumer

5 , R6 fair-queuing:
ushR
p R1, R4, R5,
R2, R6, R3
Producer 3

46

0mq PUB-SUB (High Availability)

Listener 1

Publisher 1

Listener 2

Publisher 2
Listener 3

[Broadcast] [Dynamic Subscriptions]

47

0mq PUB-SUB (High Availability)

DC 1
Publisher 1

Publisher 2

DC 2

48

High Availability - Replication

Example

49

$ZeroMQ Producer (PUB) <?php $context = new ZMQContext(); $producer = $context->getSocket(ZMQ::SOCKET_PUB); $producer->bind('tcp://*:5555'); $messages = array( // topic => msg array('painters' => 'Michelangelo'), array('painters' => 'Raffaello'), array('sculptors' => 'Donatello'), ); // send messages to listeners. foreach ($messages as $msg) { // Non-blocking operation. No ACK. // Message sent to ALL subscribers $producer->sendMulti($msg); } 50$

$ZeroMQ Consumer (SUB) <?php $context = new ZMQContext(); $producer = $context->getSocket(ZMQ::SOCKET_SUB); $producer->connect('tcp://myhost:5555'); $topic = 'painters'; // ignore sculptors $producer->setSockOption( ZMQ::SOCKOPT_SUBSCRIBE, $topic ); // Listen to messages with given topic while (true) { list($t, $m) = $producer->recvMulti(); // $t is the topic (‘painters.*’) } 51$

Interesting Ideas
Some Architecture Ideas

52

Internal “Firehose”

Publishers Subscribers
Alice’s John’s
Y Z timeline Inbox
X
subscribe
to topic X

Data Bus
subscribe
to topic Y

System Fred’s Tech
Monitor Followers Blog Feed

53

Internal “Firehose”

Publishers Data Feeds, Subscribers
User-generated Alice’s John’s
content, timeline Inbox
X Y Z
System events, ...
subscribe
to topic X

Data Bus
subscribe
to topic Y

System Fred’s Tech
Monitor Followers Blog Feed

53

Internal “Firehose”

Publishers Applications, Subscribers
Services,
Monitors, Alice’s John’s
Y Z Routers, timeline Inbox
X
Repeaters, subscribe
...
to topic X

Data Bus
subscribe
to topic Y

System Fred’s Tech
Monitor Followers Blog Feed

53

Internal “Firehose”

Publishers Subscribers
Alice’s John’s
Y Z timeline Inbox
X
subscribe
to topic X

Data Bus
subscribe
to topic Y
Everyone
connected to System Fred’s Tech
the data bus, no Monitor Followers Blog Feed
directed graph

53

Monitoring
Measure Anything, Measure Everything

http://codeascraft.etsy.com/2011/02/15/measure-anything-measure-everything/ 54

Monitoring: Measure Everything

1. Is there a problem? User experience / Business metrics monitors

2. Where is the problem? System monitors (threshold - variance)

3. What is the problem? Application monitors

55

Monitoring: Measure Everything

StatsD

1. Is there a problem? User experience / Business metrics monitors

2. Where is the problem? System monitors (threshold - variance)

3. What is the problem? Application monitors

Keep Signal vs. Noise ratio high
55

Instrumentation

https://play.google.com/store/apps/details?id=net.networksaremadeofstring.rhybudd 56

StatsD + Graphite

Example
StatsD: Node.JS daemon. Listens for messages over a UDP port and
extracts metrics, which are dumped to Graphite for further processing
and visualisation.

Graphite: Real-time graphing system. Data is sent to carbon
(processing back-end) which stores data into Graphite’s db. Data
visualised via Graphite’s web interface.

58

$StatsD Metrics <?php ; statsd.ini $statsTypePrefix = 'workerX.received.type.'; [statsd] host = yourhost $statsTimeKey = 'workerX.processing_time'; port = 8125 while (true) { $batch = $worker->getBatchOfWork(); foreach ($batch as $item) { // time how long it takes to process this item... $time_start = microtime(true); // ... process item here ... $time = (int)(1000 * (microtime(true) - $time_start)); StatsD::timing($statsTimeKey, $time); // time in ms // count items by type StatsD::increment($statsTypePrefix . $item['type']); } https://github.com/etsy/statsd/ 59$

Graphite Output

workerX.processing_time.mean workerX.processing_time.upper_90

http://graphite.wikidot.com/ 60

Graphite Output

monitor average,
percentiles,
standard deviation

workerX.processing_time.mean workerX.processing_time.upper_90

http://graphite.wikidot.com/ 60

Look! Rib cages! Network Load Viz

http://www.network-weathermap.com/ http://cacti.net 61

Look! Rib cages! Network Load Viz

Not enough!

Contextualise metrics

http://www.network-weathermap.com/ http://cacti.net 61

Cacti + WeatherMap

Example
Cacti: Network graphing solution harnessing the power of RRDTool’s
data storage and graphing functionality. Provides a fast poller, graph
templating, multiple data acquisition methods.

Weathermap: Cacti plugin to integrate network maps into the
Cacti web UI. Includes a web-based map editor.

62

Network Load Visualisation

345/s
8432/s 225/s
296/s
335/s
7312/s 311/s
289/s
145/s

4410/s 5320/s

80/s 1331/s

5320/s

5320/s

13/s

2954/s 44/s
3296/s 4322/s
219/s

2954/s 5320/s 832/s

5320/s

Graphite datasource for Weathermap: https://github.com/alexforrow/php-weathermap-graphite 63

Network Load Visualisation

345/s
8432/s 225/s
296/s
335/s
7312/s 311/s
289/s
145/s

5320/s

augmentation
4410/s

80/s service
1331/s

timing out?
5320/s

5320/s

13/s

2954/s 44/s
3296/s 4322/s
219/s

2954/s 5320/s 832/s

5320/s

Graphite datasource for Weathermap: https://github.com/alexforrow/php-weathermap-graphite 63

Network Load Visualisation

ﬁltering server
8432/s
345/s

slightly
225/s
296/s
335/s

overloaded? 7312/s 311/s
289/s
145/s

4410/s 5320/s

80/s 1331/s

5320/s

5320/s

13/s

2954/s 44/s
3296/s 4322/s
219/s

2954/s 5320/s 832/s

5320/s

Graphite datasource for Weathermap: https://github.com/alexforrow/php-weathermap-graphite 63

Network Load Visualisation

345/s
8432/s 225/s
296/s
335/s
7312/s 311/s
289/s
145/s

4410/s 5320/s

1331/s

consumer
80/s
5320/s

5320/s slower than
13/s
producer?
2954/s 44/s
3296/s 4322/s
219/s

2954/s 5320/s 832/s

5320/s

Graphite datasource for Weathermap: https://github.com/alexforrow/php-weathermap-graphite 63

Monitoring Reporting Guidelines

Make the subtle obvious
Make the complex/busy simple/clean
Group information by context
Detect anomalies/deviation from norm
Turn raw numbers into graphs
Appeal to intuition
64

We’re Hiring!

http://datasift.com/whoweare/jobs
65

References
http://www.slideshare.net/quipo/the-art-of-scalability-managing-growth
https://bitly.com/vCSd49 (DataSift architecture on HighScalability)
http://www.slideshare.net/combell/varnish-in-action-phpday2011
https://vimeo.com/couchmode/chariottechcast/videos/sort:date/40988625
http://blog.stuartherbert.com/php/2011/09/21/real-time-graphing-with-
graphite/
http://zguide.zeromq.org/page:all

Image credits:
http://accidental-entrepreneur.com/wp-content/uploads/2011/04/fire-hose.jpg
http://www.alibaba.com/product-free/103854677/Q_FIRE_FIRE_HOSE.html

66

Lorenzo Alberton
@lorenzoalberton

Thank you!
lorenzo@alberton.info
http://www.alberton.info/talks

https://joind.in/6372
67

Weitere ähnliche Inhalte

Was ist angesagt?

サポートエンジニアが語る、Microsoft Azure を支えるインフラの秘密

ShuheiUda

Presentation f5 – beyond load balancer

xKinAnx

金融 API 時代のセキュリティ: OpenID Financial API (FAPI) WG

Nat Sakimura

BIG IP F5 GTM Presentation

PCCW GLOBAL

Grafana vs Kibana

jeetendra mandal

Microsoft Azure 概要スライド。政府CIO標準ガイドライン群: “政府情報システムにおけるクラウドサービスの利用に係る基本方針” を抜粋しながら、Azureのインフラ、セキュリティと運用管理機能、ハイブリッドクラウド機能の概要を紹介。日常、Microsoftのパートナーの皆様への説明の際に使用しているスライド。

Microsoft Azure Overview - Japanses version

Takeshi Fukuhara

講演者：富士通株式会社データ＆セキュリティ研究所　中川格氏概要：minifabricは、ブロックチェーンプラットフォームの１つHyperledger Fabricの環境構築を容易にするOSSです。ブロックチェーンを使ってみたいが環境構築が大変で敷居が高いという問題を解消するため、minifabric は簡単さと柔軟性をあわせ持つ構築ツールとして開発されています。本講演では、minifabricの基本的な使い方を紹介するとともに、本番環境構築などに向けた学習ツールとしても使えるように内部の仕組みについても紹介します。 2021年10月7日オンライン開催 Hyperledger Tokyo Meetupで講演

Hyperledger Fabric 簡単構築ツール minifabricのご紹介〜productionへの移行をminifabricで加速〜

Hyperleger Tokyo Meetup

AWS WAF のマネージドルールって結局どれを選べばいいの？

YOJI WATANABE

Fluentdで本番環境を再現

Hiroshi Toyama

Red Hat multi-cluster management & what's new in OpenShift

Kangaroot

性能問題を起こしにくい強いDBシステムの作り方（Ver. 2018.9）

Tomoyuki Oota

インフラ野郎Azureチーム Night

Toru Makabe

12 factor app - Core Guidelines To Cloud Ready Solutions

Kashif Ali Siddiqui

Apache Kafkaって本当に大丈夫？～故障検証のオーバービューと興味深い挙動の紹介～

NTT DATA OSS Professional Services

BigData Architecture for Azure

Ryoma Nagata

サポートエンジニアが Azure Networking をじっくりたっぷり語りつくす会

ShuheiUda

20210127 今日から始めるイベントドリブンアーキテクチャ AWS Expert Online #13

Amazon Web Services Japan

近年、物理サーバの利用に限らず、サーバの仮想化やクラウド化などサーバの構成方法が多様化しつつあります。ただ、どの環境のサーバを利用していたとしても、データ損失のリスクは潜んでおり、自然災害やハードウェアの障害、人為ミスなどの多くの原因である日突然データがなくなってしまう可能性もゼロではありません。そのため、いつデータが消える事態になっても対応できるようバックアップ等でデータ保護を行い、対策をしておく必要がございます。本セミナーではそのような万が一のためにリスクに対応できるデータ保護ツールとして「Veeam」の製品をご紹介いたします。実際にVeeamを利用することでどの環境に対してのデータ保護が可能なのか、それらの環境に対してどのようにバックアップやリストア等が行えるかをご紹介いたします。

【Veeam基礎】簡単解説！バックアップ可能な環境や機能をご紹介

株式会社クライム

Apache Hadoop YARNとマルチテナントにおけるリソース管理

Cloudera Japan

Couchbase training basic

Knoldus Inc.

Was ist angesagt? (20)

サポートエンジニアが語る、Microsoft Azure を支えるインフラの秘密

Presentation f5 – beyond load balancer

金融 API 時代のセキュリティ: OpenID Financial API (FAPI) WG

BIG IP F5 GTM Presentation

Grafana vs Kibana

Microsoft Azure Overview - Japanses version

Hyperledger Fabric 簡単構築ツール minifabricのご紹介〜productionへの移行をminifabricで加速〜

AWS WAF のマネージドルールって結局どれを選べばいいの？

Fluentdで本番環境を再現

Red Hat multi-cluster management & what's new in OpenShift

性能問題を起こしにくい強いDBシステムの作り方（Ver. 2018.9）

インフラ野郎Azureチーム Night

12 factor app - Core Guidelines To Cloud Ready Solutions

Apache Kafkaって本当に大丈夫？～故障検証のオーバービューと興味深い挙動の紹介～

BigData Architecture for Azure

サポートエンジニアが Azure Networking をじっくりたっぷり語りつくす会

20210127 今日から始めるイベントドリブンアーキテクチャ AWS Expert Online #13

【Veeam基礎】簡単解説！バックアップ可能な環境や機能をご紹介

Apache Hadoop YARNとマルチテナントにおけるリソース管理

Couchbase training basic

Andere mochten auch

Modern Algorithms and Data Structures - 1. Bloom Filters, Merkle Trees

Lorenzo Alberton

At a certain scale, millions of events happen every second, and all of them are important to evaluate the health of the system. If not handled correctly, such a volume of information can overwhelm both the infrastructure that needs to support them, and people who have to make a sense out of thousands of signals and make decisions upon them, fast. By understanding how our rational mind works, how people process information, we can present data so it's more evident and intuitive. This talk will explain how to collect useful metrics, and to create the perfect monitoring dashboard to organise and display them, letting our intuition operate automatically and quickly, and saving attention and mental effort to activities that demand it.

Monitoring at scale - Intuitive dashboard design

Lorenzo Alberton

The ability to grow (and shrink) according to the needs and the available resources is an essential part of designing applications. In this talk we'll cover the fundamental elements of scalability, including aspects involving people, processes and technology. With sound and proven principles and some advice on how to shape your organisation, set the right processes and design your application, this session is a must-see for developers and technical leads alike.

The Art of Scalability - Managing growth

Lorenzo Alberton

Scaling Teams, Processes and Architectures

Lorenzo Alberton

Despite the NoSQL movement trying to flag traditional databases as a dying breed, the RDBMS keeps evolving and adding new powerful weapons to its arsenal. In this talk we'll explore Common Table Expressions (SQL-99) and how SQL handles recursion, breaking the bi-dimensional barriers and paving the way to more complex data structures like trees and graphs, and how we can replicate features from social networks and recommendation systems. We'll also have a look at window functions (SQL:2003) and the advanced reporting features they make finally possible.

Graphs in the Database: Rdbms In The Social Networks Age

Lorenzo Alberton

NoSQL databases get a lot of press coverage, but there seems to be a lot of confusion surrounding them, as in which situations they work better than a Relational Database, and how to choose one over another. This talk will give an overview of the NoSQL landscape and a classification for the different architectural categories, clarifying the base concepts and the terminology, and will provide a comparison of the features, the strengths and the drawbacks of the most popular projects (CouchDB, MongoDB, Riak, Redis, Membase, Neo4j, Cassandra, HBase, Hypertable).

NoSQL Databases: Why, what and when

Lorenzo Alberton

Storing tree structures in a bi-dimensional table has always been problematic. The simplest tree models are usually quite inefficient, while more complex ones aren't necessarily better. In this talk I briefly go through the most used models (adjacency list, materialized path, nested sets) and introduce some more advanced ones belonging to the nested intervals family (Farey algorithm, Continued Fractions, and other encodings). I describe the advantages and pitfalls of each model, some proprietary solutions (e.g. Oracle's CONNECT BY) and one of the SQL Standard's upcoming features, Common Table Expressions.

Trees In The Database - Advanced data structures

Lorenzo Alberton

Andere mochten auch (7)

Modern Algorithms and Data Structures - 1. Bloom Filters, Merkle Trees

Monitoring at scale - Intuitive dashboard design

The Art of Scalability - Managing growth

Scaling Teams, Processes and Architectures

Graphs in the Database: Rdbms In The Social Networks Age

NoSQL Databases: Why, what and when

Trees In The Database - Advanced data structures

Ähnlich wie Scalable Architectures - Taming the Twitter Firehose

Developing web applications used to be simple. Your single war-file web application served up HTML to a desktop browser and used a relational database. Today however, web applications are much more complex: the front-end uses HTML5 and NodeJS, the middle tier is decomposed into multiple services, and the back-end uses a mix of SQL and NoSQL databases. Developing these kind of applications can be challenging since there are so many moving parts that need to be correctly installed and configured. Deployment is even more difficult. In this talk, you will learn why we need to build applications with this architectural style and how Cloud Foundry, which is modern, multi-lingual, multi-service, extensible open-source PaaS, can help. We will talk about how to develop modern applications that run on Cloud Foundry and cover what’s new and different about the cloud environment. You will learn how your application can consume the various services that are provided by Cloud Foundry. We will discuss the various ways of using Cloud Foundry including the Micro Cloud that runs on a laptop as well as the hosted CloudFoundry.com.

Developing polyglot applications on Cloud Foundry (#oredev 2012)

Chris Richardson

PHP Day 2011 PHP goes to the cloud

pietrobr

Service mesh in action with onap

Huabing Zhao

About this session A service mesh solves some of the tough problems in microservices architectures, providing visibility and network traffic controls for distributed applications. AWS App Mesh is a service mesh based on the Envoy proxy, and standardises how your microservices communicate to give you end-to-end visibility and help ensure high-availability for your applications. In this Dev Lounge session, we will take a look at the features of AWS App Mesh, and also demonstrate how we could make use of AWS Step Functions, the AWS Code Suite and leverage AWS App Mesh’s weighted routing features to implement Blue/Green/Canary deployments of microservices. We'll also take a tour of the AWS CDK, a software development framework to define cloud infrastructure as code, and talk about how developers can easily construct complex infrastructure patterns. Learning outcomes: -Learn about AWS App Mesh, its components and how it works See how to use AWS App Mesh for monitoring services running on Fargate -Understand how you can use AWS App Mesh to control the routing of network traffic between services -Look at how developers can leverage the AWS CDK to build complex infrastructure components easily

AWS Dev Lounge: Taking Control of Your Microservices with AWS App Mesh

Amazon Web Services

OpenNebula Interoperability

dmamolina

Talk from O'Reilly Software Architecture Conference San Jose 2019 Microservices and containers have taken the software industry by storm. Transitioning from a monolith to microservices enables you to deploy your application more frequently, independently, and reliably. However, microservice architecture has its own challenges, and it has to deal with the same problems encountered while designing distributed systems. Enter service mesh technology to the rescue. A service mesh reduces the complexity associated with microservices and provides functionality like load balancing, service discovery, traffic management, circuit breaking, telemetry, fault injection, and more. Istio is one of the best implementations of a service mesh at this point, while Kubernetes provides a platform for running microservices and automating deployment of containerized applications. Join Samir Behara to go beyond the buzz and understand microservices and service mesh technologies.

Building a scalable microservice architecture with envoy, kubernetes and istio

SAMIR BEHARA

Prairie DevCon-What's New in Hyper-V in Windows Server "8" Beta - Part 2

Damir Bersinic

Choosing Your Windows Azure Platform Strategy

drmarcustillett

The Microservices world in. NET Core and. NET framework

Massimo Bonanni

Building Multi-Site and Multi-OpenStack Cloud with OpenStack Cascading

Joe Huang

Alcatellucentsdn2013

deepersnet

ONOS-Based VIM Implementation

OPNFV

NFV SDN Summit March 2014 D3 03 bruno_rijsman NFV with OpenContrail

ozkan01

Was ist neu in Exchange 2013?

Digicomp Academy AG

Learn OpenStack from trystack.cn ——Folsom in practice

OpenCity Community

Microservices with Spring

Carlos Cavero Barca

Api service mesh and microservice tooling

Red Hat

Service Oriented Architecture (SOA) [1/5] : Introduction to SOA

IMC Institute

CCA09 Cloud Computing Standards and OCCI

befreax

Software Architecture Definition for On-demand Cloud Provisioning

Clovis Chapman

Ähnlich wie Scalable Architectures - Taming the Twitter Firehose (20)

Developing polyglot applications on Cloud Foundry (#oredev 2012)

PHP Day 2011 PHP goes to the cloud

Service mesh in action with onap

AWS Dev Lounge: Taking Control of Your Microservices with AWS App Mesh

OpenNebula Interoperability

Building a scalable microservice architecture with envoy, kubernetes and istio

Prairie DevCon-What's New in Hyper-V in Windows Server "8" Beta - Part 2

Choosing Your Windows Azure Platform Strategy

The Microservices world in. NET Core and. NET framework

Building Multi-Site and Multi-OpenStack Cloud with OpenStack Cascading

Alcatellucentsdn2013

ONOS-Based VIM Implementation

NFV SDN Summit March 2014 D3 03 bruno_rijsman NFV with OpenContrail

Was ist neu in Exchange 2013?

Learn OpenStack from trystack.cn ——Folsom in practice

Microservices with Spring

Api service mesh and microservice tooling

Service Oriented Architecture (SOA) [1/5] : Introduction to SOA

CCA09 Cloud Computing Standards and OCCI

Software Architecture Definition for On-demand Cloud Provisioning

Kürzlich hochgeladen

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Modernizing Securities Finance: The cloud-native prime brokerage platform transforming capital markets. Madhu Subbu, Managing Director, Head of Securities Finance Engineering Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

apidays

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

Architecting Cloud Native Applications

WSO2

MINDCTI Revenue Release Quarter One 2024

MIND CTI

Artificial Intelligence Chap.5 : Uncertainty

Khushali Kathiriya

GenAI Risks & Security Meetup 01052024.pdf

lior mazor

presentation ICT roal in 21st century education

jfdjdjcjdnsjd

Strategies for Landing an Oracle DBA Job as a Fresher

Remote DBA Services

Real Time Object Detection Using Open CV

Khem

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

Corporate and higher education. Two industries that, in the past, have had a clear divide with very little crossover. The difference in goals, learning styles and objectives paved the way for differing learning technologies platforms to evolve. Now, those stark lines are blurring as both sides are discovering they have content that’s relevant to the other. Join Tammy Rutherford as she walks through the pros and cons of corporate and higher ed collaborating. And the challenges of these different technology platforms working together for a brighter future.

Corporate and higher education May webinar.pptx

Rustici Software

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

Manulife - Insurer Transformation Award 2024

The Digital Insurer

Scalable LLM APIs for AI and Generative AI Application Development Ettikan Karuppiah, Director/Technologist - NVIDIA Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

apidays

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows. We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases. This video focuses on the deployment of external web forms using Jotform for Bonterra Impact Management. This solution can be customized to your organization’s needs and deployed to support the common use cases below: - Intake and consent - Assessments - Surveys - Applications - Program registration Interested in deploying web form automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Jeffrey Haguewood

Kürzlich hochgeladen (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Data Cloud, More than a CDP by Matt Robison

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Architecting Cloud Native Applications

MINDCTI Revenue Release Quarter One 2024

Artificial Intelligence Chap.5 : Uncertainty

GenAI Risks & Security Meetup 01052024.pdf

presentation ICT roal in 21st century education

Strategies for Landing an Oracle DBA Job as a Fresher

Real Time Object Detection Using Open CV

Exploring the Future Potential of AI-Enabled Smartphone Processors

Corporate and higher education May webinar.pptx

Apidays New York 2024 - The value of a flexible API Management solution for O...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Manulife - Insurer Transformation Award 2024

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Scalable Architectures - Taming the Twitter Firehose

1. Lorenzo Alberton @lorenzoalberton Scalable Architectures: Taming the Twitter Firehose Patterns for scalable real-time platforms PHPDay 2012 Verona, 18th May 2012 1

2. Outline 1) SOAs scaling the platform 2

3. Outline 1) SOAs scaling the platform 2) Message Queues scaling the communication 2

4. Outline 1) SOAs scaling the platform 2) Message Queues scaling the communication 3) Monitoring scaling the maintainability 2

5. DataSift Architecture High-level overview 3

6. DataSift Architecture http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 4

7. 1/4) Ingestion of Input Streams http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 5

8. 2/4) Filtering http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 6

9. 3/4) Delivery / Frontend http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 7

10. 4/4) Monitoring / Historics / Analytics http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 8

11. DataSift 350+ Million input messages/day * ~330 Million from the Twitter Firehose alone 9

12. DataSift 2 Terabyte messages processed in real time and stored every day ~1 Petabyte of storage available 10

13. DataSift Thousands of concurrent, custom output streams all crafted with tender love and surgical precision 11

14. SOA Service-Oriented Architectures 12

15. Service-Oriented Architectures Service Service Service A B C Loose Coupling - Separation of Responsibilities http://en.wikipedia.org/wiki/Service-oriented_architecture 13

16. Service-Oriented Architectures Consumer Service Service Service A B C Separate ConsumersSeparation of Responsibilities Loose Coupling - From Service Implementation http://en.wikipedia.org/wiki/Service-oriented_architecture 13

17. Service-Oriented Architectures Consumer Consumer Proxy Cache Service Service Service A B C Separate ConsumersSeparation of Responsibilities Loose Couplingcaching atService Implementation Aggressive - From application level http://en.wikipedia.org/wiki/Service-oriented_architecture 13

18. Service-Oriented Architectures Orchestrator Service Service Service A B C Orchestration of distinctFrom ServiceResponsibilities Separate ConsumersSeparation of Implementation Loose Couplingcaching at accessible over a network Aggressive - units application level http://en.wikipedia.org/wiki/Service-oriented_architecture 13

19. Service-Oriented Architectures Orchestrator JSON Thrift XML Service Service Service A B C Communication distinctFrom Service Implementation Separate ConsumersSeparation interoperablenetwork Orchestration of via a -well-deﬁnedof Responsibilities Loose Couplingcaching at accessible over a format Aggressive units application level http://en.wikipedia.org/wiki/Service-oriented_architecture 13

20. Independent Horizontal Scaling Service A Orchestrator Service B 14

21. Independent Horizontal Scaling Service A Orchestrator Service B 14

22. Independent Horizontal Scaling Service A Orchestrator Load Balancer Service B1 Load balancing - Service Service Multiple nodes B B2 14

23. Independent Horizontal Scaling Rev.Proxy Better single-node Service performances with A application-level caching Orchestrator Load Balancer Service B1 Load balancing - Service Service Multiple nodes B B2 14

24. Cell Architectures Ensure that everything +1 you develop has at least one additional instance N + 1 design of that system in the event of failure. Have multiple live, isolated nodes of the multiple same type to distribute live nodes the load. http://highscalability.com/blog/2012/5/9/cell-architectures.html 15

25. Cardinality of Nodes on Each Service 3 2 2 5 2 2 2 8 8 5 7 60+ 7 7 7 7 7 http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 16

26. Load-Balancing Example 17

27. Load-Balancing with HAProxy # /etc/haproxy.cfg global frontend http-in daemon bind *:80 maxconn 256 default_backend mysvc defaults backend mysvc mode http server s1 10.0.1.10:7474 maxconn 32 timeout connect 5000ms server s2 10.0.1.11:7474 maxconn 32 timeout client 50000ms timeout server 50000ms listen admin bind *:8080 stats enable Start by running /usr/sbin/haproxy -f /etc/haproxy.cfg http://haproxy.1wt.eu/ 18

28. Load-Balancing with Varnish backend node01 { backend node02 { .host = "svc01.myhost.com"; .host = "svc02.myhost.com"; .probe = { .probe = { .url = "/"; .url = "/"; .interval = 1s; .interval = 1s; .timeout = 50 ms; .timeout = 50 ms; .window = 2; .window = 2; .threshold = 2; .threshold = 2; } } } } director mysvcdir round-robin { {.backend = node01;} mysvc {.backend = node02;} Request 50% node 01 Varnish } sub vcl_recv { set req.backend = mysvcdir; 50% mysvc return(pass); round-robin node 02 } http://varnish-cache.org 19

29. Caching Example 20

30. Caching with Varnish No special directives required to cache normal requests. Just use the defaults, and set Cache-Control headers. <?php $ttl = 300; //cache for 5 minutes $ts = new DateTime('@' . (time() + $ttl)); header("Expires: " . $ts->format(DateTime::RFC1123)); header("Cache-Control: max-age=$ttl, must-revalidate"); ?> Warning: by default, pages with cookies are not cached 21

31. Application Programming Interfaces APIs Software-to-Software Contract 22

32. API Docs Guidelines Simple (RESTful verbs, actions on resources) Well deﬁned (action, endpoint, parameters, response) Discoverable (self-describing endpoint) Working documentation 23

33. APIs everywhere: Internal & External http://mashery.com/solution/iodocs http://console.datasift.com/ 24

34. Service API discovery GET /<servicename>/api 25

35. Service Host Discovery - Conﬁg Mgr GET /conﬁguration/<servicename>/hosts HTTP/1.1 200 OK Content-Type: application/json; charset=UTF-8 { “service”: “<servicename>”, “hosts”:[ “10.0.1.33:80”, “10.0.1.34:80” ], “base_path”: “/svc/xyz/” } 26

36. Service Host Discovery - Zookeeper ZooKeeper is a centralized service for maintaining conﬁguration information, naming, providing distributed synchronization, and providing group services. http://zookeeper.apache.org/ <?php $zk = new Zookeeper(); $zk->connect('localhost:2181'); //server $params = array(array( 'perms' => Zookeeper::PERM_ALL, 'scheme' => 'world', 'id' => 'anyone' )); if (!$zk->exists('/services/mysvc/host') { $zk->create('/services', 'config for internal services', $params); $zk->create('/services/mysvc', 'config for mysvc', $params); $zk->create('/services/mysvc/host', 'http://my.site.com', $params); } 27

37. Service Host Discovery - Zookeeper ZooKeeper is a centralized service for maintaining conﬁguration information, naming, providing distributed synchronization, and providing group services. http://zookeeper.apache.org/ <?php $zk = new Zookeeper(); $zk->connect('localhost:2181'); //server $params = array(array( 'perms' => Zookeeper::PERM_ALL, 'scheme' => 'world', 'id' => 'anyone' )); if (!$zk->exists('/services/mysvc/host') { $zk->create('/services', 'config for internal services', $params); $zk->create('/services/mysvc', 'config for mysvc', $params); $zk->create('/services/mysvc/host', 'http://my.site.com', $params); } 27

38. Service Host Discovery - Zookeeper ZooKeeper is a centralized service for maintaining conﬁguration information, naming, providing distributed synchronization, and providing group services. http://zookeeper.apache.org/ <?php $zk = new Zookeeper(); $zk->connect('localhost:2181'); //client $host = $zk->get('/services/mysvc/host'); ... 28

39. SOA - Scale Each Component http://www.thisnext.com/item/647CD0BE/Matryoshkas-Nesting-Dolls 29

40. SOA - Scale Each Component http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 30

41. SOA - Scale Each Component SOA: Independently scalable services. Example on distributing processing load: http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 30

42. Workers for sharing processing load 31

43. Workers for sharing processing load Distribute processing load among workers. Lightweight orchestration, heavy lifting in separate, asynchronous processes 31

44. Scale all things! http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 32

45. Scale all things! Example on scaling large data volumes: http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html 32

46. In Case of “Big Data”... 33

47. In Case of “Big Data”... With lots of data, move the processing logic itself to the storage nodes (I/O is expensive) Map/Reduce, Parallel Processing 33

48. Message Queues Asynchronous Communication 34

49. Messaging ZeroMQ: PUSH-PULL, REQ-REP, PUB-SUB (multicast, broadcast) Internal communication: pass messages to the next processing stage in the pipeline, control events, monitoring. Very high throughput. Socket library. Kafka/Redis: PUSH-PULL with persistence* Internal message / workload buffering and distribution Node.js: WebSockets / HTTP Streaming Message delivery (output) 35

50. Message queues as Buffers (Decoupling) P C Unpredictable load spikes 36

51. Message queues as Buffers (Decoupling) P C Unpredictable load spikes P C Load normalisation / smoothing 36

52. Message queues as Buffers (Decoupling) P C Unpredictable load spikes P C Load normalisation / smoothing Batching ⇒ higher throughput 36

53. Redis Buffer Example 37

54. Redis Processing Queue <?php //producer(s) $redis = new Redis(); $redis->connect('127.0.0.1', 6379, 1.5); // timeout 1.5 seconds ... // push items to the queue as they are produced $redis->lPush('queue:xyz', $item); ... <?php ... //consumer(s) while (true) { // read items off the queue as they are available // block for up to 2 seconds (timeout) $item = redis->brPop('queue:xyz', 2); ... } https://github.com/nicolasff/phpredis https://github.com/chrisboulton/php-resque 38

55. Kafka Buffer Example 39

56. Kafka Processing Queue Producer <?php $host = '127.0.0.1'; $port = 9092; $producer = new Kafka_Producer($host, $port); $messages = array( 'aaa', 'bbb', 'ccc', ); $topic = 'test'; // send a batch of messages (MessageSet) $bytes_sent = $producer->send($messages, $topic); https://github.com/apache/kafka/tree/trunk/clients/php/src/examples 40

57. Kafka Processing Queue Consumer <?php $timeout = 2; $maxSize = 1000000; $host = '127.0.0.1'; $port = 9092; $partition = 0; $offset = 0; $topic = 'test'; $consumer = new Kafka_SimpleConsumer($host, $port, $timeout, $maxSize); while (true) { $request = new Kafka_FetchRequest($topic, $partition, $offset, $maxSize); $messages = $consumer->fetch($request); foreach ($messages as $msg) { echo $msg->payload(); } $offset += $messages->validBytes(); } 41

58. 0mq PUSH-PULL (Workload Distribution) Consumer 1 p ull push pull Producer Consumer 2 (blocking operation, pu ll until delivered to one worker) Consumer 3 42

59. Workload Distribution Example 43

60. ZeroMQ Producer (PUSH) <?php $context = new ZMQContext(); $producer = $context->getSocket(ZMQ::SOCKET_PUSH); $producer->bind('tcp://*:5555'); // send tasks to workers. foreach ($tasks as $task) { // Blocking operation until the message // is received by one (and only one) worker $producer->send($task); } ... http://zguide.zeromq.org/php:all 44

61. ZeroMQ Consumers (PULL) <?php $context = new ZMQContext(); $worker = $context->getSocket(ZMQ::SOCKET_PULL); $worker->connect('tcp://myhost:5555'); // process tasks forever while (true) { // receive a message (blocking operation) $task = $worker->recv(); ... } 45

62. 0mq PUSH-PULL (Mux) Producer 1 pus hR 1, R 2, R 3 push R4 Producer 2 pull Consumer 5 , R6 fair-queuing: ushR p R1, R4, R5, R2, R6, R3 Producer 3 46

63. 0mq PUB-SUB (High Availability) Listener 1 Publisher 1 Listener 2 Publisher 2 Listener 3 [Broadcast] [Dynamic Subscriptions] 47

64. 0mq PUB-SUB (High Availability) DC 1 Publisher 1 Publisher 2 DC 2 48

65. High Availability - Replication Example 49

66. ZeroMQ Producer (PUB) <?php $context = new ZMQContext(); $producer = $context->getSocket(ZMQ::SOCKET_PUB); $producer->bind('tcp://*:5555'); $messages = array( // topic => msg array('painters' => 'Michelangelo'), array('painters' => 'Raffaello'), array('sculptors' => 'Donatello'), ); // send messages to listeners. foreach ($messages as $msg) { // Non-blocking operation. No ACK. // Message sent to ALL subscribers $producer->sendMulti($msg); } 50

67. ZeroMQ Consumer (SUB) <?php $context = new ZMQContext(); $producer = $context->getSocket(ZMQ::SOCKET_SUB); $producer->connect('tcp://myhost:5555'); $topic = 'painters'; // ignore sculptors $producer->setSockOption( ZMQ::SOCKOPT_SUBSCRIBE, $topic ); // Listen to messages with given topic while (true) { list($t, $m) = $producer->recvMulti(); // $t is the topic (‘painters.*’) } 51

68. Interesting Ideas Some Architecture Ideas 52

69. Internal “Firehose” Publishers Subscribers Alice’s John’s Y Z timeline Inbox X subscribe to topic X Data Bus subscribe to topic Y System Fred’s Tech Monitor Followers Blog Feed 53

70. Internal “Firehose” Publishers Data Feeds, Subscribers User-generated Alice’s John’s content, timeline Inbox X Y Z System events, ... subscribe to topic X Data Bus subscribe to topic Y System Fred’s Tech Monitor Followers Blog Feed 53

71. Internal “Firehose” Publishers Applications, Subscribers Services, Monitors, Alice’s John’s Y Z Routers, timeline Inbox X Repeaters, subscribe ... to topic X Data Bus subscribe to topic Y System Fred’s Tech Monitor Followers Blog Feed 53

72. Internal “Firehose” Publishers Subscribers Alice’s John’s Y Z timeline Inbox X subscribe to topic X Data Bus subscribe to topic Y Everyone connected to System Fred’s Tech the data bus, no Monitor Followers Blog Feed directed graph 53

73. Internal “Firehose” Publishers Subscribers Alice’s John’s Y Z timeline Inbox X subscribe to topic X Data Bus subscribe to topic Y System Fred’s Tech Monitor Followers Blog Feed 53

74. Monitoring Measure Anything, Measure Everything http://codeascraft.etsy.com/2011/02/15/measure-anything-measure-everything/ 54

75. Monitoring: Measure Everything 55

76. Monitoring: Measure Everything 1. Is there a problem? User experience / Business metrics monitors 2. Where is the problem? System monitors (threshold - variance) 3. What is the problem? Application monitors 55

77. Monitoring: Measure Everything 1. Is there a problem? User experience / Business metrics monitors 2. Where is the problem? System monitors (threshold - variance) 3. What is the problem? Application monitors Keep Signal vs. Noise ratio high 55

78. Monitoring: Measure Everything StatsD 1. Is there a problem? User experience / Business metrics monitors 2. Where is the problem? System monitors (threshold - variance) 3. What is the problem? Application monitors Keep Signal vs. Noise ratio high 55

79. Instrumentation https://play.google.com/store/apps/details?id=net.networksaremadeofstring.rhybudd 56

80. Look! Monitors! 57

81. Look! Monitors! 57

82. StatsD + Graphite Example StatsD: Node.JS daemon. Listens for messages over a UDP port and extracts metrics, which are dumped to Graphite for further processing and visualisation. Graphite: Real-time graphing system. Data is sent to carbon (processing back-end) which stores data into Graphite’s db. Data visualised via Graphite’s web interface. 58

83. StatsD Metrics <?php ; statsd.ini $statsTypePrefix = 'workerX.received.type.'; [statsd] host = yourhost $statsTimeKey = 'workerX.processing_time'; port = 8125 while (true) { $batch = $worker->getBatchOfWork(); foreach ($batch as $item) { // time how long it takes to process this item... $time_start = microtime(true); // ... process item here ... $time = (int)(1000 * (microtime(true) - $time_start)); StatsD::timing($statsTimeKey, $time); // time in ms // count items by type StatsD::increment($statsTypePrefix . $item['type']); } https://github.com/etsy/statsd/ 59

84. StatsD Metrics <?php ; statsd.ini $statsTypePrefix = 'workerX.received.type.'; [statsd] host = yourhost $statsTimeKey = 'workerX.processing_time'; port = 8125 while (true) { $batch = $worker->getBatchOfWork(); foreach ($batch as $item) { // time how long it takes to process this item... $time_start = microtime(true); // ... process item here ... $time = (int)(1000 * (microtime(true) - $time_start)); StatsD::timing($statsTimeKey, $time); // time in ms // count items by type StatsD::increment($statsTypePrefix . $item['type']); } https://github.com/etsy/statsd/ 59

85. Graphite Output workerX.processing_time.mean workerX.processing_time.upper_90 http://graphite.wikidot.com/ 60

86. Graphite Output monitor average, percentiles, standard deviation workerX.processing_time.mean workerX.processing_time.upper_90 http://graphite.wikidot.com/ 60

87. Look! Rib cages! Network Load Viz http://www.network-weathermap.com/ http://cacti.net 61

88. Look! Rib cages! Network Load Viz Not enough! Contextualise metrics http://www.network-weathermap.com/ http://cacti.net 61

89. Cacti + WeatherMap Example Cacti: Network graphing solution harnessing the power of RRDTool’s data storage and graphing functionality. Provides a fast poller, graph templating, multiple data acquisition methods. Weathermap: Cacti plugin to integrate network maps into the Cacti web UI. Includes a web-based map editor. 62

90. Network Load Visualisation 345/s 8432/s 225/s 296/s 335/s 7312/s 311/s 289/s 145/s 4410/s 5320/s 80/s 1331/s 5320/s 5320/s 13/s 2954/s 44/s 3296/s 4322/s 219/s 2954/s 5320/s 832/s 5320/s Graphite datasource for Weathermap: https://github.com/alexforrow/php-weathermap-graphite 63

91. Network Load Visualisation 345/s 8432/s 225/s 296/s 335/s 7312/s 311/s 289/s 145/s 5320/s augmentation 4410/s 80/s service 1331/s timing out? 5320/s 5320/s 13/s 2954/s 44/s 3296/s 4322/s 219/s 2954/s 5320/s 832/s 5320/s Graphite datasource for Weathermap: https://github.com/alexforrow/php-weathermap-graphite 63

92. Network Load Visualisation ﬁltering server 8432/s 345/s slightly 225/s 296/s 335/s overloaded? 7312/s 311/s 289/s 145/s 4410/s 5320/s 80/s 1331/s 5320/s 5320/s 13/s 2954/s 44/s 3296/s 4322/s 219/s 2954/s 5320/s 832/s 5320/s Graphite datasource for Weathermap: https://github.com/alexforrow/php-weathermap-graphite 63

93. Network Load Visualisation 345/s 8432/s 225/s 296/s 335/s 7312/s 311/s 289/s 145/s 4410/s 5320/s 1331/s consumer 80/s 5320/s 5320/s slower than 13/s producer? 2954/s 44/s 3296/s 4322/s 219/s 2954/s 5320/s 832/s 5320/s Graphite datasource for Weathermap: https://github.com/alexforrow/php-weathermap-graphite 63

94. Monitoring Reporting Guidelines Make the subtle obvious Make the complex/busy simple/clean Group information by context Detect anomalies/deviation from norm Turn raw numbers into graphs Appeal to intuition 64

95. We’re Hiring! http://datasift.com/whoweare/jobs 65

96. References http://www.slideshare.net/quipo/the-art-of-scalability-managing-growth https://bitly.com/vCSd49 (DataSift architecture on HighScalability) http://www.slideshare.net/combell/varnish-in-action-phpday2011 https://vimeo.com/couchmode/chariottechcast/videos/sort:date/40988625 http://blog.stuartherbert.com/php/2011/09/21/real-time-graphing-with- graphite/ http://zguide.zeromq.org/page:all Image credits: http://accidental-entrepreneur.com/wp-content/uploads/2011/04/fire-hose.jpg http://www.alibaba.com/product-free/103854677/Q_FIRE_FIRE_HOSE.html 66

97. Lorenzo Alberton @lorenzoalberton Thank you! lorenzo@alberton.info http://www.alberton.info/talks https://joind.in/6372 67

Hinweis der Redaktion

I&#x2019;m Lorenzo, I&#x2019;m Italian but live in the UK. \nI&#x2019;ve been working on several large scale websites like the BBC, Channel 5, Ladbrokes, iPlayer.\nI spent the past two years as Chief Architect at DataSift, a hot big-data startup. \n
\n
\n
I&#x2019;m going to introduce DataSift to explain what we do and how we do it.\nDon&#x2019;t worry, this is not a sales pitch, I&#x2019;m just using DataSift as an example of how to build a scalable architecture based on lessons learnt in the past.\n
Some architecture porn.\n\n
Sources are Twitter, Facebook, YouTube, Flickr, Boards, Forums, etc.\nNews agencies: Thomson Reuters, Associated Press, Al-Jazeera, NYT, Chicago Tribune, etc.\nData Normalisation + Augmentation. Make data rich and structured.\nLanguage detection, demographics (gender detection), trends analysis, sentiment analysis, influence ranking, topic analysis, entities.\n
2nd stage: the core filtering engine. A scalable, highly parallel, custom-built C++ Virtual Machine.\nCan process thousands of incoming messages per second, and thousands of custom filters.\n
Web site, public API, Output streams (HTTP Streaming, WebSockets), Buffered streams (batches of messages), and finally...\n
...storage. We record everything in our Hadoop cluster (historical access, analytics).\nWe also have watchdogs to keep track of usage limits, licenses, etc.\n
I&#x2019;m going to give you some numbers to give you a sense of the scale we&#x2019;re operating at.\nBetween 3 and 9K/sec depending on the time of the day.\n\n
\n
\n
Now, everyone here heard about service-oriented architectures, but I&#x2019;m going to share some of the lessons I learnt in the past on how to scale a platform, that helped me designing and scaling DataSift and other large enterprise sites before it.\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
The first characteristic of a SOA is having several loosely-coupled services.\nSeparate consumers from service implementation\nOrchestration of distinct units accessible over a network\nCommunication with data in a well-defined interoperable format\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Having decoupled services means you can scale each one horizontally. \nIf a service is under heavy load, on fire, you can have more node of the same to keep the service up, without having to duplicate the entire monolithic platform.\n
Avoid failover (hot-swap) configuration. They don&#x2019;t work well and usually involve downtime or data loss.\nCells provide a unit of parallelization that can be adjusted to any size as the user base grows.\nCell are added in an incremental fashion as more capacity is required.\nCells isolate failures. One cell failure does not impact other cells.\nCells provide isolation as the storage and application horsepower to process requests is independent of other cells.\nCells enable nice capabilities like the ability to test upgrades, implement rolling upgrades, and test different versions of software.\nCells can fail, be upgraded, and distributed across datacenters independent of other cells.\n\n
As an example, this is the current cardinality of servers we have for each service.\nEach box in the diagram has between 2 and 60+ nodes.\n
Let&#x2019;s have a look at how to practically implement load-balancing and application caching.\n
You can buy a hardware appliance (excellent, expensive), or use a software like HA-Proxy.\nSet the service nodes as backend servers.\nHA-Proxy will do health-checks, and reroute the traffic to the healthy nodes.\n
Use a random director to have weights (send more load to a more powerful machine).\nThe random director uses a random number to seed the backend selection.\nThe client director picks a backend based on the clients identity. You can set the VCL variable client.identity to identify the client by picking up the value of a session cookie or similar.\nThe hash director will pick a backend based on the URL hash value (req.hash).\nThe fallback director will pick the first backend that is healthy. It considers them in the order in which they are listed in its definition.\n
\n
It works out of the box, just set Cache-Control headers.\nIt supports ETags to cache several versions of the same page for different customers.\nEdge-Side Includes. Thijs\n
We&#x2019;ve seen some characteristics of Service Oriented Architectures, what they are and why they are useful. \nThere&#x2019;s another incredibly important defining characteristic of SOAs: the API, i.e. the contract between any two services. It&#x2019;s a software-to-software interface, not a user interface.\n
Keep it simple: RESTful verbs, actions on resources, simple data structures in exchange data format \nDefine the action, the endpoint, the parameters, the response\nReserve endpoint for description of the service&#x2019;s API.\nUse the response to generate API docs.\nFeed to test console as configuration.\n
I recommend a tool that really makes your API docs alive.\nMashery IO Docs: example of working documentation.\nDefine an API for all services (internal AND external)\nReserve an endpoint to describe the API for the service itself\nRESTful. Personal preference for plain-text format (XML or JSON)\n
Reserve the root endpoint (or a /discovery or /self endpoint) to a description of the service&#x2019;s API.\nBonus: if the response is in the Mashery IO Docs&#x2019; format, you can have a web interface to document and test the API.\n
Instead of hard-coding the configuration of all the services everywhere, expose the configuration via a separate service.\n\n
ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services.\nIt looks like a distributed file system, each node can have children and properties.\nEach service can register itself at startup and become available to receive requests.\n
The consumer simply reads the properties of a node (file / path)\n\n
As we saw, each component should be able to scale horizontally. \n\n
There are two possible problems:\n- when processing itself is expensive\n- when there&#x2019;s too much data\n
There are two possible problems:\n- when processing itself is expensive\n- when there&#x2019;s too much data\n
Internally\nUse queues and workers to make processes asynchronous, distribute data to parallel workers. \nCurl-multi, low timeouts.\n\n\n
Internally\nUse queues and workers to make processes asynchronous, distribute data to parallel workers. \nCurl-multi, low timeouts.\n\n\n
\n
\n
don&#x2019;t move the data to the processing nodes. I/O is very expensive.\n
2nd part of the talk: moving data around (communication across services).\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
At DataSift we use different message systems, depending on volume, destination, communication type.\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
Source/sink, Producer/consumer\n- Asynchronous communication\n- Decoupling (buffers)\n- Load balancing\n- Distribution\n- High throughput\n- In memory, Persistent, Distributed\n
\n
http://www.justincarmony.com/blog/2012/01/10/php-workers-with-redis-solo/\nhttp://blog.meltingice.net/programming/creating-processing-queues-redis/\n\n
\n
http://www.justincarmony.com/blog/2012/01/10/php-workers-with-redis-solo/\nhttp://blog.meltingice.net/programming/creating-processing-queues-redis/\n\n
\n
We&#x2019;ve seen simple buffering. Let&#x2019;s now see a few more useful patterns.\nThe first example shows how to move from one processor to several nodes, to distribute the data and process it in parallel.\nPUSH-PULL is an efficient pattern for workload distribution \n
\n
\n
Workload distribution with workers\n
You can also invert producers and consumer and have a multiplexer to join messages coming from several nodes back into a single one.\n
The second pattern shows how to distribute data in a non-exclusive way: each consumer gets a copy of the same data, the items are not removed from the queue when one consumer gets them. \nThe producer doesn&#x2019;t need to know who&#x2019;s listening, it doesn&#x2019;t need to have a registry of addresses of connected consumers.\nMongrel2\n
You can also broadcast to different datacenters.\nListeners can only subscribe to one or more topics. Different output channels.\nZeroMQ v3: filtering done on the publisher side\n
broadcasting\n
\n
\n
\n
An interesting idea if you have a highly dynamic site / service, with each update affecting several other users / pages, is to have an internal data bus that carries all the information, with updates labelled with topics, and all the services/users subscribing to the relevant topics.\nThumbler: internal firehose. Each service subscribes to interesting events.\n
An interesting idea if you have a highly dynamic site / service, with each update affecting several other users / pages, is to have an internal data bus that carries all the information, with updates labelled with topics, and all the services/users subscribing to the relevant topics.\nThumbler: internal firehose. Each service subscribes to interesting events.\n
An interesting idea if you have a highly dynamic site / service, with each update affecting several other users / pages, is to have an internal data bus that carries all the information, with updates labelled with topics, and all the services/users subscribing to the relevant topics.\nThumbler: internal firehose. Each service subscribes to interesting events.\n
An interesting idea if you have a highly dynamic site / service, with each update affecting several other users / pages, is to have an internal data bus that carries all the information, with updates labelled with topics, and all the services/users subscribing to the relevant topics.\nThumbler: internal firehose. Each service subscribes to interesting events.\n
An interesting idea if you have a highly dynamic site / service, with each update affecting several other users / pages, is to have an internal data bus that carries all the information, with updates labelled with topics, and all the services/users subscribing to the relevant topics.\nThumbler: internal firehose. Each service subscribes to interesting events.\n
An interesting idea if you have a highly dynamic site / service, with each update affecting several other users / pages, is to have an internal data bus that carries all the information, with updates labelled with topics, and all the services/users subscribing to the relevant topics.\nThumbler: internal firehose. Each service subscribes to interesting events.\n
Statistics are better than logs. At certain volumes, logs are just noise (and a waste of space), make your application dynamically configurable to turn logging on only when strictly necessary.&#xA0; Statsd / Graphite.\nMonitor everything. Set alerts based on deviance from norm, not just on absolute thresholds.\n\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
Logging at scale is useless. Too much noise. Instrumentation is essential.\nYou need to identify bottlenecks quickly or suffer prolonged and painful outages. The question of "How come we didn't catch that earlier?" addresses the incident, not the problem. The alternative question "What in our process is flawed that allowed us to launch the service without the appropriate monitoring to catch such an issue?" addresses the people and the processes that allowed the event you just had and every other event for which you didn't have appropriate monitoring.\nDesigning to be monitored is an approach wherein one builds monitoring into the application rather than around it. "How do we know when it's starting to behave poorly?" First, you need to answer the question "Is there a problem?" with user experience and business metrics monitors (lower click-through rate, shopping cart abandonment rate, ...). Then you need to identify where the problem is with system monitors (the problem with this is that it's usually relying on threshold alerts - i.e. checking if something is behaving outside of our expectations - rather than alerting on when it's performing significantly differently than in the past). Finally you need to identify what is the problem thanks to application monitoring. \nNot all monitoring data is valuable, too much of it only creates noise, while wasting time and resources. It's advisable to only save a summary of the reports over time to keep costs down while still providing value. In the ideal world, incidents and crises are predicted and avoided by a robust monitoring solution.\n
We collect millions of events every second.\nThe importance of people: devops who know what to monitor, how, how to use and write tools, and have 100% dedication. Useful: mobile phone apps receiving alerts from Zenoss.\nWe use different technologies. It&#x2019;s very easy to set up a new ZeroMQ listener.\nWe use StatsD (from Flickr / Etsy), Zenoss, Graphite\n
Here&#x2019;s a photo of our monitoring wall. We even have an emergency lighting with a siren, triggered by Zenoss alerts.\n
Here&#x2019;s a photo of our monitoring wall. We even have an emergency lighting with a siren, triggered by Zenoss alerts.\n
http://www.apievangelist.com/2011/06/23/api-ecosystem-tracking-with-statsd-and-graphite/\nhttp://mat.github.com/statsd-railscamp2011-slides/\n\n
With the Etsy library, you can sample the sending rate. UDP.We created a wrapper to buffer and aggregate stats in memory for a while and then to flush them at regular intervals, to save a LOT of bandwidth.\n
With the Etsy library, you can sample the sending rate. UDP.We created a wrapper to buffer and aggregate stats in memory for a while and then to flush them at regular intervals, to save a LOT of bandwidth.\n
With the Etsy library, you can sample the sending rate.We created a wrapper to buffer and aggregate stats in memory for a while and then to flush them at regular intervals, to save a LOT of bandwidth.\n
Monitoring at application level, system level, infrastructure level. Heatmap of any link of the pipeline (physical and logical). Network rib-cages like this one are NOT ENOUGH! You want to contextualise the metrics you receive.\n + Cacti\n
\n
When you process real-time data in a complex pipeline made of several stages, you need a way of immediately telling IF there is a problem and WHERE it is. You don&#x2019;t have time to debug, you need to SEE. \nMeasure throughput and latency.\n
When you process real-time data in a complex pipeline made of several stages, you need a way of immediately telling IF there is a problem and WHERE it is. You don&#x2019;t have time to debug, you need to SEE. \nMeasure throughput and latency.\n
When you process real-time data in a complex pipeline made of several stages, you need a way of immediately telling IF there is a problem and WHERE it is. You don&#x2019;t have time to debug, you need to SEE. \nMeasure throughput and latency.\n
When you process real-time data in a complex pipeline made of several stages, you need a way of immediately telling IF there is a problem and WHERE it is. You don&#x2019;t have time to debug, you need to SEE. \nMeasure throughput and latency.\n
When you process real-time data in a complex pipeline made of several stages, you need a way of immediately telling IF there is a problem and WHERE it is. You don&#x2019;t have time to debug, you need to SEE. \nMeasure throughput and latency.\n
Information density is important, but don&#x2019;t overdo it: keep the signal-to-noise high.\nUse colours. Cognitive process: let the visual cortex do the work. Normalise.\nIntuition is involuntary, fast, effortless, invisible.\nAttention is voluntary, slow, difficult, visible.\n
\n
happy to talk about any of them\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
- N+1 design (ensure that everything you develop has at least one additional instance of that system in the event of failure)\n- Designing the capability to roll back into an app helps limit the scalability impact of any given release.\n- Designing to disable features adds the flexibility of keeping the most recent release in production while limiting / containing the impact of offending features or functionality.\n- Design to be monitored: you want your system to identify when it&#x2019;s performing differently than it normally operates in addition to telling you when it&#x2019;s not functioning properly.\n- Design for multiple live sites: it usually costs less than the operation of a hot site and a cold disaster recovery site.\n- Use mature technology: early adopters risk a lot in finding the bugs; availability and reliability are important.\n- Asynchronous design: asynchronous systems tend to be more fault tolerant to extreme load.\n- Stateless Systems (if necessary, store state with the end users)\n- Buy when non-core\n- Scale out not up (with commodity hardware; horizontal split in terms of data, transactions and customers).\n- Design for any technology, not for a specific product/vendor\n
Synchronous calls, if used excessively or incorrectly cause undue burden on the system and prevent it from scaling.\nSystems designed to interact synchronously have a higher failure rate than asynchronous ones. Their ability to scale is tied to the slowest system in the chain of communications. It&#x2019;s better to use callbacks, and timeouts to recover gracefully should they not receive responses in a timely fashion.\nSynchronisation is when two or more pieces of work must be in a specific order to accomplish a task. Asynchronous coordination between the original method and the invoked method requires a mechanism that the original method determines when or if a called method has completed executing (callbacks). Ensure they have a chance to recover gracefully with timeouts should they not receive responses in a timely fashion.\nA related problem is stateful versus stateless applications. An application that uses state relies on the current condition of execution as a determinant of the next action to be performed. \nThere are 3 basic approaches to solving the complexities of scaling an application that uses session data: 1) Avoidance (using no sessions or sticky sessions) avoid replication: Share-nothing architecture; 2) Decentralisation (store session data in the browser&#x2019;s cookie or in a db whose key is referenced by a hash in the cookie); 3) Centralisation (store cookies in the db / memcached).\n\n
You must be able to isolate and limit the effects of failures within any system, by segmenting the components. Decouple decouple decouple! A swim lane represent both a barrier and a guide (ensure that swimmers don&#x2019;t interfere with each other. Help guide the swimmer toward their objective with minimal effort). AKA Shard.\nThey increase availability by limiting the impact of failures to a subset of functionality, make incidents easier to detect, identify and resolve. The fewer the things are shared between lanes, the more isolative and beneficial the swim lane becomes to both scalability and availability. They should not have lines of communication crossing lane boundaries, and should always move in the direction of the communication. When designing swim lanes, always address the transactions making the company money first (e.g. Search&Browse vs Shopping Cart), then move functions causing repetitive problems into swim lanes; finally consider the natural layout or topology of the site for opportunities to swim lanes (e.g. customer boundaries within an app / environment. If you have a tenant who is very busy, assign it a swim lane; other tenants with a low utilisation can be all put into another swim lane).\n
You must be able to isolate and limit the effects of failures within any system, by segmenting the components. Decouple decouple decouple! A swim lane represent both a barrier and a guide (ensure that swimmers don&#x2019;t interfere with each other. Help guide the swimmer toward their objective with minimal effort). AKA Shard.\nThey increase availability by limiting the impact of failures to a subset of functionality, make incidents easier to detect, identify and resolve. The fewer the things are shared between lanes, the more isolative and beneficial the swim lane becomes to both scalability and availability. They should not have lines of communication crossing lane boundaries, and should always move in the direction of the communication. When designing swim lanes, always address the transactions making the company money first (e.g. Search&Browse vs Shopping Cart), then move functions causing repetitive problems into swim lanes; finally consider the natural layout or topology of the site for opportunities to swim lanes (e.g. customer boundaries within an app / environment. If you have a tenant who is very busy, assign it a swim lane; other tenants with a low utilisation can be all put into another swim lane).\n
You must be able to isolate and limit the effects of failures within any system, by segmenting the components. Decouple decouple decouple! A swim lane represent both a barrier and a guide (ensure that swimmers don&#x2019;t interfere with each other. Help guide the swimmer toward their objective with minimal effort). AKA Shard.\nThey increase availability by limiting the impact of failures to a subset of functionality, make incidents easier to detect, identify and resolve. The fewer the things are shared between lanes, the more isolative and beneficial the swim lane becomes to both scalability and availability. They should not have lines of communication crossing lane boundaries, and should always move in the direction of the communication. When designing swim lanes, always address the transactions making the company money first (e.g. Search&Browse vs Shopping Cart), then move functions causing repetitive problems into swim lanes; finally consider the natural layout or topology of the site for opportunities to swim lanes (e.g. customer boundaries within an app / environment. If you have a tenant who is very busy, assign it a swim lane; other tenants with a low utilisation can be all put into another swim lane).\n
You must be able to isolate and limit the effects of failures within any system, by segmenting the components. Decouple decouple decouple! A swim lane represent both a barrier and a guide (ensure that swimmers don&#x2019;t interfere with each other. Help guide the swimmer toward their objective with minimal effort). AKA Shard.\nThey increase availability by limiting the impact of failures to a subset of functionality, make incidents easier to detect, identify and resolve. The fewer the things are shared between lanes, the more isolative and beneficial the swim lane becomes to both scalability and availability. They should not have lines of communication crossing lane boundaries, and should always move in the direction of the communication. When designing swim lanes, always address the transactions making the company money first (e.g. Search&Browse vs Shopping Cart), then move functions causing repetitive problems into swim lanes; finally consider the natural layout or topology of the site for opportunities to swim lanes (e.g. customer boundaries within an app / environment. If you have a tenant who is very busy, assign it a swim lane; other tenants with a low utilisation can be all put into another swim lane).\n
You must be able to isolate and limit the effects of failures within any system, by segmenting the components. Decouple decouple decouple! A swim lane represent both a barrier and a guide (ensure that swimmers don&#x2019;t interfere with each other. Help guide the swimmer toward their objective with minimal effort). AKA Shard.\nThey increase availability by limiting the impact of failures to a subset of functionality, make incidents easier to detect, identify and resolve. The fewer the things are shared between lanes, the more isolative and beneficial the swim lane becomes to both scalability and availability. They should not have lines of communication crossing lane boundaries, and should always move in the direction of the communication. When designing swim lanes, always address the transactions making the company money first (e.g. Search&Browse vs Shopping Cart), then move functions causing repetitive problems into swim lanes; finally consider the natural layout or topology of the site for opportunities to swim lanes (e.g. customer boundaries within an app / environment. If you have a tenant who is very busy, assign it a swim lane; other tenants with a low utilisation can be all put into another swim lane).\n
What is the best way to handle large volumes of traffic? Answer: &#x201C;Establish the right organisation, implement the right processes and follow the right architectural principles&#x201D;. Correct, but the best way is not to have to handle it at all. The key to achieving this is through pervasive use of caching. The cache hit ratio is important to understand its effectiveness. The cache can be updated/refreshed via a batch job or on a cache-miss. If the cache is filled, some algorithms (LRU, MRU...) will decide on which entry to evict. When the data changes, the cache can be updated through a write-back or write-through policy. There are 3 cache types:\n- Object caches: used to store objects for the app to be reused, usually serialized objects. The app must be aware of them. Layer in front of the db / external services. Marshalling is a process where the object is transformed into a data format suitable for transmitting or storing.\n- Application caches: A) Proxy caches, usually implemented by ISPs, universities or corporations; it caches for a limited number of users and for an unlimited number of sites. B) Reverse proxy caches (opposite): it caches for an unlimited number of users and for a limited number of applications; the configuration of the specific app will determine what can be cached. HTTP headers give much control over caching (Last-Modified, Etag, Cache-Control).\n- Content Delivery Networks: they speed up response time, off load requests from your application&#x2019;s origin server, and usually lower costs. The total capacity of the CDN&#x2019;s strategically placed servers can yield a higher capacity and availability than the network backbone. The way it works is that you place the CDN&#x2019;s domain name as an alias for your server by using a canonical name (CNAME) in your DNS entry\n
What is the best way to handle large volumes of traffic? Answer: &#x201C;Establish the right organisation, implement the right processes and follow the right architectural principles&#x201D;. Correct, but the best way is not to have to handle it at all. The key to achieving this is through pervasive use of caching. The cache hit ratio is important to understand its effectiveness. The cache can be updated/refreshed via a batch job or on a cache-miss. If the cache is filled, some algorithms (LRU, MRU...) will decide on which entry to evict. When the data changes, the cache can be updated through a write-back or write-through policy. There are 3 cache types:\n- Object caches: used to store objects for the app to be reused, usually serialized objects. The app must be aware of them. Layer in front of the db / external services. Marshalling is a process where the object is transformed into a data format suitable for transmitting or storing.\n- Application caches: A) Proxy caches, usually implemented by ISPs, universities or corporations; it caches for a limited number of users and for an unlimited number of sites. B) Reverse proxy caches (opposite): it caches for an unlimited number of users and for a limited number of applications; the configuration of the specific app will determine what can be cached. HTTP headers give much control over caching (Last-Modified, Etag, Cache-Control).\n- Content Delivery Networks: they speed up response time, off load requests from your application&#x2019;s origin server, and usually lower costs. The total capacity of the CDN&#x2019;s strategically placed servers can yield a higher capacity and availability than the network backbone. The way it works is that you place the CDN&#x2019;s domain name as an alias for your server by using a canonical name (CNAME) in your DNS entry\n
What is the best way to handle large volumes of traffic? Answer: &#x201C;Establish the right organisation, implement the right processes and follow the right architectural principles&#x201D;. Correct, but the best way is not to have to handle it at all. The key to achieving this is through pervasive use of caching. The cache hit ratio is important to understand its effectiveness. The cache can be updated/refreshed via a batch job or on a cache-miss. If the cache is filled, some algorithms (LRU, MRU...) will decide on which entry to evict. When the data changes, the cache can be updated through a write-back or write-through policy. There are 3 cache types:\n- Object caches: used to store objects for the app to be reused, usually serialized objects. The app must be aware of them. Layer in front of the db / external services. Marshalling is a process where the object is transformed into a data format suitable for transmitting or storing.\n- Application caches: A) Proxy caches, usually implemented by ISPs, universities or corporations; it caches for a limited number of users and for an unlimited number of sites. B) Reverse proxy caches (opposite): it caches for an unlimited number of users and for a limited number of applications; the configuration of the specific app will determine what can be cached. HTTP headers give much control over caching (Last-Modified, Etag, Cache-Control).\n- Content Delivery Networks: they speed up response time, off load requests from your application&#x2019;s origin server, and usually lower costs. The total capacity of the CDN&#x2019;s strategically placed servers can yield a higher capacity and availability than the network backbone. The way it works is that you place the CDN&#x2019;s domain name as an alias for your server by using a canonical name (CNAME) in your DNS entry\n
\n
\n
\n
\n
shameless plug\n
\n
\n

Scalable Architectures - Taming the Twitter Firehose

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (7)

Ähnlich wie Scalable Architectures - Taming the Twitter Firehose

Ähnlich wie Scalable Architectures - Taming the Twitter Firehose (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Scalable Architectures - Taming the Twitter Firehose

Hinweis der Redaktion