Database API Design: Picking GraphQL, REST or gRPC

Designing API for
Databases:
‪ Which one should you pick ?
GraphQL
@clunven | #voxxed_lu | @voxxed_lu
Casino 2000, Luxembourg

@clunven | @voxxed_lu | #voxxed_lu
About me
Cedrick Lunven
Developer Advocate
Creator Contributor

© DataStax, All Rights Reserved.Confidential © DataStax, All Rights Reserved.
Agenda 2
3
4 DECISION TREE
DEMONSTRATION AND CODE REVIEW
API DESIGN METHODOLOGY
1 APACHE CASSANDRA™ OVERVIEW

APACHE CASSANDRA™
Quick Overview

This is a DISTRIBUTED Database
Node
• Up to 1TB
• 3000 Tx/s/core

…which scale linearly

Data is distributed
0
13
25
38
50
63
75
88 59 (data)
RF=2
59 (data)
RF=3
replicated

Cassandra is « AP »
Consistency
Partition
Tolerance
Availability

Tuneable Consistency
RF=3
Client CL=ONECL=QUORUMCL=ALL

…anywhere
Cluster
Datacenter (Ring)

Sweet spots
1. High Throughput (because we can keep up)
2. High Volume (because we scale linearly and still OLTP)
3. High Availability (replication, masterless)
4. Data distribution (read/write around the globe)

• Le KEYSPACE est comme un schéma
dans Oracle, une isolation des
données
12
Projet_X Keyspace
Une table contient une CLEF
PRIMAIRE contenant 2 parties : La
partition key et le reste (clustering
columns). Chaque valeur de
partition key est hashée sous la
forme d’un token.
Plusieurs lignes avec la même
partition key constitue une
partition.
Data Modelling

• Syntaxe proche du SQL pour
les bases relationnelles
• Création des objets avec le
DDL :
• CREATE, INSERT, UPDATE,
DELETE, GRANT, REVOKE,
SELECT, WHERE
13
Exemple
CREATE TABLE market_prices (
symbol TEXT,
date TIMESTAMP,
price DECIMAL,
side INT,
PRIMARY KEY (symbol, date)
) WITH CLUSTERING ORDER BY(date DESC);
Cassandra Query Language

API DESIGN METHODOLOGY
From zero to hero

Reference Application
http://killrvideo.github.io

Api Design Methodology
1
killrvideo-dse
Drivers
DAO3
killrvideo-api-rest killrvideo-api-grpc killrvideo-api-graphql
2

Designing Data Model
Entities & Relationships
Queries

Conceptual Data Model

Application Workflow
R1: Find comments related to target video using its identifier
• Get most recent first
• Implement Paging
R2: Find comments related to target user using its identifier
• Get most recent first
• Implement Paging
R3: Implement CRUD operations

Mapping
Q2: Find comments posted for a user with a
known id (show most recent first)
comments_by_video
comments_by_user
Q1: Find comments for a video with a
known id (show most recent first)
Q3: CRUD Operations

Logical Data Model
userid
creationdate
commentid
videoid
comment
comments_by_user
K
C
↑
videoid
creationdate
commentid
userid
comment
comments_by_video
C
↑
K
C
↑
↑C

Physical Data Model
userid
commentid
videoid
comment
comments_by_user
TIMEUUID
K
TEXT
C
UUID
UUID
↑
videoid
commentid
userid
comment
comments_by_video
TIMEUUID
K
TEXT
C
UUID
UUID
↑

Schema DDL
CREATE TABLE IF NOT EXISTS comments_by_user (
userid uuid,
commentid timeuuid,
videoid uuid,
comment text,
PRIMARY KEY ((userid), commentid)
) WITH CLUSTERING ORDER BY (commentid DESC);
CREATE TABLE IF NOT EXISTS comments_by_video (
videoid uuid,
commentid timeuuid,
userid uuid,
comment text,
PRIMARY KEY ((videoid), commentid)
) WITH CLUSTERING ORDER BY (commentid DESC);

How?
Conceptual Data
Model
(Entities, Relations)
Application Workflow
(Queries)
Database Family
(Technos +Table)

DECISION TREE
Because we are serious

Analysis Criteria
📋 Conceptual
Data Model
Application
Workflow
(Queries)
Database Family
(Technos
+Table)
Caching
Syncvs AsyncReactive
SLA (Volume)
Data Integrity
Filters
Paging
Sort
Latencies Throughput
Versionning
Confidentiality
Atomicity
Cardinality
Developers
👤 Users/Consumers
Language
CodeFirst/
Vs SchemaFirst
Documentation
Test
Build
Packaging
Api Catalog
Internalvs
Public Techno
Profile
XP

Analysis Matrix

Decision Tree

 Decoupling Client / Server (Schema on read)
 Flexibility: Sync, Async, Reactive + Multi payload
 Api Lifecycle (Versioning)
 Tooling (API Management, Serverless)
 Verbose payloads (json, xml)
 No discoverability
 Not best fit for command-like (functions) API (RPC)
 CRUD superstar
 Relevant for OLTP mutations and statuses
 Public and web APIs

 High Performances (http/2 – binary serialisation)
 Multiple stubs : Sync, Async, Streaming
 Multi languages (Interoperability)
 Strongly coupled (schema with proto files)
 No discoverability
 Protobuf serialization format
 Distributed network of services (no waits)
 High throughput & streaming use cases
 Command-like, RPC

 Discoverability, documentation
 Custom payloads
 Match standards (Json | Http)
 Single endpoint (versioning, monitoring, security)
 Complex implementation (tooling, still young)
 Nice for customers nasty for DB (N+1 select)
 BFF : Backend for frontend
 Service aggregation | composition (joins)
 When bandwidth matters (mobile phones)
GraphQL

RESOURCES
Because COPY/PASTE is the most important skill for developers

References
https://github.com/clun/api_Rest_Grpc_GraphQL

Database API Design: Picking GraphQL, REST or gRPC

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Database API Design: Picking GraphQL, REST or gRPC

Ähnlich wie Database API Design: Picking GraphQL, REST or gRPC (20)

Mehr von Cédrick Lunven

Mehr von Cédrick Lunven (17)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Database API Design: Picking GraphQL, REST or gRPC

Hinweis der Redaktion