Database as a Service - Tutorial @ICDE 2010

Database as a ServiceSeminar, ICDE 2010, Long Beach, March 04 Wolfgang Lehner | Dresden University of Technology, Germany Kai-Uwe Sattler | Ilmenau University of Technology, Germany 1

Introduction Motivation SaaS Cloud Computing UseCases 2

Software as a Service (SaaS) Traditional Software On-DemandUtility Plug In, SubscribePay-per-Use Build Your Own 3

Comparison of business model 4

Avoidhiddencostof traditional SW Traditional Software SaaS SW Licenses Subscription Fee Training Training Customization Hardware IT Staff Maintenance Customization 5

The Long Tail Dozens of markets of millions or millions of markets of dozens? Your Large Customers $ / Customer What if you lower your cost of sale (i.e. lower barrier to entry) and you also lower cost of operations Your Typical Customers New addressable market >> current market (Currently) “non addressable” Customers # of Customers 6

Acquisition Model Service Business Model Pay for usage Access ModelInternet Technical ModelScalable, elastic, shareable EC2 & S3 "All that matters is results — I don't care how it is done" Cloud Computing: A style of computing where massively scalable, IT-enabled capabilities are provided "as a service" across the Internet to multiple external customers. "I don't want to own assets — I want to pay for elastic usage, like a utility" "I want accessibility from anywhere from any device" "It's about economies of scale, with effective and dynamic sharing" What is Cloud? – Gartner’s Definition 7

To Qualify as a Cloud Common, Location-independent, Online Utility on Demand* Common implies multi-tenancy, not single or isolated tenancy Utility implies pay-for-use pricing onDemandimplies ~infinite, ~immediate, ~invisible scalability Alternatively, a “Zero-One-Infinity” definition:** 0On-premise infrastructure, acquisition cost, adoption cost, support cost 1Coherent and resilient environment – not a brittle “software stack” Scalability in response to changing need, Integratability/ Interoperability with legacy assets and other services Customizability/Programmability from data, through logic, up into the user interface without compromising robust multi-tenancy * Joe Weinman, Vice President of Solutions Sales, AT&T, 3 Nov. 2008 ** From The Jargon File: “Allow none of foo, one of foo, or any number of foo” 8

Cloud Differentials: Service Models 9 Cloud Software as a Service (SaaS) Use provider’s applications over a network Cloud Platform as a Service (PaaS) Deploy customer-created applications to a cloud Cloud Infrastructure as a Service (IaaS) Rent processing, storage, network capacity, and other fundamental computing resources

Cloud Differentials: Characteristics 10 Platform Physical – Virtual Homogenous – Heterogeneous Design Paradigms Storage CPU Bandwidth Usage Model Exclusive Shared Pseudo-Shared Size/Location Large Scale(AWS, Google, BM/Google), Small Scale(SMB, Academia) Purpose General Purpose Special Purpose (e.g., DB-Cloud) Administration/Jurisdiction Public Private

UseCases: Large-Scale Data Analytics Outsourceyourdata and usecloudresourcesforanalysis Historical and mostlynon-criticaldata Parallelizable, read-mostlyworkload, high variantworkloads Relaxed ACID guarantees Examples (HadoopPoweredBy): Yahoo!: researchfor ad systems and Web search Facebook: reporting and analytics Netseer.com: crawling and log analysis Journey Dynamics: trafficspeedforecasting 11

UseCases: Database Hosting Public datasets Biologicaldatabases: a singlerepositoryinstead of > 700 separate databases Semantic Web Data, Linkeddata, ... Sloan Digital Sky Survey TwitterCache Already on Amazon AWS: annotated human genomedata, US census, Freebase, ... Archiving, Metadata Indexing, ... 12

UseCases: Service Hosting Data managementforSaaSsolutions Run theservicesnearthedata = ASP Alreadymanyexistingapplications CRM, e.g. Salesforce, SugarCRM Web Analytics Supply Chain Management HelpDesk Management Enterprise ResourcePlanning, e.g. SAP Business ByDesign ... 13

Foundations & Architectures Virtualization Programmingmodels Consistencymodels & replication SLAs & Workloadmanagement Security 14

Topics covered in this Seminar Query & Programming Model Logical Data Model Virtuali-zation Multi-Tenancy Service Level Agreements Storage Model DistributedStorage Replication Security 15

Current Solutions userperspective one DB for all clients one DB per client Virtualization Replication 16 DistributedStorage physicalperspective

Virtualization Separating the abstract view of computing resources from the implementation of these resources addsflexibility and agility to the computing infrastructure soften problems related to provisioning, manageability, … lowers TCO: fewercomputingresources Classicaldrivingfactor: serverconsolidation 18 E-mail server Web server Database server E-mail server Database server Linux Linux Linux Linux Linux EDBT2008 Tutorial (Aboulnaga e.a.) Web server Linux Virtualization Consolidate  Improved utilization using consolidation

Whatcanbevirtualized – thebigfour. 19

Different TypesofVirtualization 20 APP 1 APP 4 APP 2 APP 3 APP 5 OPERATING SYSTEM OPERATING SYSTEM VIRTUAL MACHINE 1 VIRTUAL MACHINE 2 CPU CPU CPU MEM MEM NET VIRTUAL MACHINE MONITOR (VMM) PHYSICAL STORAGE PHYSICAL MACHINE CPU MEM NET CPU CPU

Virtual Machines 21 Technique with long history (since the 1960's) Prominent since IBM 370 mainframeseries Today large scale commodity hardware and operating systems Virtual Machine Monitor (Hypervisor) strong isolation between virtual machines (security, privacy, fault tolerance) flexible mapping between virtual machines and physical resources classical operationspause, resume, checkpoint, migrate (admin / load balancing) Software deployment Preconfigured virtual appliances Repositories of virtual appliances on the web

DBMS on top of Virtual Machines ... yetanotherapplication? ... Overhead? SQL Server withinVMware 22

Virtualization Design Advisor What fraction of node resources goes to what DBMS? Configuring VM parameters What parameter settings are best for a given resource configuration Configuringthe DBMS parameters Example Workload 1: TPC-H (10GByte) Workload 2: TPC-H (10GByte) only Q18 (132 copies) Virtualization design advisor 20% of CPU to Workload 1 80% of CPU to Workload 2 23

Some Experiments Workload Definition based on TPC-H Q18 isoneofthemost CPU intensive queries Q21 isoneofthe least CPU intensive queries Workload Units C: 25x Q18 I: 1x Q21 Experiment: Sensitivity to workloadResource Needs W1 = 5C + 5I W2 = kC + (10-k)I (increaseof k -> more CPU intensive) Postgres DB2 24

Some Experiments (2) Workload Settings W3 = 1C W4 = kC Workload Settings W5 = 1C W6 = kI 25

Virtualization in DBaaS environments DB Layer DB Server DB Server DB Server DB DB DB DB DB Instance Layer Instance Instance Instance Instance Instance Instance DB Server Layer VM VM VM VM VM VM VM Layer HW Layer 26

Existing Tools for Node Virtualization DB Server DB Layer DB DB DB DB DB DB Ad2visor ,[object Object]

Redistribution of TablesDB Workload Manager Instance Layer Instance Instance DB Server Layer Static Environment Assumptions ,[object Object]

VM expects static (peak) resource requirements

Interactions between layers can improve performance/utilizationNode Ressource Model VM VM VM VM Layer VM Configuration ,[object Object]

Layer Interactions (2) Experiment DB2 on Linux TPC-H workload on 1GB database Ranges for resource grants Main memory (BP) – 50 MB to 1GB Additional storage (Indexes) – 5% to 30% DB size Varying advisor output (17-26 indexes) Different possible improvement Different expected Performance after improvement DB Advisor Expected Performance Possible Improvement Index Storage Index Storage 35% 90% 25% 25% 20% 20% 15% 15% <1% <3% 10% 10% VM Configuration 5% 5% 200 MB 400 MB 600 MB 800 MB 1 GB 200 MB 400 MB 600 MB 800 MB 1 GB BP BP 28

Storage Virtualization General Goal provide a layerofindircetiontoallowthedefinitionofvirtualstoragedevices minimize/avoiddowntime (local and remote mirroring) improveperformance (distribution/balancing – provisioning - controlplacement) reducecostofstorageadministration Operations create, destroy, grow, shrinkvirtualdevices changesize, performance, reliability, ... workloadfluctuations hierarchicalstoragemanagement versioning, snapshots, point-in-time copies backup, checkpoints exploit CPU and memory in the storage system caching executelow-level DBMS functions 29

Virtualization in DBaaS Environments (2) DB Layer DB Server DB Server DB Server DB DB DB DB DB Instance Layer Instance Instance Instance Instance Instance Instance DB Server Layer VM VM VM VM VM VM VM Layer Shared Disk HW Layer Storage Layer 30 Local Disk

Virtualization in DBaaS Environments (2) DB Layer DB DB DB DB DB DB Server Instance Layer Instance Instance DB Server Layer VM VM VM VM Layer HW Layer Storage Layer 31 DB Advisor ,[object Object]

Redistribution of TablesDB Workload Manager StorageRessource Model Storage Configuration ,[object Object]

ArchivingShared Disk Local Disk

Onewaytogo? Paravirtualization CPU and Memory Paravirtualization extendstheguest to allow direct interaction withtheunderlyinghypervisor reducesthemonitorcostincludingmemoryand System calloperations. gainsfromparavirtualizationareworkloadspecific Device Paravirtualization places a highperformancevirtualization-aware device driver into the guest paravirtualizeddriversaremoreCPU efficient (less CPU overhead forvirtualization) Paravirtualizeddriverscanalso take advantage of HW features, like partial offload

Outline Query & Programming Model Logical Data Model Virtuali-zation Multi-Tenancy Service Level Agreements Storage Model DistributedStorage Replication Security 33

Multi Tenancy Goal: consolidate multiple customersontothesame operational system best resourceutilization flexible,butlimitedscalability separate DBper tenant shared DBsharedschema shared DBseparate schema ,[object Object]

Extensibility: customer-specificschemachanges

Security: preventingunauthorizeddataaccessesbyothertenants

Performance/scalability: scale-up & scale-out

Maintenance: on tenantlevelinstead of on databaselevel34

Flexible Schema Approaches Goal: allowtenant-specificschemaadditions (columns) Universal Table Extension Table PivotTable 35

Flexible Schema Approaches: Comparison Best performance Flexible schemaevolution Pivottable Extension table Chunkfolding Private tables Applicationownstheschema Database ownstheschema Universal table XML columns Universal table: requirestechniquesforhandlingsparsedata Fine-grainedindexsupportnotpossible Pivottable: Requiresjoinsforreconstructinglogicaltuples Chunkfolding: similar to pivottables Group of columnsarecombined in a chunk and mappedinto a chunktable Requirescomplexquerytransformation 36

Access Control in Multi-Tenant DB Shared DB approachesrequirerow-levelaccesscontrol Query transformation.... whereTenantID = 42 ... Potential securityrisks DBMS-levelcontrol, e.g. IBM DB2 LBAC Label-based Access control Controls read/writeaccess to individualrows and columns Securitylabelswithpolicies Requires separate accountforeachtenant 37

In a Nutshell How shall virtualization be handled on Machine level (VM to HW) DBMS level (database to instance to database server) Schema level (multi tenancy) ... using … Allocation between layers Configuration inside layers Flexible schemas … when … Characteristics of the workloads are known Virtual machines are transparent Tenant-specific schema extensions … demanding that … SLAs and security are respected Each node’s utilization is maximized Number of nodes is minimized 38

MapReduce Background 40 Programming model and an associated implementation for large-scale data processing Google and related approaches: Apache Hadoop and Microsoft Dryad User-defined map & reduce functions Infrastructure hides details of parallelization provides fault-tolerance, data distribution, I/O scheduling, load balancing, ... map (in_key, in_value) -> (out_key, intermediate_value) list reduce (out_key,intermediate_value list) -> out_value list M { (key,value) } R M R M

Logic Flow of WordCount Mapper Hadoop Map/Reduce is a software framework for easily writing applications which process vast amounts of data (multi-terabyte data-sets) in-parallel on large clusters (thousands of nodes) of commodity hardware in a reliable, fault-tolerant manner… 1  Hadoop Map/Reduce is a Hadoop 1 Map  1 17  software framework for Reduce  1 is  1 45  easily writing applications a  1 … … Sort/Shuffle Reducer Hadoop [1, 1, 1, …,1] Hadoop 5 Map  [1, 1, 1, …, 1] Map  12 Reduce  [1, 1, 1, …, 1] Reduce  12 is  [1, 1, 1, …, 1] is  42 a  [1, 1, 1, …, 1] a  23

MapRecude Disadvantages Extremely rigid data flow Common operations must be coded by hand join, filter, split, projection, aggregates, sorting, distinct User plans may be suboptimal and lead to performance degradation Semantics hidden inside map-reduce functions Inflexible, difficult to maintain, extend and optimize Combination of high-level declarative querying and low-level programming with MapReduce  Dataflow Programming Languages Hive, JAQL and Pig M R 42

PigLatin PigLatin On top of map-reduce/ Hadoop Mix of declarative style of SQL and procedural style of map-reduce Consists of two parts PigLatin: A Data Processing Language Pig Infrastructure: An Evaluator for PigLatin programs Pig compiles Pig Latin into physical plans Plans are to be executed over Hadoop 30% of all queriesat Yahoo! in Pig-Latin Open-source, http://incubator.apache.org/pig 43

Example ,[object Object],URL Info Visits 44

Implementation in MapReduce 45

ExampleWorkflow in Pig-Latin load URL Info load Visits visits = load ‘/data/visits’ as (user, url, time); gVisits = group visits byurl; visitCounts = foreachgVisitsgenerateurl, count(visits); urlInfo = load ‘/data/urlInfo’ as (url, category, pRank); visitCounts = joinvisitCountsbyurl, urlInfobyurl; gCategories = groupvisitCountsby category; topUrls = foreachgCategoriesgenerate top(visitCounts,10); store topUrls into ‘/data/topURLs’; Operatedirectly over files. group by url foreachurl generate count Schemas optional. Can be assigned dynamically. join on url User-defined functions (UDFs) can be used in every construct ,[object Object]

group, filter, foreachgroup by category foreachcategory generate top10 URLs 46

Compilation in MapReduce Every group or join operation forms a map-reduce boundary Other operations pipelined into map and reduce phases load URL Info load Visits Map1 Map2 group by url Reduce1 foreachurl generate count join on url Reduce2 Map3 group by category Reduce3 foreachcategory generate top10 URLs 47

Data warehouse infrastructure built on top of Hadoop, providing: Data Summarization Ad hoc querying Simple query language: Hive QL (based on SQL) Extendable via custom mappers and reducers Subproject of Hadoop No „Hive format“ http://hadoop.apache.org/hive/ Hive 48

Hive - Example LOAD DATA INPATH `/data/visits` INTO TABLE visits INSERT OVERWRITE TABLE visitCounts SELECT url, category, count(*) FROM visits GROUP BY url, category; LOAD DATA INPATH ‘/data/urlInfo’ INTO TABLE urlInfo INSERT OVERWRITE TABLE visitCounts SELECT vc.*, ui.* FROM visitCountsvc JOIN urlInfoui ON (vc.url = ui.url); INSERT OVERWRITE TABLE gCategories SELECT category, count(*) FROM visitCounts GROUP BY category; INSERT OVERWRITE TABLE topUrls SELECT TRANSFORM (visitCounts) USING ‘top10’; 49

Higher level query language for JSON documents Developed at IBM‘s Almaden research center Supports several operations known from SQL Grouping, Joining, Sorting Built-in support for Loops, Conditionals, Recursion Custom Java methods extend JAQL JAQL scripts are compiled to MapReduce jobs Various I/O Local FS, HDFS, Hbase, Custom I/O adapters http://www.jaql.org/ JAQL 50

JAQL - Example registerFunction(„top“, „de.tuberlin.cs.dima.jaqlextensions.top10“); $visits= hdfsRead(„/data/visits“); $visitCounts= $visits -> groupby $url = $ into { $url, num: count($)}; $urlInfo= hdfsRead(„data/urlInfo“); $visitCounts= join $visitCounts, $urlInfo where $visitCounts.url == $urlInfo.url; $gCategories= $visitCounts -> group by $category = $ into {$category, num: count($)}; $topUrls= top10($gCategories); hdfsWrite(“/data/topUrls”, $topUrls); 51

ACID vs. BASE Traditional distributeddatamanagement Web-scaledatamanagement ACID BasicallyAvailableSoft-stateEventualconsistent Strongconsistency Isolation Focus on „commit“ Availability? Pessimistic Difficultevolution (e.g. schema) Weakconsistency Availabilityfirst Best effort Optimistic (aggressive) Fast and simple Easierevolution 53

CAP Theorem [Brewer 2000] Consistency: all clientshavethesameview, even in case of updates Availability: all clients find a replica of data, even in thepresence of failures Tolerance to networkpartitions: systemproperties hold evenwhenthenetwork (system) ispartitioned Youcanhave at mosttwoof thesepropertiesforanyshared-data system. 54

CAP Theorem No consistencyguarantees➟ updateswithconflictresolution On a partitionevent, simplywaituntildataisconsistentagain➟ pessimisticlocking All nodesare in contactwitheachotherorputeverything in a single box➟ 2 phasecommit 55

CAP: Explanations PA :=update(o) PB:=read(o) 1. 3. 2. M Networkpartitions ➫ M isnotdelivered Solutions? Synchronousmessage: <PA,M> isatomic Possiblelatencyproblems (availability) Transaction <PA, M, PB>: requires to controlwhen PBhappens Impacts partitiontoleranceoravailability 56

Consistency Models [Vogels 2008] A B C update: D0->D1 read(D) D0 Distributedstoragesystem Strongconsistency: afterthe update completes, anysubsequentaccessfrom A, B, C will return D1 Weakconsistency: doesnotguaranteethatsubsequentaccesses will returnD1 -> a number of conditionsneed to bemetbeforeD1 isreturned Eventualconsistency: Special form of weakconsistency Guaranteesthatif no newupdatesaremade, eventually all accesses will returnD1 57

Variations of EventualConsistency Causalconsistency: If A notifies B aboutthe update, B will read D1 (butnot C!) Read-your-writes: A will alwaysread D1afteritsown update Session consistency: Read-your-writesinside a session Monotonicreads: If a process has seenDk, anysubsequentaccess will neverreturnany Diwith i < k Monotonicwrites: guarantees to serializethewrites of thesameprocess 58

Database Replication storethesamedata on multiple nodes in order to improvereliability, accessibility, fault-tolerance Single master Multimaster Optimisticreplication relaxedconsistency 1-copy consistency ,[object Object]

Allowsreplicas to diverge; requiresconflictresolution

Allowdatabeaccessedwithouta-priorisynchronization

Updates arepropagated in thebackground

Occasionalconflictsarefixedaftertheyhappen

Improvedavailability, flexibility, scalabability, butsee CAP theorem59

OptimisticReplication: Elements 1 2 2 2 2 1 1 1 1 2 2 2 1 1 1. operationsubmission 3. scheduling 2. propagation 1+2 1+2 1+2 4. conflictresolution 5. commitment 60 Y. Saito, M. Shapiro: OptimisticReplication, ACM ComputingSurveys, 5(3):1-44, 2005

Conflict Resolution & Update Propagation Single master Thomas writerule Dividingobjects, ... Vector clocks App-specificorderingorpreconditions Prohibit Ignore Reduce Syntactic Semantic Detect & repair 61 ,[object Object]

Updates pass throughthesystemlikeinfectiousdiseases

Pairwisecommunication: a sitecontactsothers (randomlychosen) and sends ist information, e.g. aboutupdates

All sitesprocessmessages in thesame way

Proactivebehaviour: no failurerecoverynecessary!

Basic approaches:anti-entropy, rumor mongering, ...,[object Object]

The Notion of QoS and Predictability Service Level Agreement legal part technical part Service Level Objectives ,[object Object]

fees, penalties, ...Common understandingaboutservices, guarantees, responsibilities 63 Application Server / middleware DBMS OS / Hardware

TechniquesforQoS in Data Management 64 Providesufficientresources Capacityplanning: „Howmuchboxesforcustomer X?“ Cost vs. Performance tradeoff Shielding Dedicated (virtual) systemforcustomers Scalability? Costefficiency? Scheduling Orderingrequests on priority At whichlevel?

Workload Management Purpose: achieveperformancegoalsforclasses of requests (queries, transactions) Resourceprovisioning Aspects: Specification of service-levelobjectives Workloadclassification and modeling Admissioncontrol & scheduling Staticpriorization: DB2 Query Patroller, Oracle Resource Manager, ... Goal-orientedapproaches Economicapproaches Utility-basedapproaches 65

Workload Characteristics Functional I/O requirements (volume, bandwidth) CPU Degree of parallelism Response times? Throughput? … Non-Functional Availability Reliability Durability Scalability … 66

WLM: Model classes workload classification MPL result admission control &scheduling transaction response time Admission control: limit the number of simultanously executing requests (multiprogramming level = MPL) Scheduling: ordering requests by priority 67

Utility Functions Utility function = preferencespecification mappossiblesystemstates (e.g. resourceprovisioning to jobs) to a real scalarvalue Representsperformancefeature (response time, throughput, ...) and/oreconomicvalue ,[object Object]

Explorespace of alternative mappings (searchproblem)

Runtimemonitoring and controlutility response time 68 Kephart, Das: Achievingself-management via utilityfunctions. IEEE Internet Computing 2007

WorkloadModeling & Prediction Goal: predictresourcerequirementsfor a givenworkload, i.e., find correlationbetweenqueryfeatures and performancefeatures Approaches: regression, correlationanalysis, KernelCanonical CA queryplans/job descr. jobfeaturematrix query planprojection KCCA performancestatistics performancefeaturematrix performanceprojection Ganapathi et al.: Predicting Multiple MetricsforQueries: BetterDecisionsEnabledbyMachineLearning. ICDE 2009 ,[object Object]

Calculate job coordinates in query plan projectionbased on job featurevector

Inferjob‘scoordinates on theperformanceprojection69

Database as a Service - Tutorial @ICDE 2010

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie Database as a Service - Tutorial @ICDE 2010

Ähnlich wie Database as a Service - Tutorial @ICDE 2010 (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Database as a Service - Tutorial @ICDE 2010

Hinweis der Redaktion