SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
Modern Database Systems
CS-E5040
Aristides Gionis
Michael Mathioudakis
T.A.: Orestis Kostakis
Spring 2016
what is a Database Management System (DBMS)?
what is a database?
a collection of data
what is a database management system?
... a.k.a. ‘database system’
software to store, access, administer a database
not just a collection of files
provides mechanism to query the data
transfers data between main memory and secondary storage (disk)
enables concurrent access, offers guarantees for data consistency
provides crash recovery mechanisms
provides security and access control
2
why use a DBMS?
discuss
3
why use a DBMS?
separate logical from physical data organization
efficient data access
guarantee data integrity and security
reduce application development time
data administration
4
why study database systems?
discuss
5
why study database systems?
to manage data efficiently
6
consider the following task…
data records that contain information about products viewed or purchased from an online store
task for each pair of Games products, count the number of customers that have purchased both
7
Product Category Customer Date Price Action other...
Portal 2 Games Michael M. 12/01/2015 10€ Purchase
...
FLWR Plant Food Garden Aris G. 19/02/2015 32€ View
Chase the Rabbit Games Michael M. 23/04/2015 1€ View
Portal 2 Games Orestis K. 13/05/2015 10€ Purchase
...
> what challenges does case B pose compared to case A?
hint limited main memory, disk access, distributed setting
case A
10,000 records (0.5MB per record, 5GB total disk space)
10GB of main memory
case B
10,000,000 records (~5TB total disk space) stored across 100 nodes (50GB per node),
10GB of main memory per node
the main message
to manage data efficiently
minimize expensive operations
e.g., disk access
parallelize computation
8
why study database systems?
to manage data efficiently ...
... from different roles
•  develop database systems that match application requirements
•  use database systems efficiently
o  … knowing how a DBMS
•  stores data,
•  processes queries,
•  and accesses data
o  … allows us to
ü  organize data appropriately
ü  design efficient algorithms to process the data
•  combine existing database systems to match requirements
•  large variety of data and applications
•  “one size fits none” - Michael Stonebraker
9
the relational database system
10
database
(data stored on disk)
DBMS
query optimization & execution
relational operators
files and access methods
buffer (memory) management
disk space management
query interface
database
design
application
database user
introductory course
relational dbms
our course
relational dbms
our course
non-relational dbms
previously on ‘database systems’...
§  relational data model
•  relation, attribute, tuple, schema, domain, keys
§  relational algebra
•  projection, selection,
•  cartesian product, natural joins, theta joins, outer joins
•  renaming, constraints
§  structured query language (SQL)
§  schema design
•  functional dependencies, normalization
§  applications
•  embedded sql, drivers
11
modern database systems
beyond the typical relational DBMS setting...
different data models
semi-structured data, unstructured text, graphs
operations at massive scale
big data platforms & map-reduce paradigm,
hadoop and spark, cloud computing
tailored performance
key-value stores, column-stores,
in-memory databases, streaming systems
12
about this course
familiarize ourselves with modern database systems
principles and practice
database models: data, queries, and computation
algorithms - simple queries (e.g., joins) to complex algorithms
experience with real technologies
emphasis is on understanding of core issues…
essentially: the cost of algorithms for different database models and settings
you can use what you learn here to:
select a database system that fits the demands of your application...
… based on supported data model, functionality, optimizations, scalability
design your database to fit the needs of your application
… e.g., by building appropriate index structures
write fast algorithms to process your data…
… and estimate their running time
adapt your knowledge to the database system you’ll be using 5 years from now
13
syllabus
14
part 1: relational database systems (Jan 22 & Jan 29 - Michael)
topics relational model (SQL), indexing (b+ trees, hash tables), join algorithms, query optimization
technology MySQL
part 2: semi-structured data (Feb 5 - Aris)
topics semi-structured data abstraction, representation, search, indexing and pipeline aggregation
technology MongoDB
part 3: unstructured text and information retrieval (Feb 12 & Feb 26 - Aris)
topics querying text data, inverted indexes, compression, ranking and evaluation, rank aggregation
technology Lucene
part 4a: big data platforms - mapreduce (Mar 4 & Mar 11 - Michael)
topics mapreduce paradigm, algorithms in mapreduce
technology Hadoop
part 4b: big data platforms - graph databases (Mar 18 & Apr 1 - Aris)
topics the pregel paradigm, algorithms on pregel (pagerank, centrality)
technology hadoop giraph, spark graphx
logistics
15
instructors Aristides Gionis, Michael Mathioudakis
teaching assistant Orestis Kostakis
lectures Friday 10-12, Room T3
weeks 2 - 6, 8 - 11, 13
office hours by appointment starting on Monday January 25th
curriculum
•  slides and course notes
•  no single textbook, but slides will provide references for further study
announcements •  follow course website on mycourses.aalto.fi
when you send email •  aristides.gionis / michael.mathioudakis / orestis.kostakis @ aalto.fi
•  subject: [ModernDB] your topic
programming
assignments
•  we’ll provide instances of VirtualBox, ready for use
•  you can use campus labs or own laptop
•  access to CSC, if needed
workload & grading scheme
●  3 assignments + exam
●  25% each
●  need to have at least 50% on each
○  i.e., cannot skip some of the course
●  assignments:
○  pen & paper
■  based on slides + references
○  programming
■  real-world tools, e.g., MySQL, MongoDB, Spark
■  will provide tutorials
16
that’s all for now!
questions?
next week
relational model and SQL
indexing
access cost analysis
what to do until then
(if you want)
SQL
17
Credits
for some of these slides, we used material from
“Database Systems: The Complete Book”, by Garcia-Mollina, Ullman, Widom
“Database Management Systems”, by Ramakrishnan and Gehrke
18

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction to databases
Introduction to databasesIntroduction to databases
Introduction to databasesBryan Corpuz
 
Week 1 Before the Advent of Database Systems & Fundamental Concepts
Week 1 Before the Advent of Database Systems & Fundamental ConceptsWeek 1 Before the Advent of Database Systems & Fundamental Concepts
Week 1 Before the Advent of Database Systems & Fundamental Conceptsoudesign
 
Week 1 Lab Directions
Week 1 Lab DirectionsWeek 1 Lab Directions
Week 1 Lab Directionsoudesign
 
data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
Database system concepts
Database system conceptsDatabase system concepts
Database system conceptsKumar
 
ความรู้เบื้องต้นฐานข้อมูล 1
ความรู้เบื้องต้นฐานข้อมูล 1ความรู้เบื้องต้นฐานข้อมูล 1
ความรู้เบื้องต้นฐานข้อมูล 1Witoon Thammatuch-aree
 
Chapter 6 Database SC025 2017/2018
Chapter 6 Database SC025 2017/2018Chapter 6 Database SC025 2017/2018
Chapter 6 Database SC025 2017/2018Fizaril Amzari Omar
 
Database Concepts and Components
Database Concepts and ComponentsDatabase Concepts and Components
Database Concepts and ComponentsRIAH ENCARNACION
 
Introduction to DBMS(For College Seminars)
Introduction to DBMS(For College Seminars)Introduction to DBMS(For College Seminars)
Introduction to DBMS(For College Seminars)Naman Joshi
 
Basic Concept of Database
Basic Concept of DatabaseBasic Concept of Database
Basic Concept of DatabaseMarlon Jamera
 
database management system (DBMS)
database management system (DBMS)database management system (DBMS)
database management system (DBMS)BigyanDhital1
 
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtapADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtapVikas Jagtap
 
Big data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBig data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBhavya Gulati
 

Was ist angesagt? (20)

Introduction to databases
Introduction to databasesIntroduction to databases
Introduction to databases
 
Week 1 Before the Advent of Database Systems & Fundamental Concepts
Week 1 Before the Advent of Database Systems & Fundamental ConceptsWeek 1 Before the Advent of Database Systems & Fundamental Concepts
Week 1 Before the Advent of Database Systems & Fundamental Concepts
 
Lecture1 data structure(introduction)
Lecture1 data structure(introduction)Lecture1 data structure(introduction)
Lecture1 data structure(introduction)
 
Week 1 Lab Directions
Week 1 Lab DirectionsWeek 1 Lab Directions
Week 1 Lab Directions
 
data resource management
 data resource management data resource management
data resource management
 
Database system concepts
Database system conceptsDatabase system concepts
Database system concepts
 
ความรู้เบื้องต้นฐานข้อมูล 1
ความรู้เบื้องต้นฐานข้อมูล 1ความรู้เบื้องต้นฐานข้อมูล 1
ความรู้เบื้องต้นฐานข้อมูล 1
 
Chapter 6 Database SC025 2017/2018
Chapter 6 Database SC025 2017/2018Chapter 6 Database SC025 2017/2018
Chapter 6 Database SC025 2017/2018
 
Database Concepts and Components
Database Concepts and ComponentsDatabase Concepts and Components
Database Concepts and Components
 
Introduction to DBMS(For College Seminars)
Introduction to DBMS(For College Seminars)Introduction to DBMS(For College Seminars)
Introduction to DBMS(For College Seminars)
 
Database an introduction
Database an introductionDatabase an introduction
Database an introduction
 
11 Database Concepts
11 Database Concepts11 Database Concepts
11 Database Concepts
 
Basic Concept of Database
Basic Concept of DatabaseBasic Concept of Database
Basic Concept of Database
 
database management system (DBMS)
database management system (DBMS)database management system (DBMS)
database management system (DBMS)
 
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtapADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtap
 
Database Part 2
Database Part 2Database Part 2
Database Part 2
 
Unit 01 dbms
Unit 01 dbmsUnit 01 dbms
Unit 01 dbms
 
Database concepts
Database conceptsDatabase concepts
Database concepts
 
Big data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBig data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edge
 
Database Part 1
Database Part 1Database Part 1
Database Part 1
 

Ähnlich wie Modern Database Systems - Lecture 00

Big Data with Not Only SQL
Big Data with Not Only SQLBig Data with Not Only SQL
Big Data with Not Only SQLPhilippe Julio
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Dwdmunit1 a
Dwdmunit1 aDwdmunit1 a
Dwdmunit1 abhagathk
 
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera, Inc.
 
Lecture_1_Intro.pdf
Lecture_1_Intro.pdfLecture_1_Intro.pdf
Lecture_1_Intro.pdfpaijitk
 
Minimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationMinimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationDenodo
 
Data Science & Big Data - Theory.pdf
Data Science & Big Data - Theory.pdfData Science & Big Data - Theory.pdf
Data Science & Big Data - Theory.pdfRAKESHG79
 
CH005
CH005CH005
CH005JUC
 
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptxHEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptxssuser0d9ec0
 
Ch 2-introduction to dbms
Ch 2-introduction to dbmsCh 2-introduction to dbms
Ch 2-introduction to dbmsRupali Rana
 
Laudon_MIS13_ch06.ppt
Laudon_MIS13_ch06.pptLaudon_MIS13_ch06.ppt
Laudon_MIS13_ch06.pptanisur_rehman
 
Databases and Information Management (1).ppt
Databases and Information Management (1).pptDatabases and Information Management (1).ppt
Databases and Information Management (1).pptAlaaShaqfa2
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 

Ähnlich wie Modern Database Systems - Lecture 00 (20)

Big Data with Not Only SQL
Big Data with Not Only SQLBig Data with Not Only SQL
Big Data with Not Only SQL
 
DBMS introduction
DBMS introductionDBMS introduction
DBMS introduction
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Database part1-
Database part1-Database part1-
Database part1-
 
Dwdmunit1 a
Dwdmunit1 aDwdmunit1 a
Dwdmunit1 a
 
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
 
Lecture_1_Intro.pdf
Lecture_1_Intro.pdfLecture_1_Intro.pdf
Lecture_1_Intro.pdf
 
Minimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationMinimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data Virtualization
 
Data Science & Big Data - Theory.pdf
Data Science & Big Data - Theory.pdfData Science & Big Data - Theory.pdf
Data Science & Big Data - Theory.pdf
 
CH005
CH005CH005
CH005
 
Introduction to Databases by Dr. Kamal Gulati
Introduction to Databases by Dr. Kamal GulatiIntroduction to Databases by Dr. Kamal Gulati
Introduction to Databases by Dr. Kamal Gulati
 
Database management system
Database management systemDatabase management system
Database management system
 
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptxHEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
HEDW-2020-Using-Data-Virtualization-to-Break-Down-Data-Silos.pptx
 
Ch 2-introduction to dbms
Ch 2-introduction to dbmsCh 2-introduction to dbms
Ch 2-introduction to dbms
 
Laudon_MIS13_ch06.ppt
Laudon_MIS13_ch06.pptLaudon_MIS13_ch06.ppt
Laudon_MIS13_ch06.ppt
 
Databases and Information Management (1).ppt
Databases and Information Management (1).pptDatabases and Information Management (1).ppt
Databases and Information Management (1).ppt
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Big data.ppt
Big data.pptBig data.ppt
Big data.ppt
 
Lecture1
Lecture1Lecture1
Lecture1
 
Foundations of business intelligence databases and information management
Foundations of business intelligence databases and information managementFoundations of business intelligence databases and information management
Foundations of business intelligence databases and information management
 

Mehr von Michael Mathioudakis

Measuring polarization on social media
Measuring polarization on social mediaMeasuring polarization on social media
Measuring polarization on social mediaMichael Mathioudakis
 
Lecture 07 - CS-5040 - modern database systems
Lecture 07 -  CS-5040 - modern database systemsLecture 07 -  CS-5040 - modern database systems
Lecture 07 - CS-5040 - modern database systemsMichael Mathioudakis
 
Lecture 06 - CS-5040 - modern database systems
Lecture 06  - CS-5040 - modern database systemsLecture 06  - CS-5040 - modern database systems
Lecture 06 - CS-5040 - modern database systemsMichael Mathioudakis
 
Modern Database Systems - Lecture 02
Modern Database Systems - Lecture 02Modern Database Systems - Lecture 02
Modern Database Systems - Lecture 02Michael Mathioudakis
 
Mining the Social Web - Lecture 3 - T61.6020
Mining the Social Web - Lecture 3 - T61.6020Mining the Social Web - Lecture 3 - T61.6020
Mining the Social Web - Lecture 3 - T61.6020Michael Mathioudakis
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Michael Mathioudakis
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMichael Mathioudakis
 
Bump Hunting in the Dark - ICDE15 presentation
Bump Hunting in the Dark - ICDE15 presentationBump Hunting in the Dark - ICDE15 presentation
Bump Hunting in the Dark - ICDE15 presentationMichael Mathioudakis
 

Mehr von Michael Mathioudakis (9)

Measuring polarization on social media
Measuring polarization on social mediaMeasuring polarization on social media
Measuring polarization on social media
 
Lecture 07 - CS-5040 - modern database systems
Lecture 07 -  CS-5040 - modern database systemsLecture 07 -  CS-5040 - modern database systems
Lecture 07 - CS-5040 - modern database systems
 
Lecture 06 - CS-5040 - modern database systems
Lecture 06  - CS-5040 - modern database systemsLecture 06  - CS-5040 - modern database systems
Lecture 06 - CS-5040 - modern database systems
 
Modern Database Systems - Lecture 02
Modern Database Systems - Lecture 02Modern Database Systems - Lecture 02
Modern Database Systems - Lecture 02
 
Mining the Social Web - Lecture 3 - T61.6020
Mining the Social Web - Lecture 3 - T61.6020Mining the Social Web - Lecture 3 - T61.6020
Mining the Social Web - Lecture 3 - T61.6020
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020
 
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slidesMining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
Mining the Social Web - Lecture 1 - T61.6020 lecture-01-slides
 
Absorbing Random Walk Centrality
Absorbing Random Walk CentralityAbsorbing Random Walk Centrality
Absorbing Random Walk Centrality
 
Bump Hunting in the Dark - ICDE15 presentation
Bump Hunting in the Dark - ICDE15 presentationBump Hunting in the Dark - ICDE15 presentation
Bump Hunting in the Dark - ICDE15 presentation
 

Kürzlich hochgeladen

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 

Kürzlich hochgeladen (20)

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 

Modern Database Systems - Lecture 00

  • 1. Modern Database Systems CS-E5040 Aristides Gionis Michael Mathioudakis T.A.: Orestis Kostakis Spring 2016
  • 2. what is a Database Management System (DBMS)? what is a database? a collection of data what is a database management system? ... a.k.a. ‘database system’ software to store, access, administer a database not just a collection of files provides mechanism to query the data transfers data between main memory and secondary storage (disk) enables concurrent access, offers guarantees for data consistency provides crash recovery mechanisms provides security and access control 2
  • 3. why use a DBMS? discuss 3
  • 4. why use a DBMS? separate logical from physical data organization efficient data access guarantee data integrity and security reduce application development time data administration 4
  • 5. why study database systems? discuss 5
  • 6. why study database systems? to manage data efficiently 6
  • 7. consider the following task… data records that contain information about products viewed or purchased from an online store task for each pair of Games products, count the number of customers that have purchased both 7 Product Category Customer Date Price Action other... Portal 2 Games Michael M. 12/01/2015 10€ Purchase ... FLWR Plant Food Garden Aris G. 19/02/2015 32€ View Chase the Rabbit Games Michael M. 23/04/2015 1€ View Portal 2 Games Orestis K. 13/05/2015 10€ Purchase ... > what challenges does case B pose compared to case A? hint limited main memory, disk access, distributed setting case A 10,000 records (0.5MB per record, 5GB total disk space) 10GB of main memory case B 10,000,000 records (~5TB total disk space) stored across 100 nodes (50GB per node), 10GB of main memory per node
  • 8. the main message to manage data efficiently minimize expensive operations e.g., disk access parallelize computation 8
  • 9. why study database systems? to manage data efficiently ... ... from different roles •  develop database systems that match application requirements •  use database systems efficiently o  … knowing how a DBMS •  stores data, •  processes queries, •  and accesses data o  … allows us to ü  organize data appropriately ü  design efficient algorithms to process the data •  combine existing database systems to match requirements •  large variety of data and applications •  “one size fits none” - Michael Stonebraker 9
  • 10. the relational database system 10 database (data stored on disk) DBMS query optimization & execution relational operators files and access methods buffer (memory) management disk space management query interface database design application database user introductory course relational dbms our course relational dbms our course non-relational dbms
  • 11. previously on ‘database systems’... §  relational data model •  relation, attribute, tuple, schema, domain, keys §  relational algebra •  projection, selection, •  cartesian product, natural joins, theta joins, outer joins •  renaming, constraints §  structured query language (SQL) §  schema design •  functional dependencies, normalization §  applications •  embedded sql, drivers 11
  • 12. modern database systems beyond the typical relational DBMS setting... different data models semi-structured data, unstructured text, graphs operations at massive scale big data platforms & map-reduce paradigm, hadoop and spark, cloud computing tailored performance key-value stores, column-stores, in-memory databases, streaming systems 12
  • 13. about this course familiarize ourselves with modern database systems principles and practice database models: data, queries, and computation algorithms - simple queries (e.g., joins) to complex algorithms experience with real technologies emphasis is on understanding of core issues… essentially: the cost of algorithms for different database models and settings you can use what you learn here to: select a database system that fits the demands of your application... … based on supported data model, functionality, optimizations, scalability design your database to fit the needs of your application … e.g., by building appropriate index structures write fast algorithms to process your data… … and estimate their running time adapt your knowledge to the database system you’ll be using 5 years from now 13
  • 14. syllabus 14 part 1: relational database systems (Jan 22 & Jan 29 - Michael) topics relational model (SQL), indexing (b+ trees, hash tables), join algorithms, query optimization technology MySQL part 2: semi-structured data (Feb 5 - Aris) topics semi-structured data abstraction, representation, search, indexing and pipeline aggregation technology MongoDB part 3: unstructured text and information retrieval (Feb 12 & Feb 26 - Aris) topics querying text data, inverted indexes, compression, ranking and evaluation, rank aggregation technology Lucene part 4a: big data platforms - mapreduce (Mar 4 & Mar 11 - Michael) topics mapreduce paradigm, algorithms in mapreduce technology Hadoop part 4b: big data platforms - graph databases (Mar 18 & Apr 1 - Aris) topics the pregel paradigm, algorithms on pregel (pagerank, centrality) technology hadoop giraph, spark graphx
  • 15. logistics 15 instructors Aristides Gionis, Michael Mathioudakis teaching assistant Orestis Kostakis lectures Friday 10-12, Room T3 weeks 2 - 6, 8 - 11, 13 office hours by appointment starting on Monday January 25th curriculum •  slides and course notes •  no single textbook, but slides will provide references for further study announcements •  follow course website on mycourses.aalto.fi when you send email •  aristides.gionis / michael.mathioudakis / orestis.kostakis @ aalto.fi •  subject: [ModernDB] your topic programming assignments •  we’ll provide instances of VirtualBox, ready for use •  you can use campus labs or own laptop •  access to CSC, if needed
  • 16. workload & grading scheme ●  3 assignments + exam ●  25% each ●  need to have at least 50% on each ○  i.e., cannot skip some of the course ●  assignments: ○  pen & paper ■  based on slides + references ○  programming ■  real-world tools, e.g., MySQL, MongoDB, Spark ■  will provide tutorials 16
  • 17. that’s all for now! questions? next week relational model and SQL indexing access cost analysis what to do until then (if you want) SQL 17
  • 18. Credits for some of these slides, we used material from “Database Systems: The Complete Book”, by Garcia-Mollina, Ullman, Widom “Database Management Systems”, by Ramakrishnan and Gehrke 18