Data Abstraction for Large Web Applications

•Als KEY, PDF herunterladen•

0 gefällt mir•2,837 views

brandonsavage

Technologie

Who Am I?

• Software developer at Mozilla working on Socorro

• Author of the PHP Playbook

• Former frequent blogger on PHP topics

• Private pilot in my spare time

Data Abstraction For LARGE
Web Applications

Once upon a time...

In a galaxy far far away...

Eventually the web grew up.

And grew larger.

Most webapps still start as
though they’ll always use a
database.

Socorro Data Sources
• Postgres

• REST API (Middleware)

• Elastic Search

• Hbase

• Bugzilla REST API

• Memcache

A database-centric model just
doesn’t work anymore.

Solving the problem

• Separate the use of data from the retrieval of data.

• Think in terms of actions.

• Build our applications to be storage agnostic.

• Use the correct data storage medium.

#1 Separate the use of data
from the retrieval of data

$<?php class MainPage_Controller { /* ... */ public function do_something(){ /* ... */ $sql = ‘SELECT * FROM database”; $results = $this->execute($sql); return $this->executeView(‘index’, array(‘results’ => $results)); } }$

$<?php class Data_Model { /* ... */ public function get_some_data() { $sql = ‘SELECT * FROM database”; $results = $this->execute($sql); /** process results **/ return $processedResults } }$

Processing the data is a
separate layer.

$<?php class Data_Model { /* ... */ public function getSomeData() { $data = $this->adapter->queryData(); ! /** process data here **/ ! return $processedData; } } class Data_Model_Adapter extends MySQL_Adapter implements Adapter { public function queryData() { $sql = ‘SELECT * FROM table’; /** turn into common format **/ return $commonFormatData; } }$

Swapping out data sources
becomes very simple.

Make life easier on yourself: do
it right the ﬁrst time!

Actions move beyond SELECT,
INSERT, UPDATE and DELETE.

Domain Modeling:
“What are you modeling?”

What do I want?
What do I need?
What does this data represent?

Django Models:
One model per table.
All methods relate to SQL.
That sucks.

$<?php abstract class User_Model { public function loadUser(); public function authenticateUser(); public function showUserPhones(); }$

#3 Build our applications to be
storage agnostic

Create custom objects for
typehinting or additional
methods

Avoid expecting built-ins like
PDOStatement and
MongoCursor outside
retrieval layer

Example: memcache isn’t for
long-term storage.

Example: MongoDB is not for
relational data storage.

Relational data goes in relational
databases!

Choose the correct NoSQL
database for your needs.

Availability, reliability, and
consistency.

Pick two.

Consider data storage that isn’t
a database at all.

Alternative data options

• Elastic Search

• Redis

• S3

• The File System (Yes! It still exists!)

Fix it now or ﬁx it later.

But you will have to ﬁx it.

Weitere ähnliche Inhalte

Was ist angesagt?

Advance java session 16Smita B Kumar

Web Programming - 5 Passing and Request DataAndiNurkholis1

Survey on NoSQL integrationLuiz Henrique Zambom Santana

Entities in Drupal 8 - Drupal Tech Talk - Bart FeenstraTriquanta

Building nTier Applications with Entity Framework Services (Part 1)David McCarter

An Introduction to Spring DataOliver Gierke

[Mas 500] Data Basicsrahulbot

Entity Framework Database and Code FirstJames Johnson

#MongoDB indexesDaniele Graziani

NOSQL vs SQLMohammed Fazuluddin

Mysqlguest817344

SQL & NoSQLAhmad Awsaf-uz-zaman

01 nosql and multi model databaseMahdi Atawneh

Big data technologies and databasesHariniA7

Spring Test DBUnitJaran Flaath

Do’s and don’ts of a hybrid environmentRick Vasquez

Tech Gupshup Meetup On MongoDB - 24/06/2016Mukesh Tilokani

Appache Cassandra nehabsairam

SQL vs NoSQLJacinto Limjap

Multi model-databasesMichael Hackstein

Was ist angesagt? (20)

Advance java session 16

Web Programming - 5 Passing and Request Data

Survey on NoSQL integration

Entities in Drupal 8 - Drupal Tech Talk - Bart Feenstra

Building nTier Applications with Entity Framework Services (Part 1)

An Introduction to Spring Data

[Mas 500] Data Basics

Entity Framework Database and Code First

#MongoDB indexes

NOSQL vs SQL

Mysql

SQL & NoSQL

01 nosql and multi model database

Big data technologies and databases

Spring Test DBUnit

Do’s and don’ts of a hybrid environment

Tech Gupshup Meetup On MongoDB - 24/06/2016

Appache Cassandra

SQL vs NoSQL

Multi model-databases

Andere mochten auch

Applications for the Enterprise with PHP (CPEurope)Robert Lemke

Beyond MVC: from Model to DomainJeremy Cook

Software Engineering In PHPRalph Schindler

PHP deployment, 2016 flavor - cakefest 2016Quentin Adam

Advanced PHP: Design Patterns - Dennis-Jan Broersedpc

Proved PHP Design Patterns for Data PersistenceGjero Krsteski

Taming the resource tigerElizabeth Smith

Building Data Mapper PHP5Vance Lucas

Asynchronous I/O in PHPThomas Weinert

Driving Design through ExamplesCiaranMcNulty

PHP Strings and PatternsHenry Osborne

Some REST Design Patterns (and Anti-Patterns) - SOA Symposium 2009Cesare Pautasso

Enterprise PHP: mappers, models and servicesAaron Saray

ORM: Object-relational mappingAbhilash M A

Elegant Ways of Handling PHP Errors and ExceptionsZendCon

Design Patterns avec PHP 5.3, Symfony et PimpleHugo Hamon

Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...Aaron Saray

Database Design PatternsHugo Hamon

Patterns of Enterprise Application Architecture (by example)Paulo Gandra de Sousa

Writing and using php streams and socketsElizabeth Smith

Andere mochten auch (20)

Applications for the Enterprise with PHP (CPEurope)

Beyond MVC: from Model to Domain

Software Engineering In PHP

PHP deployment, 2016 flavor - cakefest 2016

Advanced PHP: Design Patterns - Dennis-Jan Broerse

Proved PHP Design Patterns for Data Persistence

Taming the resource tiger

Building Data Mapper PHP5

Asynchronous I/O in PHP

Driving Design through Examples

PHP Strings and Patterns

Some REST Design Patterns (and Anti-Patterns) - SOA Symposium 2009

Enterprise PHP: mappers, models and services

ORM: Object-relational mapping

Elegant Ways of Handling PHP Errors and Exceptions

Design Patterns avec PHP 5.3, Symfony et Pimple

Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...

Database Design Patterns

Patterns of Enterprise Application Architecture (by example)

Writing and using php streams and sockets

Ähnlich wie Data Abstraction for Large Web Applications

Spring data presentationOleksii Usyk

Elements for an iOS BackendLaurent Cerveau

Minerva: Drill Storage Plugin for IPFSBowenDing4

Data accessJoshua Yoon

Drupal performance and scalabilityTwinbit

Java Developers, make the database work for you (NLJUG JFall 2010)Lucas Jellema

Staying Sane with Drupal NEPHPOscar Merida

BackboneJS Training - Giving Backbone to your applicationsJoseph Khan

Microsoft Entity FrameworkMahmoud Tolba

Machine Learning with ML.NET and Azure - Andy CrossAndrew Flatters

[2015/2016] Local data storage for web-based mobile appsIvano Malavolta

CrawlerLD - Distributed crawler for linked dataRaphael do Vale

Java Web Programming on Google Cloud Platform [2/3] : DatastoreIMC Institute

La sqlJames Johnson

Being RDBMS Free -- Alternate Approaches to Data PersistenceDavid Hoerster

Core data WIPJam workshop @ MWC'14Diego Freniche Brito

Spring Data - Intro (Odessa Java TechTalks)Igor Anishchenko

Midao JDBC presentationZachar Prychoda

Dao examplemyrajendra

Academy PRO: HTML5 Data storageBinary Studio

Ähnlich wie Data Abstraction for Large Web Applications (20)

Spring data presentation

Elements for an iOS Backend

Minerva: Drill Storage Plugin for IPFS

Data access

Drupal performance and scalability

Java Developers, make the database work for you (NLJUG JFall 2010)

Staying Sane with Drupal NEPHP

BackboneJS Training - Giving Backbone to your applications

Microsoft Entity Framework

Machine Learning with ML.NET and Azure - Andy Cross

[2015/2016] Local data storage for web-based mobile apps

CrawlerLD - Distributed crawler for linked data

Java Web Programming on Google Cloud Platform [2/3] : Datastore

La sql

Being RDBMS Free -- Alternate Approaches to Data Persistence

Core data WIPJam workshop @ MWC'14

Spring Data - Intro (Odessa Java TechTalks)

Midao JDBC presentation

Dao example

Academy PRO: HTML5 Data storage

Kürzlich hochgeladen

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Histor y of HAM Radio presentation slidevu2urc

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

A Call to Action for Generative AI in 2024Results

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Kürzlich hochgeladen (20)

Axa Assurance Maroc - Insurer Innovation Award 2024

Presentation on how to chat with PDF using ChatGPT code interpreter

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Boost PC performance: How more available memory can improve productivity

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

2024: Domino Containers - The Next Step. News from the Domino Container commu...

What Are The Drone Anti-jamming Systems Technology?

08448380779 Call Girls In Friends Colony Women Seeking Men

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Histor y of HAM Radio presentation slide

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Handwritten Text Recognition for manuscripts and early printed texts

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

A Call to Action for Generative AI in 2024

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Breaking the Kubernetes Kill Chain: Host Path Mount

Data Abstraction for Large Web Applications

1. Data Abstraction for Large Web Applications By Brandon Savage

2. Who Am I? • Software developer at Mozilla working on Socorro • Author of the PHP Playbook • Former frequent blogger on PHP topics • Private pilot in my spare time

3. Data Abstraction For LARGE Web Applications

4. No magic bullets

5. Once upon a time... In a galaxy far far away...

6. Eventually the web grew up. And grew larger.

7. Most webapps still start as though they’ll always use a database.

8. We need to change our thinking.

9. Socorro

10. Socorro Data Sources • Postgres • REST API (Middleware) • Elastic Search • Hbase • Bugzilla REST API • Memcache

11. A database-centric model just doesn’t work anymore.

12. Solving the problem • Separate the use of data from the retrieval of data. • Think in terms of actions. • Build our applications to be storage agnostic. • Use the correct data storage medium.

13. #1 Separate the use of data from the retrieval of data

14. <?php class MainPage_Controller { /* ... */ public function do_something(){ /* ... */ $sql = ‘SELECT * FROM database”; $results = $this->execute($sql); return $this->executeView(‘index’, array(‘results’ => $results)); } }

15. <?php class Data_Model { /* ... */ public function get_some_data() { $sql = ‘SELECT * FROM database”; $results = $this->execute($sql); /** process results **/ return $processedResults } }

16. Processing the data is a separate layer.

17. <?php class Data_Model { /* ... */ public function getSomeData() { $data = $this->adapter->queryData(); ! /** process data here **/ ! return $processedData; } } class Data_Model_Adapter extends MySQL_Adapter implements Adapter { public function queryData() { $sql = ‘SELECT * FROM table’; /** turn into common format **/ return $commonFormatData; } }

18. Swapping out data sources becomes very simple.

19. A cautionary tale

20. Move to middleware in Socorro

21. Make life easier on yourself: do it right the ﬁrst time!

22. #2 Think in terms of actions.

23. Actions move beyond SELECT, INSERT, UPDATE and DELETE.

24. Domain Modeling: “What are you modeling?”

25. What do I want? What do I need? What does this data represent?

26. Django Models: One model per table. All methods relate to SQL. That sucks.

27. <?php abstract class User_Model { public function loadUser(); public function authenticateUser(); public function showUserPhones(); }

28. #3 Build our applications to be storage agnostic

29. Use a standard data format

30. stdClass()

31. Create custom objects for typehinting or additional methods

32. Avoid expecting built-ins like PDOStatement and MongoCursor outside retrieval layer

33. #4 Use the correct storage medium.

34. Example: memcache isn’t for long-term storage.

35. Example: MongoDB is not for relational data storage.

36. Relational data goes in relational databases!

37. Choose the correct NoSQL database for your needs.

38. Availability, reliability, and consistency. Pick two.

39. Consider data storage that isn’t a database at all.

40. Alternative data options • Elastic Search • Redis • S3 • The File System (Yes! It still exists!)

41. Fix it now or ﬁx it later. But you will have to ﬁx it.

42. Question time

Hinweis der Redaktion

\n
\n
\n
\n
Years and years ago, when the web was young, state was maintained simply by the creation of a database. Web applications were mostly small, and databases could easily handle the traffic that was being sent their way. Most of us learned how to write web applications against a database. Most of us used the &#x201C;LAMP stack&#x201D; or Linux Apache MySQL PHP.\n
As the web grew up, and grew bigger, methods for obtaining, storing and using data changed.\n\nDevelopers began using data sources provided by others, first over SOAP then REST. Other data stores like NoSQL, Redis, Elastic Search and Memcache came along to complicate things. \n\nIt was no longer all about the database. The database was just one piece of the puzzle.\n
Yet if we take a good look at most of the frameworks available, they&#x2019;re database-centric. For a long time, Doctrine support for other data layers was non-existent. Support for something other than a database in Django is non-existent. We still think in a database-centric way. Or data layers are still database-focused.\n
The bottom line: we need to change our thinking.\n\nDatabases are not it. Even for applications that start against a database (and that&#x2019;s most if not all of them), we need to think about the other ways that we&#x2019;ll ingest data.\n
This lesson was painful for those of us working on Socorro. Initially built as a database-centric application we&#x2019;ve slowly expanded our technology stack as new needs have arisen. While much of our webapp data comes from Postgres, we&#x2019;ve begun a process of moving our data layer to a more source-agnostic middleware layer.\n
\n
It&#x2019;s clear for us that a database centric model doesn&#x2019;t work anymore. We can&#x2019;t think of data in concepts of rows and columns. It doesn&#x2019;t work like that. \n\nSo how do we solve this problem?\n
\n
Large web applications don&#x2019;t pursue abstraction as an art form. They pursue it as a necessity. Failing to properly abstract a large web application can result in catastrophic failure. It is therefore important to abstract the layer that gets data from a data storage unit from the layers that use the data.\n\nHere&#x2019;s an example...\n
When programmers are in a hurry they often don&#x2019;t take the time to abstract their code in a way that makes it easy to come along later and make changes. I&#x2019;ve seen this example hundreds of time in codebases I&#x2019;ve worked on; many of you probably have too. But the problem here is that if ever the data source changes from some SQL-based database to something else, a programmer will have to rewrite the logic here and everywhere else all over again. This makes the cost of transition much higher than it has to be.\n
When programmers are in a hurry they often don&#x2019;t take the time to abstract their code in a way that makes it easy to come along later and make changes. I&#x2019;ve seen this example hundreds of time in codebases I&#x2019;ve worked on; many of you probably have too. But the problem here is that if ever the data source changes from some SQL-based database to something else, a programmer will have to rewrite the logic here and everywhere else all over again. This makes the cost of transition much higher than it has to be.\n
It would make good sense to therefore abstract the process of \n
We should instead use adapters to query the data and return it in an agreed upon format. The processing takes place elsewhere.\n
\n
NAP story. Data layer Postgres focused.\n
When the retrieval and processing are combined, it makes it that much harder to remove one from the other in the future.\n
\n
\n
\n
\n
\n
\n
When you think in terms of actions, rather than data sources, you don&#x2019;t care what happens behind the scenes. Instead, you start caring about the finished product. In Socorro, we have reports that use both Hbase and Postgres data. If we cared about the data source, we&#x2019;d have many more calls than we need.\n
\n
If we use JSON as a standard data format throughout our app, we can construct generic objects easily without worrying about what methods are automatically available to us.\n
\n
Rather than relying upon model-constructed or ORM-built objects, we should create our own when and if the need arises. \n
It&#x2019;s okay to process the results from a database query into some standard format or create an object using the data. But once the data has been retrieved, it should be pushed into a standard format that can be used in the app without caring about what the data source was.\n
Developers are drawn to things that are new, cool, or otherwise unique and special. But it&#x2019;s important to use the correct storage medium for development.\n
\n
\n
\n
Socorro uses ElasticSearch (not a NoSQL database) and Hbase. We should have used Cassandra, but we have Hbase instead.\n
\n
External APIs, the file system, all are valid data storage mechanisms. Just because we write database-driven applications doesn&#x2019;t mean our data storage has to be entirely a database. A REST API to an external resource is a valid data storage mechanism, that isn&#x2019;t database-driven (at least as far as your app is concerned).\n
\n
\n
\n

Data Abstraction for Large Web Applications

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie Data Abstraction for Large Web Applications

Ähnlich wie Data Abstraction for Large Web Applications (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Data Abstraction for Large Web Applications

Hinweis der Redaktion