SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Modern Operational Data
Architecture
Arthur Gimpel, DataZone
About Me
• Name: Arthur Gimpel
• Position: Technology Evangelist, Solutions
Architect, Trainer
• Tech Stack: MongoDB, SQL Server,
Couchbase, Elastic Stack, Redis, Kafka,
Python, .NET
Relational Databases
• First RDBMS was introduced in late 1970s
• Exist in all possible flavors but share one
thing - ACID
• Still dominate the database market
RDBMS In Theory
• Atomicity: All or nothing approach, transactions
• Consistency: Hard state, every transaction
changes the whole DBMS
• Isolation: Transactions cannot interfere with
each other
• Durability: Every transaction is persisted
RDBMS Is Not Perfect
• Everything is persisted, synchronously.
Limited by IO performance
• All data is bound to a tabular schema,
hard to make changes in big databases
• ACID makes horizontal scaling nearly*
impossible
• Complex schema slows down aggregations
and queries drastically
NoSQL
• Distributed / Horizontal Scalability
• Mostly Open Source
• Mostly schema less:
• Key - Value
• Document
• Graph
• Serves specific purposes
NoSQL - Key Value Stores
• Key:
• Usually string, equivalent to primary key in a
relational database
• Value:
• Simple values: Int, Float, DateTime
• Complex values: Array, Binary, XML, JSON
Key Value - Characteristics
• Database is usually a set of unique keys,
and its values
• KV data stores are usually easy to
distribute
• Key Value access usually is VERY fast
• Indexing and querying values is usually
challenging
Key Value - Use Cases
• Distributed caching
• Session / temporary user data
• Ad tech: Impressions
• Ad tech: Serving data - profiles, segments
• Recommendation engines - main data store
NoSQL - Graph Stores
“In computing, a graph database is a database
that uses graph structures for semantic
queries with nodes, edges and properties to
represent and store data” (Wikipedia)
Graph - Characteristics
• Nodes are entities - for example a person
• Properties describe nodes - for example
age, name
• Edges are relations between nodes and/or
properties
Graph - Use Cases
• Fraud detection
• Recommendation engines - link analysis
• Intelligence systems
• Social Networks
• Medical Research
NoSQL - Document Stores
• Document databases usually store JSON
• Used to store object oriented data
• Usually used to avoid relational - object
mismatch
• Document stores have the highest
adoption rate among NoSQL databases
Document Store - Characteristics
• Information is stored in JSON variations
• Some document stores support secondary
indexes for easier querying
• Documents are usually divided to logical
groups (collections, buckets, types -
instead of RDBMS tables)
Document Store - Use Cases
• “Relational” use cases where there is a
need for high scale (volume, velocity,
variety)
• Hierarchal data - aggregations
• Search use cases
NoSQL - Challenges
• Every data store has its purpose. There is
no single solution to all database needs
• NoSQL does not implement all of RDBMS’s
abilities (CDC, Jobs, Stored Procedures,
Triggers)
• Every data store has its own languages,
and APIs. There is no ANSI SQL
Not Only SQL
Polyglot Persistence
Sample Use Cases
• Add search capabilities to your database
• Split session / temporary data processing
to key value stores
• Add Graph analysis capabilities to your
operational database
Search Use Case
Search: Architecture #1
Search: Architecture #2
Architecture Comparison
Architecture #1 Architecture #2
Data distribution
strategy
Data store based Application based
Data distribution
component
Data Pipeline Message Queue
Implementation Team Data Engineers / DevOps DevOps / Developers
Implementation
Complexity
Low: Data pipeline
development
High: data access layer
refactor
Scalability Limited to RDBMS Scale
Fully scalable regardless
of RDBMS
Summary
• Chose the relevant database engine for
the right mission - replacing databases is
not easy
• Do not hesitate to use more than one
database engine in your operational
application, single point of truth will be
created in the analytical stack
• Sizing is no replacement for benchmark.
Check your deployment carefully
DataZone
Advanced Data Solutions
Enterprise
Search
Data Flow
Management
Centralized
Logging
Operational
Analytics
Polyglot
Persistence
Business
Analytics
DataZone
Scale With Confidence
Troubleshooting 

& Tuning
Technological 

Evaluation
Training
Services
Architecture
Review
Cost
Management
End-to-End
Implementations
Infrastructure
Support / DevOps
Our Ecosystem
Keep in touch: contact@DataZone.io

Weitere ähnliche Inhalte

Was ist angesagt?

Elasticsearch { "Meetup" : "talk" }
Elasticsearch { "Meetup" : "talk" }Elasticsearch { "Meetup" : "talk" }
Elasticsearch { "Meetup" : "talk" }Lutf Ur Rehman
 
NoSQL - Not Only SQL
NoSQL - Not Only SQLNoSQL - Not Only SQL
NoSQL - Not Only SQLEasyData
 
SQL, NoSQL, Distributed SQL: Choose your DataStore carefully
SQL, NoSQL, Distributed SQL: Choose your DataStore carefullySQL, NoSQL, Distributed SQL: Choose your DataStore carefully
SQL, NoSQL, Distributed SQL: Choose your DataStore carefullyMd Kamaruzzaman
 
Coming to cassandra from relational world (New)
Coming to cassandra from relational world (New)Coming to cassandra from relational world (New)
Coming to cassandra from relational world (New)Nenad Bozic
 
Demystfying nosql databases
Demystfying nosql databasesDemystfying nosql databases
Demystfying nosql databasesMike King
 
Automate your data flows with Apache NIFI
Automate your data flows with Apache NIFIAutomate your data flows with Apache NIFI
Automate your data flows with Apache NIFIAdam Doyle
 
Sebastian Cohnen – Building a Startup with NoSQL - NoSQL matters Barcelona 2014
Sebastian Cohnen – Building a Startup with NoSQL - NoSQL matters Barcelona 2014Sebastian Cohnen – Building a Startup with NoSQL - NoSQL matters Barcelona 2014
Sebastian Cohnen – Building a Startup with NoSQL - NoSQL matters Barcelona 2014NoSQLmatters
 
REDIS (Remote Dictionary Server)
REDIS (Remote Dictionary Server)REDIS (Remote Dictionary Server)
REDIS (Remote Dictionary Server)Ameya Vijay Gokhale
 
Deven s presentation
Deven s   presentationDeven s   presentation
Deven s presentationdshastri001
 
Continuous Optimization for Distributed BigData Analysis
Continuous Optimization for Distributed BigData AnalysisContinuous Optimization for Distributed BigData Analysis
Continuous Optimization for Distributed BigData AnalysisKai Sasaki
 
UNC Chapel Hill Ctc Retreat 2014 SAS Visual Analytics and Business Intelligence
UNC Chapel Hill Ctc Retreat 2014 SAS Visual Analytics and Business IntelligenceUNC Chapel Hill Ctc Retreat 2014 SAS Visual Analytics and Business Intelligence
UNC Chapel Hill Ctc Retreat 2014 SAS Visual Analytics and Business IntelligenceJonathan Pletzke
 
Data Ingestion Engine
Data Ingestion EngineData Ingestion Engine
Data Ingestion EngineAdam Doyle
 
Application Development with Apache Cassandra as a Service
Application Development with Apache Cassandra as a ServiceApplication Development with Apache Cassandra as a Service
Application Development with Apache Cassandra as a ServiceWSO2
 
HBase from the Trenches - Phoenix Data Conference 2015
HBase from the Trenches - Phoenix Data Conference 2015HBase from the Trenches - Phoenix Data Conference 2015
HBase from the Trenches - Phoenix Data Conference 2015Avinash Ramineni
 
Big Data Architecture For enterprise
Big Data Architecture For enterpriseBig Data Architecture For enterprise
Big Data Architecture For enterpriseWei Zhang
 

Was ist angesagt? (19)

Elasticsearch { "Meetup" : "talk" }
Elasticsearch { "Meetup" : "talk" }Elasticsearch { "Meetup" : "talk" }
Elasticsearch { "Meetup" : "talk" }
 
NoSQL - Not Only SQL
NoSQL - Not Only SQLNoSQL - Not Only SQL
NoSQL - Not Only SQL
 
SQL, NoSQL, Distributed SQL: Choose your DataStore carefully
SQL, NoSQL, Distributed SQL: Choose your DataStore carefullySQL, NoSQL, Distributed SQL: Choose your DataStore carefully
SQL, NoSQL, Distributed SQL: Choose your DataStore carefully
 
Coming to cassandra from relational world (New)
Coming to cassandra from relational world (New)Coming to cassandra from relational world (New)
Coming to cassandra from relational world (New)
 
AWS Database Services
AWS Database ServicesAWS Database Services
AWS Database Services
 
Demystfying nosql databases
Demystfying nosql databasesDemystfying nosql databases
Demystfying nosql databases
 
Automate your data flows with Apache NIFI
Automate your data flows with Apache NIFIAutomate your data flows with Apache NIFI
Automate your data flows with Apache NIFI
 
Sebastian Cohnen – Building a Startup with NoSQL - NoSQL matters Barcelona 2014
Sebastian Cohnen – Building a Startup with NoSQL - NoSQL matters Barcelona 2014Sebastian Cohnen – Building a Startup with NoSQL - NoSQL matters Barcelona 2014
Sebastian Cohnen – Building a Startup with NoSQL - NoSQL matters Barcelona 2014
 
REDIS (Remote Dictionary Server)
REDIS (Remote Dictionary Server)REDIS (Remote Dictionary Server)
REDIS (Remote Dictionary Server)
 
Deven s presentation
Deven s   presentationDeven s   presentation
Deven s presentation
 
Continuous Optimization for Distributed BigData Analysis
Continuous Optimization for Distributed BigData AnalysisContinuous Optimization for Distributed BigData Analysis
Continuous Optimization for Distributed BigData Analysis
 
UNC Chapel Hill Ctc Retreat 2014 SAS Visual Analytics and Business Intelligence
UNC Chapel Hill Ctc Retreat 2014 SAS Visual Analytics and Business IntelligenceUNC Chapel Hill Ctc Retreat 2014 SAS Visual Analytics and Business Intelligence
UNC Chapel Hill Ctc Retreat 2014 SAS Visual Analytics and Business Intelligence
 
Data Ingestion Engine
Data Ingestion EngineData Ingestion Engine
Data Ingestion Engine
 
Application Development with Apache Cassandra as a Service
Application Development with Apache Cassandra as a ServiceApplication Development with Apache Cassandra as a Service
Application Development with Apache Cassandra as a Service
 
HBase from the Trenches - Phoenix Data Conference 2015
HBase from the Trenches - Phoenix Data Conference 2015HBase from the Trenches - Phoenix Data Conference 2015
HBase from the Trenches - Phoenix Data Conference 2015
 
No sql databases
No sql databasesNo sql databases
No sql databases
 
Big Data Architecture For enterprise
Big Data Architecture For enterpriseBig Data Architecture For enterprise
Big Data Architecture For enterprise
 
Koha System Architecture
Koha System ArchitectureKoha System Architecture
Koha System Architecture
 
MySQL 101
MySQL 101MySQL 101
MySQL 101
 

Andere mochten auch

Andere mochten auch (11)

View - intranet portfolio
View - intranet portfolioView - intranet portfolio
View - intranet portfolio
 
Accueil intranet 2016
Accueil intranet 2016Accueil intranet 2016
Accueil intranet 2016
 
Webinar: The Rise of NextGen Intranets: Introducing OneWindow Workplace
Webinar: The Rise of NextGen Intranets: Introducing OneWindow Workplace Webinar: The Rise of NextGen Intranets: Introducing OneWindow Workplace
Webinar: The Rise of NextGen Intranets: Introducing OneWindow Workplace
 
Intranet Governance by Toby Ward, Prescient Digital Media
Intranet Governance by Toby Ward, Prescient Digital Media Intranet Governance by Toby Ward, Prescient Digital Media
Intranet Governance by Toby Ward, Prescient Digital Media
 
Webinar: NextGen Portals: What? Why? When?
Webinar: NextGen Portals: What? Why? When?Webinar: NextGen Portals: What? Why? When?
Webinar: NextGen Portals: What? Why? When?
 
Webinar: Ignite Your Intranet with OneWindow Workplace
Webinar: Ignite Your Intranet with OneWindow WorkplaceWebinar: Ignite Your Intranet with OneWindow Workplace
Webinar: Ignite Your Intranet with OneWindow Workplace
 
Beyond Intranets -Enabling a Digital Workplace
Beyond Intranets -Enabling a Digital WorkplaceBeyond Intranets -Enabling a Digital Workplace
Beyond Intranets -Enabling a Digital Workplace
 
From Intranets to the Digital Workplace - how far have we really come so far?
From Intranets to the Digital Workplace - how far have we really come so far?From Intranets to the Digital Workplace - how far have we really come so far?
From Intranets to the Digital Workplace - how far have we really come so far?
 
Secrets of successful SharePoint Intranets
Secrets of successful SharePoint IntranetsSecrets of successful SharePoint Intranets
Secrets of successful SharePoint Intranets
 
What a modern intranet home page looks like
What a modern intranet home page looks likeWhat a modern intranet home page looks like
What a modern intranet home page looks like
 
Key Digital Trends for 2017
Key Digital Trends for 2017Key Digital Trends for 2017
Key Digital Trends for 2017
 

Ähnlich wie Modern Operational Data Architecture

UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxRahul Borate
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxRahul Borate
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Databasenehabsairam
 
Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQLRTigger
 
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
 How to use Big Data and Data Lake concept in business using Hadoop and Spark... How to use Big Data and Data Lake concept in business using Hadoop and Spark...
How to use Big Data and Data Lake concept in business using Hadoop and Spark...Institute of Contemporary Sciences
 
No SQL- The Future Of Data Storage
No SQL- The Future Of Data StorageNo SQL- The Future Of Data Storage
No SQL- The Future Of Data StorageBethmi Gunasekara
 
Dropping ACID: Wrapping Your Mind Around NoSQL Databases
Dropping ACID: Wrapping Your Mind Around NoSQL DatabasesDropping ACID: Wrapping Your Mind Around NoSQL Databases
Dropping ACID: Wrapping Your Mind Around NoSQL DatabasesKyle Banerjee
 
Comparative study of modern databases
Comparative study of modern databasesComparative study of modern databases
Comparative study of modern databasesAnirban Konar
 
Non-Relational Databases at ACCU2011
Non-Relational Databases at ACCU2011Non-Relational Databases at ACCU2011
Non-Relational Databases at ACCU2011Gavin Heavyside
 
SQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureSQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureVenu Anuganti
 
Scaling the Web: Databases & NoSQL
Scaling the Web: Databases & NoSQLScaling the Web: Databases & NoSQL
Scaling the Web: Databases & NoSQLRichard Schneeman
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology LandscapeShivanandaVSeeri
 
Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5Mike King
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabasesAdi Challa
 

Ähnlich wie Modern Operational Data Architecture (20)

NOsql Presentation.pdf
NOsql Presentation.pdfNOsql Presentation.pdf
NOsql Presentation.pdf
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptx
 
NoSQL.pptx
NoSQL.pptxNoSQL.pptx
NoSQL.pptx
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptx
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
 
Database Technologies
Database TechnologiesDatabase Technologies
Database Technologies
 
Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQL
 
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
 How to use Big Data and Data Lake concept in business using Hadoop and Spark... How to use Big Data and Data Lake concept in business using Hadoop and Spark...
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
Revision
RevisionRevision
Revision
 
No SQL- The Future Of Data Storage
No SQL- The Future Of Data StorageNo SQL- The Future Of Data Storage
No SQL- The Future Of Data Storage
 
Dropping ACID: Wrapping Your Mind Around NoSQL Databases
Dropping ACID: Wrapping Your Mind Around NoSQL DatabasesDropping ACID: Wrapping Your Mind Around NoSQL Databases
Dropping ACID: Wrapping Your Mind Around NoSQL Databases
 
Comparative study of modern databases
Comparative study of modern databasesComparative study of modern databases
Comparative study of modern databases
 
Non-Relational Databases at ACCU2011
Non-Relational Databases at ACCU2011Non-Relational Databases at ACCU2011
Non-Relational Databases at ACCU2011
 
SQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureSQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data Architecture
 
Scaling the Web: Databases & NoSQL
Scaling the Web: Databases & NoSQLScaling the Web: Databases & NoSQL
Scaling the Web: Databases & NoSQL
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology Landscape
 
Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
Nosql data models
Nosql data modelsNosql data models
Nosql data models
 

Kürzlich hochgeladen

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Kürzlich hochgeladen (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Modern Operational Data Architecture

  • 2. About Me • Name: Arthur Gimpel • Position: Technology Evangelist, Solutions Architect, Trainer • Tech Stack: MongoDB, SQL Server, Couchbase, Elastic Stack, Redis, Kafka, Python, .NET
  • 3. Relational Databases • First RDBMS was introduced in late 1970s • Exist in all possible flavors but share one thing - ACID • Still dominate the database market
  • 4. RDBMS In Theory • Atomicity: All or nothing approach, transactions • Consistency: Hard state, every transaction changes the whole DBMS • Isolation: Transactions cannot interfere with each other • Durability: Every transaction is persisted
  • 5. RDBMS Is Not Perfect • Everything is persisted, synchronously. Limited by IO performance • All data is bound to a tabular schema, hard to make changes in big databases • ACID makes horizontal scaling nearly* impossible • Complex schema slows down aggregations and queries drastically
  • 6. NoSQL • Distributed / Horizontal Scalability • Mostly Open Source • Mostly schema less: • Key - Value • Document • Graph • Serves specific purposes
  • 7. NoSQL - Key Value Stores • Key: • Usually string, equivalent to primary key in a relational database • Value: • Simple values: Int, Float, DateTime • Complex values: Array, Binary, XML, JSON
  • 8. Key Value - Characteristics • Database is usually a set of unique keys, and its values • KV data stores are usually easy to distribute • Key Value access usually is VERY fast • Indexing and querying values is usually challenging
  • 9. Key Value - Use Cases • Distributed caching • Session / temporary user data • Ad tech: Impressions • Ad tech: Serving data - profiles, segments • Recommendation engines - main data store
  • 10. NoSQL - Graph Stores “In computing, a graph database is a database that uses graph structures for semantic queries with nodes, edges and properties to represent and store data” (Wikipedia)
  • 11. Graph - Characteristics • Nodes are entities - for example a person • Properties describe nodes - for example age, name • Edges are relations between nodes and/or properties
  • 12. Graph - Use Cases • Fraud detection • Recommendation engines - link analysis • Intelligence systems • Social Networks • Medical Research
  • 13. NoSQL - Document Stores • Document databases usually store JSON • Used to store object oriented data • Usually used to avoid relational - object mismatch • Document stores have the highest adoption rate among NoSQL databases
  • 14. Document Store - Characteristics • Information is stored in JSON variations • Some document stores support secondary indexes for easier querying • Documents are usually divided to logical groups (collections, buckets, types - instead of RDBMS tables)
  • 15. Document Store - Use Cases • “Relational” use cases where there is a need for high scale (volume, velocity, variety) • Hierarchal data - aggregations • Search use cases
  • 16. NoSQL - Challenges • Every data store has its purpose. There is no single solution to all database needs • NoSQL does not implement all of RDBMS’s abilities (CDC, Jobs, Stored Procedures, Triggers) • Every data store has its own languages, and APIs. There is no ANSI SQL
  • 18. Polyglot Persistence Sample Use Cases • Add search capabilities to your database • Split session / temporary data processing to key value stores • Add Graph analysis capabilities to your operational database
  • 22. Architecture Comparison Architecture #1 Architecture #2 Data distribution strategy Data store based Application based Data distribution component Data Pipeline Message Queue Implementation Team Data Engineers / DevOps DevOps / Developers Implementation Complexity Low: Data pipeline development High: data access layer refactor Scalability Limited to RDBMS Scale Fully scalable regardless of RDBMS
  • 23. Summary • Chose the relevant database engine for the right mission - replacing databases is not easy • Do not hesitate to use more than one database engine in your operational application, single point of truth will be created in the analytical stack • Sizing is no replacement for benchmark. Check your deployment carefully
  • 24. DataZone Advanced Data Solutions Enterprise Search Data Flow Management Centralized Logging Operational Analytics Polyglot Persistence Business Analytics
  • 25. DataZone Scale With Confidence Troubleshooting 
 & Tuning Technological 
 Evaluation Training Services Architecture Review Cost Management End-to-End Implementations Infrastructure Support / DevOps
  • 27. Keep in touch: contact@DataZone.io