SlideShare ist ein Scribd-Unternehmen logo
1 von 20
IN-MEMORY BI
Opensource friends
2
AGENDA
1. Opensource
2. In-memory solutions
3. BI definition
4. Components
5. In-memory databases
6. Data visualization
7. Conclusion
8. Q&A
3
AGENDA
Open code
• Core code open
• Enterprise features closed
Community editions
• All code closed
• Reach features are opened
with limitations (user coun,
database size)
• Enterprise features closed
4
1. Object grids
• java/.net objects in memory
• access via programming language
2. Databases
• tables
• access via sql/nosql
IN-MEMORY SOLUTIONS
5
6
IN-MEMORY DATABASES
1. сache-intended
2. analytical
7
TARANTOOL
1. Intended for cache
2. Opensourced core
3. SQL support in enterprise edition
4. Customers: Megafon, Yota, Avito
8
APACHE IGNITE
1. Intended as operational database
2. Opensourced core
3. SQL support
4. Customers: Sberbank, Barclays
9
ARENADATA
1. Intended as analytical solution
2. Opensourced core
3. SQL support
4. Customers: Gazpromneft
10
ARENADATA
11
EXASOL
1. Intended as analytical solution
2. Community edition: 1 Tb data single node
3. Full SQL support
4. Customers: Badoo
12
EXASOL
Pros
1. ETL efforts are intact mostly
2. BI tools efforts are intact mostly
3. Decent data volume in community edition
4. Perfect SQL support
5. 4-5 times faster than Oracle
Cons
1. Unclear pricing policy
2. Community support is slow
13
DATA VISUALIZATION
1. Sql-driven
2. Meta-data driven
14
PENTAHO REPORTING
1. Metadata-driven
2. Opensourced core
3. Simple metadata layer
15
SAIKU BI
1. Metadata-driven
2. Opensourced core
3. Sophisticated metadata layer
16
APACHE ZEPPELIN
1. Sql+java driven
2. Opensourced core
17
REPORTING SERVER
1. Sql+metadata+groovy driven
2. Opensourced core
3. Unifies a lot of opensource reporting tools in 1 bundle
4. All reporting under security rules
18
REPORTING SERVER
Pros
1. Very easy setup
2. Default capabilities are good
3. Extremely competitive enterprise edition price
Cons
1. Not sophisticated metadata layer
19
CONCLUSION
1. Full opensource in-memory BI is possible
2. Recommendations:
• Community edition of in-memory database
• Opensource/enterprise edition of visualization toolkit
THANK YOU
questions
20

Weitere ähnliche Inhalte

Was ist angesagt?

Membase Meetup 2010
Membase Meetup 2010Membase Meetup 2010
Membase Meetup 2010
Membase
 

Was ist angesagt? (20)

A Technical Introduction to WiredTiger
A Technical Introduction to WiredTigerA Technical Introduction to WiredTiger
A Technical Introduction to WiredTiger
 
JSSUG: SQL Sever Performance Tuning
JSSUG: SQL Sever Performance TuningJSSUG: SQL Sever Performance Tuning
JSSUG: SQL Sever Performance Tuning
 
Let the Tiger Roar - MongoDB 3.0
Let the Tiger Roar - MongoDB 3.0Let the Tiger Roar - MongoDB 3.0
Let the Tiger Roar - MongoDB 3.0
 
MySQL 5.7 New Features for Developers
MySQL 5.7 New Features for DevelopersMySQL 5.7 New Features for Developers
MySQL 5.7 New Features for Developers
 
MongoDB Capacity Planning
MongoDB Capacity PlanningMongoDB Capacity Planning
MongoDB Capacity Planning
 
Sql server 2016 it just runs faster sql bits 2017 edition
Sql server 2016 it just runs faster   sql bits 2017 editionSql server 2016 it just runs faster   sql bits 2017 edition
Sql server 2016 it just runs faster sql bits 2017 edition
 
Connecting Hadoop and Oracle
Connecting Hadoop and OracleConnecting Hadoop and Oracle
Connecting Hadoop and Oracle
 
Hardware Provisioning
Hardware ProvisioningHardware Provisioning
Hardware Provisioning
 
Membase Meetup 2010
Membase Meetup 2010Membase Meetup 2010
Membase Meetup 2010
 
MongoDB Pros and Cons
MongoDB Pros and ConsMongoDB Pros and Cons
MongoDB Pros and Cons
 
FOSSASIA 2016 - 7 Tips to design web centric high-performance applications
FOSSASIA 2016 - 7 Tips to design web centric high-performance applicationsFOSSASIA 2016 - 7 Tips to design web centric high-performance applications
FOSSASIA 2016 - 7 Tips to design web centric high-performance applications
 
Exploring the replication in MongoDB
Exploring the replication in MongoDBExploring the replication in MongoDB
Exploring the replication in MongoDB
 
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.
 
3 scenarios when to use MongoDB!
3 scenarios when to use MongoDB!3 scenarios when to use MongoDB!
3 scenarios when to use MongoDB!
 
Oracle Database Performance Tuning Advanced Features and Best Practices for DBAs
Oracle Database Performance Tuning Advanced Features and Best Practices for DBAsOracle Database Performance Tuning Advanced Features and Best Practices for DBAs
Oracle Database Performance Tuning Advanced Features and Best Practices for DBAs
 
MongoDB Aggregation Performance
MongoDB Aggregation PerformanceMongoDB Aggregation Performance
MongoDB Aggregation Performance
 
Oracle result cache highload 2017
Oracle result cache highload 2017Oracle result cache highload 2017
Oracle result cache highload 2017
 
In-memory Databases
In-memory DatabasesIn-memory Databases
In-memory Databases
 
Why PostgreSQL for Analytics Infrastructure (DW)?
Why PostgreSQL for Analytics Infrastructure (DW)?Why PostgreSQL for Analytics Infrastructure (DW)?
Why PostgreSQL for Analytics Infrastructure (DW)?
 
WiredTiger Overview
WiredTiger OverviewWiredTiger Overview
WiredTiger Overview
 

Ähnlich wie Inmemory BI based on opensource stack

Improving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch FixImproving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch Fix
Stitch Fix Algorithms
 
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
ACTUONDA
 

Ähnlich wie Inmemory BI based on opensource stack (20)

Open Data Node - Platform and Methodology - 2015-May
Open Data Node - Platform and Methodology - 2015-MayOpen Data Node - Platform and Methodology - 2015-May
Open Data Node - Platform and Methodology - 2015-May
 
SAP S/4 Hana:Key User Extensibility Overview
SAP S/4 Hana:Key User Extensibility OverviewSAP S/4 Hana:Key User Extensibility Overview
SAP S/4 Hana:Key User Extensibility Overview
 
ODN - Technical introduction of the platform
ODN - Technical introduction of the platformODN - Technical introduction of the platform
ODN - Technical introduction of the platform
 
Improving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch FixImproving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch Fix
 
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
 
Net Beans Jasig Jun2006
Net Beans Jasig Jun2006Net Beans Jasig Jun2006
Net Beans Jasig Jun2006
 
An Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
An Enterprise Analytics Platform with Jupyter Notebooks and Apache SparkAn Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
An Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
 
MySQL :What's New #GIDS16
MySQL :What's New #GIDS16MySQL :What's New #GIDS16
MySQL :What's New #GIDS16
 
Maria_Colgan_2.pdf
Maria_Colgan_2.pdfMaria_Colgan_2.pdf
Maria_Colgan_2.pdf
 
Oasis – data analysis platform for enterprise
Oasis – data analysis platform for enterpriseOasis – data analysis platform for enterprise
Oasis – data analysis platform for enterprise
 
SODA Framework Projects 25 Sep 2022 v1.pptx
SODA Framework Projects 25 Sep 2022 v1.pptxSODA Framework Projects 25 Sep 2022 v1.pptx
SODA Framework Projects 25 Sep 2022 v1.pptx
 
Moodle performance optimizations
Moodle performance optimizationsMoodle performance optimizations
Moodle performance optimizations
 
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a... The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 
Geode Meetup Apachecon
Geode Meetup ApacheconGeode Meetup Apachecon
Geode Meetup Apachecon
 
2015: Whats New in MySQL 5.7, At Oracle Open World, November 3rd, 2015
2015: Whats New in MySQL 5.7, At Oracle Open World, November 3rd, 2015 2015: Whats New in MySQL 5.7, At Oracle Open World, November 3rd, 2015
2015: Whats New in MySQL 5.7, At Oracle Open World, November 3rd, 2015
 
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
 
Monitoring IO performance with iostat and pt-diskstats
Monitoring IO performance with iostat and pt-diskstatsMonitoring IO performance with iostat and pt-diskstats
Monitoring IO performance with iostat and pt-diskstats
 
Sitecore user group mumbai sitecore commerce extension
Sitecore user group mumbai  sitecore commerce extensionSitecore user group mumbai  sitecore commerce extension
Sitecore user group mumbai sitecore commerce extension
 
The Complete MariaDB Server tutorial
The Complete MariaDB Server tutorialThe Complete MariaDB Server tutorial
The Complete MariaDB Server tutorial
 
OpenPOWER Foundation Overview
OpenPOWER Foundation OverviewOpenPOWER Foundation Overview
OpenPOWER Foundation Overview
 

Mehr von Alexander Tokarev

Mehr von Alexander Tokarev (18)

Rate limits and all about
Rate limits and all aboutRate limits and all about
Rate limits and all about
 
rnd teams.pptx
rnd teams.pptxrnd teams.pptx
rnd teams.pptx
 
FinOps for private cloud
FinOps for private cloudFinOps for private cloud
FinOps for private cloud
 
Graph ql and enterprise
Graph ql and enterpriseGraph ql and enterprise
Graph ql and enterprise
 
FinOps introduction
FinOps introductionFinOps introduction
FinOps introduction
 
Open Policy Agent for governance as a code
Open Policy Agent for governance as a code Open Policy Agent for governance as a code
Open Policy Agent for governance as a code
 
Relational databases for BigData
Relational databases for BigDataRelational databases for BigData
Relational databases for BigData
 
Cloud DWH deep dive
Cloud DWH deep diveCloud DWH deep dive
Cloud DWH deep dive
 
Cloud dwh
Cloud dwhCloud dwh
Cloud dwh
 
Row Level Security in databases advanced edition
Row Level Security in databases advanced editionRow Level Security in databases advanced edition
Row Level Security in databases advanced edition
 
Tagging search solution design Advanced edition
Tagging search solution design Advanced editionTagging search solution design Advanced edition
Tagging search solution design Advanced edition
 
Tagging search solution design
Tagging search solution designTagging search solution design
Tagging search solution design
 
Oracle JSON internals advanced edition
Oracle JSON internals advanced editionOracle JSON internals advanced edition
Oracle JSON internals advanced edition
 
Oracle Result Cache deep dive
Oracle Result Cache deep diveOracle Result Cache deep dive
Oracle Result Cache deep dive
 
Oracle json caveats
Oracle json caveatsOracle json caveats
Oracle json caveats
 
Apache Solr for begginers
Apache Solr for begginersApache Solr for begginers
Apache Solr for begginers
 
Data structures for cloud tag storage
Data structures for cloud tag storageData structures for cloud tag storage
Data structures for cloud tag storage
 
Oracle High Availabiltity for application developers
Oracle High Availabiltity for application developersOracle High Availabiltity for application developers
Oracle High Availabiltity for application developers
 

Kürzlich hochgeladen

AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
Alluxio, Inc.
 

Kürzlich hochgeladen (20)

SQL Injection Introduction and Prevention
SQL Injection Introduction and PreventionSQL Injection Introduction and Prevention
SQL Injection Introduction and Prevention
 
AI/ML Infra Meetup | ML explainability in Michelangelo
AI/ML Infra Meetup | ML explainability in MichelangeloAI/ML Infra Meetup | ML explainability in Michelangelo
AI/ML Infra Meetup | ML explainability in Michelangelo
 
Workforce Efficiency with Employee Time Tracking Software.pdf
Workforce Efficiency with Employee Time Tracking Software.pdfWorkforce Efficiency with Employee Time Tracking Software.pdf
Workforce Efficiency with Employee Time Tracking Software.pdf
 
AI Hackathon.pptx
AI                        Hackathon.pptxAI                        Hackathon.pptx
AI Hackathon.pptx
 
Crafting the Perfect Measurement Sheet with PLM Integration
Crafting the Perfect Measurement Sheet with PLM IntegrationCrafting the Perfect Measurement Sheet with PLM Integration
Crafting the Perfect Measurement Sheet with PLM Integration
 
AI/ML Infra Meetup | Perspective on Deep Learning Framework
AI/ML Infra Meetup | Perspective on Deep Learning FrameworkAI/ML Infra Meetup | Perspective on Deep Learning Framework
AI/ML Infra Meetup | Perspective on Deep Learning Framework
 
IT Software Development Resume, Vaibhav jha 2024
IT Software Development Resume, Vaibhav jha 2024IT Software Development Resume, Vaibhav jha 2024
IT Software Development Resume, Vaibhav jha 2024
 
Top Mobile App Development Companies 2024
Top Mobile App Development Companies 2024Top Mobile App Development Companies 2024
Top Mobile App Development Companies 2024
 
INGKA DIGITAL: Linked Metadata by Design
INGKA DIGITAL: Linked Metadata by DesignINGKA DIGITAL: Linked Metadata by Design
INGKA DIGITAL: Linked Metadata by Design
 
OpenChain @ LF Japan Executive Briefing - May 2024
OpenChain @ LF Japan Executive Briefing - May 2024OpenChain @ LF Japan Executive Briefing - May 2024
OpenChain @ LF Japan Executive Briefing - May 2024
 
Tree in the Forest - Managing Details in BDD Scenarios (live2test 2024)
Tree in the Forest - Managing Details in BDD Scenarios (live2test 2024)Tree in the Forest - Managing Details in BDD Scenarios (live2test 2024)
Tree in the Forest - Managing Details in BDD Scenarios (live2test 2024)
 
AI/ML Infra Meetup | Reducing Prefill for LLM Serving in RAG
AI/ML Infra Meetup | Reducing Prefill for LLM Serving in RAGAI/ML Infra Meetup | Reducing Prefill for LLM Serving in RAG
AI/ML Infra Meetup | Reducing Prefill for LLM Serving in RAG
 
StrimziCon 2024 - Transition to Apache Kafka on Kubernetes with Strimzi.pdf
StrimziCon 2024 - Transition to Apache Kafka on Kubernetes with Strimzi.pdfStrimziCon 2024 - Transition to Apache Kafka on Kubernetes with Strimzi.pdf
StrimziCon 2024 - Transition to Apache Kafka on Kubernetes with Strimzi.pdf
 
KLARNA - Language Models and Knowledge Graphs: A Systems Approach
KLARNA -  Language Models and Knowledge Graphs: A Systems ApproachKLARNA -  Language Models and Knowledge Graphs: A Systems Approach
KLARNA - Language Models and Knowledge Graphs: A Systems Approach
 
How to pick right visual testing tool.pdf
How to pick right visual testing tool.pdfHow to pick right visual testing tool.pdf
How to pick right visual testing tool.pdf
 
Entropy, Software Quality, and Innovation (presented at Princeton Plasma Phys...
Entropy, Software Quality, and Innovation (presented at Princeton Plasma Phys...Entropy, Software Quality, and Innovation (presented at Princeton Plasma Phys...
Entropy, Software Quality, and Innovation (presented at Princeton Plasma Phys...
 
A Guideline to Zendesk to Re:amaze Data Migration
A Guideline to Zendesk to Re:amaze Data MigrationA Guideline to Zendesk to Re:amaze Data Migration
A Guideline to Zendesk to Re:amaze Data Migration
 
What need to be mastered as AI-Powered Java Developers
What need to be mastered as AI-Powered Java DevelopersWhat need to be mastered as AI-Powered Java Developers
What need to be mastered as AI-Powered Java Developers
 
CompTIA Security+ (Study Notes) for cs.pdf
CompTIA Security+ (Study Notes) for cs.pdfCompTIA Security+ (Study Notes) for cs.pdf
CompTIA Security+ (Study Notes) for cs.pdf
 
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
 

Inmemory BI based on opensource stack

Hinweis der Redaktion

  1. Так как мы говорим о таких вещах как BI, то опенсурса в них мало, преобладает Community editions, которые не так и плохи. Самые яркие представители – это аналитические базы данных Vertica и Exasol. Я знаю много больших контор, зарабатывающих миллионы долларов, но использующих Community edition. Хорошо это или нет это очень спорно, но так есть.
  2. Разработка мейл.ру Ядро полностью опенсурсное, энтерпрайзные фичи а-ля репликации закрыты Чисто для кэша, sql якобы есть, но я его не видел Заказчиков в России весьма много
  3. Эта бд планировалась как замена операционных баз данных как оракл и мсскул, ядро опенсурсно, энтерпрайзные фичи нет. Есть более менее адекватный sql, но я бы не сказал, что он очень богат как нормальные БД. Ключевой заказчик в России – сбер. Они хотели им заменить процессинг и оракл, но по-факту ничего не вышло.
  4. Есть интересное решение, где Apache Ignite стоит как кэш перед mpp базой данных Greenplum. Это opensource дистрибутив для обработки данных. Как всегда всё, что интересно enterprise отдельно.
  5. Для заливки данных у них есть Apache Nifi, далее тот самый Ignite, потом база данных Greenplum, потом хадуп. Но для малых предприятий это весьма тяжеловесно.
  6. Очень классная аналитическая база данных, заточенная под inmemory. Программировалась как inmemory убийца оракла, поэтому очень похож синтаксис и возможности. Можно хранить до 1 Тб данных, это примерно 3-4 несжатых.
  7. Очень классная аналитическая база данных, заточенная под inmemory. Программировалась как inmemory убийца оракла, поэтому очень похож синтаксис и возможности. Можно хранить до 1 Тб данных, это примерно 3-4 несжатых.
  8. Sql просты для первичной настройки и быстры Metadata driven пишут sql сами, но позволяют бизнес-пользователям строить более сложные отчёты
  9. Разработка мейл.ру Ядро полностью опенсурсное, энтерпрайзные фичи а-ля репликации закрыты Чисто для кэша, sql якобы есть, но я его не видел Заказчиков в России весьма много
  10. Разработка мейл.ру Ядро полностью опенсурсное, энтерпрайзные фичи а-ля репликации закрыты Чисто для кэша, sql якобы есть, но я его не видел Заказчиков в России весьма много
  11. Разработка мейл.ру Ядро полностью опенсурсное, энтерпрайзные фичи а-ля репликации закрыты Чисто для кэша, sql якобы есть, но я его не видел Заказчиков в России весьма много
  12. По-факту это дистрибутив различных опенсурс решений, объединённых 1 UI с общей security + свой очень легковесный движёк отчётности
  13. Очень классная аналитическая база данных, заточенная под inmemory. Программировалась как inmemory убийца оракла, поэтому очень похож синтаксис и возможности. Можно хранить до 1 Тб данных, это примерно 3-4 несжатых.
  14. Разработка мейл.ру Ядро полностью опенсурсное, энтерпрайзные фичи а-ля репликации закрыты Чисто для кэша, sql якобы есть, но я его не видел Заказчиков в России весьма много