Suche senden
Hochladen
Apache Hive ACID Project
âą
Als PPTX, PDF herunterladen
âą
4 gefÀllt mir
âą
2,692 views
DataWorks Summit/Hadoop Summit
Folgen
Apache Hive ACID Project
Weniger lesen
Mehr lesen
Technologie
Melden
Teilen
Melden
Teilen
1 von 19
Jetzt herunterladen
Empfohlen
Hive acid and_2.x new_features
Hive acid and_2.x new_features
Alberto Romero
Â
Major advancements in Apache Hive towards full support of SQL compliance
Major advancements in Apache Hive towards full support of SQL compliance
DataWorks Summit/Hadoop Summit
Â
Hive Does ACID
Hive Does ACID
DataWorks Summit
Â
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
DataWorks Summit/Hadoop Summit
Â
Streaming SQL with Apache Calcite
Streaming SQL with Apache Calcite
Julian Hyde
Â
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
DataWorks Summit/Hadoop Summit
Â
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
gluent.
Â
Transactional operations in Apache Hive: present and future
Transactional operations in Apache Hive: present and future
DataWorks Summit
Â
Empfohlen
Hive acid and_2.x new_features
Hive acid and_2.x new_features
Alberto Romero
Â
Major advancements in Apache Hive towards full support of SQL compliance
Major advancements in Apache Hive towards full support of SQL compliance
DataWorks Summit/Hadoop Summit
Â
Hive Does ACID
Hive Does ACID
DataWorks Summit
Â
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
DataWorks Summit/Hadoop Summit
Â
Streaming SQL with Apache Calcite
Streaming SQL with Apache Calcite
Julian Hyde
Â
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
DataWorks Summit/Hadoop Summit
Â
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
gluent.
Â
Transactional operations in Apache Hive: present and future
Transactional operations in Apache Hive: present and future
DataWorks Summit
Â
LLAP: Building Cloud First BI
LLAP: Building Cloud First BI
DataWorks Summit
Â
Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
Â
From Device to Data Center to Insights
From Device to Data Center to Insights
DataWorks Summit/Hadoop Summit
Â
Apache Hive on ACID
Apache Hive on ACID
DataWorks Summit/Hadoop Summit
Â
A TPC Benchmark of Hive LLAP and Comparison with Presto
A TPC Benchmark of Hive LLAP and Comparison with Presto
Yu Liu
Â
Llap: Locality is Dead
Llap: Locality is Dead
t3rmin4t0r
Â
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
DataWorks Summit/Hadoop Summit
Â
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
DataWorks Summit/Hadoop Summit
Â
Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for ...
Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for ...
DataWorks Summit
Â
ORC File - Optimizing Your Big Data
ORC File - Optimizing Your Big Data
DataWorks Summit
Â
Hive: Loading Data
Hive: Loading Data
Benjamin Leonhardi
Â
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
Â
Transactional SQL in Apache Hive
Transactional SQL in Apache Hive
DataWorks Summit
Â
Hive acid-updates-summit-sjc-2014
Hive acid-updates-summit-sjc-2014
alanfgates
Â
High throughput data replication over RAFT
High throughput data replication over RAFT
DataWorks Summit
Â
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
DataWorks Summit/Hadoop Summit
Â
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
DataWorks Summit/Hadoop Summit
Â
Data organization: hive meetup
Data organization: hive meetup
t3rmin4t0r
Â
Hive - 1455: Cloud Storage
Hive - 1455: Cloud Storage
Hortonworks
Â
The Heterogeneous Data lake
The Heterogeneous Data lake
DataWorks Summit/Hadoop Summit
Â
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
DataWorks Summit
Â
Hive acid-updates-strata-sjc-feb-2015
Hive acid-updates-strata-sjc-feb-2015
alanfgates
Â
Weitere Àhnliche Inhalte
Was ist angesagt?
LLAP: Building Cloud First BI
LLAP: Building Cloud First BI
DataWorks Summit
Â
Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
Â
From Device to Data Center to Insights
From Device to Data Center to Insights
DataWorks Summit/Hadoop Summit
Â
Apache Hive on ACID
Apache Hive on ACID
DataWorks Summit/Hadoop Summit
Â
A TPC Benchmark of Hive LLAP and Comparison with Presto
A TPC Benchmark of Hive LLAP and Comparison with Presto
Yu Liu
Â
Llap: Locality is Dead
Llap: Locality is Dead
t3rmin4t0r
Â
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
DataWorks Summit/Hadoop Summit
Â
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
DataWorks Summit/Hadoop Summit
Â
Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for ...
Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for ...
DataWorks Summit
Â
ORC File - Optimizing Your Big Data
ORC File - Optimizing Your Big Data
DataWorks Summit
Â
Hive: Loading Data
Hive: Loading Data
Benjamin Leonhardi
Â
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
Â
Transactional SQL in Apache Hive
Transactional SQL in Apache Hive
DataWorks Summit
Â
Hive acid-updates-summit-sjc-2014
Hive acid-updates-summit-sjc-2014
alanfgates
Â
High throughput data replication over RAFT
High throughput data replication over RAFT
DataWorks Summit
Â
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
DataWorks Summit/Hadoop Summit
Â
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
DataWorks Summit/Hadoop Summit
Â
Data organization: hive meetup
Data organization: hive meetup
t3rmin4t0r
Â
Hive - 1455: Cloud Storage
Hive - 1455: Cloud Storage
Hortonworks
Â
The Heterogeneous Data lake
The Heterogeneous Data lake
DataWorks Summit/Hadoop Summit
Â
Was ist angesagt?
(20)
LLAP: Building Cloud First BI
LLAP: Building Cloud First BI
Â
Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem
Â
From Device to Data Center to Insights
From Device to Data Center to Insights
Â
Apache Hive on ACID
Apache Hive on ACID
Â
A TPC Benchmark of Hive LLAP and Comparison with Presto
A TPC Benchmark of Hive LLAP and Comparison with Presto
Â
Llap: Locality is Dead
Llap: Locality is Dead
Â
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
Â
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
Â
Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for ...
Comparative Performance Analysis of AWS EC2 Instance Types Commonly Used for ...
Â
ORC File - Optimizing Your Big Data
ORC File - Optimizing Your Big Data
Â
Hive: Loading Data
Hive: Loading Data
Â
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
Â
Transactional SQL in Apache Hive
Transactional SQL in Apache Hive
Â
Hive acid-updates-summit-sjc-2014
Hive acid-updates-summit-sjc-2014
Â
High throughput data replication over RAFT
High throughput data replication over RAFT
Â
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
Â
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
Â
Data organization: hive meetup
Data organization: hive meetup
Â
Hive - 1455: Cloud Storage
Hive - 1455: Cloud Storage
Â
The Heterogeneous Data lake
The Heterogeneous Data lake
Â
Andere mochten auch
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
DataWorks Summit
Â
Hive acid-updates-strata-sjc-feb-2015
Hive acid-updates-strata-sjc-feb-2015
alanfgates
Â
Toward Better Multi-Tenancy Support from HDFS
Toward Better Multi-Tenancy Support from HDFS
DataWorks Summit/Hadoop Summit
Â
Machine Learning for Any Size of Data, Any Type of Data
Machine Learning for Any Size of Data, Any Type of Data
DataWorks Summit/Hadoop Summit
Â
A New "Sparkitecture" for modernizing your data warehouse
A New "Sparkitecture" for modernizing your data warehouse
DataWorks Summit/Hadoop Summit
Â
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
DataWorks Summit/Hadoop Summit
Â
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
DataWorks Summit/Hadoop Summit
Â
Keep your Hadoop Cluster at its Best
Keep your Hadoop Cluster at its Best
DataWorks Summit/Hadoop Summit
Â
From Zero to Data Flow in Hours with Apache NiFi
From Zero to Data Flow in Hours with Apache NiFi
DataWorks Summit/Hadoop Summit
Â
Producing Spark on YARN for ETL
Producing Spark on YARN for ETL
DataWorks Summit/Hadoop Summit
Â
File Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & Parquet
DataWorks Summit/Hadoop Summit
Â
How to build a successful Data Lake
How to build a successful Data Lake
DataWorks Summit/Hadoop Summit
Â
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
DataWorks Summit/Hadoop Summit
Â
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
DataWorks Summit
Â
Apache HBase: State of the Union
Apache HBase: State of the Union
DataWorks Summit/Hadoop Summit
Â
Extreme Analytics @ eBay
Extreme Analytics @ eBay
DataWorks Summit/Hadoop Summit
Â
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
DataWorks Summit/Hadoop Summit
Â
Intro to Spark with Zeppelin Crash Course Hadoop Summit SJ
Intro to Spark with Zeppelin Crash Course Hadoop Summit SJ
Daniel Madrigal
Â
Simplified Cluster Operation & Troubleshooting
Simplified Cluster Operation & Troubleshooting
DataWorks Summit/Hadoop Summit
Â
YARN Federation
YARN Federation
DataWorks Summit/Hadoop Summit
Â
Andere mochten auch
(20)
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
Â
Hive acid-updates-strata-sjc-feb-2015
Hive acid-updates-strata-sjc-feb-2015
Â
Toward Better Multi-Tenancy Support from HDFS
Toward Better Multi-Tenancy Support from HDFS
Â
Machine Learning for Any Size of Data, Any Type of Data
Machine Learning for Any Size of Data, Any Type of Data
Â
A New "Sparkitecture" for modernizing your data warehouse
A New "Sparkitecture" for modernizing your data warehouse
Â
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
Â
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
Â
Keep your Hadoop Cluster at its Best
Keep your Hadoop Cluster at its Best
Â
From Zero to Data Flow in Hours with Apache NiFi
From Zero to Data Flow in Hours with Apache NiFi
Â
Producing Spark on YARN for ETL
Producing Spark on YARN for ETL
Â
File Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & Parquet
Â
How to build a successful Data Lake
How to build a successful Data Lake
Â
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
Â
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
Adding ACID Transactions, Inserts, Updates, and Deletes in Apache Hive
Â
Apache HBase: State of the Union
Apache HBase: State of the Union
Â
Extreme Analytics @ eBay
Extreme Analytics @ eBay
Â
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
Â
Intro to Spark with Zeppelin Crash Course Hadoop Summit SJ
Intro to Spark with Zeppelin Crash Course Hadoop Summit SJ
Â
Simplified Cluster Operation & Troubleshooting
Simplified Cluster Operation & Troubleshooting
Â
YARN Federation
YARN Federation
Â
Ăhnlich wie Apache Hive ACID Project
Hive ACID Apache BigData 2016
Hive ACID Apache BigData 2016
alanfgates
Â
Apache Hive on ACID
Apache Hive on ACID
Hortonworks
Â
What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?
DataWorks Summit
Â
Hive 3 New Horizons DataWorks Summit Melbourne February 2019
Hive 3 New Horizons DataWorks Summit Melbourne February 2019
alanfgates
Â
What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?
DataWorks Summit
Â
Apache Hive 2.0; SQL, Speed, Scale
Apache Hive 2.0; SQL, Speed, Scale
Hortonworks
Â
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
DataWorks Summit/Hadoop Summit
Â
Put is the new rename: San Jose Summit Edition
Put is the new rename: San Jose Summit Edition
Steve Loughran
Â
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
alanfgates
Â
Hadoop 3 in a Nutshell
Hadoop 3 in a Nutshell
DataWorks Summit/Hadoop Summit
Â
IoT:what about data storage?
IoT:what about data storage?
DataWorks Summit/Hadoop Summit
Â
Hive 3 a new horizon
Hive 3 a new horizon
Artem Ervits
Â
Hive2.0 big dataspain-nov-2016
Hive2.0 big dataspain-nov-2016
alanfgates
Â
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Big Data Spain
Â
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
DataWorks Summit/Hadoop Summit
Â
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
DataWorks Summit/Hadoop Summit
Â
Paris FOD Meetup #5 Hortonworks Presentation
Paris FOD Meetup #5 Hortonworks Presentation
Abdelkrim Hadjidj
Â
What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?
DataWorks Summit
Â
What's New in Apache Hive 3.0 - Tokyo
What's New in Apache Hive 3.0 - Tokyo
DataWorks Summit
Â
Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017
alanfgates
Â
Ăhnlich wie Apache Hive ACID Project
(20)
Hive ACID Apache BigData 2016
Hive ACID Apache BigData 2016
Â
Apache Hive on ACID
Apache Hive on ACID
Â
What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?
Â
Hive 3 New Horizons DataWorks Summit Melbourne February 2019
Hive 3 New Horizons DataWorks Summit Melbourne February 2019
Â
What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?
Â
Apache Hive 2.0; SQL, Speed, Scale
Apache Hive 2.0; SQL, Speed, Scale
Â
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
Â
Put is the new rename: San Jose Summit Edition
Put is the new rename: San Jose Summit Edition
Â
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Â
Hadoop 3 in a Nutshell
Hadoop 3 in a Nutshell
Â
IoT:what about data storage?
IoT:what about data storage?
Â
Hive 3 a new horizon
Hive 3 a new horizon
Â
Hive2.0 big dataspain-nov-2016
Hive2.0 big dataspain-nov-2016
Â
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Â
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
Â
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
Â
Paris FOD Meetup #5 Hortonworks Presentation
Paris FOD Meetup #5 Hortonworks Presentation
Â
What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0?
Â
What's New in Apache Hive 3.0 - Tokyo
What's New in Apache Hive 3.0 - Tokyo
Â
Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017
Â
Mehr von DataWorks Summit/Hadoop Summit
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
Â
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
Â
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
Â
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
Â
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
Â
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
Â
Hadoop Crash Course
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
Â
Data Science Crash Course
Data Science Crash Course
DataWorks Summit/Hadoop Summit
Â
Apache Spark Crash Course
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
Â
Dataflow with Apache NiFi
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
Â
Schema Registry - Set you Data Free
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
Â
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
Â
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
Â
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
Â
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
Â
HBase in Practice
HBase in Practice
DataWorks Summit/Hadoop Summit
Â
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
Â
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
Â
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
Â
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
Â
Mehr von DataWorks Summit/Hadoop Summit
(20)
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
Â
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
Â
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
Â
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
Â
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
Â
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
Â
Hadoop Crash Course
Hadoop Crash Course
Â
Data Science Crash Course
Data Science Crash Course
Â
Apache Spark Crash Course
Apache Spark Crash Course
Â
Dataflow with Apache NiFi
Dataflow with Apache NiFi
Â
Schema Registry - Set you Data Free
Schema Registry - Set you Data Free
Â
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Â
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Â
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
Â
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
Â
HBase in Practice
HBase in Practice
Â
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
Â
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Â
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
Â
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
Â
KĂŒrzlich hochgeladen
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
Â
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
Â
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Edi Saputra
Â
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
MadyBayot
Â
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
apidays
Â
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
debabhi2
Â
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
Zilliz
Â
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
MIND CTI
Â
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(âïž+971_581248768%)**%*]'#abortion pills for sale in dubai@
Â
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
The Digital Insurer
Â
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
apidays
Â
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
The Digital Insurer
Â
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
apidays
Â
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
Overkill Security
Â
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
apidays
Â
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Drew Madelung
Â
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
Dropbox
Â
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
Product Anonymous
Â
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Khushali Kathiriya
Â
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
apidays
Â
KĂŒrzlich hochgeladen
(20)
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
Â
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Â
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Â
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
Â
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Â
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
Â
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
Â
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
Â
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
Â
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
Â
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
Â
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
Â
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Â
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
Â
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Â
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Â
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
Â
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
Â
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Â
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Â
Apache Hive ACID Project
1.
Apache Hive ACID
Project Eugene Koifman June 2016
2.
2 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Agenda ï Motivations/Goals ï What is included in the project ï End user point of view ï Architecture ï Recent Progress ï Possible future directions
3.
3 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Motivations/Goals ï Continuously adding new data to Hive in the past â INSERT INTO Target as SELECT FROM Staging â ALTER TABLE Target ADD PARTITION (dt=â2016-06-30â) âą Lots of files â bad for performance âą Fewer files âusers wait longer to see latest data ï Modifying existing data â Analyzing log files â not that important. Sourcing data from an Operational Data Store â may be really important. â INSERT OVERWRITE TABLE Target SELECT * FROM Target WHERE ⊠⹠Concurrency â Hope for the best (multiple updates) â ZooKeeper lock manager S/X locks â restrictive âą Expensive to do repeatedly (write side)
4.
4 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Goals ï Make above use cases easy and efficient ï Key Requirement â Long running analytics queries should run concurrently with update commands ï NOT OLTP!!! â Support slowly changing tables â Not for 100s of concurrent queries trying to update the same partition
5.
5 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved ACID at High Level ï A new type of table that supports Insert/Update/Delete SQL operations ï Concept of ACID transaction â Atomic, Consistent, Isolated, Durable ï Streaming Ingest API â Write a continuous stream of events to Hive in micro batches with transactional semantics
6.
6 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved ACID at High Level RDMS Compute Nodes HDFS Streaming Client SQL Client Meta Store openTxn/commit/abort Data txnID
7.
7 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved User Point of View ï CREATE TABLE T(a int, b int) CLUSTERED BY (b) INTO 8 BUCKETS STORED AS ORC TBLPROPERTIES ('transactional'='true'); ï Not all tables support transactional semantics ï Table must be bucketed â important for query performance ï Table cannot be sorted â ACID implementation requires its own sort order ï Currently requires ORC File but anything implementing format â AcidInputFormat/AcidOutputFormat ï Snapshot Isolation â Lock in the state of the DB as of the start of the query for the duration of the query ï autoCommit=true
8.
8 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Design â Storage Layer ï Storage layer enhanced to support MVCC architecture â Multiple versions of each row â Allows concurrent readers/writers ï HDFS â append only file system â All update operations are written to a delta file first â Files are combined on read and compaction ï Even if you could update a file in the middle â The architecture of choice for analytics is columnar storage (ORC File) â Compresses by column â difficult to update ï Random data access is prohibitively slow
9.
9 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Storage Layer Example ï CREATE TABLE T(a int, b int) CLUSTERED BY (b) INTO 1 BUCKETS STORED AS ORC TBLPROPERTIES ('transactional'='true'); ï Suppose the table contains (1,2),(3,4) hive> update T set a = -3 where a = 3; hive> update T set a = -1 where a = 1; Now the table has (-1,2),(-3,4) ï hive> dfs -ls -R /user/hive/warehouse/t; /user/hive/warehouse/t/base_0000022/bucket_00000 /user/hive/warehouse/t/delta_0000023_0000023_0000/bucket_00000 /user/hive/warehouse/t/delta_0000024_0000024_0000/bucket_00000
10.
10 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Example Continued ï bin/hive --orcfiledump -j -d /user/hive/warehouse/t/base_0000022/bucket_00000 {"operation":0,"originalTransaction":22,"bucket":0,"rowId":0,"currentTransaction":22,"row":{"a":3,"b":4}} {"operation":0,"originalTransaction":22,"bucket":0,"rowId":1,"currentTransaction":22,"row":{"a":1,"b":2}} ï bin/hive --orcfiledump -j -d /âŠ/t/delta_0000023_0000023_0000/bucket_00000 {"operation":1,"originalTransaction":22,"bucket":0,"rowId":0,"currentTransaction":23,"row":{"_col1":-3,"_col2":4}} ï Each file is sorted by PK: originalTransaction,bucket,rowid ï On read base & deltas are stitched together to produce correct version of each row. ï Each read operation âknowsâ the state of all transactions up to the moment it started
11.
11 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Producing The Snapshot base_0000022/bucket_00000 oTxn bucket rowId cTxn a b 22 0 0 22 3 4 22 0 1 22 1 2 select * from T a b -3 4 -1 2 delta_0000023_0000023_0000 oTxn bucket rowId cTxn a b 22 0 0 23 - 3 4 delta_0000024_0000024_0000 oTxn bucket rowId cTxn a b 22 0 1 24 -1 2
12.
12 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Design - Compactor ï More operations = more delta files ï Compactor rewrites the table in the background â Minor compaction - merges delta files into fewer deltas â Major compactor merges deltas with base - more expensive â This amortizes the cost of updates and self tunes the tables âą Makes ORC more efficient - larger stripes, better compression ï Compaction can be triggered automatically or on demand â There are various configuration options to control when the process kicks in. â Compaction itself is a Map-Reduce job ï Key design principle is that compactor does not affect readers/writers ï Cleaner process â removes obsolete files
13.
13 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Design - Concurrency ï Transaction Manager â manages transaction ID assignment â keeps track of transaction state: open, committed, aborted ï Lock Manager â DDL operations acquire eXclusive locks â Read operations acquire Shared locks. â Main goal is to prevent someone dropping a table while a query is in progress ï State of both persisted in Hive Metastore ï Write Set tracking to prevent Write-Write conflicts in concurrent transactions ï Note that 2 Inserts are never in conflict since Hive does not enforce unique constraints.
14.
14 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved ï You are allowed to read acid and non-acid tables in same query. ï You cannot write to acid and non-acid tables at the same time (multi-insert statement)
15.
15 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Design - Streaming Ingest ï Allows you to continuously write events to a hive table â Can commit periodically to make writes durable/visible â Can also call abort to make writes since last commit/abort invisible. â Optimized so that it can handle writing micro batches of events - every second. âą Multiple transactions are written to one file â Only supports adding new data ï Streaming tools like Storm and Flume rely on this API to ingest data into hive ï This API is public so it can be used directly ï Data written via Streaming API has the same transactional semantics as SQL side
16.
16 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Recent improvements ï PPD ï Schema Evolution ï Split computation ( Tez version 0.7 required) ï Usability â better lock info â compaction history â show locks filtering ï Various safety checks - open txn limit ï Metastore side processes like compaction are no longer singletons ï Metastore scalability ï Bug fixes (Hive, Flume, Storm)
17.
17 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Future Work (Uncommitted transaction⊠may be rolled back) ï Automatic/Dynamic bucketing ï Merge statement (SQL Standard 2003) - HIVE-10924 ï Performance â Better Vectorization; some operations over acid tables donât vectorize at all â Some do but not as well as they could ï HCatalog integration (at least read side) to read from Pig/MR ï Multi statement transactions, i.e. BEGIN TRANSACTION/COMMIT/ROLLBACK ï Finer grained concurrency management/conflict detection ï Better Monitoring/Alerting
18.
18 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Etc ï Documentaton â https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions â https://cwiki.apache.org/confluence/display/Hive/Streaming+Data+Ingest ï Follow/Contribute â https://issues.apache.org/jira/browse/HIVE- 14004?jql=project%20%3D%20HIVE%20AND%20component%20%3D%20Transactions ï user@hive.apache.org ï dev@hive.apache.org
19.
19 © Hortonworks
Inc. 2011 â 2016. All Rights Reserved Thank You
Hinweis der Redaktion
Easiest way to explain this is to talk about how you used to do some things in Hive before Hive ACID project.
Jetzt herunterladen