WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
Open Source SQL for Hadoop: Where are we and Where are we Going?
1. Open Source SQL for Hadoop
Where are we now and where are we going
2. 2
• History of Hadapt
– Founded in July, 2010 by Borgman, Bajda-Pawlikowski, and Abadi
– Pioneered SQL-on-Hadoop market
– Based on work done by database research group in Yale Computer Science Department
– Hybrid of Hadoop scalability and DBMS performance
• Today
– Hadapt acquired by Teradata in July, 2014, renamed Teradata Center for Hadoop
– 40 developers with deep Hadoop and database expertise
– Headquarters in Boston, MA
Teradata Center for Hadoop
Justin Borgman
VP & GM, Teradata Center for Hadoop
Former Founder & CEO, Hadapt
4. 4
• 100% open source contributions to Presto to
increase adoption in the enterprise
• A multi-year roadmap commitment to
phased enhancements of the open source
code
• The first ever commercial support offering for
Presto
#presto
What is Teradata Announcing?
Available starting June 8, 2015
5. 5
Presto
100% open source SQL query engine
– Modern code base
– Proven scalability
– Interactive querying
Cross platform query capability, not only SQL on Hadoop
Licensed by Apache
Not supported by a major vendor
Used by a community of well known, well respected technology companies
7. What is Presto?
Distributed SQL analytics engine
Optimized for low-latency, interactive
analysis
ANSI SQL
Extensible
8. August 2012
4 developers
start Presto
development
June 2014
68 Releases
30 Contributors
2796 Commits
March 2015
98 Releases
65 Contributors
4587 Commits
December
2012
Presto rolled out
within Facebook
November
2013
Facebook open
sources Presto
FALL 2008
Facebook
open sources
Hive
9. Presto @ Facebook
1000s of internal daily active users
Millions of queries each month
Multiple PBs scanned every day
Trillions of rows a day
14. What makes Presto fast?
Data in memory during execution
Pipelining and streaming
Very careful coding of inner loops
Efficient flat-memory data structures
Bytecode generation
Custom ORC reader
15. Next
More SQL features
Planner/execution engine improvements
Native columnar store
Security
16. Open source
Apache License 2.0
Open development
Releases every 1-2 weeks
Contributors welcome!
19. 19
Early Feedback is Extremely Positive
“Presto is an integral part of the Airbnb data infrastructure stack with
hundreds of employees running queries each day with the technology.
We are excited to see Teradata joining the Presto open source
community and are encouraged by the direction of their contributions”
- James Mayfield, product lead, Airbnb.
"We are excited to see Teradata's commitment to Presto and adding
capabilities in the open source domain. This will create interesting
opportunities within our technical and business teams to open up more
access options to our critical data. We think this is a positive for Teradata
and for the community as a whole”
- Steve Deasy, vice president of Engineering, Groupon.
20. Download version Presto 101t today!
www.teradata.com/presto
Contribute to Presto today!
www.github.com/facebook/presto
www.prestodb.io
#presto
Hinweis der Redaktion
Interactive performance of execution engine
Code generation for operators (similarly to Impala)
Data is pipelined MPP-style
Runs at Facebook scale
*Capable of querying other non-HDFS data stores as well*