SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Downloaden Sie, um offline zu lesen
Apache Kylin
Open Source Journey
韩卿 | Luke Han
Co-Creator & PMC Member
lukehan@apache.org
2015-­‐04-­‐25
Agenda
• About Apache Kylin
• Kylin Open Source Journey
• Apache Incubating
• Build Community and Ecosystem
• The Good, The Bad and The Ugly
• Q&A
About	
  Apache	
  Kylin	
  (麒麟)
Extreme OLAP Engine
for Big Data
http://kylin.io	
  
Kylin is an open source Distributed Analytics Engine
that provides SQL interface and multi-dimensional
analysis (OLAP) on Hadoop supporting extremely
large datasets
• First Apache Project open sourced by eBay Inc.
• First Apache Project fully contributed from eBay CCOE
• Open Sourced on Oct 1st, 2014
• Be accepted as Apache Incubator Project on Nov 25th, 2014
• Apache Kylin is an effort undergoing incubation at The Apache Software
Foundation (ASF), sponsored by Incubator.
Technical	
  Challenges
• Huge volume data
– Table scan
• Big table joins
– Data shuffling
• Analysis on different granularity
– Runtime aggregation expensive
• Map Reduce job
– Batch processing
Apache	
  Kylin	
  Architecture
Cube	
  Build	
  Engine	
  
(MapReduce,	
  Streaming…)
SQL
Low	
  	
  Latency	
  -­‐	
  Seconds
Mid	
  Latency	
  -­‐	
  Minutes
Routing
3rd	
  Party	
  App	
  
(Web	
  App,	
  Mobile…)
Metadata
SQL-­‐Based	
  Tool	
  
(BI	
  Tools:	
  Tableau…)
Query	
  Engine
Hadoop
Hive
REST	
  API JDBC/ODBC
➢ Online	
  Analysis	
  Data	
  Flow	
  
➢ Offline	
  Data	
  Flow	
  
➢ Clients/Users	
  interactive	
  with	
  Kylin	
  
via	
  SQL	
  
➢ OLAP	
  Cube	
  is	
  transparent	
  to	
  users
Star	
  Schema	
  Data Key	
  Value	
  Data
Data	
  
Cube
OLAP	
  
Cube	
  
(HBase)
SQL
REST	
  Server
Features
• Extremely Fast OLAP Engine at scale
• ANSI SQL Interface on Hadoop
• Seamless Integration with BI Tools, like Tableau
• Interactive Query Capability
• MOLAP Cube
• Compression and Encoding Support
• Incremental Build of Cubes
• Approximate Query Capability for Distinct Count (HyperLogLog)
• Leverage HBase Coprocessor for query latency
• Job Management and Monitoring
• User friendly Web GUI for manage, build, monitor and query cubes
• Security capability to set ACL at Cube/Project Level
• Support LDAP Integration
• Streaming Support Coming soon!
6
90%$le'queries'<5s'
Agenda
• About Apache Kylin
• Kylin Open Source Journey
• Apache Incubating
• Build Community and Ecosystem
• The Good, The Bad and The Ugly
• Q&A
Jun	
  2014
US#Patent#Filed#
Kylin	
  Open	
  Source	
  Journey
Sep	
  2013
Ini$a$ve(
Jan	
  2014
POC$Completed$
	
  Jul	
  2014
V1.0%Beta%Released%
Oct	
  2014
V1.0%GA%Released%
Open%Sourced%
Apache	
  Top	
  Project
Nov	
  2014
Apache''
Incubator'Project'
Ready	
  for	
  Open	
  Source
• Open	
  Source	
  from	
  Day	
  One	
  
• Internal	
  vs	
  External	
  
• Intellectual	
  Property	
  
• Legal	
  
• Domain	
  
• License	
  
– Apache/MIT/BSD/GPL…	
  
• Team
Patent
• Why?
• How?
• Patent vs Open Source
Phase	
  I:	
  Open	
  Source	
  on	
  Github
• Code pushed to github.com on Oct 1st, 2014
Phase	
  II:	
  Apache	
  Incubator
• Be accepted as Apache Incubator Project on
Nov 25th, 2014
Why	
  &	
  How	
  Apache?
• Hadoop Ecosystem Home
• Branding
• Community
• The Apache Way
Incubation	
  Progress
• IPMC & PPMC
• Mentors and Champion
• Committers
Incubator	
  Project	
  Proposal
Agenda
• About Apache Kylin
• Kylin Open Source Journey
• Apache Incubating
• Build Community and Ecosystem
• The Good, The Bad and The Ugly
• Q&A
Infrastructure	
  Setup
•	
  Mailing	
  List	
  
– Private@	
  
– Dev@	
  
•	
  Source	
  Code	
  Repo	
  
– git	
  &	
  svn	
  
– Migration	
  
•	
  Website	
  
•	
  JIRA	
  
•	
  Wiki
IP	
  Clearance	
  &	
  Release
• Kylin	
  for	
  brand	
  name?	
  
• Apache	
  License	
  
• GPL	
  Dependency?	
  	
  
• Apache	
  Release	
  
• README,	
  LICENSE,	
  NOTICS,	
  DECLIARMER	
  
• Source	
  Headers	
  
• Licensing	
  of	
  dependencies	
  
• Binaries
18
Team	
  onboard	
  Apache	
  Way
• Community	
  then	
  Code	
  
• Mailing	
  list	
  discussions	
  
• Vote	
  
• Code	
  Quality	
  and	
  Style	
  
• JIRA	
  for	
  each	
  issue,	
  feature	
  
• Merge	
  Pull	
  Request	
  
• Recruiting	
  contributor/committer
19
How	
  to	
  contribute?
• Join	
  mailing	
  list:	
  
• dev@kylin.incubator.apache.org	
  	
  
• Create	
  JIRA	
  or	
  Leave	
  Comments	
  
• Pull	
  Request/Patch	
  to	
  Apache	
  Github	
  Mirror
20
Graduate	
  to	
  Top	
  Project
21
• Diversity	
  
• Complete	
  (and	
  sign	
  off)	
  tasks	
  documented	
  in	
  the	
  
status	
  file	
  
• Ensure	
  suitability	
  for	
  project	
  name	
  and	
  product	
  name	
  
• Demonstrate	
  ability	
  to	
  create	
  Apache	
  releases	
  
• Demonstrate	
  community	
  readiness	
  
• Ensure	
  that	
  mentors	
  and	
  the	
  IPMC	
  have	
  no	
  remaining	
  
issues
Ready	
  to	
  Apache?
22
Agenda
• About Apache Kylin
• Kylin Open Source Journey
• Apache Incubating
• Build Community and Ecosystem
• The Good, The Bad and The Ugly
• Q&A
Build	
  Community	
  and	
  Ecosystem
• What’s community?
• How to grow community?
• Community than Code!
Marketing	
  -­‐	
  Website
• http://kylin.io
– Hosted on github.io (Github Pages)
– Hosted on Apache Infra Server
– http://kylin.incubator.apache.org
Marketing	
  -­‐	
  Blog
• Publish	
  via	
  eBay	
  Tech	
  Blog	
  to	
  gain	
  focus	
  from	
  industry	
  
• http://www.ebaytechblog.com/2014/10/20/announcing-­‐kylin-­‐extreme-­‐olap-­‐engine-­‐for-­‐big-­‐data	
  
“Like	
  arch-­‐rival	
  Amazon.com,	
  the	
  soon-­‐to-­‐split	
  eBay	
  Inc.	
  is	
  
something	
  of	
  an	
  oddity	
  in	
  that	
  it	
  hasn’t	
  historically	
  been	
  a	
  
big	
  contributor	
  to	
  the	
  open-­‐source	
  community.	
  But	
  the	
  e-­‐
commerce	
  pioneer	
  hopes	
  to	
  change	
  that	
  with	
  the	
  release	
  
of	
  the	
  source-­‐code	
  for	
  a	
  homegrown	
  online	
  analytics	
  
processing	
  (OLAP)	
  engine	
  that	
  promises	
  to	
  speed	
  up	
  
Hadoop	
  while	
  also	
  making	
  it	
  more	
  accessible	
  to	
  everyday	
  
enterprise	
  users.”	
  
	
   -­‐-­‐	
  siliconangle.com
Marketing	
  –	
  Social	
  Media
• Github
• KylinOLAP
• Twitter
– @ApacheKylin
• HackNews
• Facebook
– Page: kylin.io
• LinkedIn
– Group: Kylin
• WeChat(微信)
– ApacheKylin
• …
Marketing	
  -­‐	
  Media
• InfoQ	
  
• CSDN	
  
• OSChina	
  
• …
28
Build	
  Community	
  –	
  Mailing	
  List
Build	
  Community	
  –	
  Meetup
• Hive Meetup Bay Area, Dec 2014
• Apache Kylin Meetup Bay Area, Dec 2014
• Apache Kylin Tech Talk @AWS Seattle, Dec 2014
• Apache Kylin Meetup Beijing, Dec 2014
• Spark Meetup Bay Area, March 2015
• Kylin Meetup in China, coming soon
• …
• Big Data Summit Shanghai, Oct 2014
• Big Data Technology Conference Beijing, Dec 2014
• Database Technology Conference Beijing, April 2015
• Hadoop Summit Europe, April 2015
• QCon Beijing, April 2015
• Strata+Hadoop World London, May 2015
• HBaseCon San Francisco, May 2015
• Hadoop Summit San Jose, June 2015
• …
Build	
  Community	
  –	
  Conference
Know	
  your	
  community
• Google	
  Analytics	
  
• Github	
  Statistics	
  
• Mailing	
  List	
  
• WeChat	
  
• …
Apache	
  Kylin	
  Ecosystem
Kylin OLAP
Core
Extension
!  Security
!  Redis Storage
!  Spark Engine
!  Docker
Interface
!  Web Console
!  Customized BI
!  Ambari/Hue Plugin
Integration
!  ODBC Driver
!  ETL
!  Drill
!  SparkSQL
• Kylin Core
• Fundamental framework of Kylin OLAP
Engine
•Extension
– Plugins to support for additional
functions and features
•Integration
– Lifecycle Management Support to
integrate with other applications like BI
tools
•Interface
– Allows for third party users to build
more features via user-interface atop
Kylin core
Apache	
  Kylin	
  Evolution	
  Roadmap
2015%2014%2013%
Ini$al%
Prototype.
for.MOLAP.
•  Basic.end.to.end.
POC.
.
MOLAP.
•  Incremental.
Refresh.
•  ANSI.SQL.
•  ODBC.Driver.
•  Web.GUI.
•  ACL.
•  Open.Source%
HOLAP.
•  Streaming.OLAP.
•  JDBC.Driver.
•  New.GUI.
•  Excel.Support.
•  SparkSQL.
•  ….more.
%
.
Next.Gen.
•  Lambda.Arch.
•  Automa$on.
•  Capacity.
Management.
•  InNMemory.
Analysis.(TBD).
•  Spark.(TBD).
•  Mobile.(TBD).
•  ….more.
TBD.
Future…%
Sep,%2013%
Jan,%2014%
Sep,%2014%
H1,%2015%
Excellence	
  of	
  Engineering
Recruit best people
Done is better than perfect
Do academic research
Explain design in simple words
Everyone does dirty work
You write first version, I write second one
Debate, Decision & Delivery
35
Team Philosophy
Agenda
• About Apache Kylin
• Kylin Open Source Journey
• Apache Incubating
• Build Community and Ecosystem
• The Good, The Bad and The Ugly
• Q&A
• 知名度	
  
• 个⼈人成⻓长	
  
• 团队⽂文化	
  
• 项⺫⽬目质量	
  
• 成就感	
  
• 和⽜牛⼈人做邻居
全世界都在注视着你和你的代码!
The	
  Good
37
The	
  Bad
• 开发效率降低	
  
• 内部项⺫⽬目进度vs外部⽀支持和问题	
  
• 业余时间	
  
• Roadmap	
  and	
  Features	
  from	
  external	
  
38
The	
  Ugly
• 开源不等于免费	
  
• 请尊重开源作者	
  
• Ask	
  question	
  with	
  right	
  way	
  
39
If	
  you	
  want	
  to	
  go	
  fast,	
  go	
  alone.	
  
If	
  you	
  want	
  to	
  go	
  far,	
  go	
  together.
!!African)Proverb)
• Kylin Site:
– http://kylin.incubator.apache.org
– http://kylin.io 	
  
• Twitter:
– @ApacheKylin	
  
• WeChat(微信)
– ApacheKylin
Apache	
  Kylin
@InfoQ infoqchina

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Apache kylin (china hadoop summit 2015 shanghai)
Apache kylin (china hadoop summit 2015 shanghai)Apache kylin (china hadoop summit 2015 shanghai)
Apache kylin (china hadoop summit 2015 shanghai)
 
Apache Kylin Extreme OLAP Engine for Big Data
Apache Kylin Extreme OLAP Engine for Big DataApache Kylin Extreme OLAP Engine for Big Data
Apache Kylin Extreme OLAP Engine for Big Data
 
Apache Kylin: Hadoop OLAP Engine, 2014 Dec
Apache Kylin: Hadoop OLAP Engine, 2014 DecApache Kylin: Hadoop OLAP Engine, 2014 Dec
Apache Kylin: Hadoop OLAP Engine, 2014 Dec
 
Kylin olap part 1- getting started
Kylin olap   part 1- getting startedKylin olap   part 1- getting started
Kylin olap part 1- getting started
 
Big Data MDX with Mondrian and Apache Kylin
Big Data MDX with Mondrian and Apache KylinBig Data MDX with Mondrian and Apache Kylin
Big Data MDX with Mondrian and Apache Kylin
 
Apache Kylin’s Performance Boost from Apache HBase
Apache Kylin’s Performance Boost from Apache HBaseApache Kylin’s Performance Boost from Apache HBase
Apache Kylin’s Performance Boost from Apache HBase
 
Apache Kylin Streaming
Apache Kylin Streaming Apache Kylin Streaming
Apache Kylin Streaming
 
Apache Kylin 1.5 Updates
Apache Kylin 1.5 UpdatesApache Kylin 1.5 Updates
Apache Kylin 1.5 Updates
 
Apache Kylin – Cubes on Hadoop
Apache Kylin – Cubes on HadoopApache Kylin – Cubes on Hadoop
Apache Kylin – Cubes on Hadoop
 
Apache Kylin - Balance between space and time - Hadoop Summit 2015
Apache Kylin -  Balance between space and time - Hadoop Summit 2015Apache Kylin -  Balance between space and time - Hadoop Summit 2015
Apache Kylin - Balance between space and time - Hadoop Summit 2015
 
Apache Kylin Use Cases in China and Japan
Apache Kylin Use Cases in China and JapanApache Kylin Use Cases in China and Japan
Apache Kylin Use Cases in China and Japan
 
Apache Kylin on HBase: Extreme OLAP engine for big data
Apache Kylin on HBase: Extreme OLAP engine for big dataApache Kylin on HBase: Extreme OLAP engine for big data
Apache Kylin on HBase: Extreme OLAP engine for big data
 
Kylin Engineering Principles
Kylin Engineering PrinciplesKylin Engineering Principles
Kylin Engineering Principles
 
Design cube in Apache Kylin
Design cube in Apache KylinDesign cube in Apache Kylin
Design cube in Apache Kylin
 
Apache kylin 2.0: from classic olap to real-time data warehouse
Apache kylin 2.0: from classic olap to real-time data warehouseApache kylin 2.0: from classic olap to real-time data warehouse
Apache kylin 2.0: from classic olap to real-time data warehouse
 
Apache Kylin @ Big Data Europe 2015
Apache Kylin @ Big Data Europe 2015Apache Kylin @ Big Data Europe 2015
Apache Kylin @ Big Data Europe 2015
 
Apache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on HadoopApache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on Hadoop
 
The Evolution of Apache Kylin
The Evolution of Apache KylinThe Evolution of Apache Kylin
The Evolution of Apache Kylin
 
Datacubes in Apache Hive at ApacheCon
Datacubes in Apache Hive at ApacheConDatacubes in Apache Hive at ApacheCon
Datacubes in Apache Hive at ApacheCon
 
ApacheKylin_HBaseCon2015
ApacheKylin_HBaseCon2015ApacheKylin_HBaseCon2015
ApacheKylin_HBaseCon2015
 

Ähnlich wie Apache Kylin Open Source Journey for QCon2015 Beijing

How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
Lucas Jellema
 
Ibm leads way with hadoop and spark 2015 may 15
Ibm leads way with hadoop and spark 2015 may 15Ibm leads way with hadoop and spark 2015 may 15
Ibm leads way with hadoop and spark 2015 may 15
IBMInfoSphereUGFR
 

Ähnlich wie Apache Kylin Open Source Journey for QCon2015 Beijing (20)

How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
 
OpenStack Documentation in the Open
OpenStack Documentation in the OpenOpenStack Documentation in the Open
OpenStack Documentation in the Open
 
Kuali OLE: Enabling Choices for Libraries
Kuali OLE: Enabling Choices for LibrariesKuali OLE: Enabling Choices for Libraries
Kuali OLE: Enabling Choices for Libraries
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...
 
Digital Publishing Made Easy with the OSCI Toolkit
 Digital Publishing Made Easy with the OSCI Toolkit Digital Publishing Made Easy with the OSCI Toolkit
Digital Publishing Made Easy with the OSCI Toolkit
 
HBaseCon 2015: Apache Kylin - Extreme OLAP Engine for Hadoop
HBaseCon 2015: Apache Kylin - Extreme OLAP  Engine for HadoopHBaseCon 2015: Apache Kylin - Extreme OLAP  Engine for Hadoop
HBaseCon 2015: Apache Kylin - Extreme OLAP Engine for Hadoop
 
Building Enterprise OLAP on Hadoop for FSI
Building Enterprise OLAP on Hadoop for FSIBuilding Enterprise OLAP on Hadoop for FSI
Building Enterprise OLAP on Hadoop for FSI
 
Circuit 2015 Keynote - Carsten Ziegeler
Circuit 2015 Keynote -  Carsten ZiegelerCircuit 2015 Keynote -  Carsten Ziegeler
Circuit 2015 Keynote - Carsten Ziegeler
 
OpenStack Doc Overview for Boot Camp
OpenStack Doc Overview for Boot CampOpenStack Doc Overview for Boot Camp
OpenStack Doc Overview for Boot Camp
 
Beyond DevOps: How Netflix Bridges the Gap?
Beyond DevOps: How Netflix Bridges the Gap?Beyond DevOps: How Netflix Bridges the Gap?
Beyond DevOps: How Netflix Bridges the Gap?
 
Alfresco Day Vienna 2015 - Technical Track - REST API of the Future
Alfresco Day Vienna 2015 - Technical Track - REST API of the FutureAlfresco Day Vienna 2015 - Technical Track - REST API of the Future
Alfresco Day Vienna 2015 - Technical Track - REST API of the Future
 
Unicon June 2014 IAM Briefing
Unicon June 2014 IAM BriefingUnicon June 2014 IAM Briefing
Unicon June 2014 IAM Briefing
 
AWS User Group - Survey Results and Building APIs on AWS
AWS User Group - Survey Results and Building APIs on AWSAWS User Group - Survey Results and Building APIs on AWS
AWS User Group - Survey Results and Building APIs on AWS
 
Getting a Neural Network Up and Running with OpenLab
Getting a Neural Network Up and Running with OpenLabGetting a Neural Network Up and Running with OpenLab
Getting a Neural Network Up and Running with OpenLab
 
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
 
Apache Kylin 101
Apache Kylin 101Apache Kylin 101
Apache Kylin 101
 
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive DatasetsApache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
 
Ibm leads way with hadoop and spark 2015 may 15
Ibm leads way with hadoop and spark 2015 may 15Ibm leads way with hadoop and spark 2015 may 15
Ibm leads way with hadoop and spark 2015 may 15
 
AD1545 - Extending the XPages Extension Library
AD1545 - Extending the XPages Extension LibraryAD1545 - Extending the XPages Extension Library
AD1545 - Extending the XPages Extension Library
 
caseywest
caseywestcaseywest
caseywest
 

Mehr von Luke Han

Mehr von Luke Han (7)

Augmented OLAP for Big Data
Augmented OLAP for Big DataAugmented OLAP for Big Data
Augmented OLAP for Big Data
 
Apache Kylin and Use Cases - 2018 Big Data Spain
Apache Kylin and Use Cases - 2018 Big Data SpainApache Kylin and Use Cases - 2018 Big Data Spain
Apache Kylin and Use Cases - 2018 Big Data Spain
 
Refactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsRefactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics Products
 
3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai
3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai
3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai
 
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai
 
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
 
Actuate presentation 2011
Actuate presentation   2011Actuate presentation   2011
Actuate presentation 2011
 

Kürzlich hochgeladen

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
chiefasafspells
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 

Kürzlich hochgeladen (20)

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 

Apache Kylin Open Source Journey for QCon2015 Beijing

  • 1. Apache Kylin Open Source Journey 韩卿 | Luke Han Co-Creator & PMC Member lukehan@apache.org 2015-­‐04-­‐25
  • 2. Agenda • About Apache Kylin • Kylin Open Source Journey • Apache Incubating • Build Community and Ecosystem • The Good, The Bad and The Ugly • Q&A
  • 3. About  Apache  Kylin  (麒麟) Extreme OLAP Engine for Big Data http://kylin.io   Kylin is an open source Distributed Analytics Engine that provides SQL interface and multi-dimensional analysis (OLAP) on Hadoop supporting extremely large datasets • First Apache Project open sourced by eBay Inc. • First Apache Project fully contributed from eBay CCOE • Open Sourced on Oct 1st, 2014 • Be accepted as Apache Incubator Project on Nov 25th, 2014 • Apache Kylin is an effort undergoing incubation at The Apache Software Foundation (ASF), sponsored by Incubator.
  • 4. Technical  Challenges • Huge volume data – Table scan • Big table joins – Data shuffling • Analysis on different granularity – Runtime aggregation expensive • Map Reduce job – Batch processing
  • 5. Apache  Kylin  Architecture Cube  Build  Engine   (MapReduce,  Streaming…) SQL Low    Latency  -­‐  Seconds Mid  Latency  -­‐  Minutes Routing 3rd  Party  App   (Web  App,  Mobile…) Metadata SQL-­‐Based  Tool   (BI  Tools:  Tableau…) Query  Engine Hadoop Hive REST  API JDBC/ODBC ➢ Online  Analysis  Data  Flow   ➢ Offline  Data  Flow   ➢ Clients/Users  interactive  with  Kylin   via  SQL   ➢ OLAP  Cube  is  transparent  to  users Star  Schema  Data Key  Value  Data Data   Cube OLAP   Cube   (HBase) SQL REST  Server
  • 6. Features • Extremely Fast OLAP Engine at scale • ANSI SQL Interface on Hadoop • Seamless Integration with BI Tools, like Tableau • Interactive Query Capability • MOLAP Cube • Compression and Encoding Support • Incremental Build of Cubes • Approximate Query Capability for Distinct Count (HyperLogLog) • Leverage HBase Coprocessor for query latency • Job Management and Monitoring • User friendly Web GUI for manage, build, monitor and query cubes • Security capability to set ACL at Cube/Project Level • Support LDAP Integration • Streaming Support Coming soon! 6 90%$le'queries'<5s'
  • 7. Agenda • About Apache Kylin • Kylin Open Source Journey • Apache Incubating • Build Community and Ecosystem • The Good, The Bad and The Ugly • Q&A
  • 8. Jun  2014 US#Patent#Filed# Kylin  Open  Source  Journey Sep  2013 Ini$a$ve( Jan  2014 POC$Completed$  Jul  2014 V1.0%Beta%Released% Oct  2014 V1.0%GA%Released% Open%Sourced% Apache  Top  Project Nov  2014 Apache'' Incubator'Project'
  • 9. Ready  for  Open  Source • Open  Source  from  Day  One   • Internal  vs  External   • Intellectual  Property   • Legal   • Domain   • License   – Apache/MIT/BSD/GPL…   • Team
  • 10. Patent • Why? • How? • Patent vs Open Source
  • 11. Phase  I:  Open  Source  on  Github • Code pushed to github.com on Oct 1st, 2014
  • 12. Phase  II:  Apache  Incubator • Be accepted as Apache Incubator Project on Nov 25th, 2014
  • 13. Why  &  How  Apache? • Hadoop Ecosystem Home • Branding • Community • The Apache Way
  • 15. • IPMC & PPMC • Mentors and Champion • Committers Incubator  Project  Proposal
  • 16. Agenda • About Apache Kylin • Kylin Open Source Journey • Apache Incubating • Build Community and Ecosystem • The Good, The Bad and The Ugly • Q&A
  • 17. Infrastructure  Setup •  Mailing  List   – Private@   – Dev@   •  Source  Code  Repo   – git  &  svn   – Migration   •  Website   •  JIRA   •  Wiki
  • 18. IP  Clearance  &  Release • Kylin  for  brand  name?   • Apache  License   • GPL  Dependency?     • Apache  Release   • README,  LICENSE,  NOTICS,  DECLIARMER   • Source  Headers   • Licensing  of  dependencies   • Binaries 18
  • 19. Team  onboard  Apache  Way • Community  then  Code   • Mailing  list  discussions   • Vote   • Code  Quality  and  Style   • JIRA  for  each  issue,  feature   • Merge  Pull  Request   • Recruiting  contributor/committer 19
  • 20. How  to  contribute? • Join  mailing  list:   • dev@kylin.incubator.apache.org     • Create  JIRA  or  Leave  Comments   • Pull  Request/Patch  to  Apache  Github  Mirror 20
  • 21. Graduate  to  Top  Project 21 • Diversity   • Complete  (and  sign  off)  tasks  documented  in  the   status  file   • Ensure  suitability  for  project  name  and  product  name   • Demonstrate  ability  to  create  Apache  releases   • Demonstrate  community  readiness   • Ensure  that  mentors  and  the  IPMC  have  no  remaining   issues
  • 23. Agenda • About Apache Kylin • Kylin Open Source Journey • Apache Incubating • Build Community and Ecosystem • The Good, The Bad and The Ugly • Q&A
  • 24. Build  Community  and  Ecosystem • What’s community? • How to grow community? • Community than Code!
  • 25. Marketing  -­‐  Website • http://kylin.io – Hosted on github.io (Github Pages) – Hosted on Apache Infra Server – http://kylin.incubator.apache.org
  • 26. Marketing  -­‐  Blog • Publish  via  eBay  Tech  Blog  to  gain  focus  from  industry   • http://www.ebaytechblog.com/2014/10/20/announcing-­‐kylin-­‐extreme-­‐olap-­‐engine-­‐for-­‐big-­‐data   “Like  arch-­‐rival  Amazon.com,  the  soon-­‐to-­‐split  eBay  Inc.  is   something  of  an  oddity  in  that  it  hasn’t  historically  been  a   big  contributor  to  the  open-­‐source  community.  But  the  e-­‐ commerce  pioneer  hopes  to  change  that  with  the  release   of  the  source-­‐code  for  a  homegrown  online  analytics   processing  (OLAP)  engine  that  promises  to  speed  up   Hadoop  while  also  making  it  more  accessible  to  everyday   enterprise  users.”     -­‐-­‐  siliconangle.com
  • 27. Marketing  –  Social  Media • Github • KylinOLAP • Twitter – @ApacheKylin • HackNews • Facebook – Page: kylin.io • LinkedIn – Group: Kylin • WeChat(微信) – ApacheKylin • …
  • 28. Marketing  -­‐  Media • InfoQ   • CSDN   • OSChina   • … 28
  • 29. Build  Community  –  Mailing  List
  • 30. Build  Community  –  Meetup • Hive Meetup Bay Area, Dec 2014 • Apache Kylin Meetup Bay Area, Dec 2014 • Apache Kylin Tech Talk @AWS Seattle, Dec 2014 • Apache Kylin Meetup Beijing, Dec 2014 • Spark Meetup Bay Area, March 2015 • Kylin Meetup in China, coming soon • …
  • 31. • Big Data Summit Shanghai, Oct 2014 • Big Data Technology Conference Beijing, Dec 2014 • Database Technology Conference Beijing, April 2015 • Hadoop Summit Europe, April 2015 • QCon Beijing, April 2015 • Strata+Hadoop World London, May 2015 • HBaseCon San Francisco, May 2015 • Hadoop Summit San Jose, June 2015 • … Build  Community  –  Conference
  • 32. Know  your  community • Google  Analytics   • Github  Statistics   • Mailing  List   • WeChat   • …
  • 33. Apache  Kylin  Ecosystem Kylin OLAP Core Extension !  Security !  Redis Storage !  Spark Engine !  Docker Interface !  Web Console !  Customized BI !  Ambari/Hue Plugin Integration !  ODBC Driver !  ETL !  Drill !  SparkSQL • Kylin Core • Fundamental framework of Kylin OLAP Engine •Extension – Plugins to support for additional functions and features •Integration – Lifecycle Management Support to integrate with other applications like BI tools •Interface – Allows for third party users to build more features via user-interface atop Kylin core
  • 34. Apache  Kylin  Evolution  Roadmap 2015%2014%2013% Ini$al% Prototype. for.MOLAP. •  Basic.end.to.end. POC. . MOLAP. •  Incremental. Refresh. •  ANSI.SQL. •  ODBC.Driver. •  Web.GUI. •  ACL. •  Open.Source% HOLAP. •  Streaming.OLAP. •  JDBC.Driver. •  New.GUI. •  Excel.Support. •  SparkSQL. •  ….more. % . Next.Gen. •  Lambda.Arch. •  Automa$on. •  Capacity. Management. •  InNMemory. Analysis.(TBD). •  Spark.(TBD). •  Mobile.(TBD). •  ….more. TBD. Future…% Sep,%2013% Jan,%2014% Sep,%2014% H1,%2015%
  • 35. Excellence  of  Engineering Recruit best people Done is better than perfect Do academic research Explain design in simple words Everyone does dirty work You write first version, I write second one Debate, Decision & Delivery 35 Team Philosophy
  • 36. Agenda • About Apache Kylin • Kylin Open Source Journey • Apache Incubating • Build Community and Ecosystem • The Good, The Bad and The Ugly • Q&A
  • 37. • 知名度   • 个⼈人成⻓长   • 团队⽂文化   • 项⺫⽬目质量   • 成就感   • 和⽜牛⼈人做邻居 全世界都在注视着你和你的代码! The  Good 37
  • 38. The  Bad • 开发效率降低   • 内部项⺫⽬目进度vs外部⽀支持和问题   • 业余时间   • Roadmap  and  Features  from  external   38
  • 39. The  Ugly • 开源不等于免费   • 请尊重开源作者   • Ask  question  with  right  way   39
  • 40. If  you  want  to  go  fast,  go  alone.   If  you  want  to  go  far,  go  together. !!African)Proverb)
  • 41. • Kylin Site: – http://kylin.incubator.apache.org – http://kylin.io   • Twitter: – @ApacheKylin   • WeChat(微信) – ApacheKylin Apache  Kylin