SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Downloaden Sie, um offline zu lesen
OSC 2012 Tokyo


             openstack
         Open source software to build public and private clouds.


     Hadoop on OpenStack Swift
  - Experiment of using swift as storage
           for Apache Hadoop
                  2012.09.08
               OpenStack Japan
                  Zheng Xu
                                                                    1
Self introduction

 ●
     Software designer(engineer) for
     embedded system and web
     system(60%hobbit, 40%job).
 ●
     Major: openstack, linux, web browser,
     html, epub, OSS
 ●
     Contact
     ●
         @xz911
     ●
         https://www.facebook.com/xuzheng2001




                                                2
Abstract

●
    This slide is to introduce how to use OpenStack
    Swift as storage service for Apache Hadoop
    instead of HDFS(which is storage service of
    Hadoop project).
●
    This slide is based on
    http://bigdatacraft.com/archives/349, and really
    appreciate Constantine Peresypkin and David
    Gruzman for providing their idea and
    implementation.
                                                       3
Agenda
●
    OpenStack Swift


●
    Apache Hadoop and HDFS


●
    Experiment of replacing HDFS by OpenStack
    Swift



                                                4
What is OpenStack and Swift




  From http://www.openstack.org/
                                   5
What is OpenStack and Swift
                       User Application
                                               http


               Proxy Server            Proxy Server
                                                            http


   Account Server     Account Server       Account Server
                                                                   http




Container Server       Container Server           Container Server



Object Server Object Server        Object Server      Object Server

                                                                          6
What is OpenStack and Swift

●
    OpenSource written in Python
●
    diversity
    ●
        Swift can be a part of OpenStack or an individual
        service it self.
●
    zones, devices, partitions, and replicas
●
    No SPOF




                                                            7
Agenda
●
    OpenStack Swift


●
    Apache Hadoop and HDFS


●
    Experiment of replacing HDFS by OpenStack
    Swift



                                                8
Apache Hadoop and HDFS




         From http://hadoop.apache.org/
                                          9
Apache Hadoop and HDFS

             User Application


                                    Map-Reduce

               Name Node
                                             Hive



 Data Node   Data Node          Data Node




                                                    10
Agenda
●
    OpenStack Swift


●
    Apache Hadoop and HDFS


●
    Experiment of replacing HDFS by OpenStack
    Swift



                                                11
Experiment (Concept)

             User Application
                                      Map-Reduce




               Name Node                      Hive




 Data Node   Data Node          Data Node




                                                     12
Experiment (Concept)

                    User Application
                                                 Map-Reduce


                     java-cloudfiles   java-cloudfiles
                                                                    Hive
             http



                                                  java-cloudfiles
 Data Node          Data Node
             Swift


                                                                           13
Experiment (Software)
 ●
     Swift v1.6
     ●
         https://github.com/openstack/swift.git
     ●
         r21616cf, Jul 25
 ●
     Java Client java-cloudfiles
     ●
         https://github.com/rackspace/java-cloudfiles
     ●
         r0807fa6, Jun 4
 ●
     Apache Hadoop
     ●
         1.0.3
 ●
     Swift fs for Apache Hadoop(just part of following source code)
     ●
         https://github.com/Dazo-org/hadoop-common.git (branch-0.20-security-
         205.swift )

                                                                           14
Experiment (infra)




                     192.168.0.4
   192.168.0.9




                                   15
Experiment(install swift)

●
    Install swift based on
    http://docs.openstack.org/developer/swift/development_saio.html

●
    Do not forget to set bind_ip of proxy-server.conf
    ●
        192.168.0.9 in my case


    ●
        Suppose we have username as "test:tester" with
        password as "testing", the account name is
        AUTH_test and have some container based on
        steps in above Url.

                                                                      16
Experiment (cloudfiles)

●
    Run "ant compile"
●
    Change cloudfiles.properties to following
      # Auth info

      auth_url=http://192.168.0.9:8080/auth/v1.0

      auth_token_name=X-Auth-Token

      #auth_user_header=X-Storage-User

      #auth_pass_header=X-Storage-Pass



      # user properties

      username=test:tester

      password=testing



      # cloudfs properties

      version=v1

      connection_timeout=15000                     17
Experiment(cloudfiles)

●
    Connect cloudfiles to swift(this is option)
    ●
        Change cloudfiles.sh as following and run it to try
        connection with swift
        #!/bin/sh


        export CLASSPATH=lib/httpcore-4.1.4.jar:lib/commons-cli-
        1.1.jar:lib/httpclient-4.1.3.jar:lib/commons-lang-
        2.4.jar:lib/junit.jar:lib/commons-codec-1.3.jar:lib/commons-io-
        1.4.jar:lib/commons-logging-1.1.1.jar:lib/log4j-1.2.15.jar:dist/java-
        cloudfiles.jar:.
        java com.rackspacecloud.client.cloudfiles.sample.FilesCli $@

                                                                                18
Experiment (cloudfiles)

●
    Packaging java-cloudfiles to jar file for Apache
    Hadoop (clone java-cloudfiles to ~/java-
    cloudfiles)
    ●
        We need to put *.properties into java-cloudfiles.jar
        $ ant package
        $ cd cloudfiles/dist
        $ cp ../*.properties .
        $ rm java-cloudfiles.jar
        $ jar cvf java-cloudfiles.jar ./*
                                                               19
Experiment (hadoop)
●
    Prepare
    ●
        download hadoop to ~/hadoop-1.0.3 (newest stable
        version of original hadoop) and git clone
        https://github.com/Dazo-org/hadoop-common.git to
        ~/hadoop-common (old hadoop source code with
        swift fs plugin)
    ●
        At ~/hadoop-1.0.3 (copy java-cloudfiles and related
        library to hadoop lib folder)
        –   cd lib;cp ~/java-cloudfiles/cloudfiles/dist/java-cloudfiles.jar .
        –   cp ~/java-cloudfiles/lib/httpc* .

                                                                         20
Experiment (setting hadoop)

●
    ./hadoop-1.0.3/src/core/core-default.xml
    ●
        Add following to make hadoop can recognize
        handle "swift://" schema to SwiftFileSystem class
    <property>

     <name>fs.swift.impl</name>
    <value>org.apache.hadoop.fs.swift.SwiftFileSystem</value>

    <description>The FileSystem for swift: uris.</description>

    </property>




                                                                 21
Experiment (hadoop)

●
    Copy implementation for swift fs to hadoop
    1.0.3 and build
    ●
        cp -R ../hadoop-
        common/src/core/org/apache/hadoop/fs/swift
        ./src/core/org/apache/hadoop/fs
    ●
        ant




                                                     22
Experiment(hadoop setting)

●
    ./conf/core-site.xml (part1)
    ●
         Add following property for example
    <property>

        <name>fs.swift.userName</name>

        <value>test:tester</value>

    </property>




                                              23
Experiment (hadoop setting)
●
    ./conf/core-site.xml (part2)
    ●
         Add following property for example
    <property>

        <name>fs.swift.userPassword</name>

        <value>testing</value>

    </property>

    <property>

        <name>fs.swift.acccountname</name>

        <value>AUTH_test</value>

    </property>



                                              24
Experiment (hadoop setting)
●
    ./conf/core-site.xml (part3)
    ●
         Add following property for example
     <property>

        <name>fs.swift.authUrl</name>

        <value>http://192.168.0.9:8080/auth/v1.0</value>

     </property>



     <property>

    <name>fs.default.name</name>

    <value>swift://192.168.0.9:8080/v1/AUTH_test</value>

     </property>

                                                           25
Experiment (check swift fs)

●
    At this time, we should can list account
    information via following command
    ●
        ./bin/hadoop -fs -ls /
    ●
        or ./bin/hadoop fs -put ./conf/core-site.xml
        /test_container/core-site.xml (test_container is a test
        container created after swift installed)




                                                             26
Finally

●
    We installed swift for storage service of hadoop
●
    We built origin java-cloudfiles and created
    packages for hadoop
●
    We copied fs.swift plugin from
    https://github.com/Dazo-org/hadoop-common.git
    to new hadoop source tree and build hadoop
●
    We set up core-site.xml of hadoop to connect to
    swift via java-cloudfiles

                                                   27
Thank you for listening.




                           28

Weitere ähnliche Inhalte

Was ist angesagt?

Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Mihai Criveti
 

Was ist angesagt? (20)

Cloud Foundry 百日行 振り返り
Cloud Foundry 百日行 振り返りCloud Foundry 百日行 振り返り
Cloud Foundry 百日行 振り返り
 
Docker to the Rescue of an Ops Team
Docker to the Rescue of an Ops TeamDocker to the Rescue of an Ops Team
Docker to the Rescue of an Ops Team
 
Docker for Developers - Sunshine PHP
Docker for Developers - Sunshine PHPDocker for Developers - Sunshine PHP
Docker for Developers - Sunshine PHP
 
Introduction and Deep Dive Into Containerd
Introduction and Deep Dive Into ContainerdIntroduction and Deep Dive Into Containerd
Introduction and Deep Dive Into Containerd
 
App container rkt
App container rktApp container rkt
App container rkt
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
 
Использование Docker в CI / Александр Акбашев (HERE Technologies)
Использование Docker в CI / Александр Акбашев (HERE Technologies)Использование Docker в CI / Александр Акбашев (HERE Technologies)
Использование Docker в CI / Александр Акбашев (HERE Technologies)
 
OpenStack for Centos
OpenStack for CentosOpenStack for Centos
OpenStack for Centos
 
Dockerを利用したローカル環境から本番環境までの構築設計
Dockerを利用したローカル環境から本番環境までの構築設計Dockerを利用したローカル環境から本番環境までの構築設計
Dockerを利用したローカル環境から本番環境までの構築設計
 
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
 
Tp install anything
Tp install anythingTp install anything
Tp install anything
 
Build Your Own CaaS (Container as a Service)
Build Your Own CaaS (Container as a Service)Build Your Own CaaS (Container as a Service)
Build Your Own CaaS (Container as a Service)
 
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA Architecture
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA ArchitectureRed Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA Architecture
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA Architecture
 
Gdg cloud taipei ddt meetup #53 buildpack
Gdg cloud taipei ddt meetup #53 buildpackGdg cloud taipei ddt meetup #53 buildpack
Gdg cloud taipei ddt meetup #53 buildpack
 
CoreOS + Kubernetes @ All Things Open 2015
CoreOS + Kubernetes @ All Things Open 2015CoreOS + Kubernetes @ All Things Open 2015
CoreOS + Kubernetes @ All Things Open 2015
 
Exploring the Future of Helm
Exploring the Future of HelmExploring the Future of Helm
Exploring the Future of Helm
 
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)Practical Docker for OpenStack (Juno Summit - May 15th, 2014)
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)
 
State of Big Data on ARM64 / AArch64 - Apache Bigtop
State of Big Data on ARM64 / AArch64 - Apache BigtopState of Big Data on ARM64 / AArch64 - Apache Bigtop
State of Big Data on ARM64 / AArch64 - Apache Bigtop
 
Cluster Networking with Docker
Cluster Networking with DockerCluster Networking with Docker
Cluster Networking with Docker
 
How to operate containerized OpenStack
How to operate containerized OpenStackHow to operate containerized OpenStack
How to operate containerized OpenStack
 

Andere mochten auch (8)

The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
 
Hadoop on OpenStack
Hadoop on OpenStackHadoop on OpenStack
Hadoop on OpenStack
 
Hadoop on OpenStack - Trove Day 2014
Hadoop on OpenStack - Trove Day 2014Hadoop on OpenStack - Trove Day 2014
Hadoop on OpenStack - Trove Day 2014
 
Hadoop For OpenStack Log Analysis
Hadoop For OpenStack Log AnalysisHadoop For OpenStack Log Analysis
Hadoop For OpenStack Log Analysis
 
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
 
Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014
 
Savanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStackSavanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStack
 
Hadoop on OpenStack - Sahara @DevNation 2014
Hadoop on OpenStack - Sahara @DevNation 2014Hadoop on OpenStack - Sahara @DevNation 2014
Hadoop on OpenStack - Sahara @DevNation 2014
 

Ähnlich wie 2012 09-08-josug-jeff

Ähnlich wie 2012 09-08-josug-jeff (20)

Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developer
 
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...
 
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...
 
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)
 
Django deployment with PaaS
Django deployment with PaaSDjango deployment with PaaS
Django deployment with PaaS
 
Custom Buildpacks and Data Services
Custom Buildpacks and Data ServicesCustom Buildpacks and Data Services
Custom Buildpacks and Data Services
 
DC HUG Hadoop for Windows
DC HUG Hadoop for WindowsDC HUG Hadoop for Windows
DC HUG Hadoop for Windows
 
PHP on Heroku: Deploying and Scaling Apps in the Cloud
PHP on Heroku: Deploying and Scaling Apps in the CloudPHP on Heroku: Deploying and Scaling Apps in the Cloud
PHP on Heroku: Deploying and Scaling Apps in the Cloud
 
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph Ceph Day Shanghai - Hyper Converged PLCloud with Ceph
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph
 
Introduction to JIB and Google Cloud Run
Introduction to JIB and Google Cloud RunIntroduction to JIB and Google Cloud Run
Introduction to JIB and Google Cloud Run
 
A DevOps guide to Kubernetes
A DevOps guide to KubernetesA DevOps guide to Kubernetes
A DevOps guide to Kubernetes
 
Baylisa - Dive Into OpenStack
Baylisa - Dive Into OpenStackBaylisa - Dive Into OpenStack
Baylisa - Dive Into OpenStack
 
Scaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesScaleable PHP Applications in Kubernetes
Scaleable PHP Applications in Kubernetes
 
Scale Apache with Nginx
Scale Apache with NginxScale Apache with Nginx
Scale Apache with Nginx
 
Hadoop Everywhere & Cloudbreak
Hadoop Everywhere & CloudbreakHadoop Everywhere & Cloudbreak
Hadoop Everywhere & Cloudbreak
 
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
 
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereApache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
 
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes][BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]
 
Deploying Hadoop-based Bigdata Environments
Deploying Hadoop-based Bigdata Environments Deploying Hadoop-based Bigdata Environments
Deploying Hadoop-based Bigdata Environments
 
Deploying Hadoop-Based Bigdata Environments
Deploying Hadoop-Based Bigdata EnvironmentsDeploying Hadoop-Based Bigdata Environments
Deploying Hadoop-Based Bigdata Environments
 

Kürzlich hochgeladen

Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
dlhescort
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
dollysharma2066
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
amitlee9823
 
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Sheetaleventcompany
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 

Kürzlich hochgeladen (20)

Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort ServiceEluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
 
Falcon's Invoice Discounting: Your Path to Prosperity
Falcon's Invoice Discounting: Your Path to ProsperityFalcon's Invoice Discounting: Your Path to Prosperity
Falcon's Invoice Discounting: Your Path to Prosperity
 
Cracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxCracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptx
 
JAYNAGAR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
JAYNAGAR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRLJAYNAGAR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
JAYNAGAR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
 
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 MonthsSEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
 
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Century
 
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
 
Business Model Canvas (BMC)- A new venture concept
Business Model Canvas (BMC)-  A new venture conceptBusiness Model Canvas (BMC)-  A new venture concept
Business Model Canvas (BMC)- A new venture concept
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors Data
 
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRLBAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
Falcon Invoice Discounting platform in india
Falcon Invoice Discounting platform in indiaFalcon Invoice Discounting platform in india
Falcon Invoice Discounting platform in india
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 

2012 09-08-josug-jeff

  • 1. OSC 2012 Tokyo openstack Open source software to build public and private clouds. Hadoop on OpenStack Swift - Experiment of using swift as storage for Apache Hadoop 2012.09.08 OpenStack Japan Zheng Xu 1
  • 2. Self introduction ● Software designer(engineer) for embedded system and web system(60%hobbit, 40%job). ● Major: openstack, linux, web browser, html, epub, OSS ● Contact ● @xz911 ● https://www.facebook.com/xuzheng2001 2
  • 3. Abstract ● This slide is to introduce how to use OpenStack Swift as storage service for Apache Hadoop instead of HDFS(which is storage service of Hadoop project). ● This slide is based on http://bigdatacraft.com/archives/349, and really appreciate Constantine Peresypkin and David Gruzman for providing their idea and implementation. 3
  • 4. Agenda ● OpenStack Swift ● Apache Hadoop and HDFS ● Experiment of replacing HDFS by OpenStack Swift 4
  • 5. What is OpenStack and Swift From http://www.openstack.org/ 5
  • 6. What is OpenStack and Swift User Application http Proxy Server Proxy Server http Account Server Account Server Account Server http Container Server Container Server Container Server Object Server Object Server Object Server Object Server 6
  • 7. What is OpenStack and Swift ● OpenSource written in Python ● diversity ● Swift can be a part of OpenStack or an individual service it self. ● zones, devices, partitions, and replicas ● No SPOF 7
  • 8. Agenda ● OpenStack Swift ● Apache Hadoop and HDFS ● Experiment of replacing HDFS by OpenStack Swift 8
  • 9. Apache Hadoop and HDFS From http://hadoop.apache.org/ 9
  • 10. Apache Hadoop and HDFS User Application Map-Reduce Name Node Hive Data Node Data Node Data Node 10
  • 11. Agenda ● OpenStack Swift ● Apache Hadoop and HDFS ● Experiment of replacing HDFS by OpenStack Swift 11
  • 12. Experiment (Concept) User Application Map-Reduce Name Node Hive Data Node Data Node Data Node 12
  • 13. Experiment (Concept) User Application Map-Reduce java-cloudfiles java-cloudfiles Hive http java-cloudfiles Data Node Data Node Swift 13
  • 14. Experiment (Software) ● Swift v1.6 ● https://github.com/openstack/swift.git ● r21616cf, Jul 25 ● Java Client java-cloudfiles ● https://github.com/rackspace/java-cloudfiles ● r0807fa6, Jun 4 ● Apache Hadoop ● 1.0.3 ● Swift fs for Apache Hadoop(just part of following source code) ● https://github.com/Dazo-org/hadoop-common.git (branch-0.20-security- 205.swift ) 14
  • 15. Experiment (infra) 192.168.0.4 192.168.0.9 15
  • 16. Experiment(install swift) ● Install swift based on http://docs.openstack.org/developer/swift/development_saio.html ● Do not forget to set bind_ip of proxy-server.conf ● 192.168.0.9 in my case ● Suppose we have username as "test:tester" with password as "testing", the account name is AUTH_test and have some container based on steps in above Url. 16
  • 17. Experiment (cloudfiles) ● Run "ant compile" ● Change cloudfiles.properties to following # Auth info auth_url=http://192.168.0.9:8080/auth/v1.0 auth_token_name=X-Auth-Token #auth_user_header=X-Storage-User #auth_pass_header=X-Storage-Pass # user properties username=test:tester password=testing # cloudfs properties version=v1 connection_timeout=15000 17
  • 18. Experiment(cloudfiles) ● Connect cloudfiles to swift(this is option) ● Change cloudfiles.sh as following and run it to try connection with swift #!/bin/sh export CLASSPATH=lib/httpcore-4.1.4.jar:lib/commons-cli- 1.1.jar:lib/httpclient-4.1.3.jar:lib/commons-lang- 2.4.jar:lib/junit.jar:lib/commons-codec-1.3.jar:lib/commons-io- 1.4.jar:lib/commons-logging-1.1.1.jar:lib/log4j-1.2.15.jar:dist/java- cloudfiles.jar:. java com.rackspacecloud.client.cloudfiles.sample.FilesCli $@ 18
  • 19. Experiment (cloudfiles) ● Packaging java-cloudfiles to jar file for Apache Hadoop (clone java-cloudfiles to ~/java- cloudfiles) ● We need to put *.properties into java-cloudfiles.jar $ ant package $ cd cloudfiles/dist $ cp ../*.properties . $ rm java-cloudfiles.jar $ jar cvf java-cloudfiles.jar ./* 19
  • 20. Experiment (hadoop) ● Prepare ● download hadoop to ~/hadoop-1.0.3 (newest stable version of original hadoop) and git clone https://github.com/Dazo-org/hadoop-common.git to ~/hadoop-common (old hadoop source code with swift fs plugin) ● At ~/hadoop-1.0.3 (copy java-cloudfiles and related library to hadoop lib folder) – cd lib;cp ~/java-cloudfiles/cloudfiles/dist/java-cloudfiles.jar . – cp ~/java-cloudfiles/lib/httpc* . 20
  • 21. Experiment (setting hadoop) ● ./hadoop-1.0.3/src/core/core-default.xml ● Add following to make hadoop can recognize handle "swift://" schema to SwiftFileSystem class <property> <name>fs.swift.impl</name> <value>org.apache.hadoop.fs.swift.SwiftFileSystem</value> <description>The FileSystem for swift: uris.</description> </property> 21
  • 22. Experiment (hadoop) ● Copy implementation for swift fs to hadoop 1.0.3 and build ● cp -R ../hadoop- common/src/core/org/apache/hadoop/fs/swift ./src/core/org/apache/hadoop/fs ● ant 22
  • 23. Experiment(hadoop setting) ● ./conf/core-site.xml (part1) ● Add following property for example <property> <name>fs.swift.userName</name> <value>test:tester</value> </property> 23
  • 24. Experiment (hadoop setting) ● ./conf/core-site.xml (part2) ● Add following property for example <property> <name>fs.swift.userPassword</name> <value>testing</value> </property> <property> <name>fs.swift.acccountname</name> <value>AUTH_test</value> </property> 24
  • 25. Experiment (hadoop setting) ● ./conf/core-site.xml (part3) ● Add following property for example <property> <name>fs.swift.authUrl</name> <value>http://192.168.0.9:8080/auth/v1.0</value> </property> <property> <name>fs.default.name</name> <value>swift://192.168.0.9:8080/v1/AUTH_test</value> </property> 25
  • 26. Experiment (check swift fs) ● At this time, we should can list account information via following command ● ./bin/hadoop -fs -ls / ● or ./bin/hadoop fs -put ./conf/core-site.xml /test_container/core-site.xml (test_container is a test container created after swift installed) 26
  • 27. Finally ● We installed swift for storage service of hadoop ● We built origin java-cloudfiles and created packages for hadoop ● We copied fs.swift plugin from https://github.com/Dazo-org/hadoop-common.git to new hadoop source tree and build hadoop ● We set up core-site.xml of hadoop to connect to swift via java-cloudfiles 27
  • 28. Thank you for listening. 28