SlideShare ist ein Scribd-Unternehmen logo
1 von 52
Downloaden Sie, um offline zu lesen
A Model for the Systems
                             Architecture of the Future



         Prof. Paul A. Strassmann
         George Mason University, December 5, 2005
                                                                          1
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Data-Centric Era; IBM Dominates




                Hundred Sources

                   1950-1980




                Months⇒Weeks


                                                                          2
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup-Centric Era; Microsoft, INTEL Dominate




                                                        Million Sources

               Hundred Sources

                   1950-1980                             1980-2010


                                                        Weeks⇒Days

                Months⇒Weeks


                                                                          3
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Network-Centric Era; Google and Cisco?
                                                                          +Multi-Media
                                                              +Text       Billions Sources
                        Data
                                                        Million Sources

               Hundred Sources

                   1950-1980                             1980-2010         2010-

                                                                          Days⇒Real-Time

                                                      Weeks⇒Days

                Months⇒Weeks


                                                                                             4
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Example of a Network-Centric System




                                                                          5
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Network-Centric Requirements (2010)

        •     Downtime (< 5 min/yr);
        •     Display (200 Billion ops/sec);
        •     Connectivity (> 1 Gigabyte/sec);
        •     Access (< 0.25 sec);
        •     Innovation (< 1 day);
        •     Security (> 8 sigma).



                                                                          6
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Performance (2005)


         •      Infrastructure = > 50% of spending;
         •      Security = ?;
         •      Integration = > 50% of applications;
         •      Network downtime = > 1 hour/year;
         •      Innovation = > 1 year.




                                                                          7
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Conclusion




         • Network-Centric systems cannot be
           built on Workgroup-Centric architecture.




                                                                          8
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Network-Centric Principles (Google)

1. Build & operate protected information
   network;
2. Offer universal connectivity for:
   – Collection, processing and storing of
      information;
   – Provide secured communications.
3. Maintain shared data models;
4. Require continued upgrading & innovation.


                                               9
Google Principle #1




Build & operate protected information network




                                          10
Standard Google Clusters Operate Net

                •         359 racks
                •         31,654 machines
                •         63,184 CPUs
                •         126,368 Ghz of processing power
                •         63,184 Gb of RAM
                •         2,527 Tb of Hard Drive space

         • Appx. 40 million searches/day

                                                                          11
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Clusters Have Identical Architecture



                                 Index
                                Servers

   Web             Web
  Switch          Servers



                                Document
                                 Servers


                                           12
Google Cluster Set-Up = Three Days




                                     13
Google Infrastructure: Key to Growth (1)
        • >200,000 custom-built commodity servers;
        •       Acting as one parallel supercomputer;
        • Fault tolerant hardware.
        • Storage capacity >5 petabytes; low response latency
          (0.2 sec); >80GB per server, distributed;
        • Indexed >8 billion web pages; Indexing is
          computationally complex (>500M * > 2B matrix)
        • Capital and operating costs at fraction of large scale
          commercial servers; traffic growth 20-30%/month; data
          centers (>12); in US, Europe and Asia.


                                                                          14
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Infrastructure: Key to Growth (2)
        •     >200,000 commodity Linux servers built to custom
              specifications; distributed cluster architecture; acting as one
              parallel supercomputer; scaleable;
        •     >50,000 requests/sec; fault tolerant (no single point of failure);
              diverse hardware; stripped version of Red Hat;
        •     Storage capacity >5 petabytes;
        •     >80GB per server;
        •     Indexed >8 billion web pages;
        •     Indexing is complex (500M x 2B matrix)
        •     Capital and operating costs at fraction of large scale commercial
              servers; traffic growth 20-30%/month; data centers (>12); in US,
              Europe and Asia.



                                                                          15
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Infrastructure: Key to Growth (3)
        •     >200,000 commodity Linux servers built to custom
              specifications; distributed cluster architecture; acting as one
              parallel supercomputer; scaleable;
        •     >50,000 requests/sec; fault tolerant (no single point of failure);
              diverse hardware; stripped version of Red Hat;
        •     Storage capacity >5 petabytes; low response latency (0.2 sec);
              >80GB per server, distributed;
        •     Indexed >8 billion web pages; Indexing is computationally
              complex (>500M * > 2B matrix)
        •     Capital and operating costs a fraction of commercial servers;
        •     Traffic growth 20-30%/month;
        •     Data centers (>20), in US, Europe and Asia.



                                                                          16
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Architecture for Reliability


  • Replication (3x+) for redundancy;
  • Replication for proximity and response;
  • Reliability with software and architecture,
    not with hardware.




                                                                          17
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Indexing for Response


         •      Dynamic indexing of 8B+ pages;
         •      Dynamic indexing of 1B+ images;
         •      Indexing of 1B+ messages;
         •      Index broken into “shards” and
                distributed across data centers.




                                                                          18
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Query Serving Infrastructure

     • Processing a single query may involve
       1000+ servers;
     • Index Servers access Index Shards;
     • Document Servers access Doc Shards;
     • Response times monitored to assure
       <0.25 sec latency.



                                                                          19
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google MapReduce System (1)

 •     Coordinates servers in real-time;
 •     Automates distribution of workload;
 •     Fault tolerance and service reconstitution;
 •     Systems-wide I/O cluster scheduling;
 •     Status and performance monitoring.




                                                                          20
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google MapReduce System (2)


 •     Coordination of servers in real-time;
 •     Automates distribution of workload;
 •     Fault tolerance & service reconstitution;
 •     Systems-wide cluster scheduling;
 •     Status and performance monitoring.




                                                                          21
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Principle #2




      Universal connectivity




                               22
Multi-Lingual Services




                                                                          23
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Search in Arabic Media




                                                                          24
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Video Searches




                                                                          25
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Base - Connecting Diverse Sources

                                                                           Locate events within
                                                                          45 miles of New York in
                                                                             November, 2005




                                                                                                    26
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Semantic Parsing


  • Tools parse millions of documents;
  • Automated learning for related information.
           – Query: “Bay Area Cooking Classes”

           – Finds: “San Francisco College Classes”;
                    “The Magic of Thai Cuisine”




                                                                          27
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Principle #3




         Shared data models




                              28
Data Engineering

    • Standard file management: The Google
      File System (GFS);
    • Standard job scheduling: The Global Work
      Queue (GWQ);
    • Standard network management: The
      Google MapReduce system.



                                                                          29
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google File System (GFS)

        • Replicated Masters manage MetaData
          directories;
        • Data transfers directly at the machine
          level within 2,000+ clusters;
        • File broken into 64 MB chunks for
          2000+ MB/second read/write load;
        • All file chunks at least triplicate for
          safety.

                                                                          30
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Data Dictionary for Interoperability




                                                                          31
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Application Interfaces




                                                                          32
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Principle #4




        Upgrading & Innovation




                                 33
Deliver On-Line Services




                                                                          34
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Shopping Services




                                                                          35
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Environment for Rapid Innovation




                                                                          36
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
A New Application Launched in 15 Minutes




                                                                          37
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Occupy the Desktop




                                                                          38
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Multimedia Services




                                                                          39
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures




    Comparison Summaries




                                      40
Workgroup Computing Today:
        Millions of Local Applications+Local Data




                                                                          41
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Work-Groups Vulnerable Today




                                                                          42
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
New Internet: Billions of Browsers,
        Secure Shared Applications+Data




     Browsers




                     Application
                     Application
                     Application                                          43
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (1)

  Workgroup Centric                                                       Network Centric

 Strategy: Capture Desktop                                                Strategy: Occupy Internet
 Customer’s labor and capital                                             Labor and capital in network
 User-specific infrastructures                                            Infrastructure is universal
 Systems controls by user                                                 Network controls in network
 Operating system dependency                                              Open source browsers
 License Software                                                         Pay for Use




                                                                                             44
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (2)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network
Operating system dependency                                               Open source browsers
License Software                                                          Pay for Use




                                                                                            45
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (3)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network
Operating system dependency                                               Open source browsers
License Software                                                          Pay for Use




                                                                                              46
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (4)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network
Operating system dependency                                               Open source browsers
License Software                                                          Pay for Use




                                                                                            47
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (5)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network
Operating system dependent                                                Open source browsers
License Software                                                          Pay for Use




                                                                                              48
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (6)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network

License Software                                                          Pay for Use




                                                                                              49
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (7)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network

License Software                                                          Pay for Use
Data read from files                                                      Data assembled in context



                                                                                              50
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
The Future

  Technology
  • All electronic devices on Internet.
  • Data, voice, video, sensor inputs accessible.
  • Phone, TV and print media displaced.

  Services
  • Systems respond to questions.
  • Information is displayed in context.
  • Applications for decision-making.

                                                                          51
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Relevance for National Security Systems

       • Workgroups to Network-Centric services.
       • Migrate through displacement.
       • Invest savings in innovation.




                                                                          52
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY

Weitere ähnliche Inhalte

Ähnlich wie Lecture v4

IBM Tape the future of tape
IBM Tape the future of tapeIBM Tape the future of tape
IBM Tape the future of tapeJosef Weingand
 
Vortrag ralph behrens_ibm-data
Vortrag ralph behrens_ibm-dataVortrag ralph behrens_ibm-data
Vortrag ralph behrens_ibm-dataAravindharamanan S
 
RAMCloud: Scalable Datacenter Storage Entirely in DRAM
RAMCloud: Scalable  Datacenter Storage  Entirely in DRAMRAMCloud: Scalable  Datacenter Storage  Entirely in DRAM
RAMCloud: Scalable Datacenter Storage Entirely in DRAMGeorge Ang
 
Horizon 20110928
Horizon 20110928Horizon 20110928
Horizon 20110928Mike Miller
 
Why z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsWhy z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsTeodoro Cipresso
 
KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.Kyong-Ha Lee
 
S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2Tony Pearson
 
Flash Ahead: IBM Flash System Selling Point
Flash Ahead: IBM Flash System Selling PointFlash Ahead: IBM Flash System Selling Point
Flash Ahead: IBM Flash System Selling PointCTI Group
 
USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)Ryousei Takano
 
Webinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be BackupsWebinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be BackupsStorage Switzerland
 
Compare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL ServerCompare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL ServerAlexDepo
 
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...Paul Hofmann
 
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019javier ramirez
 
Research and technology explosion in scale-out storage
Research and technology explosion in scale-out storageResearch and technology explosion in scale-out storage
Research and technology explosion in scale-out storageJeff Spencer
 
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...Aerospike
 

Ähnlich wie Lecture v4 (20)

The future of tape
The future of tapeThe future of tape
The future of tape
 
IBM Tape the future of tape
IBM Tape the future of tapeIBM Tape the future of tape
IBM Tape the future of tape
 
Vortrag ralph behrens_ibm-data
Vortrag ralph behrens_ibm-dataVortrag ralph behrens_ibm-data
Vortrag ralph behrens_ibm-data
 
RAMCloud: Scalable Datacenter Storage Entirely in DRAM
RAMCloud: Scalable  Datacenter Storage  Entirely in DRAMRAMCloud: Scalable  Datacenter Storage  Entirely in DRAM
RAMCloud: Scalable Datacenter Storage Entirely in DRAM
 
Horizon 20110928
Horizon 20110928Horizon 20110928
Horizon 20110928
 
Why z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsWhy z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIs
 
KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.
 
S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2
 
Big Data and OSS at IBM
Big Data and OSS at IBMBig Data and OSS at IBM
Big Data and OSS at IBM
 
Flash Ahead: IBM Flash System Selling Point
Flash Ahead: IBM Flash System Selling PointFlash Ahead: IBM Flash System Selling Point
Flash Ahead: IBM Flash System Selling Point
 
USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)
 
Webinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be BackupsWebinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be Backups
 
Compare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL ServerCompare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL Server
 
Storage overview
Storage overviewStorage overview
Storage overview
 
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
 
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
 
Research and technology explosion in scale-out storage
Research and technology explosion in scale-out storageResearch and technology explosion in scale-out storage
Research and technology explosion in scale-out storage
 
Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
 
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data EngineNZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
 
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
 

Kürzlich hochgeladen

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Kürzlich hochgeladen (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Lecture v4

  • 1. A Model for the Systems Architecture of the Future Prof. Paul A. Strassmann George Mason University, December 5, 2005 1 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 2. Data-Centric Era; IBM Dominates Hundred Sources 1950-1980 Months⇒Weeks 2 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 3. Workgroup-Centric Era; Microsoft, INTEL Dominate Million Sources Hundred Sources 1950-1980 1980-2010 Weeks⇒Days Months⇒Weeks 3 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 4. Network-Centric Era; Google and Cisco? +Multi-Media +Text Billions Sources Data Million Sources Hundred Sources 1950-1980 1980-2010 2010- Days⇒Real-Time Weeks⇒Days Months⇒Weeks 4 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 5. Example of a Network-Centric System 5 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 6. Network-Centric Requirements (2010) • Downtime (< 5 min/yr); • Display (200 Billion ops/sec); • Connectivity (> 1 Gigabyte/sec); • Access (< 0.25 sec); • Innovation (< 1 day); • Security (> 8 sigma). 6 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 7. Performance (2005) • Infrastructure = > 50% of spending; • Security = ?; • Integration = > 50% of applications; • Network downtime = > 1 hour/year; • Innovation = > 1 year. 7 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 8. Conclusion • Network-Centric systems cannot be built on Workgroup-Centric architecture. 8 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 9. Network-Centric Principles (Google) 1. Build & operate protected information network; 2. Offer universal connectivity for: – Collection, processing and storing of information; – Provide secured communications. 3. Maintain shared data models; 4. Require continued upgrading & innovation. 9
  • 10. Google Principle #1 Build & operate protected information network 10
  • 11. Standard Google Clusters Operate Net • 359 racks • 31,654 machines • 63,184 CPUs • 126,368 Ghz of processing power • 63,184 Gb of RAM • 2,527 Tb of Hard Drive space • Appx. 40 million searches/day 11 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 12. Clusters Have Identical Architecture Index Servers Web Web Switch Servers Document Servers 12
  • 13. Google Cluster Set-Up = Three Days 13
  • 14. Google Infrastructure: Key to Growth (1) • >200,000 custom-built commodity servers; • Acting as one parallel supercomputer; • Fault tolerant hardware. • Storage capacity >5 petabytes; low response latency (0.2 sec); >80GB per server, distributed; • Indexed >8 billion web pages; Indexing is computationally complex (>500M * > 2B matrix) • Capital and operating costs at fraction of large scale commercial servers; traffic growth 20-30%/month; data centers (>12); in US, Europe and Asia. 14 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 15. Google Infrastructure: Key to Growth (2) • >200,000 commodity Linux servers built to custom specifications; distributed cluster architecture; acting as one parallel supercomputer; scaleable; • >50,000 requests/sec; fault tolerant (no single point of failure); diverse hardware; stripped version of Red Hat; • Storage capacity >5 petabytes; • >80GB per server; • Indexed >8 billion web pages; • Indexing is complex (500M x 2B matrix) • Capital and operating costs at fraction of large scale commercial servers; traffic growth 20-30%/month; data centers (>12); in US, Europe and Asia. 15 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 16. Google Infrastructure: Key to Growth (3) • >200,000 commodity Linux servers built to custom specifications; distributed cluster architecture; acting as one parallel supercomputer; scaleable; • >50,000 requests/sec; fault tolerant (no single point of failure); diverse hardware; stripped version of Red Hat; • Storage capacity >5 petabytes; low response latency (0.2 sec); >80GB per server, distributed; • Indexed >8 billion web pages; Indexing is computationally complex (>500M * > 2B matrix) • Capital and operating costs a fraction of commercial servers; • Traffic growth 20-30%/month; • Data centers (>20), in US, Europe and Asia. 16 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 17. Architecture for Reliability • Replication (3x+) for redundancy; • Replication for proximity and response; • Reliability with software and architecture, not with hardware. 17 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 18. Indexing for Response • Dynamic indexing of 8B+ pages; • Dynamic indexing of 1B+ images; • Indexing of 1B+ messages; • Index broken into “shards” and distributed across data centers. 18 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 19. Query Serving Infrastructure • Processing a single query may involve 1000+ servers; • Index Servers access Index Shards; • Document Servers access Doc Shards; • Response times monitored to assure <0.25 sec latency. 19 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 20. Google MapReduce System (1) • Coordinates servers in real-time; • Automates distribution of workload; • Fault tolerance and service reconstitution; • Systems-wide I/O cluster scheduling; • Status and performance monitoring. 20 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 21. Google MapReduce System (2) • Coordination of servers in real-time; • Automates distribution of workload; • Fault tolerance & service reconstitution; • Systems-wide cluster scheduling; • Status and performance monitoring. 21 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 22. Google Principle #2 Universal connectivity 22
  • 23. Multi-Lingual Services 23 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 24. Search in Arabic Media 24 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 25. Video Searches 25 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 26. Google Base - Connecting Diverse Sources Locate events within 45 miles of New York in November, 2005 26 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 27. Semantic Parsing • Tools parse millions of documents; • Automated learning for related information. – Query: “Bay Area Cooking Classes” – Finds: “San Francisco College Classes”; “The Magic of Thai Cuisine” 27 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 28. Google Principle #3 Shared data models 28
  • 29. Data Engineering • Standard file management: The Google File System (GFS); • Standard job scheduling: The Global Work Queue (GWQ); • Standard network management: The Google MapReduce system. 29 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 30. Google File System (GFS) • Replicated Masters manage MetaData directories; • Data transfers directly at the machine level within 2,000+ clusters; • File broken into 64 MB chunks for 2000+ MB/second read/write load; • All file chunks at least triplicate for safety. 30 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 31. Data Dictionary for Interoperability 31 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 32. Application Interfaces 32 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 33. Google Principle #4 Upgrading & Innovation 33
  • 34. Deliver On-Line Services 34 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 35. Shopping Services 35 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 36. Environment for Rapid Innovation 36 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 37. A New Application Launched in 15 Minutes 37 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 38. Occupy the Desktop 38 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 39. Multimedia Services 39 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 40. Workgroup vs. Network Architectures Comparison Summaries 40
  • 41. Workgroup Computing Today: Millions of Local Applications+Local Data 41 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 42. Work-Groups Vulnerable Today 42 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 43. New Internet: Billions of Browsers, Secure Shared Applications+Data Browsers Application Application Application 43 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 44. Workgroup vs. Network Architectures (1) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependency Open source browsers License Software Pay for Use 44 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 45. Workgroup vs. Network Architectures (2) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependency Open source browsers License Software Pay for Use 45 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 46. Workgroup vs. Network Architectures (3) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependency Open source browsers License Software Pay for Use 46 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 47. Workgroup vs. Network Architectures (4) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependency Open source browsers License Software Pay for Use 47 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 48. Workgroup vs. Network Architectures (5) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependent Open source browsers License Software Pay for Use 48 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 49. Workgroup vs. Network Architectures (6) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network License Software Pay for Use 49 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 50. Workgroup vs. Network Architectures (7) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network License Software Pay for Use Data read from files Data assembled in context 50 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 51. The Future Technology • All electronic devices on Internet. • Data, voice, video, sensor inputs accessible. • Phone, TV and print media displaced. Services • Systems respond to questions. • Information is displayed in context. • Applications for decision-making. 51 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 52. Relevance for National Security Systems • Workgroups to Network-Centric services. • Migrate through displacement. • Invest savings in innovation. 52 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY