SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
Evolution of the mashup ecosystem
             by copying


    Michael Weiss                       Technology Innovation
                                            Management (TIM)
    Solange Sari
                                          www.carleton.ca/tim
 weiss@sce.carleton.ca   Mashups 2010                           1
Objective

  • Mashups are applications that combine data and
    services provided through APIs with user data
  • New application development model: opportunistic
    programming, uses a bricolage approach
  • Creation of mashups supported by an ecosystem of
    data providers, mashup platforms, and users

  • Research questions
       – How do mashup developers select APIs?
       – How do mashup developers learn to develop mashups?



  weiss@sce.carleton.ca       Mashups 2010                    2
Relevance

 • Users/platforms: can benefit from/offer tools that
   better support the way users work
 • Directory providers: their role is to facilitate the
   selection of APIs and learning of developers
 • Data providers: need to understand which APIs their
   APIs are used together most (interoperability)




  weiss@sce.carleton.ca      Mashups 2010                 3
Previous work

  • Examined structure and growth of mashup
    ecosystem using visualization and network analysis
    to identify members and their relationships
  • Opportunistic programming studies how developers
    use online resources in problem solving
  • Research on innovation: (re)combination shortens
    learning curve, modularity allows mix-and-match
  • Models of network growth: preferential attachment
  • Copying and duplication mechanisms in describing
    the growth of the web and biological networks


  weiss@sce.carleton.ca   Mashups 2010                   4
Hypothesis

                     • As answer to research questions, we examine to
                       what degree developers create mashups by copying
                       other mashups: copy of the mashup “blueprint”
                                       Number of copies/mashup
Not copied
                                                                                                              Snapshot on 08/16/10
                                            Amazon/GoogleMaps/YouTube
                                                                                                              ProgrammableWeb
                               5e-01




                                                                GoogleMaps/Twitter                            Mashups      4983 100%
      Cumulative probability

                               5e-02




                                                                          Flickr/GoogleMaps                   Not copied   1528   31%
                                       Amazon/
                                                                                Flickr
                                                                                                              Blueprints    341      7%
                               5e-03




                                       GoogleMaps
                                                                                                              Copies of    3114   62%
                               GoogleMaps/YouTube
                                                                                              GoogleMaps
                                                                                                              blueprints
                               5e-04




                                                      YouTube
                                       1     5   10             50       100     500   1000

                                                      Number of copies




                                weiss@sce.carleton.ca                                          Mashups 2010                           5
Copying model

 • Mashup ecosystem as network of mashups and APIs:
   a link indicates that a mashup uses an API
 • Assumption: mashups all have m APIs
 • Initialize network:
       – Create m0 ≥ m APIs, one mashup

 • Grow network from t=m0 + 1 to t=N:
       – Add new API with probability p
       – With probability 1-p, choose a mashup as a template
       – For each API in template, copy the API with probability α, or
         choose a new API at random with probability 1-Îą


  weiss@sce.carleton.ca          Mashups 2010                        6
Example

 • Initial network: APIs 1 and 2, mashup 3
 • Thin solid lines indicate random selection



                                  1




                          3

                                                     t   API


                              2                      t   Mashup



  weiss@sce.carleton.ca               Mashups 2010                7
Example

 • Growth: add a new mashup (4)
 • Thick solid lines indicate “copies” relationship
 • Thin dashed lines indicate copying

                                  1




                          3           4    Full copy


                                                       t   API


                              2                        t   Mashup



  weiss@sce.carleton.ca               Mashups 2010                  8
Example

 • Growth: add a new API (5)



                                                            5
                                  1




                          3           4    Full copy


                                                       t   API

                              2                        t   Mashup



  weiss@sce.carleton.ca               Mashups 2010                  9
Example

 • Growth: add a new mashup (6)
 • Thin solid lines indicate random selection

                                                            5
                                  1
                                               6     Partial copy



                          3           4    Full copy


                                                       t   API

                              2                        t   Mashup



  weiss@sce.carleton.ca               Mashups 2010                  10
Research method

 • Calibrate simulation parameters
       – N: combined actual number of APIs and mashups
       – m = 2: good approximation of average actual APIs / mashup
       – p: number of APIs / N (all as of 08/16/10)
 • Simulate mashup ecosystem evolution
       – Vary α over range 0.0 to 1.0, keep m = 2 fixed
       – Run each simulation multiple times and terminate when 95%
         confidence interval is sufficient for the optimization

 • Determine best fit of simulated distribution of
   mashups / API with actual using two fitting methods:
   sum of squared error fit, and power law fit

  weiss@sce.carleton.ca          Mashups 2010                   11
Actual distribution

  • Distribution of mashups / API follows Zipf’s law:
    plotting frequency of mashups relative to rank results
    in a line with slope close to -1 in a log-log plot

      GoogleMaps                                                                             Actual
                                                    Flickr
                                                          Twitter
                                      500




                                            YouTube
                  Number of mashups

                                      100




                                                                                -0.990
                                      50
                                      10
                                      5
                                      1




                                            1   2     5    10   20         50    100   200   500

                                                                    Rank



   weiss@sce.carleton.ca                                                    Mashups 2010              12
Sum of squared error fit
                • Underestimates contribution of top-ranked API
                • Overestimates the number of APIs used by at least
                  one mashup by 45% (1020 vs 703)
                                                                                                                                        Actual

                                Îą = 0.798
                                                                                                                                        Simulated (sum of squared error)
                        1e+07




                                                                                                         500
                        8e+06
 Sum of squared error




                                                                                     Number of mashups

                                                                                                         100
                        6e+06




                                                                                                         50
                        4e+06




                                                                                                         10
                                                                                                         5
                        2e+06




                                                                                                         1




                                  0.2           0.4                0.6     0.8                                 1   2   5   10   20      50     100    200       500

                                                      Copying factor (!)                                                         Rank


                        weiss@sce.carleton.ca                                    Mashups 2010                                                                     13
Power law fit
                      • Slightly overestimates contribution of top API
                      • Overestimates the number of APIs used by at least
                        one mashup by 22% (859 vs 703)
                               2.5




                                                                                                                                                          Actual

                                                                                  Îą = 0.855
                                                                                                                                                          Simulated (power law)
                               2.0




                                                                                                                    500
 Power law coefficient error

                               1.5




                                                                                                Number of mashups

                                                                                                                    100
                                                                                                                    50
                               1.0




                                                                                                                    10
                               0.5




                                                                                                                    5
                               0.0




                                                                                                                    1




                                         0.2           0.4                0.6         0.8                                 1   2   5   10   20      50   100   200      500

                                                             Copying factor (!)                                                             Rank


                               weiss@sce.carleton.ca                                        Mashups 2010                                                                 14
Cumulative contribution of APIs
  • Sum of squared error fit underestimates number of
    APIs that contributed to 50% of API uses
  • Power law fit overestimates number of APIs that
    contributed to 50% of API uses
                          Cumulative contribution

                                                    1.0
                                                    0.8
                                                    0.6
                                                    0.4
                                                    0.2




                                                          1   2   5   10   20      50   100   200   500

                                                                            Rank


  weiss@sce.carleton.ca                                               Mashups 2010                        15
Discussion

  • Both methods obtained their best fit for a high
    copying factor: this suggests that most mashups are
    created by modifying the an existing blueprint
  • Power law fit more closely approximates actual Zipf
    distribution, however, sum of squared error fit offers a
    better match of actual degrees of APIs in midrange




  weiss@sce.carleton.ca     Mashups 2010                  16
Insights for stakeholders

  • Confirmation of practices directories follow:
       – List combinations of APIs into mashups
       – Keep track of developers of mashups
       – Provide tutorials on mashup development
  • Directory providers should make blueprints more
    apparent: also list frequency of blueprints
  • Users benefit as they can look at blueprints to select
    APIs that work well together and as examples
  • API providers learn which other APIs are frequently
    combined with their API: incentive to interoperate


  weiss@sce.carleton.ca         Mashups 2010                 17
Conclusion
  • Results indicate that copying plays a significant role
    in the evolution of the mashup ecosystem
  • However, we cannot rule out other factors that could
    explain how mashup ecosystem grows
  • Copying hypothesis in line with current thinking about
    innovation: eg MacArthur’s Nature of Technology
  • Other current and future work:
       – Extend simulation to include mashups of different size
       – Test copying hypothesis empirically: we currently examine
         hereditary relationships between mashups
       – Examine link between copying and diversity of ecosystem


  weiss@sce.carleton.ca          Mashups 2010                        18

Weitere ähnliche Inhalte

Ähnlich wie Evolution of the mashup ecosystem by copying

“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOpsRui Quintino
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsApache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsAnyscale
 
magellan_mongodb_workload_analysis
magellan_mongodb_workload_analysismagellan_mongodb_workload_analysis
magellan_mongodb_workload_analysisPraveen Narayanan
 
Pitfalls of machine learning in production
Pitfalls of machine learning in productionPitfalls of machine learning in production
Pitfalls of machine learning in productionAntoine Sauray
 
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxImprove ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxNeo4j
 
Asynchronous Hyperparameter Search with Spark on Hopsworks and Maggy
Asynchronous Hyperparameter Search with Spark on Hopsworks and MaggyAsynchronous Hyperparameter Search with Spark on Hopsworks and Maggy
Asynchronous Hyperparameter Search with Spark on Hopsworks and MaggyJim Dowling
 
Using Apache Spark with IBM SPSS Modeler
Using Apache Spark with IBM SPSS ModelerUsing Apache Spark with IBM SPSS Modeler
Using Apache Spark with IBM SPSS ModelerGlobal Knowledge Training
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...Bill Liu
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScienceStepan Pushkarev
 
Real-time Machine Learning Analytics Using Structured Streaming and Kinesis F...
Real-time Machine Learning Analytics Using Structured Streaming and Kinesis F...Real-time Machine Learning Analytics Using Structured Streaming and Kinesis F...
Real-time Machine Learning Analytics Using Structured Streaming and Kinesis F...Databricks
 
A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...butest
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-Systeminside-BigData.com
 
Asynchronous Hyperparameter Optimization with Apache Spark
Asynchronous Hyperparameter Optimization with Apache SparkAsynchronous Hyperparameter Optimization with Apache Spark
Asynchronous Hyperparameter Optimization with Apache SparkDatabricks
 
Scipy, numpy and friends
Scipy, numpy and friendsScipy, numpy and friends
Scipy, numpy and friendsMichele Mattioni
 
Scalable machine learning
Scalable machine learningScalable machine learning
Scalable machine learningArnaud Rachez
 
ODSC West 2022 – Kitbashing in ML
ODSC West 2022 – Kitbashing in MLODSC West 2022 – Kitbashing in ML
ODSC West 2022 – Kitbashing in MLBryan Bischof
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro sessionAvinash Patil
 

Ähnlich wie Evolution of the mashup ecosystem by copying (20)

“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsApache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
 
magellan_mongodb_workload_analysis
magellan_mongodb_workload_analysismagellan_mongodb_workload_analysis
magellan_mongodb_workload_analysis
 
Pitfalls of machine learning in production
Pitfalls of machine learning in productionPitfalls of machine learning in production
Pitfalls of machine learning in production
 
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxImprove ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
 
Asynchronous Hyperparameter Search with Spark on Hopsworks and Maggy
Asynchronous Hyperparameter Search with Spark on Hopsworks and MaggyAsynchronous Hyperparameter Search with Spark on Hopsworks and Maggy
Asynchronous Hyperparameter Search with Spark on Hopsworks and Maggy
 
Using Apache Spark with IBM SPSS Modeler
Using Apache Spark with IBM SPSS ModelerUsing Apache Spark with IBM SPSS Modeler
Using Apache Spark with IBM SPSS Modeler
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
Real-time Machine Learning Analytics Using Structured Streaming and Kinesis F...
Real-time Machine Learning Analytics Using Structured Streaming and Kinesis F...Real-time Machine Learning Analytics Using Structured Streaming and Kinesis F...
Real-time Machine Learning Analytics Using Structured Streaming and Kinesis F...
 
Model checking
Model checkingModel checking
Model checking
 
A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-System
 
Asynchronous Hyperparameter Optimization with Apache Spark
Asynchronous Hyperparameter Optimization with Apache SparkAsynchronous Hyperparameter Optimization with Apache Spark
Asynchronous Hyperparameter Optimization with Apache Spark
 
Developing apps faster
Developing apps fasterDeveloping apps faster
Developing apps faster
 
Recommendation engine
Recommendation engineRecommendation engine
Recommendation engine
 
Scipy, numpy and friends
Scipy, numpy and friendsScipy, numpy and friends
Scipy, numpy and friends
 
Scalable machine learning
Scalable machine learningScalable machine learning
Scalable machine learning
 
ODSC West 2022 – Kitbashing in ML
ODSC West 2022 – Kitbashing in MLODSC West 2022 – Kitbashing in ML
ODSC West 2022 – Kitbashing in ML
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro session
 

KĂźrzlich hochgeladen

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

KĂźrzlich hochgeladen (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

Evolution of the mashup ecosystem by copying

  • 1. Evolution of the mashup ecosystem by copying Michael Weiss Technology Innovation Management (TIM) Solange Sari www.carleton.ca/tim weiss@sce.carleton.ca Mashups 2010 1
  • 2. Objective • Mashups are applications that combine data and services provided through APIs with user data • New application development model: opportunistic programming, uses a bricolage approach • Creation of mashups supported by an ecosystem of data providers, mashup platforms, and users • Research questions – How do mashup developers select APIs? – How do mashup developers learn to develop mashups? weiss@sce.carleton.ca Mashups 2010 2
  • 3. Relevance • Users/platforms: can benefit from/offer tools that better support the way users work • Directory providers: their role is to facilitate the selection of APIs and learning of developers • Data providers: need to understand which APIs their APIs are used together most (interoperability) weiss@sce.carleton.ca Mashups 2010 3
  • 4. Previous work • Examined structure and growth of mashup ecosystem using visualization and network analysis to identify members and their relationships • Opportunistic programming studies how developers use online resources in problem solving • Research on innovation: (re)combination shortens learning curve, modularity allows mix-and-match • Models of network growth: preferential attachment • Copying and duplication mechanisms in describing the growth of the web and biological networks weiss@sce.carleton.ca Mashups 2010 4
  • 5. Hypothesis • As answer to research questions, we examine to what degree developers create mashups by copying other mashups: copy of the mashup “blueprint” Number of copies/mashup Not copied Snapshot on 08/16/10 Amazon/GoogleMaps/YouTube ProgrammableWeb 5e-01 GoogleMaps/Twitter Mashups 4983 100% Cumulative probability 5e-02 Flickr/GoogleMaps Not copied 1528 31% Amazon/ Flickr Blueprints 341 7% 5e-03 GoogleMaps Copies of 3114 62% GoogleMaps/YouTube GoogleMaps blueprints 5e-04 YouTube 1 5 10 50 100 500 1000 Number of copies weiss@sce.carleton.ca Mashups 2010 5
  • 6. Copying model • Mashup ecosystem as network of mashups and APIs: a link indicates that a mashup uses an API • Assumption: mashups all have m APIs • Initialize network: – Create m0 ≥ m APIs, one mashup • Grow network from t=m0 + 1 to t=N: – Add new API with probability p – With probability 1-p, choose a mashup as a template – For each API in template, copy the API with probability Îą, or choose a new API at random with probability 1-Îą weiss@sce.carleton.ca Mashups 2010 6
  • 7. Example • Initial network: APIs 1 and 2, mashup 3 • Thin solid lines indicate random selection 1 3 t API 2 t Mashup weiss@sce.carleton.ca Mashups 2010 7
  • 8. Example • Growth: add a new mashup (4) • Thick solid lines indicate “copies” relationship • Thin dashed lines indicate copying 1 3 4 Full copy t API 2 t Mashup weiss@sce.carleton.ca Mashups 2010 8
  • 9. Example • Growth: add a new API (5) 5 1 3 4 Full copy t API 2 t Mashup weiss@sce.carleton.ca Mashups 2010 9
  • 10. Example • Growth: add a new mashup (6) • Thin solid lines indicate random selection 5 1 6 Partial copy 3 4 Full copy t API 2 t Mashup weiss@sce.carleton.ca Mashups 2010 10
  • 11. Research method • Calibrate simulation parameters – N: combined actual number of APIs and mashups – m = 2: good approximation of average actual APIs / mashup – p: number of APIs / N (all as of 08/16/10) • Simulate mashup ecosystem evolution – Vary Îą over range 0.0 to 1.0, keep m = 2 fixed – Run each simulation multiple times and terminate when 95% confidence interval is sufficient for the optimization • Determine best fit of simulated distribution of mashups / API with actual using two fitting methods: sum of squared error fit, and power law fit weiss@sce.carleton.ca Mashups 2010 11
  • 12. Actual distribution • Distribution of mashups / API follows Zipf’s law: plotting frequency of mashups relative to rank results in a line with slope close to -1 in a log-log plot GoogleMaps Actual Flickr Twitter 500 YouTube Number of mashups 100 -0.990 50 10 5 1 1 2 5 10 20 50 100 200 500 Rank weiss@sce.carleton.ca Mashups 2010 12
  • 13. Sum of squared error fit • Underestimates contribution of top-ranked API • Overestimates the number of APIs used by at least one mashup by 45% (1020 vs 703) Actual Îą = 0.798 Simulated (sum of squared error) 1e+07 500 8e+06 Sum of squared error Number of mashups 100 6e+06 50 4e+06 10 5 2e+06 1 0.2 0.4 0.6 0.8 1 2 5 10 20 50 100 200 500 Copying factor (!) Rank weiss@sce.carleton.ca Mashups 2010 13
  • 14. Power law fit • Slightly overestimates contribution of top API • Overestimates the number of APIs used by at least one mashup by 22% (859 vs 703) 2.5 Actual Îą = 0.855 Simulated (power law) 2.0 500 Power law coefficient error 1.5 Number of mashups 100 50 1.0 10 0.5 5 0.0 1 0.2 0.4 0.6 0.8 1 2 5 10 20 50 100 200 500 Copying factor (!) Rank weiss@sce.carleton.ca Mashups 2010 14
  • 15. Cumulative contribution of APIs • Sum of squared error fit underestimates number of APIs that contributed to 50% of API uses • Power law fit overestimates number of APIs that contributed to 50% of API uses Cumulative contribution 1.0 0.8 0.6 0.4 0.2 1 2 5 10 20 50 100 200 500 Rank weiss@sce.carleton.ca Mashups 2010 15
  • 16. Discussion • Both methods obtained their best fit for a high copying factor: this suggests that most mashups are created by modifying the an existing blueprint • Power law fit more closely approximates actual Zipf distribution, however, sum of squared error fit offers a better match of actual degrees of APIs in midrange weiss@sce.carleton.ca Mashups 2010 16
  • 17. Insights for stakeholders • Confirmation of practices directories follow: – List combinations of APIs into mashups – Keep track of developers of mashups – Provide tutorials on mashup development • Directory providers should make blueprints more apparent: also list frequency of blueprints • Users benefit as they can look at blueprints to select APIs that work well together and as examples • API providers learn which other APIs are frequently combined with their API: incentive to interoperate weiss@sce.carleton.ca Mashups 2010 17
  • 18. Conclusion • Results indicate that copying plays a significant role in the evolution of the mashup ecosystem • However, we cannot rule out other factors that could explain how mashup ecosystem grows • Copying hypothesis in line with current thinking about innovation: eg MacArthur’s Nature of Technology • Other current and future work: – Extend simulation to include mashups of different size – Test copying hypothesis empirically: we currently examine hereditary relationships between mashups – Examine link between copying and diversity of ecosystem weiss@sce.carleton.ca Mashups 2010 18