SlideShare ist ein Scribd-Unternehmen logo
1 von 56
Downloaden Sie, um offline zu lesen
From Galapagos to Twitter:
Darwin, Natural Selection,
and Web 2.0


Xavier Llorà

National Center for Supercomputing Applications
University of Illinois at Urbana-Champaign
Urbana, Illinois, 61801

xllora@ncsa.illinois.edu
http://www.ncsa.illinois.edu/~xllora
What is this talk about?

• Simple answer:
  Darwin’s natural selection concept and information technologies

• Natural selection?
       Natural selection is the process by which heritable traits that
       make it more likely for an organism to survive and successfully
     reproduce become more common in a population over successive
                                 generations.

• Remember, back on Darwin’s time genetics were unknown
Darwin on HMS Beagle
A quick Darwin background check

• In Edinburgh Charles Darwin
   • Learnt standard geology (Robert Jameson)
   • Exposed to British catastrophism of 1820s
   • Run into Lamarckian ideas with Robert Grant
• Cambridge provide him a chance to
   • Under John Henslow, explored boundaries between varieties and
     species
   • Read and accepted traditional theological explanations of the world
• Evolution existed before and after Darwin
• HMS Beagle
   • Left Plymouth 2pm on December 27, 1831
   • Returned October 2, 1836
HMS Beagle




             http://www.solarnavigator.net/history/hms_beagle.htm
And Galapagos happened
Massive diversity




              http://www.flickr.com/photos/ole/1040941928/
A key realization

• Knowledge on the boundaries of variation/species
• The facts
   •   Large number of islands
   •   Islands are spatially separated
   •   Wide diversities of species (different sources of food)
   •   Very close species (beyond variation)
• Start accumulating descriptions of specimens
http://commons.wikimedia.org/wiki/File:Darwin's_finches_by_Gould.jpg
The basic building blocks

• Large variation of features
• Highly correlated to food sources
• The basic sketch of a crazy thought:
   •   Assume you start with a diverse population of As
   •   Split As and place them in different environments
   •   Individual ai features may or may not help live in the environment
   •   If individual ai features help in the environment ai as a result has
       increased chances to reproduce
• The underlying distilled idea of natural selection
   • Variation will arise in a population
   • Selection by environment adaptation
• The uncanny idea:Variation can be random
A simple abstraction



         Recombination agent




                               Selection agent
[Fast forward] ... and genomics arise
DNA and basics of reproduction

• Watson & Crick (1953)
• Four bases:
  Adenine (A), cytosine (C), guanine (G), thymine (T)
• Groups in to base pairs: A-T, C-G
• Reproduction of DNA (crossover and mutation)




                    http://en.wikipedia.org/wiki/DNA#cite_note-FWPUB-6
[Fast forward] ... and computers happen
A quick stop in the late 60s

•   Computers were the next cool thing
•   Genetics were on the rise
•   A professor at University of Michigan
•   Natural evolution
    • Diverse population of individual
    • Individual ai features may or may not help
      live in the environment
    • If individual ai features help in the
      environment ai as a result has increased
      chances to reproduce
• Artificial evolution?
    • Model of such evolutionary process
                                                                                       Prof. John H. Holland
    • Use it to solve some real problem

             Image from http://www.slideshare.net/deg511/not-your-grandmothers-genetic-algorithm
What is a genetic algorithm?	


• ANAS published in 1975
• Chromosomes encode a solution
  to a problem
• Evaluation of fitness based on the
  quality solving the problem
• The process
   •   Survival of the fitness
   •   Mutation
   •   Crossover
   •   Repeat until good enough
A simple example

•   Imagine you have a box with an array of switches
•   0 off, 1on
•   Depending on the switch arrangement you get some payoff
•   Payoff is equivalent to the number of switches that are on
•   The goal: Maximize the payoff
Encoding the problem

• Solutions are a string of 0s and 1s.
• We need a population



    01010101
    10101101
    11111000
    00011111
    01111100
       ...
    10101010
Evaluate solutions performance

• Solutions represent a solution to a particular environment
• Solutions get measured based on their performance



    01010101                01010101 : 4
    10101101                10101101 : 5
    11111000                11111000 : 5
    00011111
                Evaluate    00001111 : 4
    01111100                01111100 : 5
       ...                       ...
    10101010                10100010 : 3
Selection based on performance

• Proportional to their performance
• Proportionate roulette wheel selection



    01010101               01010101 : 4             00011111 : 5
    10101101               10101101 : 5             11111000 : 5
    11111000               11111000 : 5             11111000 : 5
    00011111
                Evaluate   00001111 : 4
                                           Select   01111100 : 5
    01111100               01111100 : 5             10100010 : 3
       ...                      ...                      ...
    10101010               10100010 : 3             01010101 : 4
Reproduction

• Mutation introduce random   • Crossover recombines
  changes in the solution       parent solutions


         10101101                10101101     00011111


             Mutation                                  Crossover

         11101101                10101111     00011101
A nagging question

• In early 80s by David E. Goldberg
• How do uninteresting operators yield interesting behavior?
• 1983 proposed the innovation intuition:
  “Genetic algorithm power like that of human innovation”
• The underlying elements:
   • Selection + mutation as hillclimbing or kaizen
   • Selection + crossover = innovation
• In another words
   • Introducing small changes and keeping them if the improve performance
   • [...] Takes two to invent anything. One makes up combinations; the other
     one chooses [...] (Paul Valéry)
Remember the simple abstraction



        Recombination agent




                              Selection agent
Goldberg’s intuition

• Interesting behavior (emergence) is a balance
• Two components
   • Takeover time (t*)
   • Time to innovate (ti)
Goldberg’s intuition

• A design theory for scalable and efficient GAs
   •   Understanding building blocks (concepts or ideas)
   •   Ensure building block supply
   •   Ensure building block growth
   •   Control building block speed
   •   Ensure good building block decisions
   •   Ensure good building block mixing
   •   Know building block challengers
An interesting social insight

• GAs used for optimization, search, machine learning, ...
• But Valéry’s quote also implied a social aspect to innovation
   • Solutions may be created in the vacuum, randomly, or carefully crafted
   • There is a social element of acceptance/rejection
• Paul Valéry (1871-1945) lived after Darwin published his ideas
[Fast forward] ... The 2.0 information age
Adding social to genetic algorithms



                           computational
     Recombination agent

                                               Standard            Interactive
                                           Genetic Algorithms   Genetic Algorithms
                           human




                                            Computer Aided        Human-Based
                                             Design (CAD)       Genetic Algorithms



                                            computational               human
                                                      Selection agent

                                                                                     (Kosorukoff & Goldberg, 2002)
Social aspects of genetic algorithms
• Population-based method
• Parallel genetic algorithms
    • Groups of working units
    • Communication between groups for a common goal
    • Models for collaborative work
• Social network of interactions




                                      29
The DISCUS project

• IlliGAL & NCSA collaboration started in 2003
• Computers have become mediators of collaborations
   • Email, chat rooms, blogs, wikis…
   • A flood of available information
   • Different modes of communication
• Let’s take advantage of such information
   •   Logs of conversations
   •   Archive of documents (email attachments, blogs, personal web pages…)
   •   Human-computer interactions
   •   Social aspect of the communication and collaboration




                                     30
The question

• Could we help build a system using genetic algorithms/natural
  selection ideas that could foster innovation and creativity?
• The era of available Web 2.0 tools
   • Web portals
   • Collaboration and communication tools (message boards, chats, blogs,
     Twitter…)
   • Genetic algorithm (Interactive, Human-based, and Parallel GAs)
   • Chance discovery (KeyGraphs and Influence Diffusion Models)
   • Data and text mining
   • Information retrieval
   • Web services
   • Data-intensive computing



                                    31
In a picture
                       Analysis


                                            Data mining




     Social networks




                                                          Content




                                  Knowledge management



    Social aspects


                                                                    (Llorà et al., 2004)
A simple exercise

• Imagine you are chatting with your friends
• Your ideas are bundled in messages
   • You pass them your your friends
   • They read them, think about them
   • Your friends reply to you
• Could we model it using natural selection?
   • Valéry suggest so
   • Goldberg proposed a simple model using the human-based GA analogy
• If so, Goldberg’s methodology should apply
People interacting online
Concepts as building blocks




                              (Llorà et al., 2004)
Things you can measure: Concept diffusion




                                     (Llorà et al., 2005)
Social acceptance and roles




                              (Yasui et al., 2006)
Everything together




                      (Saruwatari et al., 2008)
Everything together




                      (Saruwatari et al., 2008)
In action




            (Saruwatari et al., 2008)
Other possible usages

•   Tracking influence on social media (e.g. Twitter)
•   User preference modeling
•   Topic overlap
•   Topic/trend dynamics (how conversation is changing on the fly)
•   Blogosphere and company message misalignment
•   ...
... and beyond
SW development and Darwin’s ideas?

• Data deluge, processing large volumes of data starting to
  become the norm, not the exception
• Another nagging question back on 2007
• Could we use the basic design principles by Goldberg to build
  software?
• A subset of those could have the answer
   •   Understanding building blocks (concepts or ideas)
   •   Ensure building block supply
   •   Ensure building block growth
   •   Ensure good building block mixing
• Also
   • Exploiting parallelism efficiently is becoming a more pressing issue
   • SW is a social activity
A little thinking

• Understanding building blocks (concepts or ideas)
   • Basic units of data processing
   • Building blocks should be easy to group and swap
• Ensure building block supply
   • Create a basic set of building block
• Ensure building block growth
   • Provide a set of tools for developers to add more building blocks
• Ensure good building block mixing
   • Facilitate the reproduction of solutions (programs)
   • Provide means to make those recombination possible
Modeling Meandre

• The basic outline
   •   Data-flow execution paradigm
   •   Semantic-web driven
   •   Web oriented
   •   Facilitate distributed computing
   •   Support publishing services
   •   Promote reuse, sharing, and collaboration
• Fall 2007 Meandre was adopted as the data-intensive platform
  for a collection of projects
• More information at http://seasr.org/meandre
Data flow execution in Meandre

• A simple example c ← a+b
• A traditional control-driven language

        a = 1
        b = 2
        c = a+b


• Execution following the sequence of instructions
• One step at a time
      • a+b+c+d requires 3 steps
      • Could be easily parallelized
Data flow execution in Meandre

• Data flow execution is driven by data
• The previous example may have 2 possible data flow versions

  Stateless data flow

    value
                               +                     value
    value


  State-based data flow

    value       value          +                     value
                                   ?
The basic building blocks: Components




                        Component




      RDF descriptor of the         The component
      components behavior           implementation
Go with the flow: Creating complex tasks

• Directed multigraph of components creates a flow



  Push
  Text
                 Concatenate          To Upper      Print
                    Text              Case Text     Text
  Push
  Text
Automatic parallelization:
Speed and robustness
• Meandre ZigZag language allow automatic parallelization

                                      To Upper
                                      Case Text
 Push
 Text                                 To Upper
               Concatenate            Case Text             Print
                  Text                                      Text
 Push
 Text

                                       To Upper
                                       Case Text
Visual building block recombination
Where is natural selection?

•   More DISCUS-like evolutionary modeling
•   Variations (components) in a population of solutions (flows)
•   Selection by usage/adoption
•   Recombination of flow available (encapsulation)
The take away message
Basic food for thought

• The underlying distilled idea of natural selection
   • Variation will arise in a population
   • Selection by environment adaptation
• Powerful modeling tool
   •   Problem solving (optimization and search)
   •   Machine learning
   •   Human social interaction modeling
   •   Defining new ways to explore computer programming
   •   ...
• Don’t be afraid to try and explore
More information

• Illinois Genetic Algorithms Laboratory
   • http://www.illigal.uiuc.edu/web
• The DISCUS project
   • http://www.i-discus.org
• The Meandre project
   • http://seasr.org/meandre
   • http://seasr.org
• National Center for Supercomputing Applications
   • http://www.ncsa.illinois.edu
• Homepage
   • http://www.xavierllora.net
• Some related Twitter users
   • People: @xllora, @deg511
   • Labs and projects: @illigal, @projectmeandre, @seasrproject
From Galapagos to Twitter:
Darwin, Natural Selection,
and Web 2.0


Xavier Llorà

National Center for Supercomputing Applications
University of Illinois at Urbana-Champaign
Urbana, Illinois, 61801

xllora@ncsa.illinois.edu
http://www.ncsa.illinois.edu/~xllora

Weitere ähnliche Inhalte

Ähnlich wie From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0

How2research
How2researchHow2research
How2researchkewuc
 
Future Things/Robotic Products
Future Things/Robotic ProductsFuture Things/Robotic Products
Future Things/Robotic ProductsSusie Herbstritt
 
Genetic algorithm raktim
Genetic algorithm raktimGenetic algorithm raktim
Genetic algorithm raktimRaktim Halder
 
Genetic algorithm_raktim_IITKGP
Genetic algorithm_raktim_IITKGP Genetic algorithm_raktim_IITKGP
Genetic algorithm_raktim_IITKGP Raktim Halder
 
2012 My Top 10 Tools
2012 My Top 10 Tools2012 My Top 10 Tools
2012 My Top 10 ToolsHowie DiBlasi
 
Tale of the Knowledge Organization In an Age of Wicked Problems
Tale of the Knowledge Organization In an Age of Wicked ProblemsTale of the Knowledge Organization In an Age of Wicked Problems
Tale of the Knowledge Organization In an Age of Wicked ProblemsGomindSHIFT
 
Search, citation and plagiarism: skills for a digital age have to be taught!
Search, citation and plagiarism: skills for a digital age have to be taught!Search, citation and plagiarism: skills for a digital age have to be taught!
Search, citation and plagiarism: skills for a digital age have to be taught!CIT, NUS
 
Researching people: using questionnaires and interviews
Researching people: using questionnaires and interviewsResearching people: using questionnaires and interviews
Researching people: using questionnaires and interviewsJenna Condie
 
Innovation and Creativity in the Workplace
Innovation and Creativity in the Workplace Innovation and Creativity in the Workplace
Innovation and Creativity in the Workplace Michael McEachern
 
Stc tc open_documentation
Stc tc open_documentationStc tc open_documentation
Stc tc open_documentationJanet Swisher
 
Jeff Borden: Education 3.0: Learn Better
Jeff Borden: Education 3.0: Learn BetterJeff Borden: Education 3.0: Learn Better
Jeff Borden: Education 3.0: Learn BetterPearson North America
 
We've Got Issues: Issue Tracking and Workflow in the Digital Library
We've Got Issues: Issue Tracking and Workflow in the Digital LibraryWe've Got Issues: Issue Tracking and Workflow in the Digital Library
We've Got Issues: Issue Tracking and Workflow in the Digital LibraryElectronic Resources & Libraries
 
Sensemaker for Partos Plaza - Irene Guyt
Sensemaker for Partos Plaza  - Irene GuytSensemaker for Partos Plaza  - Irene Guyt
Sensemaker for Partos Plaza - Irene Guytannepartos
 
Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012
Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012
Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012TEST Huddle
 
Infinithree: Beyond the Wiki Encyclopedia
Infinithree: Beyond the Wiki EncyclopediaInfinithree: Beyond the Wiki Encyclopedia
Infinithree: Beyond the Wiki Encyclopediagojomo
 
Introduction to genetic algorithms
Introduction to genetic algorithmsIntroduction to genetic algorithms
Introduction to genetic algorithmsshadanalam
 
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19jodischneider
 
From documents to datasets -- mining the Junius Henderson Field Notes for spe...
From documents to datasets -- mining the Junius Henderson Field Notes for spe...From documents to datasets -- mining the Junius Henderson Field Notes for spe...
From documents to datasets -- mining the Junius Henderson Field Notes for spe...andrea thomer
 
The Allure of Gothic Markup
The Allure of Gothic MarkupThe Allure of Gothic Markup
The Allure of Gothic MarkupSimon St.Laurent
 

Ähnlich wie From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0 (20)

How2research
How2researchHow2research
How2research
 
Future Things/Robotic Products
Future Things/Robotic ProductsFuture Things/Robotic Products
Future Things/Robotic Products
 
Genetic algorithm raktim
Genetic algorithm raktimGenetic algorithm raktim
Genetic algorithm raktim
 
Genetic algorithm_raktim_IITKGP
Genetic algorithm_raktim_IITKGP Genetic algorithm_raktim_IITKGP
Genetic algorithm_raktim_IITKGP
 
2012 My Top 10 Tools
2012 My Top 10 Tools2012 My Top 10 Tools
2012 My Top 10 Tools
 
Tale of the Knowledge Organization In an Age of Wicked Problems
Tale of the Knowledge Organization In an Age of Wicked ProblemsTale of the Knowledge Organization In an Age of Wicked Problems
Tale of the Knowledge Organization In an Age of Wicked Problems
 
Search, citation and plagiarism: skills for a digital age have to be taught!
Search, citation and plagiarism: skills for a digital age have to be taught!Search, citation and plagiarism: skills for a digital age have to be taught!
Search, citation and plagiarism: skills for a digital age have to be taught!
 
Researching people: using questionnaires and interviews
Researching people: using questionnaires and interviewsResearching people: using questionnaires and interviews
Researching people: using questionnaires and interviews
 
Innovation and Creativity in the Workplace
Innovation and Creativity in the Workplace Innovation and Creativity in the Workplace
Innovation and Creativity in the Workplace
 
Stc tc open_documentation
Stc tc open_documentationStc tc open_documentation
Stc tc open_documentation
 
Jeff Borden: Education 3.0: Learn Better
Jeff Borden: Education 3.0: Learn BetterJeff Borden: Education 3.0: Learn Better
Jeff Borden: Education 3.0: Learn Better
 
We've Got Issues: Issue Tracking and Workflow in the Digital Library
We've Got Issues: Issue Tracking and Workflow in the Digital LibraryWe've Got Issues: Issue Tracking and Workflow in the Digital Library
We've Got Issues: Issue Tracking and Workflow in the Digital Library
 
Sensemaker for Partos Plaza - Irene Guyt
Sensemaker for Partos Plaza  - Irene GuytSensemaker for Partos Plaza  - Irene Guyt
Sensemaker for Partos Plaza - Irene Guyt
 
Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012
Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012
Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012
 
Infinithree: Beyond the Wiki Encyclopedia
Infinithree: Beyond the Wiki EncyclopediaInfinithree: Beyond the Wiki Encyclopedia
Infinithree: Beyond the Wiki Encyclopedia
 
Introduction to genetic algorithms
Introduction to genetic algorithmsIntroduction to genetic algorithms
Introduction to genetic algorithms
 
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19
 
From documents to datasets -- mining the Junius Henderson Field Notes for spe...
From documents to datasets -- mining the Junius Henderson Field Notes for spe...From documents to datasets -- mining the Junius Henderson Field Notes for spe...
From documents to datasets -- mining the Junius Henderson Field Notes for spe...
 
Swarm intelligence
Swarm intelligenceSwarm intelligence
Swarm intelligence
 
The Allure of Gothic Markup
The Allure of Gothic MarkupThe Allure of Gothic Markup
The Allure of Gothic Markup
 

Mehr von Xavier Llorà

Meandre 2.0 Alpha Preview
Meandre 2.0 Alpha PreviewMeandre 2.0 Alpha Preview
Meandre 2.0 Alpha PreviewXavier Llorà
 
Soaring the Clouds with Meandre
Soaring the Clouds with MeandreSoaring the Clouds with Meandre
Soaring the Clouds with MeandreXavier Llorà
 
Large Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using   Genetics-Based Machine LearningLarge Scale Data Mining using   Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine LearningXavier Llorà
 
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...
Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...Xavier Llorà
 
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new TrendsScalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new TrendsXavier Llorà
 
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...Xavier Llorà
 
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...
Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...Xavier Llorà
 
Learning Classifier Systems for Class Imbalance Problems
Learning Classifier Systems  for Class Imbalance  ProblemsLearning Classifier Systems  for Class Imbalance  Problems
Learning Classifier Systems for Class Imbalance ProblemsXavier Llorà
 
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...
A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...Xavier Llorà
 
XCS: Current capabilities and future challenges
XCS: Current capabilities and future  challengesXCS: Current capabilities and future  challenges
XCS: Current capabilities and future challengesXavier Llorà
 
Negative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly DetectionNegative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly DetectionXavier Llorà
 
Searle, Intentionality, and the Future of Classifier Systems
Searle, Intentionality, and the  Future of Classifier SystemsSearle, Intentionality, and the  Future of Classifier Systems
Searle, Intentionality, and the Future of Classifier SystemsXavier Llorà
 
Computed Prediction: So far, so good. What now?
Computed Prediction:  So far, so good. What now?Computed Prediction:  So far, so good. What now?
Computed Prediction: So far, so good. What now?Xavier Llorà
 
Linkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems TractableLinkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems TractableXavier Llorà
 
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsMeandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsXavier Llorà
 
ZigZag: The Meandring Language
ZigZag: The Meandring LanguageZigZag: The Meandring Language
ZigZag: The Meandring LanguageXavier Llorà
 
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...Xavier Llorà
 
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...Xavier Llorà
 
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...Xavier Llorà
 

Mehr von Xavier Llorà (20)

Meandre 2.0 Alpha Preview
Meandre 2.0 Alpha PreviewMeandre 2.0 Alpha Preview
Meandre 2.0 Alpha Preview
 
Soaring the Clouds with Meandre
Soaring the Clouds with MeandreSoaring the Clouds with Meandre
Soaring the Clouds with Meandre
 
Large Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using   Genetics-Based Machine LearningLarge Scale Data Mining using   Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine Learning
 
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...
Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...
 
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new TrendsScalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
 
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...
 
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...
Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...
 
Learning Classifier Systems for Class Imbalance Problems
Learning Classifier Systems  for Class Imbalance  ProblemsLearning Classifier Systems  for Class Imbalance  Problems
Learning Classifier Systems for Class Imbalance Problems
 
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...
A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...
 
XCS: Current capabilities and future challenges
XCS: Current capabilities and future  challengesXCS: Current capabilities and future  challenges
XCS: Current capabilities and future challenges
 
Negative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly DetectionNegative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly Detection
 
Searle, Intentionality, and the Future of Classifier Systems
Searle, Intentionality, and the  Future of Classifier SystemsSearle, Intentionality, and the  Future of Classifier Systems
Searle, Intentionality, and the Future of Classifier Systems
 
Computed Prediction: So far, so good. What now?
Computed Prediction:  So far, so good. What now?Computed Prediction:  So far, so good. What now?
Computed Prediction: So far, so good. What now?
 
NIGEL 2006 welcome
NIGEL 2006 welcomeNIGEL 2006 welcome
NIGEL 2006 welcome
 
Linkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems TractableLinkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems Tractable
 
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsMeandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
 
ZigZag: The Meandring Language
ZigZag: The Meandring LanguageZigZag: The Meandring Language
ZigZag: The Meandring Language
 
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
 
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
 
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
 

Kürzlich hochgeladen

(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 

Kürzlich hochgeladen (20)

(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 

From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0

  • 1. From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0 Xavier Llorà National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Urbana, Illinois, 61801 xllora@ncsa.illinois.edu http://www.ncsa.illinois.edu/~xllora
  • 2. What is this talk about? • Simple answer: Darwin’s natural selection concept and information technologies • Natural selection? Natural selection is the process by which heritable traits that make it more likely for an organism to survive and successfully reproduce become more common in a population over successive generations. • Remember, back on Darwin’s time genetics were unknown
  • 3. Darwin on HMS Beagle
  • 4. A quick Darwin background check • In Edinburgh Charles Darwin • Learnt standard geology (Robert Jameson) • Exposed to British catastrophism of 1820s • Run into Lamarckian ideas with Robert Grant • Cambridge provide him a chance to • Under John Henslow, explored boundaries between varieties and species • Read and accepted traditional theological explanations of the world • Evolution existed before and after Darwin • HMS Beagle • Left Plymouth 2pm on December 27, 1831 • Returned October 2, 1836
  • 5. HMS Beagle http://www.solarnavigator.net/history/hms_beagle.htm
  • 7. Massive diversity http://www.flickr.com/photos/ole/1040941928/
  • 8. A key realization • Knowledge on the boundaries of variation/species • The facts • Large number of islands • Islands are spatially separated • Wide diversities of species (different sources of food) • Very close species (beyond variation) • Start accumulating descriptions of specimens
  • 10. The basic building blocks • Large variation of features • Highly correlated to food sources • The basic sketch of a crazy thought: • Assume you start with a diverse population of As • Split As and place them in different environments • Individual ai features may or may not help live in the environment • If individual ai features help in the environment ai as a result has increased chances to reproduce • The underlying distilled idea of natural selection • Variation will arise in a population • Selection by environment adaptation • The uncanny idea:Variation can be random
  • 11. A simple abstraction Recombination agent Selection agent
  • 12. [Fast forward] ... and genomics arise
  • 13. DNA and basics of reproduction • Watson & Crick (1953) • Four bases: Adenine (A), cytosine (C), guanine (G), thymine (T) • Groups in to base pairs: A-T, C-G • Reproduction of DNA (crossover and mutation) http://en.wikipedia.org/wiki/DNA#cite_note-FWPUB-6
  • 14. [Fast forward] ... and computers happen
  • 15. A quick stop in the late 60s • Computers were the next cool thing • Genetics were on the rise • A professor at University of Michigan • Natural evolution • Diverse population of individual • Individual ai features may or may not help live in the environment • If individual ai features help in the environment ai as a result has increased chances to reproduce • Artificial evolution? • Model of such evolutionary process Prof. John H. Holland • Use it to solve some real problem Image from http://www.slideshare.net/deg511/not-your-grandmothers-genetic-algorithm
  • 16. What is a genetic algorithm? • ANAS published in 1975 • Chromosomes encode a solution to a problem • Evaluation of fitness based on the quality solving the problem • The process • Survival of the fitness • Mutation • Crossover • Repeat until good enough
  • 17. A simple example • Imagine you have a box with an array of switches • 0 off, 1on • Depending on the switch arrangement you get some payoff • Payoff is equivalent to the number of switches that are on • The goal: Maximize the payoff
  • 18. Encoding the problem • Solutions are a string of 0s and 1s. • We need a population 01010101 10101101 11111000 00011111 01111100 ... 10101010
  • 19. Evaluate solutions performance • Solutions represent a solution to a particular environment • Solutions get measured based on their performance 01010101 01010101 : 4 10101101 10101101 : 5 11111000 11111000 : 5 00011111 Evaluate 00001111 : 4 01111100 01111100 : 5 ... ... 10101010 10100010 : 3
  • 20. Selection based on performance • Proportional to their performance • Proportionate roulette wheel selection 01010101 01010101 : 4 00011111 : 5 10101101 10101101 : 5 11111000 : 5 11111000 11111000 : 5 11111000 : 5 00011111 Evaluate 00001111 : 4 Select 01111100 : 5 01111100 01111100 : 5 10100010 : 3 ... ... ... 10101010 10100010 : 3 01010101 : 4
  • 21. Reproduction • Mutation introduce random • Crossover recombines changes in the solution parent solutions 10101101 10101101 00011111 Mutation Crossover 11101101 10101111 00011101
  • 22. A nagging question • In early 80s by David E. Goldberg • How do uninteresting operators yield interesting behavior? • 1983 proposed the innovation intuition: “Genetic algorithm power like that of human innovation” • The underlying elements: • Selection + mutation as hillclimbing or kaizen • Selection + crossover = innovation • In another words • Introducing small changes and keeping them if the improve performance • [...] Takes two to invent anything. One makes up combinations; the other one chooses [...] (Paul Valéry)
  • 23. Remember the simple abstraction Recombination agent Selection agent
  • 24. Goldberg’s intuition • Interesting behavior (emergence) is a balance • Two components • Takeover time (t*) • Time to innovate (ti)
  • 25. Goldberg’s intuition • A design theory for scalable and efficient GAs • Understanding building blocks (concepts or ideas) • Ensure building block supply • Ensure building block growth • Control building block speed • Ensure good building block decisions • Ensure good building block mixing • Know building block challengers
  • 26. An interesting social insight • GAs used for optimization, search, machine learning, ... • But Valéry’s quote also implied a social aspect to innovation • Solutions may be created in the vacuum, randomly, or carefully crafted • There is a social element of acceptance/rejection • Paul Valéry (1871-1945) lived after Darwin published his ideas
  • 27. [Fast forward] ... The 2.0 information age
  • 28. Adding social to genetic algorithms computational Recombination agent Standard Interactive Genetic Algorithms Genetic Algorithms human Computer Aided Human-Based Design (CAD) Genetic Algorithms computational human Selection agent (Kosorukoff & Goldberg, 2002)
  • 29. Social aspects of genetic algorithms • Population-based method • Parallel genetic algorithms • Groups of working units • Communication between groups for a common goal • Models for collaborative work • Social network of interactions 29
  • 30. The DISCUS project • IlliGAL & NCSA collaboration started in 2003 • Computers have become mediators of collaborations • Email, chat rooms, blogs, wikis… • A flood of available information • Different modes of communication • Let’s take advantage of such information • Logs of conversations • Archive of documents (email attachments, blogs, personal web pages…) • Human-computer interactions • Social aspect of the communication and collaboration 30
  • 31. The question • Could we help build a system using genetic algorithms/natural selection ideas that could foster innovation and creativity? • The era of available Web 2.0 tools • Web portals • Collaboration and communication tools (message boards, chats, blogs, Twitter…) • Genetic algorithm (Interactive, Human-based, and Parallel GAs) • Chance discovery (KeyGraphs and Influence Diffusion Models) • Data and text mining • Information retrieval • Web services • Data-intensive computing 31
  • 32. In a picture Analysis Data mining Social networks Content Knowledge management Social aspects (Llorà et al., 2004)
  • 33. A simple exercise • Imagine you are chatting with your friends • Your ideas are bundled in messages • You pass them your your friends • They read them, think about them • Your friends reply to you • Could we model it using natural selection? • Valéry suggest so • Goldberg proposed a simple model using the human-based GA analogy • If so, Goldberg’s methodology should apply
  • 35. Concepts as building blocks (Llorà et al., 2004)
  • 36. Things you can measure: Concept diffusion (Llorà et al., 2005)
  • 37. Social acceptance and roles (Yasui et al., 2006)
  • 38. Everything together (Saruwatari et al., 2008)
  • 39. Everything together (Saruwatari et al., 2008)
  • 40. In action (Saruwatari et al., 2008)
  • 41. Other possible usages • Tracking influence on social media (e.g. Twitter) • User preference modeling • Topic overlap • Topic/trend dynamics (how conversation is changing on the fly) • Blogosphere and company message misalignment • ...
  • 43. SW development and Darwin’s ideas? • Data deluge, processing large volumes of data starting to become the norm, not the exception • Another nagging question back on 2007 • Could we use the basic design principles by Goldberg to build software? • A subset of those could have the answer • Understanding building blocks (concepts or ideas) • Ensure building block supply • Ensure building block growth • Ensure good building block mixing • Also • Exploiting parallelism efficiently is becoming a more pressing issue • SW is a social activity
  • 44. A little thinking • Understanding building blocks (concepts or ideas) • Basic units of data processing • Building blocks should be easy to group and swap • Ensure building block supply • Create a basic set of building block • Ensure building block growth • Provide a set of tools for developers to add more building blocks • Ensure good building block mixing • Facilitate the reproduction of solutions (programs) • Provide means to make those recombination possible
  • 45. Modeling Meandre • The basic outline • Data-flow execution paradigm • Semantic-web driven • Web oriented • Facilitate distributed computing • Support publishing services • Promote reuse, sharing, and collaboration • Fall 2007 Meandre was adopted as the data-intensive platform for a collection of projects • More information at http://seasr.org/meandre
  • 46. Data flow execution in Meandre • A simple example c ← a+b • A traditional control-driven language a = 1 b = 2 c = a+b • Execution following the sequence of instructions • One step at a time • a+b+c+d requires 3 steps • Could be easily parallelized
  • 47. Data flow execution in Meandre • Data flow execution is driven by data • The previous example may have 2 possible data flow versions Stateless data flow value + value value State-based data flow value value + value ?
  • 48. The basic building blocks: Components Component RDF descriptor of the The component components behavior implementation
  • 49. Go with the flow: Creating complex tasks • Directed multigraph of components creates a flow Push Text Concatenate To Upper Print Text Case Text Text Push Text
  • 50. Automatic parallelization: Speed and robustness • Meandre ZigZag language allow automatic parallelization To Upper Case Text Push Text To Upper Concatenate Case Text Print Text Text Push Text To Upper Case Text
  • 51. Visual building block recombination
  • 52. Where is natural selection? • More DISCUS-like evolutionary modeling • Variations (components) in a population of solutions (flows) • Selection by usage/adoption • Recombination of flow available (encapsulation)
  • 53. The take away message
  • 54. Basic food for thought • The underlying distilled idea of natural selection • Variation will arise in a population • Selection by environment adaptation • Powerful modeling tool • Problem solving (optimization and search) • Machine learning • Human social interaction modeling • Defining new ways to explore computer programming • ... • Don’t be afraid to try and explore
  • 55. More information • Illinois Genetic Algorithms Laboratory • http://www.illigal.uiuc.edu/web • The DISCUS project • http://www.i-discus.org • The Meandre project • http://seasr.org/meandre • http://seasr.org • National Center for Supercomputing Applications • http://www.ncsa.illinois.edu • Homepage • http://www.xavierllora.net • Some related Twitter users • People: @xllora, @deg511 • Labs and projects: @illigal, @projectmeandre, @seasrproject
  • 56. From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0 Xavier Llorà National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Urbana, Illinois, 61801 xllora@ncsa.illinois.edu http://www.ncsa.illinois.edu/~xllora