SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
the DISCUS project & SEASR

               Xavier Llorà1,2, David E. Goldberg1 & Michael Welge2
                                                                  

  1Illinois Genetic Algorithms Lab, Department of Industrial and Enterprise Systems Engineering,!
                            University of Illinois at Urbana-Champaign!

2Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, !
                            University of Illinois at Urbana-Champaign!
The Vision

•  Computers have become mediators of collaborations
   –  Email, chat rooms, blogs, wikis…

   –  A flood of available information
   –  Different modes of communication

•  Let’s take advantage of such information 
   –  Logs of conversations

   –  Archive of documents (email attachments, blogs, personal web
      pages…)

   –  Human-computer interactions

   –  Social aspect of the communication and collaboration

   –  Needs to work for multiple languages
The Project

•  DISCUS started in 2003 as an IlliGAL & NCSA collaboration

•  Supports innovation and creativity:

  DISCUS: Distributed Innovation and Scalable Collaboration in Uncertain Settings

•  Basic research components

     –  Competent genetic algorithms (HBGA, iGA)

     –  Advance chance discovery components

     –  Adapt and expand the analysis of social interaction

     –  Efficient data mining techniques for conversations 

     –  Develop a social network analysis for creativity and innovation processes 



the
DISCUS
project
(May
2007)
         Xavier
Llorà
                                  3

The Project

•  Technology development

     –  Infrastructure to support creativity and innovation processes

     –  Reusable repositories of analytic components 

     –  Standardize heterogeneous data storage to boost interoperability

     –  Create hooks for non-intrusive usage and deployment

     –  Rapid adaptation cycle to new technologies




the
DISCUS
project
(May
2007)
         Xavier
Llorà
                        4

Research and Commercial Partners

•  Some research partners along the quot;
   way
   –  University of Illinois (IlliGAL, NCSA & CEE)

   –  University of Osaka

   –  University of Tokyo (School of Management, quot;
      School of Engineering)

   –  University of Kyushu

•  Commercial partner
   –  Hakuhodo Inc and HOW

   –  Mazda

   –  Toyota
The Research Picture
                                   Analysis


                                                                    Data mining




                 Social networks




                                                                                  Content




                                                         Knowledge management



               Social aspects
the
DISCUS
project
(May
2007)
           Xavier
Llorà
                                      6

DISCUS in Action
Online Communities




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   8

Online Communities




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   9

Content Analysis




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   10

Social Network Analysis




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   11

Topic Overlap




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   12

Topic Dynamics




the
DISCUS
project
(May
2007)
   Xavier
Llorà
   13

CSPAN

•  CSPAN digital library
   –  Videos

   –  Transcripts
   –  Annotations

•  Example of real-time analysis
•  Crawling and results
Some Facts 


•  Number of document: 110,234
•  Number of persons: 78,915
•  Number of total sentences: 252,132
•  Number of total word: 2,034,209
Documents per Year



                             5000
                             500
       Number of documents

                             50 100
                             10
                             5
                             1




                                      1940   1960          1980   2000

                                                    Year
Number of words

              1e+01   1e+02      1e+03      1e+04   1e+05




       1940
                                                            Words per Year




       1960

Year
       1980
       2000
the DISCUS project & SEASR

               Xavier Llorà1,2, David E. Goldberg1 & Michael Welge2
                                                                  

  1Illinois Genetic Algorithms Lab, Department of Industrial and Enterprise Systems Engineering,!
                            University of Illinois at Urbana-Champaign!

2Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, !
                            University of Illinois at Urbana-Champaign!

Weitere ähnliche Inhalte

Andere mochten auch

Text Mining Wksp Auvil
Text Mining Wksp AuvilText Mining Wksp Auvil
Text Mining Wksp AuvilLoretta Auvil
 
Text Mining and SEASR
Text Mining and SEASRText Mining and SEASR
Text Mining and SEASRLoretta Auvil
 
SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009Loretta Auvil
 

Andere mochten auch (6)

Text Mining Wksp Auvil
Text Mining Wksp AuvilText Mining Wksp Auvil
Text Mining Wksp Auvil
 
SEASR and UIMA
SEASR and UIMASEASR and UIMA
SEASR and UIMA
 
Text Mining and SEASR
Text Mining and SEASRText Mining and SEASR
Text Mining and SEASR
 
SEASR Installation
SEASR InstallationSEASR Installation
SEASR Installation
 
SEASR Community Hub
SEASR Community HubSEASR Community Hub
SEASR Community Hub
 
SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009
 

Ähnlich wie DISCUS Project Overview

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
Understanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveUnderstanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveYuwei Lin
 
Using Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online CommunitiesUsing Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online CommunitiesAlexandre Passant
 
Facilitating a Digital Commons with Free and Open Source Software: Paving the...
Facilitating a Digital Commons with Free and Open Source Software: Paving the...Facilitating a Digital Commons with Free and Open Source Software: Paving the...
Facilitating a Digital Commons with Free and Open Source Software: Paving the...Sameer Verma
 
Canada 3.0 Keynote Address Day 1
Canada 3.0 Keynote Address Day 1Canada 3.0 Keynote Address Day 1
Canada 3.0 Keynote Address Day 1canada30
 
Datos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbADatos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbADaniel Vila Suero
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3guru122
 
Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...Alexandre Passant
 
Teaching 2.0 Learning & Leading in the Digital Age
Teaching 2.0 Learning & Leading in the Digital AgeTeaching 2.0 Learning & Leading in the Digital Age
Teaching 2.0 Learning & Leading in the Digital AgeMatthew Hayden
 
Web 2.0 E Oltre
Web 2.0 E OltreWeb 2.0 E Oltre
Web 2.0 E Oltreronchet
 
Enhancing the Social Web through Augmented Social Cognition Research
Enhancing the Social Web through Augmented Social Cognition ResearchEnhancing the Social Web through Augmented Social Cognition Research
Enhancing the Social Web through Augmented Social Cognition ResearchEd Chi
 
Hello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperHello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperAlexandre Passant
 
Where Is eXtension
Where Is eXtensionWhere Is eXtension
Where Is eXtensionchwood
 
Web 2.0 and e-elearning
Web 2.0 and e-elearningWeb 2.0 and e-elearning
Web 2.0 and e-elearningDavid Wilcox
 
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARCEd Chi
 
Social Media: Why and how to take advantage of it
Social Media:  Why and how to take advantage of itSocial Media:  Why and how to take advantage of it
Social Media: Why and how to take advantage of itAlexandre Passant
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital LibraryEd Fay
 

Ähnlich wie DISCUS Project Overview (20)

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Understanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveUnderstanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical Perspective
 
Social Media and Web 2.0
Social Media and Web 2.0Social Media and Web 2.0
Social Media and Web 2.0
 
Using Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online CommunitiesUsing Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online Communities
 
Facilitating a Digital Commons with Free and Open Source Software: Paving the...
Facilitating a Digital Commons with Free and Open Source Software: Paving the...Facilitating a Digital Commons with Free and Open Source Software: Paving the...
Facilitating a Digital Commons with Free and Open Source Software: Paving the...
 
SEASR Overview
SEASR OverviewSEASR Overview
SEASR Overview
 
Canada 3.0 Keynote Address Day 1
Canada 3.0 Keynote Address Day 1Canada 3.0 Keynote Address Day 1
Canada 3.0 Keynote Address Day 1
 
Datos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbADatos enlazados BNE and MARiMbA
Datos enlazados BNE and MARiMbA
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...
 
Teaching 2.0 Learning & Leading in the Digital Age
Teaching 2.0 Learning & Leading in the Digital AgeTeaching 2.0 Learning & Leading in the Digital Age
Teaching 2.0 Learning & Leading in the Digital Age
 
Web 2.0 E Oltre
Web 2.0 E OltreWeb 2.0 E Oltre
Web 2.0 E Oltre
 
Enhancing the Social Web through Augmented Social Cognition Research
Enhancing the Social Web through Augmented Social Cognition ResearchEnhancing the Social Web through Augmented Social Cognition Research
Enhancing the Social Web through Augmented Social Cognition Research
 
Hello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperHello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic Developer
 
Where Is eXtension
Where Is eXtensionWhere Is eXtension
Where Is eXtension
 
Web 2.0 and e-elearning
Web 2.0 and e-elearningWeb 2.0 and e-elearning
Web 2.0 and e-elearning
 
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
2007 KMWorld Presentation on Augmented Social Cognition Research at PARC
 
Social Media: Why and how to take advantage of it
Social Media:  Why and how to take advantage of itSocial Media:  Why and how to take advantage of it
Social Media: Why and how to take advantage of it
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital Library
 

Mehr von Loretta Auvil

Fedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacFedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacLoretta Auvil
 
Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009Loretta Auvil
 
ICHASS Workshop Seasr
ICHASS Workshop SeasrICHASS Workshop Seasr
ICHASS Workshop SeasrLoretta Auvil
 
ICHASS Workshop Text Mining
ICHASS Workshop Text MiningICHASS Workshop Text Mining
ICHASS Workshop Text MiningLoretta Auvil
 

Mehr von Loretta Auvil (9)

Fedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacFedora App Slide 2009 Hastac
Fedora App Slide 2009 Hastac
 
SEASR Overview
SEASR OverviewSEASR Overview
SEASR Overview
 
SEASR Text
SEASR TextSEASR Text
SEASR Text
 
SEASR-Fedora App
SEASR-Fedora AppSEASR-Fedora App
SEASR-Fedora App
 
Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009
 
SEASR eScience 2008
SEASR eScience 2008SEASR eScience 2008
SEASR eScience 2008
 
ICHASS Workshop Lab
ICHASS Workshop LabICHASS Workshop Lab
ICHASS Workshop Lab
 
ICHASS Workshop Seasr
ICHASS Workshop SeasrICHASS Workshop Seasr
ICHASS Workshop Seasr
 
ICHASS Workshop Text Mining
ICHASS Workshop Text MiningICHASS Workshop Text Mining
ICHASS Workshop Text Mining
 

Kürzlich hochgeladen

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 

Kürzlich hochgeladen (20)

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 

DISCUS Project Overview

  • 1. the DISCUS project & SEASR Xavier Llorà1,2, David E. Goldberg1 & Michael Welge2 1Illinois Genetic Algorithms Lab, Department of Industrial and Enterprise Systems Engineering,! University of Illinois at Urbana-Champaign! 2Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, ! University of Illinois at Urbana-Champaign!
  • 2. The Vision •  Computers have become mediators of collaborations –  Email, chat rooms, blogs, wikis… –  A flood of available information –  Different modes of communication •  Let’s take advantage of such information –  Logs of conversations –  Archive of documents (email attachments, blogs, personal web pages…) –  Human-computer interactions –  Social aspect of the communication and collaboration –  Needs to work for multiple languages
  • 3. The Project •  DISCUS started in 2003 as an IlliGAL & NCSA collaboration •  Supports innovation and creativity: DISCUS: Distributed Innovation and Scalable Collaboration in Uncertain Settings •  Basic research components –  Competent genetic algorithms (HBGA, iGA) –  Advance chance discovery components –  Adapt and expand the analysis of social interaction –  Efficient data mining techniques for conversations –  Develop a social network analysis for creativity and innovation processes the
DISCUS
project
(May
2007)
 Xavier
Llorà
 3

  • 4. The Project •  Technology development –  Infrastructure to support creativity and innovation processes –  Reusable repositories of analytic components –  Standardize heterogeneous data storage to boost interoperability –  Create hooks for non-intrusive usage and deployment –  Rapid adaptation cycle to new technologies the
DISCUS
project
(May
2007)
 Xavier
Llorà
 4

  • 5. Research and Commercial Partners •  Some research partners along the quot; way –  University of Illinois (IlliGAL, NCSA & CEE) –  University of Osaka –  University of Tokyo (School of Management, quot; School of Engineering) –  University of Kyushu •  Commercial partner –  Hakuhodo Inc and HOW –  Mazda –  Toyota
  • 6. The Research Picture Analysis Data mining Social networks Content Knowledge management Social aspects the
DISCUS
project
(May
2007)
 Xavier
Llorà
 6

  • 14. CSPAN •  CSPAN digital library –  Videos –  Transcripts –  Annotations •  Example of real-time analysis •  Crawling and results
  • 15. Some Facts •  Number of document: 110,234 •  Number of persons: 78,915 •  Number of total sentences: 252,132 •  Number of total word: 2,034,209
  • 16. Documents per Year 5000 500 Number of documents 50 100 10 5 1 1940 1960 1980 2000 Year
  • 17. Number of words 1e+01 1e+02 1e+03 1e+04 1e+05 1940 Words per Year 1960 Year 1980 2000
  • 18. the DISCUS project & SEASR Xavier Llorà1,2, David E. Goldberg1 & Michael Welge2 1Illinois Genetic Algorithms Lab, Department of Industrial and Enterprise Systems Engineering,! University of Illinois at Urbana-Champaign! 2Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, ! University of Illinois at Urbana-Champaign!