SlideShare ist ein Scribd-Unternehmen logo
1 von 21
DS
RC

Data Science
Research Center

An Introduction
Marcel Worring
University of Amsterdam
DS
RC

Outline

1. Goals of the DSRC
2. Embedding and organization
3. Realizing the DSRC
4. The roadmap
5. How to get involved
DS
RC

The goal

• Become a leading center on data science by
developing the new data science discipline
Leveraging our scientific excellence
Leveraging our tools and infrastructure
Reaching out
Educating talents
DS
RC

Our contributions to data
science
Visual
Analytics
Business
Analytics

Decision
Theory

Understand
and decide

Distributed
Processing
Large Scale
Databases

Store and
process
Software
Eng.
System /
Network
Eng.

10010001010010001010010101
01010001010101010101010101
01000101010101010101010101
00101010010101010101010101
Security
01001010100000101010101001
Privacy
01010100010100101010101000
11110101010101010101010101
Provenance
01010100010101010101010101
11110101010101010101010101
00101010101010101010101010
00101010101010101010101010

Data

Reasoning
Knowledge
representati
on

Analyze
and model

Multimedia
Retrieval

Modeling
and
simulation

Information
Retrieval
Machine
Learning
DS
RC

Goals in data science
research
Visual
Analytics
Decision
Theory

Business
Analytics

Insight
Impact
Understand

and decide

Distributed
Processing

Data
Conformance

Reasoning
Knowledge
representati
on

Large Scale
Databases

Precision
Analyze
and model

Speed and
Store
process
Efficiency
Software
Scalability
Eng.
System /
Network
Eng.

Recall
Model fit

Multimedia
Retrieval

Modeling
and
simulation

Information
Retrieval
Machine
Learning
DS
RC

Data Science

• Characteristics
– All are connected
– All driven by data and its use
– A holistic approach is needed

• Our answer
– The Data Science Research Center
DS
RC

Research in data science
Visual
Analytics
Business
Analytics

Decision
Theory

Understand
and decide

Distributed
Processing
Large Scale
Databases

Store and
process
Software
Eng.
System /
Network
Eng.

Data
Science
Data
Research
Center

Reasoning
Knowledge
representati
on

Analyze
and model

Multimedia
Retrieval

Modeling
and
simulation

Information
Retrieval
Machine
Learning
DS
RC

Embedding and relations
Center for content,
creation and technology

Department of mathematics

Network Institute

CWI
Center for
Digital Humanities

ILLC
HvA

Informatics Institute
UvA

DSRC
Department of Computer Science VU

E-science center

SURFsara

CLHC
Forensic Science

Amsterdam Business
School
DS
RC

Organization
Daily management team

Marcel Worring

Paul Groth

Sanne Veenenbos

Management board

Max Welling

Henri Bal

Bert Bredeweg

Ger Koole

Leading researchers

Cees de Laat

Sander Klous

Jacopo Urbani

Frank van Harmelen

Peter Boncz

Maarten de Rijke Arnold Smeulders
DS
RC

Realization: four aims
Research: a platform for research in data science
connecting people and methodologies.

Infrastructure: a data-driven infrastructure for
experimenting with realistic complex data sets.
Valorization: a channel between scientific research
and third party applications.
Education: data-science curricula with realistic data
experimentation throughout the program.
DS
RC

Research

• Focus on research with a holistic view on
data science
– Connecting the different disciplines
– From data to domain impact

• Start
– Seed projects: small projects bringing together two
or more ICT disciplines

• Workshops
– Domain workshops: with all stakeholders define
research topics leading to data science project
proposals
DS
RC

Infrastructure
“In a sense, the physical and technical infrastructure
becomes invisible and the data themselves become
the infrastructure – a valuable asset, on which
science, technology, the economy and society can
advance.”

[“Riding the wave” EU High Level Expert Group]

Shared domain driven tasks
Shared large
scale
datasources
Transparant access
to distributed computing
infrastructure

Common tools and
code bases
DS
RC

Valorization

• Joint full projects
– Within the DSRC
– With industry / govermental organizations

• Small-scale projects
– From data and problem to solution with quick
turnaround

• Competitions
– From data and problem to innovative solutions
worked on by a number of teams

• Spin-offs and startups
DS
RC

Education

• Infrastructure yields platform for education in
– Informatics
• Information Science, Artificial Intelligence, Software
Engineering, Computer Science, Business Analytics

– Domain specific courses
• E.g. Minor Data Science for X (your favorite discipline)

– Commercial courses

• The objective of DSRC
– to introduce a full data science program

• With hands-on experience
– On real data and real problems or innovations
DS
RC

Finance
Projects
Projects

Projects

AFS profiling funds
UvA
Faculty
Research
Cluster

VU
Matching
Funds

Projects
Projects

Projects
DS
RC
•
•
•
•
•
•
•
•

Domains

Digital humanities
Computational social science
Digital forensics and security
City technology
Physical sciences
Life sciences
Business analytics
.......
DS
RC

Roadmap

Holistic data science

Year 1

Fully functional
Basic transparent
infrastructure

Start of new PhD projects
Seed projects

Infrastructure design

Domain workshops

Data acquisition

Invited talks

Year 2
Self-sustained

Data science program
Embedding
infrastructure

Start of projects

Courses

Domain workshops
Project acquisition
RDA

Minor(s) data science

Year 3 and 4

PIRE
DS
RC

How to get involved

• As a data science researcher
–
–
–
–
–

See what we can offer and what you can offer
Define one of the seed projects
Participate/propose workshops
Acknowledge DSRC in publications
Bring in (existing or new) projects

• Contact us via
– info@dsrc.nl
DS
RC

How to get involved

• As a data-driven application holder
–
–
–
–

Participate in the workshops and events
See how you can share your data
See whether we can develop joint projects
Link domain knowledge and data science
research

• Contact us via
– info@dsrc.nl
DS
RC

Overview of the talks
See some of the demos
Visual
Analytics
Business
Analytics

Decision
Theory

Understand
and decide

Henri Bal

Peter Boncz

Distributed
Processing
Large Scale
Databases

Store and
process
Software
Eng.
System /
Network
Eng.

10010001010010001010010101
01010001010101010101010101
01000101010101010101010101
00101010010101010101010101
01001010100000101010101001
01010100010100101010101000
11110101010101010101010101
01010100010101010101010101
11110101010101010101010101
00101010101010101010101010
00101010101010101010101010

Data

Reasoning
Knowledge
representati
on

Analyze
and model

Multimedia
Retrieval

Modeling
and
simulation

Information
Retrieval
Machine
Learning

Max Welling
Cees de Laat

Maarten de Rijke
DS
RC

Thanks for you attention

info@dsrc.nl
www.dsrc.nl

Weitere ähnliche Inhalte

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

Empfohlen

Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Empfohlen (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 

Data Science Research Center - Overview and Mission

  • 1. DS RC Data Science Research Center An Introduction Marcel Worring University of Amsterdam
  • 2. DS RC Outline 1. Goals of the DSRC 2. Embedding and organization 3. Realizing the DSRC 4. The roadmap 5. How to get involved
  • 3. DS RC The goal • Become a leading center on data science by developing the new data science discipline Leveraging our scientific excellence Leveraging our tools and infrastructure Reaching out Educating talents
  • 4. DS RC Our contributions to data science Visual Analytics Business Analytics Decision Theory Understand and decide Distributed Processing Large Scale Databases Store and process Software Eng. System / Network Eng. 10010001010010001010010101 01010001010101010101010101 01000101010101010101010101 00101010010101010101010101 Security 01001010100000101010101001 Privacy 01010100010100101010101000 11110101010101010101010101 Provenance 01010100010101010101010101 11110101010101010101010101 00101010101010101010101010 00101010101010101010101010 Data Reasoning Knowledge representati on Analyze and model Multimedia Retrieval Modeling and simulation Information Retrieval Machine Learning
  • 5. DS RC Goals in data science research Visual Analytics Decision Theory Business Analytics Insight Impact Understand and decide Distributed Processing Data Conformance Reasoning Knowledge representati on Large Scale Databases Precision Analyze and model Speed and Store process Efficiency Software Scalability Eng. System / Network Eng. Recall Model fit Multimedia Retrieval Modeling and simulation Information Retrieval Machine Learning
  • 6. DS RC Data Science • Characteristics – All are connected – All driven by data and its use – A holistic approach is needed • Our answer – The Data Science Research Center
  • 7. DS RC Research in data science Visual Analytics Business Analytics Decision Theory Understand and decide Distributed Processing Large Scale Databases Store and process Software Eng. System / Network Eng. Data Science Data Research Center Reasoning Knowledge representati on Analyze and model Multimedia Retrieval Modeling and simulation Information Retrieval Machine Learning
  • 8. DS RC Embedding and relations Center for content, creation and technology Department of mathematics Network Institute CWI Center for Digital Humanities ILLC HvA Informatics Institute UvA DSRC Department of Computer Science VU E-science center SURFsara CLHC Forensic Science Amsterdam Business School
  • 9. DS RC Organization Daily management team Marcel Worring Paul Groth Sanne Veenenbos Management board Max Welling Henri Bal Bert Bredeweg Ger Koole Leading researchers Cees de Laat Sander Klous Jacopo Urbani Frank van Harmelen Peter Boncz Maarten de Rijke Arnold Smeulders
  • 10. DS RC Realization: four aims Research: a platform for research in data science connecting people and methodologies. Infrastructure: a data-driven infrastructure for experimenting with realistic complex data sets. Valorization: a channel between scientific research and third party applications. Education: data-science curricula with realistic data experimentation throughout the program.
  • 11. DS RC Research • Focus on research with a holistic view on data science – Connecting the different disciplines – From data to domain impact • Start – Seed projects: small projects bringing together two or more ICT disciplines • Workshops – Domain workshops: with all stakeholders define research topics leading to data science project proposals
  • 12. DS RC Infrastructure “In a sense, the physical and technical infrastructure becomes invisible and the data themselves become the infrastructure – a valuable asset, on which science, technology, the economy and society can advance.” [“Riding the wave” EU High Level Expert Group] Shared domain driven tasks Shared large scale datasources Transparant access to distributed computing infrastructure Common tools and code bases
  • 13. DS RC Valorization • Joint full projects – Within the DSRC – With industry / govermental organizations • Small-scale projects – From data and problem to solution with quick turnaround • Competitions – From data and problem to innovative solutions worked on by a number of teams • Spin-offs and startups
  • 14. DS RC Education • Infrastructure yields platform for education in – Informatics • Information Science, Artificial Intelligence, Software Engineering, Computer Science, Business Analytics – Domain specific courses • E.g. Minor Data Science for X (your favorite discipline) – Commercial courses • The objective of DSRC – to introduce a full data science program • With hands-on experience – On real data and real problems or innovations
  • 16. DS RC • • • • • • • • Domains Digital humanities Computational social science Digital forensics and security City technology Physical sciences Life sciences Business analytics .......
  • 17. DS RC Roadmap Holistic data science Year 1 Fully functional Basic transparent infrastructure Start of new PhD projects Seed projects Infrastructure design Domain workshops Data acquisition Invited talks Year 2 Self-sustained Data science program Embedding infrastructure Start of projects Courses Domain workshops Project acquisition RDA Minor(s) data science Year 3 and 4 PIRE
  • 18. DS RC How to get involved • As a data science researcher – – – – – See what we can offer and what you can offer Define one of the seed projects Participate/propose workshops Acknowledge DSRC in publications Bring in (existing or new) projects • Contact us via – info@dsrc.nl
  • 19. DS RC How to get involved • As a data-driven application holder – – – – Participate in the workshops and events See how you can share your data See whether we can develop joint projects Link domain knowledge and data science research • Contact us via – info@dsrc.nl
  • 20. DS RC Overview of the talks See some of the demos Visual Analytics Business Analytics Decision Theory Understand and decide Henri Bal Peter Boncz Distributed Processing Large Scale Databases Store and process Software Eng. System / Network Eng. 10010001010010001010010101 01010001010101010101010101 01000101010101010101010101 00101010010101010101010101 01001010100000101010101001 01010100010100101010101000 11110101010101010101010101 01010100010101010101010101 11110101010101010101010101 00101010101010101010101010 00101010101010101010101010 Data Reasoning Knowledge representati on Analyze and model Multimedia Retrieval Modeling and simulation Information Retrieval Machine Learning Max Welling Cees de Laat Maarten de Rijke
  • 21. DS RC Thanks for you attention info@dsrc.nl www.dsrc.nl