SlideShare ist ein Scribd-Unternehmen logo
1 von 22
VTT TECHNICAL RESEARCH CENTRE OF FINLAND LTD
Analyzing trajectories of
technological knowledge
Topic modelling approach to knowledge depth and breadth
The 1st Annual International Conference of the
IEEE Technology and Engineering Management
Society
Dr. Arho Suominen
2
KNOWLEDGE – THE CORE ASSET OF CORPORATIONS
MANAGING IT REQUIRES US TO KNOW WHAT INTERNAL AND EXTERNAL KNOWLEDGE IS AVAILABLE
331/05/2017 3
INTRODUCTION
 Technology management and planning requires that we are
able to quantify knowledge embedded in and outside the
organization.
 Depth and breadth of knowledge are the main dimensions used
to make this happen.
 Knowledge depth is defined as an actors level of expertise or
sophistication.
 Knowledge breadth is defined an actors capabilities to exploit
adjacent technologies or the multi-dimensionality of its knowledge
base.
 Knowledge depth and breadth have been shown to have a
significant impact to company performance
431/05/2017 4
WHAT WE HAVE DONE BEFORE
THAT MIGHT HAVE SOME LIMITATIONS
 Patent data, admit its caveats, have been seen as the most
practical vantage point into a companies knowledge.
 Previous studies have operationalized companies knowledge
structure by looking at patent classifications:
 This approach has significant caveats, due to
 classifications errors,
 overall noisiness
 challenges related to the taxonomy of patents and
 the classification system inability represent novelty by forcing new
thing in historical classes
 Above is written with the understanding that there have been
recent studies looking at keyword and machine learning based
approaches in operationalizing patents.
TWO EXAMPLES WHY OUR APPROACH CAN ADD VALUE
OVERCOMING LIMITATIONS OF PATENT CLASSIFICATIONS AND ABSTRACT BASED ANALYSIS
6631/05/2017
CLASSIFICATION VS. MACHINE LEARNING
MACHINE LEARNED TOPICS ALIGN POORLY WITH HUMAN CLASSIFICATION
0.000
0.100
0.200
0.300
0.400
0.500
0.600
0.700
0.800
0.900
Topic1
Topic7
Topic13
Topic19
Topic25
Topic31
Topic37
Topic43
Topic49
Topic55
Topic61
Topic67
Topic73
Analysis of
biological materials
Audio-visual
technology
Basic
communication
processes
Basic materials
chemistry
Biotechnology
Chemical
engineering
731/05/2017 7
CLASSIFICATION VS. MACHINE LEARNING
MACHINE LEARNED TOPICS ALIGN POORLY WITH HUMAN CLASSIFICATION
0.000
0.100
0.200
0.300
0.400
0.500
0.600
0.700
0.800
0.900
Topic1
Topic3
Topic5
Topic7
Topic9
Topic11
Topic13
Topic15
Topic17
Topic19
Topic21
Topic23
Topic25
Topic27
Topic29
Topic31
Topic33
Topic35
Topic37
Topic39
Topic41
Topic43
Topic45
Topic47
Topic49
Topic51
Topic53
Topic55
Topic57
Topic59
Topic61
Topic63
Topic65
Topic67
Topic69
Topic71
Topic73
Topic75
Analysis of biological materials
Audio-visual technology
Basic communication processes
Basic materials chemistry
Biotechnology
Chemical engineering
Civil engineering
Computer technology
Control
Digital communication
Electrical machinery, apparatus, energy
Engines, pumps, turbines
Environmental technology
Food chemistry
Furniture, games
Handling
8831/05/2017
EXAMPLE US9185203B2
Mobile device display management
Abstract
The display of a mobile device is managed
during a voice communication session using
a proximity sensor and an accelerometer. In
one example, the display of a mobile device
is turned off during a phone call on the
mobile device when a proximity sensor
detects an object is proximate the device
and an accelerometer determines the device
is in a first orientation.
In total 62 words
EXAMPLE US9185203B2
Mobile device display management
Description
Background…
Summary…
Brief Description of Drawings…
Detailed description…
In total 8886 words
CLASSIFICATION VS. MACHINE LEARNING
MACHINE LEARNED TOPICS ALIGN POORLY WITH HUMAN CLASSIFICATION
9931/05/2017
CLASSIFICATION VS. MACHINE LEARNING
ABSTRACTS ARE POOR DESCRIPTION OF THE KNOWLEDGE CONTENT
1031/05/2017 10
Unsupervised learning
 Produces an outcome based on an input while not receiving any
feedback from the environment.
 reliance on a formal framework that enables the algorithm to find
patterns.
 Topic models " ...can extract surprisingly interpretable and useful
structure without any explicit "understanding" of the language by
computer".
 As a simplification each document in a corpus is a random
mixture over latent topics, and each latent topic is characterized
by a distribution over words.
1131/05/2017 11
DATA, PRE-PROCESSING, AND ANALYSIS
SAMPLE
 From the telecommunication industry
 Alcatel-Lucent, Apple, Google, Huawei, Microsoft, Nokia and Samsung
Electronics
 The analysis was limited to a time period from 2001 to 2014.
METHOD
 Analyzed sample companies knowledge base with unsupervised
learning using patent data as proxy.
DATA SOURCE
 full-text patent descriptions filed in the USPTO containing
approximately 6 million patents. The repository, owned by Teqmine
Analytics Ltd
 Final data contains 157 718 records.
1231/05/2017 12
DATA, PRE-PROCESSING, AND ANALYSIS
Topic 1 Topic 2 … Topic N
Patent 1 0.10 0.24 0.40
Patent 2 0.40 0.01 0.10
…
Patent N 0.01 0.80 0.01
Topic 2
Topic N
Topic 1
Patent 2
Patent 1
Patent N
131331/05/2017
DATA, PRE-PROCESSING, AND ANALYSIS
ALGORITHM: LDA
 The algorithm is based on an online
variational Bayes algorithm for LDA [9]
 Number of Topics used was set using a
trial-and-error approach to 75.
IMPLEMENTATION: Python
 Python implementation included pre-
processing
ANALYSIS: Gephi, Python, Excel
 Gephi was used to create visuals from the
soft classification created by the algorithm.
 Python was used to pivot the document
topic probability matrix by company to a
sum of probabilities by company in a given
year
 Excel was used to calculate TD defined as:
14
RESULTS
151531/05/2017
BI-PARTITE NETWORK OF KNOWLEDGE
MACHINE LEARNED TOPICS CREATE A NETWORK MAP OF KNOWLEDGE
161631/05/2017
PRACTICAL USE CASE
MACHINE LEARNED TOPICS CREATE A TEMPORAL VIEW AND ESTIMATION ON
KNOWLEDGE ASSETS
1731/05/2017 17
INSIGHT: TELECOMMUNICATION INDUSTRY
Sample telecommunication companies with a decreasing technological diversity value. X-axis is years and Y-axis is
Technological Diversity (TD), calculated for each company.
1831/05/2017 18
Insight on the telecommunication industry
Sample telecommunication companies with a increasing technological diversity value. X-axis is years and Y-axis is
Technological Diversity (TD), calculated for each company. Largest increase in TD from Google, for which the linear
trend line is given with fit values.
1931/05/2017 19
INSIGHT: TELECOMMUNICATION INDUSTRY
 Correlation between technological diversity and count of patents.
 p-value is higher than 0.05 the results of the correlation were not
statistically significant (r(109) = 0.17, p = 0.077)
 A multiple linear regression was calculated to predict the
technological diversity based on patent count and company
 A significant regression equation was found F(8, 102), 35.99, p =
.000 with an R2 = 0.73
 Google, Huawei and Microsoft were significant predictors. Patent
count, Apple, Motorola, Nokia and Samsung were not a significant
predictors.
 There is a clear trend of technological diversity.
 Patent count is not a significant predictor in explaining
technological diversity.
2031/05/2017 20
 Natural language offers an important vantage point to interesting
phenomenon not directly measurable.
 This advantage is clear in the case of patent data analysis, where
abstract are known to carry a low information value and the use of
metadata has significant limitations.
 Main finding is that, by using full-text and LDA, we can create a
Technology Diversity value independent of patent count.
 This analysis opens the possibility to utilize the approach in
more in depth studies focusing in, for example, measuring the
impact of company knowledge depth and breadth to company
performance.
INSIGHT: TELECOMMUNICATION INDUSTRY
212131/05/2017
THANK YOU
Dr. Arho Suominen
Senior Scientist, D.Sc. (Tech)
Academy of Finland Postdoctoral Researcher
stationed at
VTT TECHNICAL RESEARCH CENTRE OF
FINLAND
Innovations, Economy, and Policy
Vuorimiehentie 3, P.O. Box 1000, 02044 Espoo,
Finland
Tel. +358 50 5050 354
www.vtt.fi, arho.suominen@vtt.fi
https://www.linkedin.com/in/arhosuominen
Twitter @ArhoSuominen
TECHNOLOGY FOR BUSINESS

Weitere ähnliche Inhalte

Was ist angesagt?

European Green IT Webinar 2014 - Kaliterre (France)
European Green IT Webinar 2014 - Kaliterre (France)European Green IT Webinar 2014 - Kaliterre (France)
European Green IT Webinar 2014 - Kaliterre (France)
GreenLabCenter
 
5. Frans Volberda - Session 3: Operation, Control and Protection
5. Frans Volberda - Session 3: Operation, Control and Protection5. Frans Volberda - Session 3: Operation, Control and Protection
5. Frans Volberda - Session 3: Operation, Control and Protection
Dutch Power
 
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
ICT FOOTPRINT .eu
 
MIPLM research projekt data driven business models in healthcare
MIPLM research projekt data driven business models in healthcareMIPLM research projekt data driven business models in healthcare
MIPLM research projekt data driven business models in healthcare
MIPLM
 
Printed electronics now_whitepaper
Printed electronics now_whitepaperPrinted electronics now_whitepaper
Printed electronics now_whitepaper
Abderrahim Rodani
 
Knowles MEMS Microphones in Apple iPhone 7 Plus Patent-to-Product Mapping Sample
Knowles MEMS Microphones in Apple iPhone 7 Plus Patent-to-Product Mapping SampleKnowles MEMS Microphones in Apple iPhone 7 Plus Patent-to-Product Mapping Sample
Knowles MEMS Microphones in Apple iPhone 7 Plus Patent-to-Product Mapping Sample
Knowmade
 
Designing Next Generation Smart City Initiatives: Harnessing Findings And Les...
Designing Next Generation Smart City Initiatives:Harnessing Findings And Les...Designing Next Generation Smart City Initiatives:Harnessing Findings And Les...
Designing Next Generation Smart City Initiatives: Harnessing Findings And Les...
Edward Curry
 

Was ist angesagt? (20)

Artificial Intelligence (AI) in media applications and services
Artificial Intelligence (AI) in media applications and servicesArtificial Intelligence (AI) in media applications and services
Artificial Intelligence (AI) in media applications and services
 
CREST- GMN Workshop Deck 2014
CREST- GMN Workshop Deck 2014CREST- GMN Workshop Deck 2014
CREST- GMN Workshop Deck 2014
 
2013 12-05-sirris-materials-workshop-printed-electronics-cosemans-seronveaux
2013 12-05-sirris-materials-workshop-printed-electronics-cosemans-seronveaux2013 12-05-sirris-materials-workshop-printed-electronics-cosemans-seronveaux
2013 12-05-sirris-materials-workshop-printed-electronics-cosemans-seronveaux
 
European Green IT Webinar 2014 - Kaliterre (France)
European Green IT Webinar 2014 - Kaliterre (France)European Green IT Webinar 2014 - Kaliterre (France)
European Green IT Webinar 2014 - Kaliterre (France)
 
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...
 
5. Frans Volberda - Session 3: Operation, Control and Protection
5. Frans Volberda - Session 3: Operation, Control and Protection5. Frans Volberda - Session 3: Operation, Control and Protection
5. Frans Volberda - Session 3: Operation, Control and Protection
 
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
 
MIPLM research projekt data driven business models in healthcare
MIPLM research projekt data driven business models in healthcareMIPLM research projekt data driven business models in healthcare
MIPLM research projekt data driven business models in healthcare
 
bp china cas joint programs Roles of cecc eil ceftf
bp china cas joint programs Roles of cecc eil ceftfbp china cas joint programs Roles of cecc eil ceftf
bp china cas joint programs Roles of cecc eil ceftf
 
Feasibility Of Graphene Inks In Printed Electronics V5
Feasibility Of Graphene Inks In Printed Electronics V5Feasibility Of Graphene Inks In Printed Electronics V5
Feasibility Of Graphene Inks In Printed Electronics V5
 
Session 1.3 energy, smart homes & smart grids: towards interoperability...
Session 1.3   energy, smart homes & smart grids: towards interoperability...Session 1.3   energy, smart homes & smart grids: towards interoperability...
Session 1.3 energy, smart homes & smart grids: towards interoperability...
 
Printed electronics now_whitepaper
Printed electronics now_whitepaperPrinted electronics now_whitepaper
Printed electronics now_whitepaper
 
Knowles MEMS Microphones in Apple iPhone 7 Plus Patent-to-Product Mapping Sample
Knowles MEMS Microphones in Apple iPhone 7 Plus Patent-to-Product Mapping SampleKnowles MEMS Microphones in Apple iPhone 7 Plus Patent-to-Product Mapping Sample
Knowles MEMS Microphones in Apple iPhone 7 Plus Patent-to-Product Mapping Sample
 
Designing Next Generation Smart City Initiatives: Harnessing Findings And Les...
Designing Next Generation Smart City Initiatives:Harnessing Findings And Les...Designing Next Generation Smart City Initiatives:Harnessing Findings And Les...
Designing Next Generation Smart City Initiatives: Harnessing Findings And Les...
 
Printing based manufacturing and integration of flexible electronic system - ...
Printing based manufacturing and integration of flexible electronic system - ...Printing based manufacturing and integration of flexible electronic system - ...
Printing based manufacturing and integration of flexible electronic system - ...
 
Framework for Green and Smart Logistics - Frank Straube - Technische Univers...
Framework for Green and Smart  Logistics - Frank Straube - Technische Univers...Framework for Green and Smart  Logistics - Frank Straube - Technische Univers...
Framework for Green and Smart Logistics - Frank Straube - Technische Univers...
 
Pike Research Brochure
Pike Research BrochurePike Research Brochure
Pike Research Brochure
 
Owi project
 Owi project Owi project
Owi project
 
International Journal of Computer Science, Engineering and Information Techn...
International Journal of Computer Science, Engineering and  Information Techn...International Journal of Computer Science, Engineering and  Information Techn...
International Journal of Computer Science, Engineering and Information Techn...
 
Business Case For Green Product Development
Business Case For Green Product DevelopmentBusiness Case For Green Product Development
Business Case For Green Product Development
 

Ähnlich wie Analyzing trajectories of technological knowledge, Dr Arho Suominen

Sample-Landscape_Study_Digital_Twin.pptx
Sample-Landscape_Study_Digital_Twin.pptxSample-Landscape_Study_Digital_Twin.pptx
Sample-Landscape_Study_Digital_Twin.pptx
nour91922
 
State of AI Report 2022 - ONLINE.pptx
State of AI Report 2022 - ONLINE.pptxState of AI Report 2022 - ONLINE.pptx
State of AI Report 2022 - ONLINE.pptx
EithuThutun
 

Ähnlich wie Analyzing trajectories of technological knowledge, Dr Arho Suominen (20)

How Can AI and IoT Power the Chemical Industry?
How Can AI and IoT Power the Chemical Industry?How Can AI and IoT Power the Chemical Industry?
How Can AI and IoT Power the Chemical Industry?
 
IoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDILIoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDIL
 
2018 learning approach-digitaltrends
2018 learning approach-digitaltrends2018 learning approach-digitaltrends
2018 learning approach-digitaltrends
 
Technology Scouting
Technology ScoutingTechnology Scouting
Technology Scouting
 
Internet of Industrial Things Presentation - Sophie Peachey - IoT Midlands Me...
Internet of Industrial Things Presentation - Sophie Peachey - IoT Midlands Me...Internet of Industrial Things Presentation - Sophie Peachey - IoT Midlands Me...
Internet of Industrial Things Presentation - Sophie Peachey - IoT Midlands Me...
 
Sample-Landscape_Study_Digital_Twin.pptx
Sample-Landscape_Study_Digital_Twin.pptxSample-Landscape_Study_Digital_Twin.pptx
Sample-Landscape_Study_Digital_Twin.pptx
 
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
 
Building Reference Architectures for the Industrial IoT
Building Reference Architectures for the Industrial IoTBuilding Reference Architectures for the Industrial IoT
Building Reference Architectures for the Industrial IoT
 
State of AI Report 2022 - ONLINE.pptx
State of AI Report 2022 - ONLINE.pptxState of AI Report 2022 - ONLINE.pptx
State of AI Report 2022 - ONLINE.pptx
 
Meg Mude, Intel - Data Engineering Lifecycle Optimized on Intel - H2O World S...
Meg Mude, Intel - Data Engineering Lifecycle Optimized on Intel - H2O World S...Meg Mude, Intel - Data Engineering Lifecycle Optimized on Intel - H2O World S...
Meg Mude, Intel - Data Engineering Lifecycle Optimized on Intel - H2O World S...
 
AIVS - AI, Industrial Data Space, and Innovation Transformation
AIVS - AI, Industrial Data Space, and Innovation TransformationAIVS - AI, Industrial Data Space, and Innovation Transformation
AIVS - AI, Industrial Data Space, and Innovation Transformation
 
Ontology engineering ESTC2008
Ontology engineering ESTC2008Ontology engineering ESTC2008
Ontology engineering ESTC2008
 
Customer Success Story: IEEE Xplore Saves Time
Customer Success Story: IEEE Xplore Saves TimeCustomer Success Story: IEEE Xplore Saves Time
Customer Success Story: IEEE Xplore Saves Time
 
Thesis Defense MBI
Thesis Defense MBIThesis Defense MBI
Thesis Defense MBI
 
Mainflux Labs - References (1).pdf
Mainflux Labs - References (1).pdfMainflux Labs - References (1).pdf
Mainflux Labs - References (1).pdf
 
State of AI Report 2022 - ONLINE.pdf
State of AI Report 2022 - ONLINE.pdfState of AI Report 2022 - ONLINE.pdf
State of AI Report 2022 - ONLINE.pdf
 
SOCIAL DISTANCING MONITORING IN COVID-19 USING DEEP LEARNING
SOCIAL DISTANCING MONITORING IN COVID-19 USING DEEP LEARNINGSOCIAL DISTANCING MONITORING IN COVID-19 USING DEEP LEARNING
SOCIAL DISTANCING MONITORING IN COVID-19 USING DEEP LEARNING
 
HPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideHPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program Guide
 
Slides Master Copy 2009 Sr&Ed Course W Edison Linked In
Slides   Master Copy   2009 Sr&Ed Course W Edison   Linked InSlides   Master Copy   2009 Sr&Ed Course W Edison   Linked In
Slides Master Copy 2009 Sr&Ed Course W Edison Linked In
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked Data
 

Mehr von VTT Technical Research Centre of Finland Ltd

Mehr von VTT Technical Research Centre of Finland Ltd (20)

Sensory profiling of high-moisture extruded fish products from underutilized ...
Sensory profiling of high-moisture extruded fish products from underutilized ...Sensory profiling of high-moisture extruded fish products from underutilized ...
Sensory profiling of high-moisture extruded fish products from underutilized ...
 
VTT's Kyösti Pennanen: Consumers' understanding and views on dietary fibre
VTT's Kyösti Pennanen: Consumers' understanding and views on dietary fibreVTT's Kyösti Pennanen: Consumers' understanding and views on dietary fibre
VTT's Kyösti Pennanen: Consumers' understanding and views on dietary fibre
 
Tietoa ja suosituksia päättäjille: Kohti kestävää ruokapakkaamista
Tietoa ja suosituksia päättäjille: Kohti kestävää ruokapakkaamistaTietoa ja suosituksia päättäjille: Kohti kestävää ruokapakkaamista
Tietoa ja suosituksia päättäjille: Kohti kestävää ruokapakkaamista
 
VTT's Heikki Aisala: Flavour modification of gluten-free African crops
VTT's Heikki Aisala: Flavour modification of gluten-free African cropsVTT's Heikki Aisala: Flavour modification of gluten-free African crops
VTT's Heikki Aisala: Flavour modification of gluten-free African crops
 
Rantala: Redesigning food choice architecture to facilitate healthier choices
Rantala: Redesigning food choice architecture to facilitate healthier choicesRantala: Redesigning food choice architecture to facilitate healthier choices
Rantala: Redesigning food choice architecture to facilitate healthier choices
 
Healthy food environment for Finnish children
Healthy food environment for Finnish childrenHealthy food environment for Finnish children
Healthy food environment for Finnish children
 
VTT's Emilia Nordlund: Bioprocessing as a tool to improve the functionality o...
VTT's Emilia Nordlund: Bioprocessing as a tool to improve the functionality o...VTT's Emilia Nordlund: Bioprocessing as a tool to improve the functionality o...
VTT's Emilia Nordlund: Bioprocessing as a tool to improve the functionality o...
 
VTT's Nesli Sözer: Oats as an Alternative Protein Source
VTT's Nesli Sözer: Oats as an Alternative Protein SourceVTT's Nesli Sözer: Oats as an Alternative Protein Source
VTT's Nesli Sözer: Oats as an Alternative Protein Source
 
VTT's Pia Silventoinen: Dry fractionation and functionalisation of cereal sid...
VTT's Pia Silventoinen: Dry fractionation and functionalisation of cereal sid...VTT's Pia Silventoinen: Dry fractionation and functionalisation of cereal sid...
VTT's Pia Silventoinen: Dry fractionation and functionalisation of cereal sid...
 
HTM Solutions Knights of Nordics 2020
HTM Solutions Knights of Nordics 2020HTM Solutions Knights of Nordics 2020
HTM Solutions Knights of Nordics 2020
 
2019-10-02_presentations_Opportunities for SMEs in Horizon2020_Side_Event
2019-10-02_presentations_Opportunities for SMEs in Horizon2020_Side_Event2019-10-02_presentations_Opportunities for SMEs in Horizon2020_Side_Event
2019-10-02_presentations_Opportunities for SMEs in Horizon2020_Side_Event
 
ICT Proposers' Day 2019 Side Event, Visit 1
ICT Proposers' Day 2019 Side Event, Visit 1ICT Proposers' Day 2019 Side Event, Visit 1
ICT Proposers' Day 2019 Side Event, Visit 1
 
ICT Proposers' Day 2019 Side Event, Visit 4
ICT Proposers' Day 2019 Side Event, Visit 4ICT Proposers' Day 2019 Side Event, Visit 4
ICT Proposers' Day 2019 Side Event, Visit 4
 
ICT Proposers' Day 2019 Side Event, Visit 3
ICT Proposers' Day 2019 Side Event, Visit 3ICT Proposers' Day 2019 Side Event, Visit 3
ICT Proposers' Day 2019 Side Event, Visit 3
 
ICT Proposers' Day 2019 Side Event, Visit 2
ICT Proposers' Day 2019 Side Event, Visit 2ICT Proposers' Day 2019 Side Event, Visit 2
ICT Proposers' Day 2019 Side Event, Visit 2
 
Sensorit tulevat maitotiloille/ Nauta-lehti 03/19
Sensorit tulevat maitotiloille/ Nauta-lehti 03/19Sensorit tulevat maitotiloille/ Nauta-lehti 03/19
Sensorit tulevat maitotiloille/ Nauta-lehti 03/19
 
Virkki presentation VTT SmartHealth Ecosystem Event 12.6.2019
Virkki presentation VTT SmartHealth Ecosystem Event 12.6.2019Virkki presentation VTT SmartHealth Ecosystem Event 12.6.2019
Virkki presentation VTT SmartHealth Ecosystem Event 12.6.2019
 
Salaspuro presentation VTT SmartHealth Ecosystem Event 12.6.2019
Salaspuro presentation VTT SmartHealth Ecosystem Event 12.6.2019Salaspuro presentation VTT SmartHealth Ecosystem Event 12.6.2019
Salaspuro presentation VTT SmartHealth Ecosystem Event 12.6.2019
 
Vuorikallas presentation VTT SmartHealth Ecosystem Event 12.6.2019
Vuorikallas presentation VTT SmartHealth Ecosystem Event 12.6.2019Vuorikallas presentation VTT SmartHealth Ecosystem Event 12.6.2019
Vuorikallas presentation VTT SmartHealth Ecosystem Event 12.6.2019
 
Laurila presentation VTT SmartHealth Ecosystem Event 12.6.2019
Laurila presentation VTT SmartHealth Ecosystem Event 12.6.2019Laurila presentation VTT SmartHealth Ecosystem Event 12.6.2019
Laurila presentation VTT SmartHealth Ecosystem Event 12.6.2019
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Analyzing trajectories of technological knowledge, Dr Arho Suominen

  • 1. VTT TECHNICAL RESEARCH CENTRE OF FINLAND LTD Analyzing trajectories of technological knowledge Topic modelling approach to knowledge depth and breadth The 1st Annual International Conference of the IEEE Technology and Engineering Management Society Dr. Arho Suominen
  • 2. 2 KNOWLEDGE – THE CORE ASSET OF CORPORATIONS MANAGING IT REQUIRES US TO KNOW WHAT INTERNAL AND EXTERNAL KNOWLEDGE IS AVAILABLE
  • 3. 331/05/2017 3 INTRODUCTION  Technology management and planning requires that we are able to quantify knowledge embedded in and outside the organization.  Depth and breadth of knowledge are the main dimensions used to make this happen.  Knowledge depth is defined as an actors level of expertise or sophistication.  Knowledge breadth is defined an actors capabilities to exploit adjacent technologies or the multi-dimensionality of its knowledge base.  Knowledge depth and breadth have been shown to have a significant impact to company performance
  • 4. 431/05/2017 4 WHAT WE HAVE DONE BEFORE THAT MIGHT HAVE SOME LIMITATIONS  Patent data, admit its caveats, have been seen as the most practical vantage point into a companies knowledge.  Previous studies have operationalized companies knowledge structure by looking at patent classifications:  This approach has significant caveats, due to  classifications errors,  overall noisiness  challenges related to the taxonomy of patents and  the classification system inability represent novelty by forcing new thing in historical classes  Above is written with the understanding that there have been recent studies looking at keyword and machine learning based approaches in operationalizing patents.
  • 5. TWO EXAMPLES WHY OUR APPROACH CAN ADD VALUE OVERCOMING LIMITATIONS OF PATENT CLASSIFICATIONS AND ABSTRACT BASED ANALYSIS
  • 6. 6631/05/2017 CLASSIFICATION VS. MACHINE LEARNING MACHINE LEARNED TOPICS ALIGN POORLY WITH HUMAN CLASSIFICATION 0.000 0.100 0.200 0.300 0.400 0.500 0.600 0.700 0.800 0.900 Topic1 Topic7 Topic13 Topic19 Topic25 Topic31 Topic37 Topic43 Topic49 Topic55 Topic61 Topic67 Topic73 Analysis of biological materials Audio-visual technology Basic communication processes Basic materials chemistry Biotechnology Chemical engineering
  • 7. 731/05/2017 7 CLASSIFICATION VS. MACHINE LEARNING MACHINE LEARNED TOPICS ALIGN POORLY WITH HUMAN CLASSIFICATION 0.000 0.100 0.200 0.300 0.400 0.500 0.600 0.700 0.800 0.900 Topic1 Topic3 Topic5 Topic7 Topic9 Topic11 Topic13 Topic15 Topic17 Topic19 Topic21 Topic23 Topic25 Topic27 Topic29 Topic31 Topic33 Topic35 Topic37 Topic39 Topic41 Topic43 Topic45 Topic47 Topic49 Topic51 Topic53 Topic55 Topic57 Topic59 Topic61 Topic63 Topic65 Topic67 Topic69 Topic71 Topic73 Topic75 Analysis of biological materials Audio-visual technology Basic communication processes Basic materials chemistry Biotechnology Chemical engineering Civil engineering Computer technology Control Digital communication Electrical machinery, apparatus, energy Engines, pumps, turbines Environmental technology Food chemistry Furniture, games Handling
  • 8. 8831/05/2017 EXAMPLE US9185203B2 Mobile device display management Abstract The display of a mobile device is managed during a voice communication session using a proximity sensor and an accelerometer. In one example, the display of a mobile device is turned off during a phone call on the mobile device when a proximity sensor detects an object is proximate the device and an accelerometer determines the device is in a first orientation. In total 62 words EXAMPLE US9185203B2 Mobile device display management Description Background… Summary… Brief Description of Drawings… Detailed description… In total 8886 words CLASSIFICATION VS. MACHINE LEARNING MACHINE LEARNED TOPICS ALIGN POORLY WITH HUMAN CLASSIFICATION
  • 9. 9931/05/2017 CLASSIFICATION VS. MACHINE LEARNING ABSTRACTS ARE POOR DESCRIPTION OF THE KNOWLEDGE CONTENT
  • 10. 1031/05/2017 10 Unsupervised learning  Produces an outcome based on an input while not receiving any feedback from the environment.  reliance on a formal framework that enables the algorithm to find patterns.  Topic models " ...can extract surprisingly interpretable and useful structure without any explicit "understanding" of the language by computer".  As a simplification each document in a corpus is a random mixture over latent topics, and each latent topic is characterized by a distribution over words.
  • 11. 1131/05/2017 11 DATA, PRE-PROCESSING, AND ANALYSIS SAMPLE  From the telecommunication industry  Alcatel-Lucent, Apple, Google, Huawei, Microsoft, Nokia and Samsung Electronics  The analysis was limited to a time period from 2001 to 2014. METHOD  Analyzed sample companies knowledge base with unsupervised learning using patent data as proxy. DATA SOURCE  full-text patent descriptions filed in the USPTO containing approximately 6 million patents. The repository, owned by Teqmine Analytics Ltd  Final data contains 157 718 records.
  • 12. 1231/05/2017 12 DATA, PRE-PROCESSING, AND ANALYSIS Topic 1 Topic 2 … Topic N Patent 1 0.10 0.24 0.40 Patent 2 0.40 0.01 0.10 … Patent N 0.01 0.80 0.01 Topic 2 Topic N Topic 1 Patent 2 Patent 1 Patent N
  • 13. 131331/05/2017 DATA, PRE-PROCESSING, AND ANALYSIS ALGORITHM: LDA  The algorithm is based on an online variational Bayes algorithm for LDA [9]  Number of Topics used was set using a trial-and-error approach to 75. IMPLEMENTATION: Python  Python implementation included pre- processing ANALYSIS: Gephi, Python, Excel  Gephi was used to create visuals from the soft classification created by the algorithm.  Python was used to pivot the document topic probability matrix by company to a sum of probabilities by company in a given year  Excel was used to calculate TD defined as:
  • 15. 151531/05/2017 BI-PARTITE NETWORK OF KNOWLEDGE MACHINE LEARNED TOPICS CREATE A NETWORK MAP OF KNOWLEDGE
  • 16. 161631/05/2017 PRACTICAL USE CASE MACHINE LEARNED TOPICS CREATE A TEMPORAL VIEW AND ESTIMATION ON KNOWLEDGE ASSETS
  • 17. 1731/05/2017 17 INSIGHT: TELECOMMUNICATION INDUSTRY Sample telecommunication companies with a decreasing technological diversity value. X-axis is years and Y-axis is Technological Diversity (TD), calculated for each company.
  • 18. 1831/05/2017 18 Insight on the telecommunication industry Sample telecommunication companies with a increasing technological diversity value. X-axis is years and Y-axis is Technological Diversity (TD), calculated for each company. Largest increase in TD from Google, for which the linear trend line is given with fit values.
  • 19. 1931/05/2017 19 INSIGHT: TELECOMMUNICATION INDUSTRY  Correlation between technological diversity and count of patents.  p-value is higher than 0.05 the results of the correlation were not statistically significant (r(109) = 0.17, p = 0.077)  A multiple linear regression was calculated to predict the technological diversity based on patent count and company  A significant regression equation was found F(8, 102), 35.99, p = .000 with an R2 = 0.73  Google, Huawei and Microsoft were significant predictors. Patent count, Apple, Motorola, Nokia and Samsung were not a significant predictors.  There is a clear trend of technological diversity.  Patent count is not a significant predictor in explaining technological diversity.
  • 20. 2031/05/2017 20  Natural language offers an important vantage point to interesting phenomenon not directly measurable.  This advantage is clear in the case of patent data analysis, where abstract are known to carry a low information value and the use of metadata has significant limitations.  Main finding is that, by using full-text and LDA, we can create a Technology Diversity value independent of patent count.  This analysis opens the possibility to utilize the approach in more in depth studies focusing in, for example, measuring the impact of company knowledge depth and breadth to company performance. INSIGHT: TELECOMMUNICATION INDUSTRY
  • 21. 212131/05/2017 THANK YOU Dr. Arho Suominen Senior Scientist, D.Sc. (Tech) Academy of Finland Postdoctoral Researcher stationed at VTT TECHNICAL RESEARCH CENTRE OF FINLAND Innovations, Economy, and Policy Vuorimiehentie 3, P.O. Box 1000, 02044 Espoo, Finland Tel. +358 50 5050 354 www.vtt.fi, arho.suominen@vtt.fi https://www.linkedin.com/in/arhosuominen Twitter @ArhoSuominen