SlideShare a Scribd company logo
1 of 29
Download to read offline
© cortical.io inc. 2016
Semantic Folding
co-Founder and General Manager
Francisco De Sousa Webber
Language Intelligence made easy
f.webber@cortical.io
©cortical.ioinc.2016
What is Cortical.io ?
We explore & expand Semantic Folding Theory
We spread & sell Semantic Folding Technology
We build & grow Cortical.io as the “Oracle” for
Semantic Processing
©cortical.ioinc.2016
NLP Market Problem
• All systems are based on statistics - low differentiability
• Hard to build - high level of expertise needed
• Inaccurate compared to humans - low precision
• Have complex tuning procedures - hard to deploy
• Slow and inefficient compared to humans - hard to scale
Natural Language Processing Technology:
• Human metadata management - for differentiability
• Human specialists - for expertise
• Human correction - for precision
• Human generated gold-standards - for tuning
Weakness Compensated with:
Business-NLP is currently very expensive.
©cortical.ioinc.2016
Solution: 

Language Intelligence
• By Jeff Hawkins (Silicon Valley, California)
• numenta.com technical implementation & IP
• Processing algorithm of the human brain (neo-cortex)
Hierarchical Temporal Memory Theory
• By Francisco De Sousa Webber (Vienna, Austria)
• cortical.io technical implementation & IP
• Processing language-data like the human brain
Semantic Folding Theory
+
©cortical.ioinc.2015
Cortical Constraints
• Neocortex is a 2D sheet of repeating Modular Assemblies of neurons with
binary inputs.
• Neocortex is a Memory System not a processor.
• Neocortex stores Pattern Sequences.
• Neocortex is an Online Learning system
• Neocortex is only Trained by Exposure to real-world data
• All data fed into the neocortex must have Sparse Distributed Representation
(SDR) format:
• SDRs are very long Binary Vectors with max. 2% of “1”.
• Every SDR-bit is a self-contained Semantic Feature of the world (via sensorial input).
• Every SDR-bit is an Explicit Part of the signal.
• Similar “things” have similar SDRs.
• The Union of SDRs maintains all information of its members.
©cortical.ioinc.2015
Virtual Word Layer
hear see touch
word (SDR)
…..Wordsensorstream
Wordproductionstream
Symbolinput
Muscles
Motor output
Symboloutput
Virtualization into Retina
©cortical.ioinc.2015
Offline Process
RetinaDB Generation
Retina Training defines the Semantic Universe.
Training Collection specifies all vocabulary, linguistic properties and knowledge.
The Semantic Folding Engine generates a Semantic Map.
Every utterance is positioned within the Semantic Space.
Every term is defined by its distributed selection of utterances/contexts.
A topographic bit-vector is generated for each term of the corpus.
Training
Collection
Preprocessin
Semantic
Folding
Engine
Fingerprint Generation
©cortical.ioinc.2015
Retina-API Operation
The generated topographic bit-vectors are called Semantic Fingerprints
The Semantic Fingerprints are stored in the highly performant RetinaDB
The RetinaDB is a complete Language Model
The Retina-API provided functions: convert, compare, dissect, classify and extract text
The user application interacts via a REST Interface
Functions out
Fingerprint out
Compare out
RetinaDB
Retina API
User
Application
REST call
Online ProcessOffline Process
Training
Collection
Preprocessin
Semantic
Folding
Engine
Fingerprint Generation
©cortical.ioinc.2015
Tuning The Retina
“cholecystitis”
©cortical.ioinc.2015
Aligning Semantic Spaces
philosophy philosophie filosofía философия ‫ﻓﻠﺳﻔﺔ‬
Concepts and their representations are stable
across languages.
EN FR ES RU AR ZH
©cortical.ioinc.2015
The Cortical.io 

Retina Technology …
… converts any text into 

a semantic fingerprint.
teens like playing good
music with their mobile
phones
Fingerprint Generation
©cortical.ioinc.2015
organ
Step 1: Word Fingerprints
piano
church
liver
©cortical.ioinc.2015
aggregation
+
sparsification
Step 2: Text Fingerprints
teens like to hear music on their mobile phones
teens like to hear
music on their mobile
phones
©cortical.ioinc.2015
Similar meanings …
… look similar
37% overlap
teens like using itunes on their iphone he consumes chart hits on his notebook
©cortical.ioinc.2015
Different meanings …
… look different
the fishermen
are sailing out of
5% overlap
teens like using itunes on their iphone the fishermen are sailing out of the harbor
©cortical.ioinc.2015
Evaluation
There are very few comparable algorithms: a couple
of academic ones that cannot be readily used for
production purposes and Google’s Word2Vec.
The MEN Test Collection: http://clic.cimec.unitn.it/~elia.bruni/MEN.html
The RG-65 Test Collection: http://www.aclweb.org/aclwiki/index.php?title=RG-65_Test_Collection_(State_of_the_art)
The WordSimilarity-353 Test Collection: http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/
Yu&Dredzde 2014: http://arxiv.org/pdf/1411.4166.pdf
Distributed representations of words and phrases: http://papers.nips.cc/paper/5021-di
MEN-3K RG-65 WS-353
word2vec (Google) 55,2 44,8 54,7
Yu&Drezde (2014) 50,1 47,1 53,7
cortical.io Retina 67,4 71,3 62,2
% better word2vec 18,1 37,2 12,1
% better Yu&Drezde 25,7 33,9 13,7
©cortical.ioinc.2016
Semantic Folding Products
• Document Retrieval
• Expert Finding
• Knowledge Management
Enterprise Semantic Search
• Semantic Streaming Text Filter
• (Social) Media Monitoring
• Business Intelligence & Analytics
Big Text Data
• Natural Language based Automation
• Content Personalisation
• Semantic Profiling
Semantic Matching
similarity engine
example 

document
most similar
documents
ordered along 

the users
information need
query document index result set ranking
#finance
#markets
#mobile
#movies
#products
#trend Topic of interest
Analytics
Match
Making
EnterpriseApplicationWebApp
©cortical.ioinc.2016
Cloudera Integration
©cortical.ioinc.2016
Semantic Search
similarity engineexample 

document
most similar
documents
ordered along 

the users
information need
query document index result set ranking
©cortical.ioinc.2015
Semantic Content Filter
real-time, across languages, intelligent, meaning based
#finance
#markets
#mobile
#movies
#products
#trend
Topic of interest
Analytics
©cortical.ioinc.2015
Example: Twitter Filter 

The State of the Art
desired topic
Every tweet related to
smart phones
200 catch words
mobile phone
Iphone
cell
Android
…
sim-card
text message
network
Verizon
Apple
Google
…
5 words per tweet
Required throughput
for one filter
200 X 5 X 20,000 = 20,000,000 comparisons per second
20,000 tweets/sec
©cortical.ioinc.2015
The State of the Art
Cost per Filter: $ 10,000+ per Month
©cortical.ioinc.2015
Example: Twitter Filter 

Semantic Fingerprinting
stream of 

semantic fingerprints
twitter firehose
realtime
content sub-stream
Filter
Fingerprint
not
matching
matchmatchmatch
©cortical.ioinc.2015
Cost per Filter: $ 10 per Month
Cortical.io 

Streaming Text Filter
convert 100.000+

tweets per second
1.000+ semantic filters
+
one per firehose scalable with number of
Filters
©cortical.ioinc.2015
Dynamic Topic Pattern Analysis
Topic Monitoring
Unseen topics or sudden topic jumps are detected
Compliance Monitoring
Ongoing e-mail conversation Time >
Appearance of unseen
topic clusters
©cortical.ioinc.2015
Similar meanings “look” similar
Special “Financial Retina”
Bridging the Vocabulary Gap
fraud
Words
corruption AND mafia
Expressions
“anti human trafficking”
Idioms
Money laundering is the process
of transforming the proceeds of
crime into ostensibly legitimate
money or other assets.
Text
©cortical.ioinc.2015
Combine Fingerprints with AI Algorithms
Text Anomaly Detection
7. Enabling Artificial Intelligence Applications
email
chat
Message Forums
Blog Posts
Facebook Posts
Realtime Anomaly Detection in Text Streams
any Text Stream
©cortical.ioinc.2015
Combine Fingerprints with AI Algorithms
http://www.cortical.io/demos/semantic-anomaly-detection/
©cortical.ioinc.2015
website: cortical.io
product: https://aws.amazon.com/marketplace/pp/B00T5794P6/
twitter: http://twitter.com/cortical_io
video: https://www.youtube.com/watch?v=g3ZxJokDpds
demos: http://www.cortical.io/demos.html
API: http://api.cortical.io
Numenta: http://numenta.com

More Related Content

Similar to "Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Founder of Cortical.io

Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleInfrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleRobb Boyd
 
Top 10 Most Demand IT Certifications Course in 2020 - MildainTrainings
Top 10 Most Demand IT Certifications Course in 2020 - MildainTrainingsTop 10 Most Demand IT Certifications Course in 2020 - MildainTrainings
Top 10 Most Demand IT Certifications Course in 2020 - MildainTrainingsMildain Solutions
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)byteLAKE
 
Software Development Lifecycle Overview By CC
Software Development Lifecycle Overview By CCSoftware Development Lifecycle Overview By CC
Software Development Lifecycle Overview By CCCooperative Computing
 
M365VM - Project Cortex: AI Powered Knowledge Network for the Enterprise
M365VM - Project Cortex: AI Powered Knowledge Network for the EnterpriseM365VM - Project Cortex: AI Powered Knowledge Network for the Enterprise
M365VM - Project Cortex: AI Powered Knowledge Network for the EnterpriseJoel Oleson
 
SynopsisLowLatencySeminar.PDF
SynopsisLowLatencySeminar.PDFSynopsisLowLatencySeminar.PDF
SynopsisLowLatencySeminar.PDFAnand Narayanan
 
The information supernova
The information supernovaThe information supernova
The information supernovaAlaa Al-Agamawi
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceNeo4j
 
Rita Arrigo, Microsoft
Rita Arrigo, Microsoft Rita Arrigo, Microsoft
Rita Arrigo, Microsoft Hilary Ip
 
Anup_Kumar_Saha's_resume
Anup_Kumar_Saha's_resumeAnup_Kumar_Saha's_resume
Anup_Kumar_Saha's_resumeAnup Kumar Saha
 
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAmazon Web Services
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET Journal
 
Slide presentazione progetto DeFacto
Slide presentazione progetto DeFactoSlide presentazione progetto DeFacto
Slide presentazione progetto DeFactoHerzum Italia
 
AI for Subtitling - Limecraft presentation the 2022 Open Forum
AI for Subtitling - Limecraft presentation the 2022 Open ForumAI for Subtitling - Limecraft presentation the 2022 Open Forum
AI for Subtitling - Limecraft presentation the 2022 Open ForumMaarten Verwaest
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data ScienceDataWorks Summit
 
How AI and ML Can Accelerate and Optimize Software Development and Testing
How AI and ML Can Accelerate and Optimize Software Development and TestingHow AI and ML Can Accelerate and Optimize Software Development and Testing
How AI and ML Can Accelerate and Optimize Software Development and TestingAggregage
 
Hybrid IT, Laying the "Right Mix" Foundation for Digital Transformation
Hybrid IT, Laying the "Right Mix" Foundation for Digital TransformationHybrid IT, Laying the "Right Mix" Foundation for Digital Transformation
Hybrid IT, Laying the "Right Mix" Foundation for Digital TransformationPT Datacomm Diangraha
 

Similar to "Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Founder of Cortical.io (20)

Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleInfrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
 
Top 10 Most Demand IT Certifications Course in 2020 - MildainTrainings
Top 10 Most Demand IT Certifications Course in 2020 - MildainTrainingsTop 10 Most Demand IT Certifications Course in 2020 - MildainTrainings
Top 10 Most Demand IT Certifications Course in 2020 - MildainTrainings
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
 
Software Development Lifecycle Overview By CC
Software Development Lifecycle Overview By CCSoftware Development Lifecycle Overview By CC
Software Development Lifecycle Overview By CC
 
Resume
ResumeResume
Resume
 
M365VM - Project Cortex: AI Powered Knowledge Network for the Enterprise
M365VM - Project Cortex: AI Powered Knowledge Network for the EnterpriseM365VM - Project Cortex: AI Powered Knowledge Network for the Enterprise
M365VM - Project Cortex: AI Powered Knowledge Network for the Enterprise
 
SynopsisLowLatencySeminar.PDF
SynopsisLowLatencySeminar.PDFSynopsisLowLatencySeminar.PDF
SynopsisLowLatencySeminar.PDF
 
The information supernova
The information supernovaThe information supernova
The information supernova
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
 
Rita Arrigo, Microsoft
Rita Arrigo, Microsoft Rita Arrigo, Microsoft
Rita Arrigo, Microsoft
 
Anup_Kumar_Saha's_resume
Anup_Kumar_Saha's_resumeAnup_Kumar_Saha's_resume
Anup_Kumar_Saha's_resume
 
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech Recognition
 
Slide presentazione progetto DeFacto
Slide presentazione progetto DeFactoSlide presentazione progetto DeFacto
Slide presentazione progetto DeFacto
 
AI for Subtitling - Limecraft presentation the 2022 Open Forum
AI for Subtitling - Limecraft presentation the 2022 Open ForumAI for Subtitling - Limecraft presentation the 2022 Open Forum
AI for Subtitling - Limecraft presentation the 2022 Open Forum
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
How AI and ML Can Accelerate and Optimize Software Development and Testing
How AI and ML Can Accelerate and Optimize Software Development and TestingHow AI and ML Can Accelerate and Optimize Software Development and Testing
How AI and ML Can Accelerate and Optimize Software Development and Testing
 
Real speaker en 2
Real speaker en   2Real speaker en   2
Real speaker en 2
 
Smarter Retail
Smarter RetailSmarter Retail
Smarter Retail
 
Hybrid IT, Laying the "Right Mix" Foundation for Digital Transformation
Hybrid IT, Laying the "Right Mix" Foundation for Digital TransformationHybrid IT, Laying the "Right Mix" Foundation for Digital Transformation
Hybrid IT, Laying the "Right Mix" Foundation for Digital Transformation
 

More from Dataconomy Media

Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & David An...
Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & 	David An...Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & 	David An...
Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & David An...Dataconomy Media
 
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...Dataconomy Media
 
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...Dataconomy Media
 
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...Dataconomy Media
 
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...Dataconomy Media
 
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...Dataconomy Media
 
Data Natives Vienna v 7.0 | "Building Kubernetes Operators with KUDO for Dat...
Data Natives Vienna v 7.0  | "Building Kubernetes Operators with KUDO for Dat...Data Natives Vienna v 7.0  | "Building Kubernetes Operators with KUDO for Dat...
Data Natives Vienna v 7.0 | "Building Kubernetes Operators with KUDO for Dat...Dataconomy Media
 
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...Dataconomy Media
 
Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness...
Data Natives Cologne v 4.0  | "The Data Lorax: Planting the Seeds of Fairness...Data Natives Cologne v 4.0  | "The Data Lorax: Planting the Seeds of Fairness...
Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness...Dataconomy Media
 
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...Dataconomy Media
 
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...Dataconomy Media
 
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...Dataconomy Media
 
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...Dataconomy Media
 
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...Dataconomy Media
 
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...Dataconomy Media
 
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...Dataconomy Media
 
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...Dataconomy Media
 
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...Dataconomy Media
 
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...Dataconomy Media
 
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...Dataconomy Media
 

More from Dataconomy Media (20)

Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & David An...
Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & 	David An...Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & 	David An...
Data Natives Paris v 10.0 | "Blockchain in Healthcare" - Lea Dias & David An...
 
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...
Data Natives Frankfurt v 11.0 | "Competitive advantages with knowledge graphs...
 
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...
Data Natives Frankfurt v 11.0 | "Can we be responsible for misuse of data & a...
 
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...
Data Natives Munich v 12.0 | "How to be more productive with Autonomous Data ...
 
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...Data Natives meets DataRobot |  "Build and deploy an anti-money laundering mo...
Data Natives meets DataRobot | "Build and deploy an anti-money laundering mo...
 
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...
Data Natives Munich v 12.0 | "Political Data Science: A tale of Fake News, So...
 
Data Natives Vienna v 7.0 | "Building Kubernetes Operators with KUDO for Dat...
Data Natives Vienna v 7.0  | "Building Kubernetes Operators with KUDO for Dat...Data Natives Vienna v 7.0  | "Building Kubernetes Operators with KUDO for Dat...
Data Natives Vienna v 7.0 | "Building Kubernetes Operators with KUDO for Dat...
 
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
 
Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness...
Data Natives Cologne v 4.0  | "The Data Lorax: Planting the Seeds of Fairness...Data Natives Cologne v 4.0  | "The Data Lorax: Planting the Seeds of Fairness...
Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness...
 
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
 
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
 
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
 
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
 
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
 
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
 
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
 
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
 
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
 
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
 
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
 

Recently uploaded

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 

Recently uploaded (20)

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 

"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Founder of Cortical.io

  • 1. © cortical.io inc. 2016 Semantic Folding co-Founder and General Manager Francisco De Sousa Webber Language Intelligence made easy f.webber@cortical.io
  • 2. ©cortical.ioinc.2016 What is Cortical.io ? We explore & expand Semantic Folding Theory We spread & sell Semantic Folding Technology We build & grow Cortical.io as the “Oracle” for Semantic Processing
  • 3. ©cortical.ioinc.2016 NLP Market Problem • All systems are based on statistics - low differentiability • Hard to build - high level of expertise needed • Inaccurate compared to humans - low precision • Have complex tuning procedures - hard to deploy • Slow and inefficient compared to humans - hard to scale Natural Language Processing Technology: • Human metadata management - for differentiability • Human specialists - for expertise • Human correction - for precision • Human generated gold-standards - for tuning Weakness Compensated with: Business-NLP is currently very expensive.
  • 4. ©cortical.ioinc.2016 Solution: 
 Language Intelligence • By Jeff Hawkins (Silicon Valley, California) • numenta.com technical implementation & IP • Processing algorithm of the human brain (neo-cortex) Hierarchical Temporal Memory Theory • By Francisco De Sousa Webber (Vienna, Austria) • cortical.io technical implementation & IP • Processing language-data like the human brain Semantic Folding Theory +
  • 5. ©cortical.ioinc.2015 Cortical Constraints • Neocortex is a 2D sheet of repeating Modular Assemblies of neurons with binary inputs. • Neocortex is a Memory System not a processor. • Neocortex stores Pattern Sequences. • Neocortex is an Online Learning system • Neocortex is only Trained by Exposure to real-world data • All data fed into the neocortex must have Sparse Distributed Representation (SDR) format: • SDRs are very long Binary Vectors with max. 2% of “1”. • Every SDR-bit is a self-contained Semantic Feature of the world (via sensorial input). • Every SDR-bit is an Explicit Part of the signal. • Similar “things” have similar SDRs. • The Union of SDRs maintains all information of its members.
  • 6. ©cortical.ioinc.2015 Virtual Word Layer hear see touch word (SDR) …..Wordsensorstream Wordproductionstream Symbolinput Muscles Motor output Symboloutput Virtualization into Retina
  • 7. ©cortical.ioinc.2015 Offline Process RetinaDB Generation Retina Training defines the Semantic Universe. Training Collection specifies all vocabulary, linguistic properties and knowledge. The Semantic Folding Engine generates a Semantic Map. Every utterance is positioned within the Semantic Space. Every term is defined by its distributed selection of utterances/contexts. A topographic bit-vector is generated for each term of the corpus. Training Collection Preprocessin Semantic Folding Engine Fingerprint Generation
  • 8. ©cortical.ioinc.2015 Retina-API Operation The generated topographic bit-vectors are called Semantic Fingerprints The Semantic Fingerprints are stored in the highly performant RetinaDB The RetinaDB is a complete Language Model The Retina-API provided functions: convert, compare, dissect, classify and extract text The user application interacts via a REST Interface Functions out Fingerprint out Compare out RetinaDB Retina API User Application REST call Online ProcessOffline Process Training Collection Preprocessin Semantic Folding Engine Fingerprint Generation
  • 10. ©cortical.ioinc.2015 Aligning Semantic Spaces philosophy philosophie filosofía философия ‫ﻓﻠﺳﻔﺔ‬ Concepts and their representations are stable across languages. EN FR ES RU AR ZH
  • 11. ©cortical.ioinc.2015 The Cortical.io 
 Retina Technology … … converts any text into 
 a semantic fingerprint. teens like playing good music with their mobile phones Fingerprint Generation
  • 12. ©cortical.ioinc.2015 organ Step 1: Word Fingerprints piano church liver
  • 13. ©cortical.ioinc.2015 aggregation + sparsification Step 2: Text Fingerprints teens like to hear music on their mobile phones teens like to hear music on their mobile phones
  • 14. ©cortical.ioinc.2015 Similar meanings … … look similar 37% overlap teens like using itunes on their iphone he consumes chart hits on his notebook
  • 15. ©cortical.ioinc.2015 Different meanings … … look different the fishermen are sailing out of 5% overlap teens like using itunes on their iphone the fishermen are sailing out of the harbor
  • 16. ©cortical.ioinc.2015 Evaluation There are very few comparable algorithms: a couple of academic ones that cannot be readily used for production purposes and Google’s Word2Vec. The MEN Test Collection: http://clic.cimec.unitn.it/~elia.bruni/MEN.html The RG-65 Test Collection: http://www.aclweb.org/aclwiki/index.php?title=RG-65_Test_Collection_(State_of_the_art) The WordSimilarity-353 Test Collection: http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/ Yu&Dredzde 2014: http://arxiv.org/pdf/1411.4166.pdf Distributed representations of words and phrases: http://papers.nips.cc/paper/5021-di MEN-3K RG-65 WS-353 word2vec (Google) 55,2 44,8 54,7 Yu&Drezde (2014) 50,1 47,1 53,7 cortical.io Retina 67,4 71,3 62,2 % better word2vec 18,1 37,2 12,1 % better Yu&Drezde 25,7 33,9 13,7
  • 17. ©cortical.ioinc.2016 Semantic Folding Products • Document Retrieval • Expert Finding • Knowledge Management Enterprise Semantic Search • Semantic Streaming Text Filter • (Social) Media Monitoring • Business Intelligence & Analytics Big Text Data • Natural Language based Automation • Content Personalisation • Semantic Profiling Semantic Matching similarity engine example 
 document most similar documents ordered along 
 the users information need query document index result set ranking #finance #markets #mobile #movies #products #trend Topic of interest Analytics Match Making EnterpriseApplicationWebApp
  • 19. ©cortical.ioinc.2016 Semantic Search similarity engineexample 
 document most similar documents ordered along 
 the users information need query document index result set ranking
  • 20. ©cortical.ioinc.2015 Semantic Content Filter real-time, across languages, intelligent, meaning based #finance #markets #mobile #movies #products #trend Topic of interest Analytics
  • 21. ©cortical.ioinc.2015 Example: Twitter Filter 
 The State of the Art desired topic Every tweet related to smart phones 200 catch words mobile phone Iphone cell Android … sim-card text message network Verizon Apple Google … 5 words per tweet Required throughput for one filter 200 X 5 X 20,000 = 20,000,000 comparisons per second 20,000 tweets/sec
  • 22. ©cortical.ioinc.2015 The State of the Art Cost per Filter: $ 10,000+ per Month
  • 23. ©cortical.ioinc.2015 Example: Twitter Filter 
 Semantic Fingerprinting stream of 
 semantic fingerprints twitter firehose realtime content sub-stream Filter Fingerprint not matching matchmatchmatch
  • 24. ©cortical.ioinc.2015 Cost per Filter: $ 10 per Month Cortical.io 
 Streaming Text Filter convert 100.000+
 tweets per second 1.000+ semantic filters + one per firehose scalable with number of Filters
  • 25. ©cortical.ioinc.2015 Dynamic Topic Pattern Analysis Topic Monitoring Unseen topics or sudden topic jumps are detected Compliance Monitoring Ongoing e-mail conversation Time > Appearance of unseen topic clusters
  • 26. ©cortical.ioinc.2015 Similar meanings “look” similar Special “Financial Retina” Bridging the Vocabulary Gap fraud Words corruption AND mafia Expressions “anti human trafficking” Idioms Money laundering is the process of transforming the proceeds of crime into ostensibly legitimate money or other assets. Text
  • 27. ©cortical.ioinc.2015 Combine Fingerprints with AI Algorithms Text Anomaly Detection 7. Enabling Artificial Intelligence Applications email chat Message Forums Blog Posts Facebook Posts Realtime Anomaly Detection in Text Streams any Text Stream
  • 28. ©cortical.ioinc.2015 Combine Fingerprints with AI Algorithms http://www.cortical.io/demos/semantic-anomaly-detection/
  • 29. ©cortical.ioinc.2015 website: cortical.io product: https://aws.amazon.com/marketplace/pp/B00T5794P6/ twitter: http://twitter.com/cortical_io video: https://www.youtube.com/watch?v=g3ZxJokDpds demos: http://www.cortical.io/demos.html API: http://api.cortical.io Numenta: http://numenta.com