AWS Customer Presentation - SemantiNet

•

1 gefällt mir•696 views

Amazon Web Services

Technologie Reisen

[object Object],[object Object],[object Object],[object Object],About Headup

Same term has many different meanings. Organize content by meaning rather than keywords Unique challenges & opportunities Twilight – 2008 film Over 50 meanings in Wikipedia, 8 films

Same meaning has different names Organize content by meaning rather than keywords Unique challenges & opportunities Bridge to Nowhere - Disambiguation Matched: Knik Arm Bridge, Alaska

Organize content by meaning rather than keywords Unique challenges & opportunities ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Headup Topic Pages – powering a movies blog

[object Object],[object Object],[object Object],[object Object],[object Object],Organize content by meaning rather than keywords Unique challenges & opportunities

World Knowledge Graph ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Entities in page (+Connections) Relationships between entities Context, semantics SemantiNet Technology ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Semantic Analysis Knowledge Graph Semantic Index Query Interface

Architecture SemantiNet Architecture Client Facing Layer Amazon Load Balancer Cache x.large large small Workers perform atomic tasks: Crawl, Model, Analyze Text, Index & Render S3 RDS SimpleDB Graph SQS Data Store Frontend Web Browser CMS Frontend Frontend

Elasticity ,[object Object],[object Object],[object Object],[object Object]

Lessons learned ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Andere mochten auch

Big Data on AWS - AWS Washington D.C. Symposium 2014Amazon Web Services

(APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Ma...Amazon Web Services

AWSome Day Bangkok Opening KeynoteAmazon Web Services

Keynote - Werner Vogels Amazon Web Services

Continuous Integration and Deployment Best Practices on AWS Amazon Web Services

Understanding AWS securityAmazon Web Services

AWS Public Sector Symposium 2014 Canberra | Getting Started with AWS for Gove...Amazon Web Services

High Availability Websites: part twoAmazon Web Services

AWS Customer Presentation - qlik TechAmazon Web Services

AWS Summit Stockholm 2014 – T3 – disaster recovery on AWSAmazon Web Services

AWS Customer Presentation - NASA JPL Pervasive Cloud Now and FutureAmazon Web Services

AWS Customer Presentation: Washington Post - AWS NYC Summit 2012Amazon Web Services

(GAM304) How Riot Games re:Invented Their AWS Model | AWS re:Invent 2014Amazon Web Services

AWS Webcast - Intro to DevOps: Using Amazon RDS with AWS OpsWorksAmazon Web Services

AWS Summit Stockholm 2014 – T4 – Continuous integration on AWSAmazon Web Services

AWS Summit Stockholm 2014 – B3 – Integrating on-premises workloads with AWSAmazon Web Services

High Availability Websites: part oneAmazon Web Services

AWS Webcast - Discover Disaster Recovery Solutions in the CloudAmazon Web Services

(BAC208) Bursting to the Cloud: Deploying a Hybrid Cloud Storage Solution wit...Amazon Web Services

AWS Webcast - Disaster RecoveryAmazon Web Services

Andere mochten auch (20)

Big Data on AWS - AWS Washington D.C. Symposium 2014

(APP203) How Sumo Logic and Anki Build Highly Resilient Services on AWS to Ma...

AWSome Day Bangkok Opening Keynote

Keynote - Werner Vogels

Continuous Integration and Deployment Best Practices on AWS

Understanding AWS security

AWS Public Sector Symposium 2014 Canberra | Getting Started with AWS for Gove...

High Availability Websites: part two

AWS Customer Presentation - qlik Tech

AWS Summit Stockholm 2014 – T3 – disaster recovery on AWS

AWS Customer Presentation - NASA JPL Pervasive Cloud Now and Future

AWS Customer Presentation: Washington Post - AWS NYC Summit 2012

(GAM304) How Riot Games re:Invented Their AWS Model | AWS re:Invent 2014

AWS Webcast - Intro to DevOps: Using Amazon RDS with AWS OpsWorks

AWS Summit Stockholm 2014 – T4 – Continuous integration on AWS

AWS Summit Stockholm 2014 – B3 – Integrating on-premises workloads with AWS

High Availability Websites: part one

AWS Webcast - Discover Disaster Recovery Solutions in the Cloud

(BAC208) Bursting to the Cloud: Deploying a Hybrid Cloud Storage Solution wit...

AWS Webcast - Disaster Recovery

Ähnlich wie AWS Customer Presentation - SemantiNet

Quality, Quantity, Web and SemanticsZemanta

Quality, quantity, web and semanticsAndraz Tori

Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Artificial Intelligence Institute at UofSC

MongoDB Evenings DC: Get MEAN and Lean with Docker and KubernetesMongoDB

SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYAmit Sheth

Amazoniamzkz

Building Highly Scalable Web ApplicationsIWMW

Darin Briskman_Amazon_June_9_2017_PresentationTriNimbus

Build Data Lakes and Analytics on AWS: Patterns & Best PracticesAmazon Web Services

Build Data Lakes & Analytics on AWS: Patterns & Best PracticesAmazon Web Services

Web Search EngineChidanand Byahatti

Sql Data ServicesJames Johnson

Predicting Costs on AWSAmazon Web Services

PoolParty Thesaurus Management - ISKO UK, London 2010Andreas Blumauer

Sticking between: mashup in librariesBonaria Biancu

ABD338_MirrorWeb - Powering Large-scale, Full-text Search for the UK Governme...Amazon Web Services

Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Artificial Intelligence Institute at UofSC

Building Data Lakes and Analytics on AWS; Patterns and Best Practices - BDA30...Amazon Web Services

Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018Amazon Web Services

Semantic Media Wiki & Semantic FormsSergeyChernyshev

Ähnlich wie AWS Customer Presentation - SemantiNet (20)

Quality, Quantity, Web and Semantics

Quality, quantity, web and semantics

Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...

MongoDB Evenings DC: Get MEAN and Lean with Docker and Kubernetes

SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY

Amazon

Building Highly Scalable Web Applications

Darin Briskman_Amazon_June_9_2017_Presentation

Build Data Lakes and Analytics on AWS: Patterns & Best Practices

Build Data Lakes & Analytics on AWS: Patterns & Best Practices

Web Search Engine

Sql Data Services

Predicting Costs on AWS

PoolParty Thesaurus Management - ISKO UK, London 2010

Sticking between: mashup in libraries

ABD338_MirrorWeb - Powering Large-scale, Full-text Search for the UK Governme...

Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...

Building Data Lakes and Analytics on AWS; Patterns and Best Practices - BDA30...

Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018

Semantic Media Wiki & Semantic Forms

Mehr von Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services

Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services

Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services

Costruire Applicazioni Moderne con AWSAmazon Web Services

Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services

Open banking as a serviceAmazon Web Services

Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services

OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services

Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services

Computer Vision con AWSAmazon Web Services

Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services

Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services

API moderne real-time per applicazioni mobili e webAmazon Web Services

Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services

Tools for building your MVP on AWSAmazon Web Services

How to Build a Winning Pitch DeckAmazon Web Services

Building a web application without serversAmazon Web Services

Fundraising EssentialsAmazon Web Services

AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services

Introduzione a Amazon Elastic Container ServiceAmazon Web Services

Mehr von Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...

Big Data per le Startup: come creare applicazioni Big Data in modalità Server...

Esegui pod serverless con Amazon EKS e AWS Fargate

Costruire Applicazioni Moderne con AWS

Come spendere fino al 90% in meno con i container e le istanze spot

Open banking as a service

Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...

OpsWorks Configuration Management: automatizza la gestione e i deployment del...

Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads

Computer Vision con AWS

Database Oracle e VMware Cloud on AWS i miti da sfatare

Crea la tua prima serverless ledger-based app con QLDB e NodeJS

API moderne real-time per applicazioni mobili e web

Database Oracle e VMware Cloud™ on AWS: i miti da sfatare

Tools for building your MVP on AWS

How to Build a Winning Pitch Deck

Building a web application without servers

Fundraising Essentials

AWS_HK_StartupDay_Building Interactive websites while automating for efficien...

Introduzione a Amazon Elastic Container Service

Kürzlich hochgeladen

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

A Domino Admins Adventures (Engage 2024)Gabriella Davis

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

GenAI Risks & Security Meetup 01052024.pdflior mazor

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Kürzlich hochgeladen (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

A Domino Admins Adventures (Engage 2024)

presentation ICT roal in 21st century education

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

[2024]Digital Global Overview Report 2024 Meltwater.pdf

AWS Community Day CPH - Three problems of Terraform

How to Troubleshoot Apps for the Modern Connected Worker

Driving Behavioral Change for Information Management through Data-Driven Gree...

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Strategies for Landing an Oracle DBA Job as a Fresher

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Tata AIG General Insurance Company - Insurer Innovation Award 2024

GenAI Risks & Security Meetup 01052024.pdf

Data Cloud, More than a CDP by Matt Robison

The 7 Things I Know About Cyber Security After 25 Years | April 2024

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Powerful Google developer tools for immediate impact! (2023-24 C)

Exploring the Future Potential of AI-Enabled Smartphone Processors

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

AWS Customer Presentation - SemantiNet

1. SemantiNet & Amazon Web Services Tal Muskal CTO & Founder [email_address]

3. Same term has many different meanings. Organize content by meaning rather than keywords Unique challenges & opportunities Twilight – 2008 film Over 50 meanings in Wikipedia, 8 films

4. Same meaning has different names Organize content by meaning rather than keywords Unique challenges & opportunities Bridge to Nowhere - Disambiguation Matched: Knik Arm Bridge, Alaska

6. Headup Topic Pages – powering a movies blog

7. Headup Topic Pages – powering JPost

10.

11. Architecture SemantiNet Architecture Client Facing Layer Amazon Load Balancer Cache x.large large small Workers perform atomic tasks: Crawl, Model, Analyze Text, Index & Render S3 RDS SimpleDB Graph SQS Data Store Frontend Web Browser CMS Frontend Frontend

12. Monitoring and Elasticity

13.

14.

15.

Hinweis der Redaktion

Hello, my name is Tal Muskal and I am the CTO & founder of SemantiNet. We develop solutions for organizing web content using semantic technologies. We have been around since 2006, and we have been using amazon web services for about 2 years now, I would like to present you some of the challenges that we tackle and how amazon helps us to overcome them.
So, as I said, We develop a platform for organizing web content. we do this by utilizing semantic content analysis. Ok, so what do I mean by semantic analysis or semantic querying? And how does it help with organizing content? I will give you a few examples of what that means.
When analyzing this blog post for example, the term twilight may refer to many different things: It may refer to one of the 10 music albums or dozens of songs with that name. It may also refer to the place in Pennsylvania, twilight the game developer, or the star trek episode with that name. Or one of the 8 films with that name… or maybe the original comics which this movie was based on. It is important to recognize the right entity, because later we want to use it’s properties for indexing. And if we are wrong about the specific entity, we will also use the wrong properties, which amplifies the problem.
Another quick example. Here the “Knik arm bridge” is referred to the “bridge to nowhere”, that specific bridge can be also referred to as “Don Young's Way” for example. So the same object may have many different names.
Once we understand which entities a web page contains, we can then organize this content based on implicit facts. facts that usually a reader would knows before reading the text, but they are not explicitly stated: like the fact that lebron james is a basketball player, or that Bibi netanyahu is related to the peace talks process. This allows us to create special pages that aggregate content based on object properties. For example a page that aggregates articles about “Nobel prize winners”, about a neighborhood (will contain articles that mentions landmarks in this neighborhood) , or even a page that aggregates articles about “movies in theatres near me” We also generate pages about specific entities , that contain contextually relevant meta data
Here we can see a topic page for Sir Michael Cain that we generated for a uk movie blog called heyuguys. As you can see we include a short abstract, Films, directors & actors who worked with him In many films (and on what films in this little popup if you hover one of them). Related topics, Interview videos, image, and friends who are fan of Michael Caie. And of course, articles from this site that are related to this topic.
or we can generate a broader topic page, such as this “Peace talks” page we generated for Jerusalem Post containing all the articles around this topic. (while the words “peace talks” do not necessarily appear in the articles)
so just to summarize, these are the main challenges that we have in semantic analysis. Requires prior knowledge about entities, I want to talk about this for a minute.
This prior knowledge is what we call the world knowledge graph, and it is a huge network of named objects from the real world(such as people, places, bands, movies, companies, even things like dog breeds and chemical elements) and the connections between them, as well as their properties. (such as birthdate, height, if it’s a movie: the release date) Since ambiguity resolving requires performing very high speed graph operations over this huge data source. we had to develop our own GraphDB, and we update the data on a weekly basis. so for this we are using ELB – instead of generating a graph in a week (which would results a week old data), we generate it in a day every week. (costs the same, data is fresher) – and theoretically we can reduce it to one hour at the same cost.
So, theses are the basic pillars of our technology: 1) Semantic Analysis – taking web pages and extracting entities and facts. And resolving context using the knowledge in the graph. 2) Knowledge Graph – includes many sources, highly compressed, compressed enough to put in the memory. 3) Semantic Index – provides us the ability to query based on object properties, or different hierarchies. (we could do a page for “tall married old british actors”) Feeding facts from index back to the knowledge graph.
Let’s see a common flow in our system: There are different kinds of tasks that are system performs: crawling/modeling/analyzing, indexing and rendering a topic page. (some are dependent on others) Front facing layer gets the request, very responsive. Never blocking. either there is a result ready or we need to perform this task. Results are not in the S3, so job is queued in SQS and RDS. (RDS contains details and state for the task) Workers – spot instances. Receive different tasks from SQS, workers are in different sizes since different tasks require different capabilities. Our workers probe their specification when booting, and decide what kind of jobs they can take. When it finished, it writes task status to RDS, renders results to S3 or index analysis results to SimpleDB. In the next request for that task, the data is ready in S3.
Based on different bottlenecks. When we work with a new site, we need to index their entire archives quickly (500K+) , - so when in the initial setup with the site, we may need to raise 10 instances, and when it’s done, we can keep supporting all our clients with just one instance for new articles. That could mean down time if we didn’t have elasticity. (or a very expensive setup just to support this period). Experimenting with different setups can lead to design decisions. Such as putting a specific portion of your data in the memory. It may cost 4 time more, but can work 40 times faster… so can ten times cheaper overall
What did we learn in semantinet about working with the cloud. Takes time to learn to tweak costs. IT is very different, requires very different skill sets. Control over bottlenecks. Processed results: Cache – serving from S3 is very cheap. Steps – where it makes sense, it encourage you to break down you system into small modules. This would help to make a more elastic solution. 5) (we have worked with over 100 APIs…) so our standards are high. (design, docs, language support, etc.)
We are open to share our insight and help others that are transitioning to Amazon web services

AWS Customer Presentation - SemantiNet

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie AWS Customer Presentation - SemantiNet

Ähnlich wie AWS Customer Presentation - SemantiNet (20)

Mehr von Amazon Web Services

Mehr von Amazon Web Services (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

AWS Customer Presentation - SemantiNet

Hinweis der Redaktion