SlideShare ist ein Scribd-Unternehmen logo
1 von 47
Downloaden Sie, um offline zu lesen
DAVID GEVORKYAN 
@davidgev 
davidgevorkyan
WHO ARE WE?
EHARMONY CREATES 
THE HAPPIEST, 
MOST PASSIONATE 
AND MOST FULFILLING 
RELATIONSHIPS* 
*ACCORDING TO A RECENT STUDY
438 
MARRIAGES PER DAY
THE DIFFERENCE?
THE DIFFERENCE? 
Compatibility Matching System® 
COMPAT I B I L I T Y 
MATCHING 
AFFINITY 
MATCHING 
MATCH 
DISTRIBUTION
THE DIFFERENCE? 
Compatibility Matching System® 
COMPAT I B I L I T Y 
MATCHING 
AFFINITY 
MATCHING 
MATCH 
DISTRIBUTION
UNIDIRECTIONAL USER DEFINED CRITERIA 
Nicolette
BIDIRECTIONAL 
UNIDIRECTIONAL USER DEFINED CRITERIA 
Leo 
Ian 
Steve 
Nicolette
BIDIRECTIONAL 
UNIDIRECTIONAL USER DEFINED CRITERIA 
Leo 
Ian 
Steve 
Nicolette
150 
ques)ons 
Personality 
Values 
A5ributes 
Beliefs
Intellect 
Energy 
Sociability 
Ambition 
Humor 
Spirituality 
Kindness 
Curiosity
COMPATIBILITY MATCHING 
USER DEFINED 
CRITERIA 
COMPAT I B I L I T Y 
MODELS 
MONGO DB 
VOLDEMORT
MONGO DB 
DATA STORE NEEDS 
POWERFUL 
INDEXING 
MODELS 
FAST MULTI - 
ATTRIBUTE 
SEARCHES 
EASY TO 
MAINTAIN 
60M+ 
QUERIES 
per day
MONGO DB 
WINS 
AUTO 
SCALING 
BUILT- I N 
SHARDING 
AUTO 
BALANCING 
MMS
VOLDEMORT ? 
THAT NAME 
SOUNDS FAMILIAR
VOLDEMORT 
DATA STORE NEEDS 
CRUD 
OPERATIONS 
VARIED 
TRANSACTION 
SIZES 
BILLION+ 
POTENTIAL 
MATCHES 
per day
VOLDEMORT 
WINS 
AUTO 
REPLICATION 
AUTO 
PARTITIONING 
PLUGGABLE 
SERIALIZATION
AFFINITY MATCHING 
Compatibility Matching System® 
COMPAT I B I L I T Y 
MATCHING 
AFFINITY 
MATCHING 
MATCH 
DISTRIBUTION
65 30 
3000 miles
Comm probability Distance in Miles 
PROB 
0 1 3 7 15 63 255 1023 4095
4 
-­‐ 
8 
in 
Comm probability Height difference in cm 
PROB 
-29 -25 -21 -17 -13 -9 -6 -3 0 3 6 9 12 16 20 24 28 32 36 40 44 48 52 56
WORDS TO USE
WORDS TO USE
SOME INSIGHT
DATA NEEDS FOR AFFINITY 
50M+ REGISTERED USERS 
103 ATTRIBUTES 
250M+ 
PHOTOS 
107 DAILY MATCHES 
4B+ QUESTIONNAIRES 
ANSWERED
COMMUNICATION AGGREGATES 
EVENT LISTENER 
SERVICE 
USER ACTIVITY 
SERVICE 
10K EVENTS 
PER SECOND 
HOURLY, DAILY 
~5MS 
RESPONSE 
TIMES 
USER 
SERVICE 
TOTAL
OFFLINE BATCH JOBS 
USER 
SERVICE 
1+GB 
Compressed 
Protocol 
MAP-SIDE JOINS 
Buffers 
( T B ) SCORING 
PAIRINGS 
SERVICE 
! 
750M 
Compressed 
Protocol 
Buffers 
BILLION+ 
POTENTIAL 
MATCHES
AMAZON 
EMR 
AWS DIRECT 
CONNECT 
IN-HOUSE 
SEAMICRO 
256 NODES 
50TB STORAGE 
LOW OPERATIONAL COST 
LOW POWER CONSUMPTION 
PREDICTABLE COMPLETION TIMES 
DATA RETRIEVAL LATENCY
MODEL RETRAINING 
distcp 
Protocol 
Buffers 
from 
Offline 
Jobs
MATCH DISTRIBUTION 
Compatibility Matching System® 
COMPAT I B I L I T Y 
MATCHING 
AFFINITY 
MATCHING 
MATCH 
DISTRIBUTION
Delivering the right matches 
at the right time to as many 
people as possible across 
the entire network
PROB 
2 2
PROB 
2 2
PROB
MONITORING
metrics.codahale.com
We Are Hiring! 
jobs.eharmony.com
THANK YOU 
QUESTIONS? 
@davidgev
CREDITS: 
Visual Elements From 
The Noun Project 
http://thenounproject.com

Weitere ähnliche Inhalte

Andere mochten auch

Apple Inc. Business Overview
Apple Inc. Business OverviewApple Inc. Business Overview
Apple Inc. Business OverviewZoe Robinson
 
Machine Learning Loves Hadoop
Machine Learning Loves HadoopMachine Learning Loves Hadoop
Machine Learning Loves HadoopCloudera, Inc.
 
Anna gray eHarmony sept28
Anna gray eHarmony sept28Anna gray eHarmony sept28
Anna gray eHarmony sept28Anna Gray
 
eHarmony Strategic Marketing Case Study
eHarmony Strategic Marketing Case StudyeHarmony Strategic Marketing Case Study
eHarmony Strategic Marketing Case StudyZoe Robinson
 
Be Like Bamboo (TEDxTokyo 2011 slides)
Be Like Bamboo (TEDxTokyo 2011 slides)Be Like Bamboo (TEDxTokyo 2011 slides)
Be Like Bamboo (TEDxTokyo 2011 slides)garr
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with DataSeth Familian
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017Drift
 

Andere mochten auch (7)

Apple Inc. Business Overview
Apple Inc. Business OverviewApple Inc. Business Overview
Apple Inc. Business Overview
 
Machine Learning Loves Hadoop
Machine Learning Loves HadoopMachine Learning Loves Hadoop
Machine Learning Loves Hadoop
 
Anna gray eHarmony sept28
Anna gray eHarmony sept28Anna gray eHarmony sept28
Anna gray eHarmony sept28
 
eHarmony Strategic Marketing Case Study
eHarmony Strategic Marketing Case StudyeHarmony Strategic Marketing Case Study
eHarmony Strategic Marketing Case Study
 
Be Like Bamboo (TEDxTokyo 2011 slides)
Be Like Bamboo (TEDxTokyo 2011 slides)Be Like Bamboo (TEDxTokyo 2011 slides)
Be Like Bamboo (TEDxTokyo 2011 slides)
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with Data
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017
 

Kürzlich hochgeladen

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 

Kürzlich hochgeladen (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 

Mapping Love with Hadoop