SlideShare ist ein Scribd-Unternehmen logo
1 von 10
No, we’re not a travel agency www.tripadvisor.com/careers
Postgres at TripAdvisor By Don O’Neill & Boris Partensky 2 © 2011 TripAdvisor LLC All rights reserved.
TripAdvisor is the world’s largesttravel site 3 © 2011 TripAdvisor LLC All rights reserved.
4 40M monthly visitors 4M visitors/day 20+M registered members 45+M reviews & opinions © 2011 TripAdvisor LLC All rights reserved.
Business runs on Postgres Large Mission critical deployment Combination of 8.3 and 8.4 Site Technologies include Linux (CentOS), Apache Java, Tomcat, Velocity, and Memcached No commercial Software – All Open Source Web Scale Deployment Over 100 Frontend and Backend servers per Data Center 14 Postgres Database Servers per Data Center DR Data Center with Replication between the two Platform 5 © 2011 TripAdvisor LLC All rights reserved.
33 Postgres Servers across 3 Data Centers Not including Dev and QA servers SSD’s instead of Spinning disks for select servers Multiple Read Only DB Servers  Load Balanced using BigIP Sync’d using DB Mirror replication Just under 1 TB of data in multiple databases Sharded Pin Table  Over 1.5 billion rows Sharded to improve read and write performance  Spread over 12 databases on 2 Servers Scale out to multiple servers as needed Routing logic built into our application layer By the Numbers 6 © 2011 TripAdvisor LLC All rights reserved.
All Servers configured as Active/Passive Cluster Not using Shared Disks or SAN’s DRBD for Block level replication between systems Heartbeat for automatic failover between nodes Data Replicated to DR Site TripAdvisor has 2 Data centers (MA and VA) One site “Live” at a time Equal capacity at both sites Routinely switch between sites Data Replicated between data centers Redundancy/Failover 7 © 2011 TripAdvisor LLC All rights reserved.
Replication based on DBmirror Like Slonyonly deprecated  Community project once part of the core distribution Trigger based Master/Slave Replication Written in C and Perl All replicated tables must have primary keys TripAdvisor Updates include A Rewrite in Java! Support for SQL Server Lots of Hardening for large scale use Replication 8 © 2011 TripAdvisor LLC All rights reserved.
20 TB Data Warehouse today Expecting to grow significantly in 12 months 16 Node cluster expanding to 24 Based on Hadoop and Hive Reporting and Cubes built with SQL Server End users access the data via Excel and Reporting Services Critical for the growth of the business Long term data storage in Hadoop and not our SAN! Mobile Applications Award winning travel application Supported in iPhone, iPad, Android, Nokia, Windows, Palm http://www.tripadvisor.com/MobileApps Facebook Integration More and more users logging in via Facebook Better social experience on the site Major  Initiatives 9 © 2011 TripAdvisor LLC All rights reserved.
Q&A Questions? Did I mention we’re hiring? 10 © 2011 TripAdvisor LLC All rights reserved.

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

The Stream is the Database - Revolutionizing Healthcare Data Architecture
The Stream is the Database - Revolutionizing Healthcare Data ArchitectureThe Stream is the Database - Revolutionizing Healthcare Data Architecture
The Stream is the Database - Revolutionizing Healthcare Data Architecture
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
 
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
 
Spark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu AdunuthulaSpark Summit Keynote by Seshu Adunuthula
Spark Summit Keynote by Seshu Adunuthula
 
Introduction to basic data analytics tools
Introduction to basic data analytics toolsIntroduction to basic data analytics tools
Introduction to basic data analytics tools
 
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
 
Big and fast a quest for relevant and real-time analytics
Big and fast a quest for relevant and real-time analyticsBig and fast a quest for relevant and real-time analytics
Big and fast a quest for relevant and real-time analytics
 
Spark and the Enterprise by Tony Baer
Spark and the Enterprise by Tony BaerSpark and the Enterprise by Tony Baer
Spark and the Enterprise by Tony Baer
 
Middle Tier Scalability - Present and Future
Middle Tier Scalability - Present and FutureMiddle Tier Scalability - Present and Future
Middle Tier Scalability - Present and Future
 
MongoDB World 2019: Streaming ETL on the Shoulders of Giants
MongoDB World 2019: Streaming ETL on the Shoulders of GiantsMongoDB World 2019: Streaming ETL on the Shoulders of Giants
MongoDB World 2019: Streaming ETL on the Shoulders of Giants
 
Spark Usage in Enterprise Business Operations
Spark Usage in Enterprise Business OperationsSpark Usage in Enterprise Business Operations
Spark Usage in Enterprise Business Operations
 
Moving eBay’s Data Warehouse Over to Apache Spark – Spark as Core ETL Platfor...
Moving eBay’s Data Warehouse Over to Apache Spark – Spark as Core ETL Platfor...Moving eBay’s Data Warehouse Over to Apache Spark – Spark as Core ETL Platfor...
Moving eBay’s Data Warehouse Over to Apache Spark – Spark as Core ETL Platfor...
 
Cassandra Essentials Day Cambridge
Cassandra Essentials Day CambridgeCassandra Essentials Day Cambridge
Cassandra Essentials Day Cambridge
 
DataStax Enterprise in Practice (Field Notes)
DataStax Enterprise in Practice (Field Notes)DataStax Enterprise in Practice (Field Notes)
DataStax Enterprise in Practice (Field Notes)
 
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
 
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardwareSomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
 
Presto summit israel 2019-04
Presto summit   israel 2019-04Presto summit   israel 2019-04
Presto summit israel 2019-04
 
Spark Summit presentation by Ken Tsai
Spark Summit presentation by Ken TsaiSpark Summit presentation by Ken Tsai
Spark Summit presentation by Ken Tsai
 
Winning the On-Demand Economy with Spark and Predictive Analytics
Winning the On-Demand Economy with Spark and Predictive AnalyticsWinning the On-Demand Economy with Spark and Predictive Analytics
Winning the On-Demand Economy with Spark and Predictive Analytics
 
Data analytics at a petabyte scale final
Data analytics at a petabyte scale   finalData analytics at a petabyte scale   final
Data analytics at a petabyte scale final
 

Ähnlich wie Postgres Sql at TripAdvisor

Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and ImplyAchieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
confluent
 
Introduction to Cloud Service Design
Introduction to Cloud Service DesignIntroduction to Cloud Service Design
Introduction to Cloud Service Design
evancmiller
 

Ähnlich wie Postgres Sql at TripAdvisor (20)

Senior database administrator
Senior database administratorSenior database administrator
Senior database administrator
 
Handling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsHandling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
 
Full Stack Web Developer (MERN STACK Developer.pptx
Full Stack Web Developer (MERN STACK Developer.pptxFull Stack Web Developer (MERN STACK Developer.pptx
Full Stack Web Developer (MERN STACK Developer.pptx
 
Top local databases for react native app development
Top local databases for react native app developmentTop local databases for react native app development
Top local databases for react native app development
 
Neethu_Abraham
Neethu_AbrahamNeethu_Abraham
Neethu_Abraham
 
List of Top Local Databases used for react native app developement in 2022
List of Top Local Databases used for react native app developement in 2022					List of Top Local Databases used for react native app developement in 2022
List of Top Local Databases used for react native app developement in 2022
 
Introduction to NoSQL and Couchbase
Introduction to NoSQL and CouchbaseIntroduction to NoSQL and Couchbase
Introduction to NoSQL and Couchbase
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Metadata describes about data
Metadata describes about dataMetadata describes about data
Metadata describes about data
 
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and ImplyAchieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
 
Starting Your DevOps Journey – Practical Tips for Ops
Starting Your DevOps Journey – Practical Tips for OpsStarting Your DevOps Journey – Practical Tips for Ops
Starting Your DevOps Journey – Practical Tips for Ops
 
Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2
 
Introduction to Cloud Service Design
Introduction to Cloud Service DesignIntroduction to Cloud Service Design
Introduction to Cloud Service Design
 
Cloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsCloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native apps
 
React Native Local Database For Super Smart Applications.pdf
React Native Local Database For Super Smart Applications.pdfReact Native Local Database For Super Smart Applications.pdf
React Native Local Database For Super Smart Applications.pdf
 
Rahul_Resume_310116
Rahul_Resume_310116Rahul_Resume_310116
Rahul_Resume_310116
 
Ops Jumpstart: MongoDB Administration 101
Ops Jumpstart: MongoDB Administration 101Ops Jumpstart: MongoDB Administration 101
Ops Jumpstart: MongoDB Administration 101
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

Postgres Sql at TripAdvisor

  • 1. No, we’re not a travel agency www.tripadvisor.com/careers
  • 2. Postgres at TripAdvisor By Don O’Neill & Boris Partensky 2 © 2011 TripAdvisor LLC All rights reserved.
  • 3. TripAdvisor is the world’s largesttravel site 3 © 2011 TripAdvisor LLC All rights reserved.
  • 4. 4 40M monthly visitors 4M visitors/day 20+M registered members 45+M reviews & opinions © 2011 TripAdvisor LLC All rights reserved.
  • 5. Business runs on Postgres Large Mission critical deployment Combination of 8.3 and 8.4 Site Technologies include Linux (CentOS), Apache Java, Tomcat, Velocity, and Memcached No commercial Software – All Open Source Web Scale Deployment Over 100 Frontend and Backend servers per Data Center 14 Postgres Database Servers per Data Center DR Data Center with Replication between the two Platform 5 © 2011 TripAdvisor LLC All rights reserved.
  • 6. 33 Postgres Servers across 3 Data Centers Not including Dev and QA servers SSD’s instead of Spinning disks for select servers Multiple Read Only DB Servers Load Balanced using BigIP Sync’d using DB Mirror replication Just under 1 TB of data in multiple databases Sharded Pin Table Over 1.5 billion rows Sharded to improve read and write performance Spread over 12 databases on 2 Servers Scale out to multiple servers as needed Routing logic built into our application layer By the Numbers 6 © 2011 TripAdvisor LLC All rights reserved.
  • 7. All Servers configured as Active/Passive Cluster Not using Shared Disks or SAN’s DRBD for Block level replication between systems Heartbeat for automatic failover between nodes Data Replicated to DR Site TripAdvisor has 2 Data centers (MA and VA) One site “Live” at a time Equal capacity at both sites Routinely switch between sites Data Replicated between data centers Redundancy/Failover 7 © 2011 TripAdvisor LLC All rights reserved.
  • 8. Replication based on DBmirror Like Slonyonly deprecated Community project once part of the core distribution Trigger based Master/Slave Replication Written in C and Perl All replicated tables must have primary keys TripAdvisor Updates include A Rewrite in Java! Support for SQL Server Lots of Hardening for large scale use Replication 8 © 2011 TripAdvisor LLC All rights reserved.
  • 9. 20 TB Data Warehouse today Expecting to grow significantly in 12 months 16 Node cluster expanding to 24 Based on Hadoop and Hive Reporting and Cubes built with SQL Server End users access the data via Excel and Reporting Services Critical for the growth of the business Long term data storage in Hadoop and not our SAN! Mobile Applications Award winning travel application Supported in iPhone, iPad, Android, Nokia, Windows, Palm http://www.tripadvisor.com/MobileApps Facebook Integration More and more users logging in via Facebook Better social experience on the site Major Initiatives 9 © 2011 TripAdvisor LLC All rights reserved.
  • 10. Q&A Questions? Did I mention we’re hiring? 10 © 2011 TripAdvisor LLC All rights reserved.