CASSANDRA MEETUP - Choosing the right cloud instances for success

•

0 gefällt mir•149 views

Erick Ramirez

MELBOURNE CASSANDRA MEETUP - Choosing the right cloud instances for success

Technologie

© DataStax, All Rights Reserved.
Apache Cassandra™
Choosing instances for success
1
Erick Ramirez
DataStax Engineering
@flightc

Welcome
• Your app in focus — reads vs writes, CPU vs RAM

• What IOPS? How much is enough?

• Are ephemeral disks evil?

• False economy — cheaper instances can cost you more

• A time to kill — they’re not your pets

© DataStax, All Rights Reserved.
https://academy.datastax.com
5

© DataStax, All Rights Reserved.
ONE SIZE DOES NOT

FIT ALL
6

Tailor to workload
• intimately understand your app

• reads vs writes

• CPU vs memory

• OLTP vs OLAP

• use case will dictate requirements

© DataStax, All Rights Reserved.
STORAGE OPTIONS
8

© DataStax, All Rights Reserved.
EBS gp2 SSDs
9
• general purpose EBS option

• persistent (durable)

• default volume for EC2 instances

• guaranteed 99% single-digit millisecond latency

• only pay for each GB (IOPS included)

• minimum 10K IOPS for production workloads
3 IOPS/GB (3K IOPS/TB)
Max 10K IOPS/vol
Max 160MB/s throughput/vol
1TB = $122/mo, $1474/yr

© DataStax, All Rights Reserved.
EBS io1 SSDs
10
• fastest available EBS option

• persistent (durable)

• for latency-sensitive OLTP workloads

• guaranteed 99.9%* single-digit millisecond latency

• provisioned IOPS are charged extra

• minimum 10K IOPS for production workloads

* read the fine print
Up to 50 IOPS/GB
Max 20K IOPS/vol
Max 320MB/s throughput/vol
1TB = $141/mo, $1695/yr
1K IOPS = $72/mo, $864/yr

© DataStax, All Rights Reserved.
#spoileralert

EPHEMERAL IS YOUR FRIEND
11

Ephemeral storage
• performance orders of magnitude better than EBS

• already included in instance costs, e.g. m3, c3, i3

• “physically” attached

• not durable across reboots but…

© DataStax, All Rights Reserved.
HELLO, CASSANDRA
13

What is Cassandra
• massively scalable NoSQL database

• fully distributed, no single-point-of-failure

• linear horizontal scaling

© DataStax, All Rights Reserved.
Why Cassandra
15
• all nodes are the same — no SPOF

• real-time, durable writes

• linear scaling on commodity servers

• real-time replication across data centres

• always on — no offline operation

• because you have a scale problem

© DataStax, All Rights Reserved.16
Replication across DCs

© DataStax, All Rights Reserved.
CHEAP INSTANCES

MAY BE COSTING YOU
17

© DataStax, All Rights Reserved.
Real example
18
• deployed on c4.4xlarge

• using EBS io1 with 3K PIOPS

• nodes dropping writes

• high read latencies
16 vCPU, 30GB RAM
Instance $ 5443
EBS io1 1TB $ 1695
PIOPS 3K $ 2592
————————
Annual cost $ 9730

© DataStax, All Rights Reserved.
Recommendation
19
• swap to i3.2xlarge

• 1.9TB NVMe SSDs included

• 3M IOPS, 16GB/s

• 60-70% cheaper than replaced i2.2xlarge
8 vCPU, 61GB RAM
Instance $ 4174
————————
Annual cost $ 4174

© DataStax, All Rights Reserved.
HORSES FOR COURSES
20

© DataStax, All Rights Reserved.
Use case - dev, light prod
21
• m3.large suitable

• entry-level load, testing-the-waters

• minimum 3 C* nodes with RF=3

• use CMS GC with 2GB heap
2 vCPU
7.5GB RAM
1 x 32GB SSD
$ 962/yr

© DataStax, All Rights Reserved.
Use case -

low prod volume
22
• m3.xlarge suitable

• JVM will perform better with the extra RAM

• min 3 C* nodes with RF=3

• use CMS GC with 8GB heap
4 vCPU
15GB RAM
1 x 40GB SSD
$ 1924/yr

© DataStax, All Rights Reserved.
Use case -

moderate prod volume
23
• c3.2xlarge recommended

• more diskspace, extra cores a bonus

• costs 50% more for 2x CPU and 4x diskspace

• min 3 C* nodes with RF=3

• use CMS GC with 8GB heap
8 vCPU
15GB RAM
2 x 80GB SSD
$ 2916/yr

© DataStax, All Rights Reserved.
Use case -

real prod volume
24
• i3.2xlarge recommended

• will handle all kinds of workloads including Analytics,
Graph and Search (Solr)

• min 3 C* nodes with RF=3

• use G1 GC with 24GB heap (32GB for Search nodes)
8 vCPU
61GB RAM
1.9TB NVMe SSD
$ 4174/yr

© DataStax, All Rights Reserved.
https://datastaxacademy.slack.com
25

© DataStax, All Rights Reserved.
Thank you
26

Weitere ähnliche Inhalte

Was ist angesagt?

This sessions covers diagnosing and solving common problems encountered in production, using performance profiling tools. We’ll also give a crash course to basic JVM garbage collection tuning. Attendees will leave with a better understanding of what they should look for when they encounter problems with their in-production Cassandra cluster. This talk is intended for people with a general understanding of Cassandra, but it not required to have experience running it in production.

Cassandra Day Atlanta 2015: Diagnosing Problems in Production

DataStax Academy

Seattle Cassandra Meetup - HasOffers

btoddb

RedHat built a distributed object storage solution named Ceph which first debuted ten years ago. Now we are seeing rapid developments in the industry and we want to take advantage of them. In this talk, we will briefly introduce Ceph, revisit the problems we are seeing when profiling its I/O performance with flash device, and explain why we want to embrace the future by switching to Seastar. We’ll share our experiences with the audience of how and when we are porting our software to this framework.

Scylla Summit 2018: Rebuilding the Ceph Distributed Storage Solution with Sea...

ScyllaDB

DynamoDB at HasOffers

Amazon Web Services

10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...

DevOpsDays Tel Aviv

How to size up an Apache Cassandra cluster (Training)

DataStax Academy

Many Scylla maintenance operations require significant data movement between database nodes in a cluster. It is not an easy task to make the management operations efficient while maintaining minimum impact on the workload all the time. In this talk, we will share how we made those maintenance operations easier, safer and faster with the new Scylla features and improvements, e.g., seedless, repair based node operations, smarter off-strategy compaction, io bandwidth limiter for repair and compaction, parallel repair in Scylla Manger and more.

How We Made Scylla Maintenance Easier, Safer and Faster

ScyllaDB

Scylla Summit 2016: Why Kenshoo is about to displace Cassandra with Scylla

ScyllaDB

OLTP and Analytics are very different. One is characterized by many concurrent small requests, with a high sensitivity to latency, while the other typically processes large streams of data with more emphasis on throughput. The talk will cover: - the different requirements of the two workloads - how ScyllaDB optimizes for both - performance isolation of different workloads within ScyllaDB - how ScyllaDB supports concurrent OLTP and Analytics without sacrificing either latency or throughput - measurements

Scylla Summit 2018: OLAP or OLTP? Why Not Both?

ScyllaDB

Scylla’s Journey Towards Being an Elastic Cloud Native Database

ScyllaDB

How do you handle the continuous transformation and refinement of billions of entities with some sort of reliability and performance? In this talk, Henrik will describe how Scylla enabled him and his team to create a pipelined solution using a series of microservices written in Go communicating with each other using Nats. You’ll hear about the mistakes and learnings they had along the way as they built the services that led to the great performance and stability they are experiencing today.

Scylla Summit 2016: Using ScyllaDB for a Microservice-based Pipeline in Go

ScyllaDB

Scylla Summit 2016: Compose on Containing the Database

ScyllaDB

Building Scalable, Real Time Applications for Financial Services with DataStax

DataStax

Clara Xiong (Flurry/Yahoo!) With petabytes of data on thousands of nodes replicated across multiple data centers, growing at an accelerating rate, we have been running a workload at scale with a bottleneck of IO bandwidth. This talk covers a new compaction policy to improve efficiency for time-range scans of various look-back windows by structuring and maintaining a date-tiered store file layout for time-series data with infrequent updates and deletes.

Date-tiered Compaction Policy for Time-series Data

HBaseCon

Scylla Summit 2018: Keeping Your Latency SLAs No Matter What!

ScyllaDB

The advent of non-volatile memory (NVM) will fundamentally change the dichotomy between memory and durable storage in database management systems (DBMSs). These new NVM devices are almost as fast as DRAM, but all writes to it are potentially persistent even after power loss. Existing DBMSs are unable to take full advantage of this technology because their internal architectures are predicated on the assumption that memory is volatile. That means when NVM finally arrives, just like when you finally passed that kidney stone after three weeks, everyone will be relieved but the transition will be painful. Many of the components of legacy DBMSs will become unnecessary and will degrade the performance of data intensive applications.

IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...

In-Memory Computing Summit

Scylla Summit 2019 Keynote - Avi Kivity

ScyllaDB

Scaling Cassandra for Big Data

DataStax Academy

In last few years, technology has seen a major drift in the dominance of traditional / RDMBS databases across different domains. Expeditious adoption of NoSQL databases especially Cassandra in the industry opens up a lot more discussions on what are the major challenges that are faced during implementation of Cassandra and how to mitigate it. Many a times we conclude that migration or POC (proof of concept) is not successful; however the real flaw might be in the data modeling, identifying the right hardware configurations, database parameters, right consistency level and so on. There's no one good model or configuration which fits all use cases and all applications. Performance tuning an application is truly an art and requires perseverance. This paper delve into different performance tuning considerations and anti-patterns that need to be considered during Cassandra migration / implementation to make sure we are able to reap the benefits of Cassandra, what makes it a ‘Visionary’ in 2014 Gartner’s Magic Quadrant for Operational Database Management Systems.

Performance tuning - A key to successful cassandra migration

Ramkumar Nottath

TechTalk v2.0 - Performance tuning Cassandra + AWS

Pythian

Was ist angesagt? (20)

Cassandra Day Atlanta 2015: Diagnosing Problems in Production

Seattle Cassandra Meetup - HasOffers

Scylla Summit 2018: Rebuilding the Ceph Distributed Storage Solution with Sea...

DynamoDB at HasOffers

10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...

How to size up an Apache Cassandra cluster (Training)

How We Made Scylla Maintenance Easier, Safer and Faster

Scylla Summit 2016: Why Kenshoo is about to displace Cassandra with Scylla

Scylla Summit 2018: OLAP or OLTP? Why Not Both?

Scylla’s Journey Towards Being an Elastic Cloud Native Database

Scylla Summit 2016: Using ScyllaDB for a Microservice-based Pipeline in Go

Scylla Summit 2016: Compose on Containing the Database

Building Scalable, Real Time Applications for Financial Services with DataStax

Date-tiered Compaction Policy for Time-series Data

Scylla Summit 2018: Keeping Your Latency SLAs No Matter What!

IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...

Scylla Summit 2019 Keynote - Avi Kivity

Scaling Cassandra for Big Data

Performance tuning - A key to successful cassandra migration

TechTalk v2.0 - Performance tuning Cassandra + AWS

Ähnlich wie CASSANDRA MEETUP - Choosing the right cloud instances for success

M6d cassandrapresentation

Edward Capriolo

Ceph on All Flash Storage -- Breaking Performance Barriers

Ceph Community

Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...

DataStax

Large scale data processing for Extract Transform and Loading (ETL) jobs is a very common practice. The stackArmor DevOps team developed a Chef based automation solution to automate the AWS environment provisioning, code deployment and data ingestion processing to ingest and process over 2 TB of Data. This presentation covers the technologies used, the planning phase, AWS instance selection and optimizing the ETL processing for not only performance but also cost. The target was to process 500 million rows within 72 hours with a processing rate of 5 million transactions per hour. The presentation also provides pitfalls and automation optimizations performed to accomplish the targeted processing rates. The presentation was delivered at the DevOpsDC Meetup on May 17, 2016

DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef

Gaurav "GP" Pal

stackArmor presentation for DevOpsDC ver 4

Gaurav "GP" Pal

Sizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-Final

Vigyan Jain

Amazon RDS makes it easy to set up, operate, and scale relational databases in the cloud. The service offers a variety of options for optimizing the performance level delivered, as well as optimizing your spending. In this webinar, we will show a variety of techniques for implementing the right performance level for your application. Learning Objectives: • Understand the Amazon RDS options that change database performance and cost • Select the appropriate performance and cost level for your specific application Who Should Attend: • Technical Amazon RDS customers and prospective customers

AWS Webcast - Cost and Performance Optimization in Amazon RDS

Amazon Web Services

Colvin exadata mistakes_ioug_2014

marvin herrera

Accelerating hbase with nvme and bucket cache

David Grier

PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed

Equnix Business Solutions

This presentation will show how create truly elastic Cassandra deployments on AWS allowing you to scale and shrink your large Cassandra deployments multiple times a day. Leveraging a combination of EBS backed disks, JBOD, token pinning and our previous work on bootstrapping from backups you will be able to dramatically reduce costs per cluster by scaling to match your daily workloads. Warning: This presentation will probably contain some references to late 2000's pop group LMFAO About the Speaker Ben Bromhead CTO, Instaclustr Ben Bromhead is the CTO of Instaclustr where he is responsible for working closely with his engineering team and customers to build highly available, scalable applications on top of Cassandra. Instaclustr is the only multi-cloud, self service Cassandra as a Service provider in the world and is dedicated to provider world class support.

Everyday I'm Scaling... Cassandra (Ben Bromhead, Instaclustr) | C* Summit 2016

DataStax

Co-Founder and CTO of Instaclustr, Ben Bromhead's presentation at the Cassandra Summit 2016, in San Jose. This presentation will show how create truly elastic Cassandra deployments on AWS allowing you to scale and shrink your large Cassandra deployments multiple times a day. Leveraging a combination of EBS backed disks, JBOD, token pinning and our previous work on bootstrapping from backups you will be able to dramatically reduce costs per cluster by scaling to match your daily workloads.

Everyday I’m scaling... Cassandra

Instaclustr

Running & Scaling Large Elasticsearch Clusters

Fred de Villamil

Building a low latency (sub millisecond), high throughput database that can handle big data AND linearly scale is not easy - but we did it anyway... In this session we will get to know Aerospike, an enterprise distributed primary key database solution. - We will do an introduction to Aerospike - basic terms, how it works and why is it widely used in mission critical systems deployments. - We will understand the 'magic' behind Aerospike ability to handle small, medium and even Petabyte scale data, and still guarantee predictable performance of sub-millisecond latency - We will learn how Aerospike devops is different than other solutions in the market, and see how easy it is to run it on cloud environments as well as on premise. We will also run a demo - showing a live example of the performance and self-healing technologies the database have to offer.

Aerospike meetup july 2019 | Big Data Demystified

Omid Vahdaty

Speedment SQL Reflector is a software solution that allows applications to get automatically updated data in real time. The SQL Reflector loads data from your existing SQL database and feeds it into an in-memory data grid e.g. GridGain. When started, the SQL reflector will load your selected existing relational data into your map cluster. Also, any subsequent changes that are made to the relational database (regardless how, via your application, script, SQL commands or even stored procedures) are then continuously fed to your GridGain nodes. Even SQL-transactions are preserved so that your maps will always reflect a valid state of the underlying SQL database.

IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...

In-Memory Computing Summit

AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)

Amazon Web Services

Storage and performance- Batch processing, Whiptail

Internet World

505 kobal exadata

Kam Chan

on-Volatile-Memory express (NVMe) standard promises and order of magnitude faster storage than regular SSDs, while at the same time being more economical than regular RAM on TB/$. This talk evaluates the use cases and benefits of NVMe drives for its use in Big Data clusters with HBase and Hadoop HDFS. First, we benchmark the different drives using system level tools (FIO) to get maximum expected values for each different device type and set expectations. Second, we explore the different options and use cases of HBase storage and benchmark the different setups. And finally, we evaluate the speedups obtained by the NVMe technology for the different Big Data use cases from the YCSB benchmark. In summary, while the NVMe drives show up to 8x speedup in best case scenarios, testing the cost-efficiency of new device technologies is not straightforward in Big Data, where we need to overcome system level caching to measure the maximum benefits.

Accelerating HBase with NVMe and Bucket Cache

Nicolas Poggi

Presentation database on flash

xKinAnx

Ähnlich wie CASSANDRA MEETUP - Choosing the right cloud instances for success (20)

M6d cassandrapresentation

Ceph on All Flash Storage -- Breaking Performance Barriers

Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...

DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef

stackArmor presentation for DevOpsDC ver 4

Sizing MongoDB on AWS with Wired Tiger-Patrick and Vigyan-Final

AWS Webcast - Cost and Performance Optimization in Amazon RDS

Colvin exadata mistakes_ioug_2014

Accelerating hbase with nvme and bucket cache

PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed

Everyday I'm Scaling... Cassandra (Ben Bromhead, Instaclustr) | C* Summit 2016

Everyday I’m scaling... Cassandra

Running & Scaling Large Elasticsearch Clusters

Aerospike meetup july 2019 | Big Data Demystified

IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...

AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)

Storage and performance- Batch processing, Whiptail

505 kobal exadata

Accelerating HBase with NVMe and Bucket Cache

Presentation database on flash

Kürzlich hochgeladen

Created by Mozilla Research in 2012 and now part of Linux Foundation Europe, the Servo project is an experimental rendering engine written in Rust. It combines memory safety and concurrency to create an independent, modular, and embeddable rendering engine that adheres to web standards. Stewardship of Servo moved from Mozilla Research to the Linux Foundation in 2020, where its mission remains unchanged. After some slow years, in 2023 there has been renewed activity on the project, with a roadmap now focused on improving the engine’s CSS 2 conformance, exploring Android support, and making Servo a practical embeddable rendering engine. In this presentation, Rakhi Sharma reviews the status of the project, our recent developments in 2023, our collaboration with Tauri to make Servo an easy-to-use embeddable rendering engine, and our plans for the future to make Servo an alternative web rendering engine for the embedded devices industry. (c) Embedded Open Source Summit 2024 April 16-18, 2024 Seattle, Washington (US) https://events.linuxfoundation.org/embedded-open-source-summit/ https://ossna2024.sched.com/event/1aBNF/a-year-of-servo-reboot-where-are-we-now-rakhi-sharma-igalia

A Year of the Servo Reboot: Where Are We Now?

Igalia

GenCyber Cyber Security Day Presentation

Michael W. Hawkins

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida. In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization. In this session, participants gained answers to the following questions: - What is a Green Information Management (IM) Strategy, and why should you have one? - How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication? - How can an organization use insights into their data to influence employee behavior for IM? - How can you reap additional benefits from content reduction that go beyond Green IM?

Driving Behavioral Change for Information Management through Data-Driven Gree...

Enterprise Knowledge

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Civil Lines Women Seeking Men

Delhi Call girls

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

Choosing the right accounts payable services provider is a strategic decision that can significantly impact your business's financial performance and operational efficiency. By considering factors such as expertise, range of services, technology infrastructure, scalability, cost, and reputation, businesses can make informed decisions and select a provider that aligns with their unique needs and objectives. Partnering with the right provider can streamline accounts payable processes, drive cost savings, and position your business for long-term success. https://katprotech.com/accounts-payable-and-purchase-order-automation/

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Katpro Technologies

Finology Group – Insurtech Innovation Award 2024

The Digital Insurer

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

The presentation explores the development and application of artificial intelligence (AI) from its inception to its current status in the modern world. The term "artificial intelligence" was first coined by John McCarthy in 1956 to describe efforts to develop computer programs capable of performing tasks that typically require human intelligence. This concept was first introduced at a conference held at Dartmouth College, where programs demonstrated capabilities such as playing chess, proving theorems, and interpreting texts. In the early stages, Alan Turing contributed to the field by defining intelligence as the ability of a being to respond to certain questions intelligently, proposing what is now known as the Turing Test to evaluate the presence of intelligent behavior in machines. As the decades progressed, AI evolved significantly. The 1980s focused on machine learning, teaching computers to learn from data, leading to the development of models that could improve their performance based on their experiences. The 1990s and 2000s saw further advances in algorithms and computational power, which allowed for more sophisticated data analysis techniques, including data mining. By the 2010s, the proliferation of big data and the refinement of deep learning techniques enabled AI to become mainstream. Notable milestones included the success of Google's AlphaGo and advancements in autonomous vehicles by companies like Tesla and Waymo. A major theme of the presentation is the application of generative AI, which has been used for tasks such as natural language text generation, translation, and question answering. Generative AI uses large datasets to train models that can then produce new, coherent pieces of text or other media. The presentation also discusses the ethical implications and the need for regulation in AI, highlighting issues such as privacy, bias, and the potential for misuse. These concerns have prompted calls for comprehensive regulations to ensure the safe and equitable use of AI technologies. Artificial intelligence has also played a significant role in healthcare, particularly highlighted during the COVID-19 pandemic, where it was used in drug discovery, vaccine development, and analyzing the spread of the virus. The capabilities of AI in healthcare are vast, ranging from medical diagnostics to personalized medicine, demonstrating the technology's potential to revolutionize fields beyond just technical or consumer applications. In conclusion, AI continues to be a rapidly evolving field with significant implications for various aspects of society. The development from theoretical concepts to real-world applications illustrates both the potential benefits and the challenges that come with integrating advanced technologies into everyday life. The ongoing discussion about AI ethics and regulation underscores the importance of managing these technologies responsibly to maximize their their benefits while minimizing potential harms.

Artificial Intelligence: Facts and Myths

Joaquim Jorge

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

Advantages of Hiring UIUX Design Service Providers for Your Business

Pixlogix Infotech

08448380779 Call Girls In Friends Colony Women Seeking Men

Delhi Call girls

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

Presentation on how to chat with PDF using ChatGPT code interpreter

naman860154

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Kürzlich hochgeladen (20)

A Year of the Servo Reboot: Where Are We Now?

GenCyber Cyber Security Day Presentation

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Driving Behavioral Change for Information Management through Data-Driven Gree...

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

08448380779 Call Girls In Civil Lines Women Seeking Men

Axa Assurance Maroc - Insurer Innovation Award 2024

Exploring the Future Potential of AI-Enabled Smartphone Processors

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Finology Group – Insurtech Innovation Award 2024

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Artificial Intelligence: Facts and Myths

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Advantages of Hiring UIUX Design Service Providers for Your Business

08448380779 Call Girls In Friends Colony Women Seeking Men

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

A Domino Admins Adventures (Engage 2024)

Presentation on how to chat with PDF using ChatGPT code interpreter

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Data Cloud, More than a CDP by Matt Robison

CASSANDRA MEETUP - Choosing the right cloud instances for success

2. Welcome • Your app in focus — reads vs writes, CPU vs RAM • What IOPS? How much is enough? • Are ephemeral disks evil? • False economy — cheaper instances can cost you more • A time to kill — they’re not your pets

7. Tailor to workload • intimately understand your app • reads vs writes • CPU vs memory • OLTP vs OLAP • use case will dictate requirements

9. © DataStax, All Rights Reserved. EBS gp2 SSDs 9 • general purpose EBS option • persistent (durable) • default volume for EC2 instances • guaranteed 99% single-digit millisecond latency • only pay for each GB (IOPS included) • minimum 10K IOPS for production workloads 3 IOPS/GB (3K IOPS/TB) Max 10K IOPS/vol Max 160MB/s throughput/vol 1TB = $122/mo, $1474/yr

10. © DataStax, All Rights Reserved. EBS io1 SSDs 10 • fastest available EBS option • persistent (durable) • for latency-sensitive OLTP workloads • guaranteed 99.9%* single-digit millisecond latency • provisioned IOPS are charged extra • minimum 10K IOPS for production workloads * read the fine print Up to 50 IOPS/GB Max 20K IOPS/vol Max 320MB/s throughput/vol 1TB = $141/mo, $1695/yr 1K IOPS = $72/mo, $864/yr

12. Ephemeral storage • performance orders of magnitude better than EBS • already included in instance costs, e.g. m3, c3, i3 • “physically” attached • not durable across reboots but…

14. What is Cassandra • massively scalable NoSQL database • fully distributed, no single-point-of-failure • linear horizontal scaling

15. © DataStax, All Rights Reserved. Why Cassandra 15 • all nodes are the same — no SPOF • real-time, durable writes • linear scaling on commodity servers • real-time replication across data centres • always on — no offline operation • because you have a scale problem

18. © DataStax, All Rights Reserved. Real example 18 • deployed on c4.4xlarge • using EBS io1 with 3K PIOPS • nodes dropping writes • high read latencies 16 vCPU, 30GB RAM Instance $ 5443 EBS io1 1TB $ 1695 PIOPS 3K $ 2592 ———————— Annual cost $ 9730

19. © DataStax, All Rights Reserved. Recommendation 19 • swap to i3.2xlarge • 1.9TB NVMe SSDs included • 3M IOPS, 16GB/s • 60-70% cheaper than replaced i2.2xlarge 8 vCPU, 61GB RAM Instance $ 4174 ———————— Annual cost $ 4174

21. © DataStax, All Rights Reserved. Use case - dev, light prod 21 • m3.large suitable • entry-level load, testing-the-waters • minimum 3 C* nodes with RF=3 • use CMS GC with 2GB heap 2 vCPU 7.5GB RAM 1 x 32GB SSD $ 962/yr

22. © DataStax, All Rights Reserved. Use case - low prod volume 22 • m3.xlarge suitable • JVM will perform better with the extra RAM • min 3 C* nodes with RF=3 • use CMS GC with 8GB heap 4 vCPU 15GB RAM 1 x 40GB SSD $ 1924/yr

23. © DataStax, All Rights Reserved. Use case - moderate prod volume 23 • c3.2xlarge recommended • more diskspace, extra cores a bonus • costs 50% more for 2x CPU and 4x diskspace • min 3 C* nodes with RF=3 • use CMS GC with 8GB heap 8 vCPU 15GB RAM 2 x 80GB SSD $ 2916/yr

24. © DataStax, All Rights Reserved. Use case - real prod volume 24 • i3.2xlarge recommended • will handle all kinds of workloads including Analytics, Graph and Search (Solr) • min 3 C* nodes with RF=3 • use G1 GC with 24GB heap (32GB for Search nodes) 8 vCPU 61GB RAM 1.9TB NVMe SSD $ 4174/yr

CASSANDRA MEETUP - Choosing the right cloud instances for success

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie CASSANDRA MEETUP - Choosing the right cloud instances for success

Ähnlich wie CASSANDRA MEETUP - Choosing the right cloud instances for success (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

CASSANDRA MEETUP - Choosing the right cloud instances for success