SlideShare ist ein Scribd-Unternehmen logo
1 von 25
1© Cloudera, Inc. All rights reserved.
GDPR: Getting your data ready for heavy, new EU privacy
regulations
Steve Ross, Director, Product Management
Mark Donsky, Director, Product Management
2© Cloudera, Inc. All rights reserved.
Disclaimer
GDPR is a complex and detailed regulation.
There’s no single method or solution that will make all organizations compliant.
This presentation is intended to help organizations understand how Big Data platforms such as Cloudera software and
services can be used to help comply with certain aspects of EU General Data Protection Regulation (GDPR)
requirements. Applicability of any of these capabilities depends on each organization’s own requirements specific to their
business. Every organization should determine its own needs with regard to GDPR and then evaluate solutions for
suitability to those needs. The information contained in this presentation is not intended to be and should not be
construed to be legal advice. Organizations must not rely on the information herein and they should obtain legal advice
from their own legal counsel or other professional legal services provider.
3© Cloudera, Inc. All rights reserved.
General Data Protection Regulation (GDPR)
Rights of
the
consumer
Enforced
from May 25
2018
Substantial
penalties
Obligations
of the
organization
Applicable
worldwide
Personal Data
• Across people, process and technology
• Impacts how personal data is collected
and used
• Heavy fines for violations
• Up to 20M Euros or 4% of the annual
global turnover for the preceding
financial year
• Any organization with any users in the EU needs
to be compliant.
• Includes companies based outside the EU,
processing personal data from EU residents in
connection with the offering any goods or
services or monitoring user behavior.
• Includes data processor and data controller
• Right to be forgotten/erasure
• Right to access information
• Right to data portability
• Right for processing to be restricted
4© Cloudera, Inc. All rights reserved.
Storage limitation: How do I
erase individual data records
upon request when the file
systems are immutable?
Accuracy: What is a
low-overhead way to
fix data ?
Data minimization:
How can I
anonymize personal
data? How do I
prevent unlawful data
transfers?
Integrity and
confidentiality: How do I
apply IT controls to
prevent unauthorized
access?
Accountability: How can
I demonstrate
compliance? How do I
report breaches in 72
hrs?
Lawfulness, fairness,
transparency: How do I
track personal data?
Purpose limitation:
How do I track consent
while using data
science tool choice?
Seven principles of GDPR
5© Cloudera, Inc. All rights reserved.
The path to GDPR compliance includes…
Source: Cognizant:
Where are you along this path?
Big data solutions can help here
6© Cloudera, Inc. All rights reserved.
How big data solutions can help
7© Cloudera, Inc. All rights reserved.
How big data solutions can help
GDPR Principle Typical Customer Challenges Enterprise Data Management Capabilities
Integrity and
Confidentiality
Applying industry standard IT security controls
to prevent unauthorised access.
Comprehensive encryption and key management.
Strong authentication, fine-grained authorization.
Accountability Demonstrating compliance. Detecting and
analyzing breaches within 72 hours.
Breach detection (cybersecurity solutions).
Comprehensive audit trail.
8© Cloudera, Inc. All rights reserved.
How big data solutions can help
GDPR Principle Typical Customer Challenges Enterprise Data Management Capabilities
Integrity and
Confidentiality
Applying industry standard IT security controls
to prevent unauthorised access.
Comprehensive encryption and key management.
Strong authentication, fine-grained authorization.
Accountability Demonstrating compliance. Detecting and
analyzing breaches within 72 hours.
Breach detection (cybersecurity solutions).
Comprehensive audit trail.
Lawfulness, fairness
and transparency
Implementing a way to keep track of personal
data.
Track classification and lineage of personal data elements.
Purpose limitation Track consent and data usage while allowing
data scientists to mine it using tools of choice.
Data protection officer (DPO) can audit exactly how data was used.
Data scientists work with flexibility while keep data governed.
Data Minimization Removing or anonymising data where possible.
Preventing unlawful data transfers outside the
EU while still enabling outsourcing.
Classification tags can indicate allowed purpose.
Redacted views limit what certain people can access.
9© Cloudera, Inc. All rights reserved.
How big data solutions can help
GDPR Principle Typical Customer Challenges Enterprise Data Management Capabilities
Integrity and
Confidentiality
Applying industry standard IT security controls
to prevent unauthorised access.
Comprehensive encryption and key management.
Strong authentication, fine-grained authorization.
Accountability Demonstrating compliance. Detecting and
analyzing breaches within 72 hours.
Breach detection (cybersecurity solutions).
Comprehensive audit trail.
Lawfulness, fairness
and transparency
Implementing a way to keep track of personal
data.
Track classification and lineage of personal data elements.
Purpose limitation Track consent and data usage while allowing
data scientists to mine it using tools of choice.
Data protection officer (DPO) can audit exactly how data was used.
Data scientists work with flexibility while keep data governed.
Data Minimization Removing or anonymising data where possible.
Preventing unlawful data transfers outside the
EU while still enabling outsourcing.
Classification tags can indicate allowed purpose.
Redacted views limit what certain people can access.
Accuracy Finding a low-overhead way to fix data. Fast updates of individual records.
Storage Limitation Deleting individual personal data records in
Hadoop and Cloud storage, since those file
systems are immutable.
Erasure of individual records.
HDFS and Cloud storage options.
10© Cloudera, Inc. All rights reserved.
Which data is in scope for GDPR? Where Cloudera can help?
Batch or Real-time
Data Streams
When all your data is stored on a single platform, you have a single place to secure and govern for
GDPR compliance, across all analytic workloads
11© Cloudera, Inc. All rights reserved.
How Cloudera can help
GDPR Principle Typical Customer Challenges Cloudera Enterprise Capabilities
Integrity and
Confidentiality
Applying industry standard IT security controls
to prevent unauthorised access.
Cloudera Navigator Encrypt and Key Trustee: strong encryption
Apache Sentry: Fine-grained authorization.
Accountability Demonstrating compliance. Detecting and
analyzing breaches within 72 hours.
Cloudera Navigator: Comprehensive, inescapable audit trail
Cloudera Cybersecurity solutions
12© Cloudera, Inc. All rights reserved.
How Cloudera can help
GDPR Principle Typical Customer Challenges Cloudera Enterprise Capabilities
Integrity and
Confidentiality
Applying industry standard IT security controls
to prevent unauthorised access.
Cloudera Navigator Encrypt and Key Trustee: strong encryption
Apache Sentry: Fine-grained authorization.
Accountability Demonstrating compliance. Detecting and
analyzing breaches within 72 hours.
Cloudera Navigator: Comprehensive, inescapable audit trail
Cloudera Cybersecurity solutions
Lawfulness, fairness
and transparency
Implementing a way to keep track of personal
data.
Cloudera Navigator: Classify/tag and track lineage of personal data
elements
Purpose limitation Track consent and data usage while allowing
data scientists to mine it using tools of choice.
Cloudera Navigator: DPO can audit exactly how data was used
Cloudera Data Science Workbench: keep data governed
Data Minimization Removing or anonymising data where possible.
Preventing unlawful data transfers outside the
EU while still enabling outsourcing.
Cloudera Navigator: tags can indicate allowed purpose, time limit
Apache Sentry: Redacted views
13© Cloudera, Inc. All rights reserved.
How Cloudera can help
GDPR Principle Typical Customer Challenges Cloudera Enterprise Capabilities
Integrity and
Confidentiality
Applying industry standard IT security controls
to prevent unauthorised access.
Cloudera Navigator Encrypt and Key Trustee: strong encryption
Apache Sentry: Fine-grained authorization.
Accountability Demonstrating compliance. Detecting and
analyzing breaches within 72 hours.
Cloudera Navigator: Comprehensive, inescapable audit trail
Cloudera Cybersecurity solutions
Lawfulness, fairness
and transparency
Implementing a way to keep track of personal
data.
Cloudera Navigator: Classify/tag and track lineage of personal data
elements
Purpose limitation Track consent and data usage while allowing
data scientists to mine it using tools of choice.
Cloudera Navigator: DPO can audit exactly how data was used
Cloudera Data Science Workbench: keep data governed
Data Minimization Removing or anonymising data where possible.
Preventing unlawful data transfers outside the
EU while still enabling outsourcing.
Cloudera Navigator: tags can indicate allowed purpose, time limit
Apache Sentry: Redacted views
Accuracy Finding a low-overhead way to fix data. Apache Kudu: Fast updates of individual records
Storage Limitation Deleting individual personal data records in
Hadoop and Cloud storage, since those file
systems are immutable.
Apache Kudu: Fast erasure of individual records
HDFS and Cloud storage options
14© Cloudera, Inc. All rights reserved.
Fine-grained access control and data masking
Apache Sentry for column-level permissions and views with masking and row filtering
(optional) Cloudera partners enable dynamic masking and tokenization
15© Cloudera, Inc. All rights reserved.
Governance: auditing, lineage, metadata
Capabilities:
● Inescapable, detailed audit – enabling
forensics
● Full tracking of personal data
● Lineage tracking and visualization
16© Cloudera, Inc. All rights reserved.
Enterprise-grade key management
Manager Navigator
Impala Hive
HDFS Kudu & HBase
Sentry
Navigator Key Trustee
Log Files
Metadata Store
Encrypted Data
Encryption Key
Legend
Ingest
Paths,
Temp/Spill
files
HSM (optional)
• ALL data at rest: HDFS, HBase, metadata
databases, temp files, ingest paths
• ALL data on the wire
• Automated key replication & backup
• HSM backed key protection
• Sensitive data in logs
Encryption
Key Management
Redaction
17© Cloudera, Inc. All rights reserved.
Enterprise-wide breach detection
Detect advanced threat leveraging
machine learning models
RespondDetect
Search across Apache Spot’s user,
endpoint, and network open data
models for full context and
accelerated investigation
Investigate
Use the Apache Spot open data
models and Cloudera Navigator to
see if the threat is widespread
18© Cloudera, Inc. All rights reserved.
Erasing individual records on HDFS and cloud storage
● Concentrate personal data in a small number of “lookup tables”
● Replace personal data in most locations with anonymized or
pseudonymised data
● Instead of deleting records upon request, add them to a “to be deleted”
list
● Execute a periodic batch job to remove “to be deleted” records by re-
writing entire files/tables
Challenge: concentrate the personal data enough so that the overhead of the re-write
step doesn’t render the cluster unusable for a significant period of time
19© Cloudera, Inc. All rights reserved.
Erasing individual records on Kudu
20© Cloudera, Inc. All rights reserved.
Laptop vs Centralized Data Science
Data scientists pull data to their laptops so they can run their own tools
• Copy personal data to laptops
• Fails GDPR compliance audit
• Potential data breach
“Laptop Data Science”
● Personal data remains governed
● Purpose limitation enforced
Centralized Data Science
Typical Big Data
Environment Data Science Workbench
21© Cloudera, Inc. All rights reserved.
Cloudera GDPR approach
CRM
BI
Staff
Core
Data Subject
360
Rights &
Consent
Manager
Governance and audit
Enterprise search
Right to forget
Cyber intelligence Data Science Workbench
Web logs
Sources
Security - encryption, authentication,
fine-grained access control
22© Cloudera, Inc. All rights reserved.
Professional services: Cloudera GDPR architecture review
This engagement offering focuses on providing technical expertise to help
customers advance their GDPR compliance by optimising their Cloudera platform
and advising on best-of-breed data architecture.
The goal is to help customers maximise the value of their data while ensuring the
impacts and risks of compliance are minimised.
This engagement is for current Cloudera subscription customers
The engagement is on a Time and Materials basis. Duration: approximately 2
weeks.
The offering includes architecture design work for up to three data sources and
implementation work for at least a single source. Cloudera will deliver a full set of
documentation.
23© Cloudera, Inc. All rights reserved.
What’s Next
● May 25, 2018 is 233 days away
● If you haven’t yet started the foundational activities, start them now!
● Get in touch with your Cloudera account team to learn more.
24© Cloudera, Inc. All rights reserved.
Questions?
25© Cloudera, Inc. All rights reserved.
Thank you
Steve Ross: sross@cloudera.com
Mark Donsky: md@cloudera.com

Weitere ähnliche Inhalte

Mehr von Cloudera, Inc.

Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Cloudera, Inc.
 
Get started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionGet started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionCloudera, Inc.
 
Spark and Deep Learning Frameworks at Scale 7.19.18
Spark and Deep Learning Frameworks at Scale 7.19.18Spark and Deep Learning Frameworks at Scale 7.19.18
Spark and Deep Learning Frameworks at Scale 7.19.18Cloudera, Inc.
 
Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18Cloudera, Inc.
 

Mehr von Cloudera, Inc. (20)

Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18
 
Get started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionGet started with Cloudera's cyber solution
Get started with Cloudera's cyber solution
 
Spark and Deep Learning Frameworks at Scale 7.19.18
Spark and Deep Learning Frameworks at Scale 7.19.18Spark and Deep Learning Frameworks at Scale 7.19.18
Spark and Deep Learning Frameworks at Scale 7.19.18
 
Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Kürzlich hochgeladen (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Simplify Your Response to GDPR

  • 1. 1© Cloudera, Inc. All rights reserved. GDPR: Getting your data ready for heavy, new EU privacy regulations Steve Ross, Director, Product Management Mark Donsky, Director, Product Management
  • 2. 2© Cloudera, Inc. All rights reserved. Disclaimer GDPR is a complex and detailed regulation. There’s no single method or solution that will make all organizations compliant. This presentation is intended to help organizations understand how Big Data platforms such as Cloudera software and services can be used to help comply with certain aspects of EU General Data Protection Regulation (GDPR) requirements. Applicability of any of these capabilities depends on each organization’s own requirements specific to their business. Every organization should determine its own needs with regard to GDPR and then evaluate solutions for suitability to those needs. The information contained in this presentation is not intended to be and should not be construed to be legal advice. Organizations must not rely on the information herein and they should obtain legal advice from their own legal counsel or other professional legal services provider.
  • 3. 3© Cloudera, Inc. All rights reserved. General Data Protection Regulation (GDPR) Rights of the consumer Enforced from May 25 2018 Substantial penalties Obligations of the organization Applicable worldwide Personal Data • Across people, process and technology • Impacts how personal data is collected and used • Heavy fines for violations • Up to 20M Euros or 4% of the annual global turnover for the preceding financial year • Any organization with any users in the EU needs to be compliant. • Includes companies based outside the EU, processing personal data from EU residents in connection with the offering any goods or services or monitoring user behavior. • Includes data processor and data controller • Right to be forgotten/erasure • Right to access information • Right to data portability • Right for processing to be restricted
  • 4. 4© Cloudera, Inc. All rights reserved. Storage limitation: How do I erase individual data records upon request when the file systems are immutable? Accuracy: What is a low-overhead way to fix data ? Data minimization: How can I anonymize personal data? How do I prevent unlawful data transfers? Integrity and confidentiality: How do I apply IT controls to prevent unauthorized access? Accountability: How can I demonstrate compliance? How do I report breaches in 72 hrs? Lawfulness, fairness, transparency: How do I track personal data? Purpose limitation: How do I track consent while using data science tool choice? Seven principles of GDPR
  • 5. 5© Cloudera, Inc. All rights reserved. The path to GDPR compliance includes… Source: Cognizant: Where are you along this path? Big data solutions can help here
  • 6. 6© Cloudera, Inc. All rights reserved. How big data solutions can help
  • 7. 7© Cloudera, Inc. All rights reserved. How big data solutions can help GDPR Principle Typical Customer Challenges Enterprise Data Management Capabilities Integrity and Confidentiality Applying industry standard IT security controls to prevent unauthorised access. Comprehensive encryption and key management. Strong authentication, fine-grained authorization. Accountability Demonstrating compliance. Detecting and analyzing breaches within 72 hours. Breach detection (cybersecurity solutions). Comprehensive audit trail.
  • 8. 8© Cloudera, Inc. All rights reserved. How big data solutions can help GDPR Principle Typical Customer Challenges Enterprise Data Management Capabilities Integrity and Confidentiality Applying industry standard IT security controls to prevent unauthorised access. Comprehensive encryption and key management. Strong authentication, fine-grained authorization. Accountability Demonstrating compliance. Detecting and analyzing breaches within 72 hours. Breach detection (cybersecurity solutions). Comprehensive audit trail. Lawfulness, fairness and transparency Implementing a way to keep track of personal data. Track classification and lineage of personal data elements. Purpose limitation Track consent and data usage while allowing data scientists to mine it using tools of choice. Data protection officer (DPO) can audit exactly how data was used. Data scientists work with flexibility while keep data governed. Data Minimization Removing or anonymising data where possible. Preventing unlawful data transfers outside the EU while still enabling outsourcing. Classification tags can indicate allowed purpose. Redacted views limit what certain people can access.
  • 9. 9© Cloudera, Inc. All rights reserved. How big data solutions can help GDPR Principle Typical Customer Challenges Enterprise Data Management Capabilities Integrity and Confidentiality Applying industry standard IT security controls to prevent unauthorised access. Comprehensive encryption and key management. Strong authentication, fine-grained authorization. Accountability Demonstrating compliance. Detecting and analyzing breaches within 72 hours. Breach detection (cybersecurity solutions). Comprehensive audit trail. Lawfulness, fairness and transparency Implementing a way to keep track of personal data. Track classification and lineage of personal data elements. Purpose limitation Track consent and data usage while allowing data scientists to mine it using tools of choice. Data protection officer (DPO) can audit exactly how data was used. Data scientists work with flexibility while keep data governed. Data Minimization Removing or anonymising data where possible. Preventing unlawful data transfers outside the EU while still enabling outsourcing. Classification tags can indicate allowed purpose. Redacted views limit what certain people can access. Accuracy Finding a low-overhead way to fix data. Fast updates of individual records. Storage Limitation Deleting individual personal data records in Hadoop and Cloud storage, since those file systems are immutable. Erasure of individual records. HDFS and Cloud storage options.
  • 10. 10© Cloudera, Inc. All rights reserved. Which data is in scope for GDPR? Where Cloudera can help? Batch or Real-time Data Streams When all your data is stored on a single platform, you have a single place to secure and govern for GDPR compliance, across all analytic workloads
  • 11. 11© Cloudera, Inc. All rights reserved. How Cloudera can help GDPR Principle Typical Customer Challenges Cloudera Enterprise Capabilities Integrity and Confidentiality Applying industry standard IT security controls to prevent unauthorised access. Cloudera Navigator Encrypt and Key Trustee: strong encryption Apache Sentry: Fine-grained authorization. Accountability Demonstrating compliance. Detecting and analyzing breaches within 72 hours. Cloudera Navigator: Comprehensive, inescapable audit trail Cloudera Cybersecurity solutions
  • 12. 12© Cloudera, Inc. All rights reserved. How Cloudera can help GDPR Principle Typical Customer Challenges Cloudera Enterprise Capabilities Integrity and Confidentiality Applying industry standard IT security controls to prevent unauthorised access. Cloudera Navigator Encrypt and Key Trustee: strong encryption Apache Sentry: Fine-grained authorization. Accountability Demonstrating compliance. Detecting and analyzing breaches within 72 hours. Cloudera Navigator: Comprehensive, inescapable audit trail Cloudera Cybersecurity solutions Lawfulness, fairness and transparency Implementing a way to keep track of personal data. Cloudera Navigator: Classify/tag and track lineage of personal data elements Purpose limitation Track consent and data usage while allowing data scientists to mine it using tools of choice. Cloudera Navigator: DPO can audit exactly how data was used Cloudera Data Science Workbench: keep data governed Data Minimization Removing or anonymising data where possible. Preventing unlawful data transfers outside the EU while still enabling outsourcing. Cloudera Navigator: tags can indicate allowed purpose, time limit Apache Sentry: Redacted views
  • 13. 13© Cloudera, Inc. All rights reserved. How Cloudera can help GDPR Principle Typical Customer Challenges Cloudera Enterprise Capabilities Integrity and Confidentiality Applying industry standard IT security controls to prevent unauthorised access. Cloudera Navigator Encrypt and Key Trustee: strong encryption Apache Sentry: Fine-grained authorization. Accountability Demonstrating compliance. Detecting and analyzing breaches within 72 hours. Cloudera Navigator: Comprehensive, inescapable audit trail Cloudera Cybersecurity solutions Lawfulness, fairness and transparency Implementing a way to keep track of personal data. Cloudera Navigator: Classify/tag and track lineage of personal data elements Purpose limitation Track consent and data usage while allowing data scientists to mine it using tools of choice. Cloudera Navigator: DPO can audit exactly how data was used Cloudera Data Science Workbench: keep data governed Data Minimization Removing or anonymising data where possible. Preventing unlawful data transfers outside the EU while still enabling outsourcing. Cloudera Navigator: tags can indicate allowed purpose, time limit Apache Sentry: Redacted views Accuracy Finding a low-overhead way to fix data. Apache Kudu: Fast updates of individual records Storage Limitation Deleting individual personal data records in Hadoop and Cloud storage, since those file systems are immutable. Apache Kudu: Fast erasure of individual records HDFS and Cloud storage options
  • 14. 14© Cloudera, Inc. All rights reserved. Fine-grained access control and data masking Apache Sentry for column-level permissions and views with masking and row filtering (optional) Cloudera partners enable dynamic masking and tokenization
  • 15. 15© Cloudera, Inc. All rights reserved. Governance: auditing, lineage, metadata Capabilities: ● Inescapable, detailed audit – enabling forensics ● Full tracking of personal data ● Lineage tracking and visualization
  • 16. 16© Cloudera, Inc. All rights reserved. Enterprise-grade key management Manager Navigator Impala Hive HDFS Kudu & HBase Sentry Navigator Key Trustee Log Files Metadata Store Encrypted Data Encryption Key Legend Ingest Paths, Temp/Spill files HSM (optional) • ALL data at rest: HDFS, HBase, metadata databases, temp files, ingest paths • ALL data on the wire • Automated key replication & backup • HSM backed key protection • Sensitive data in logs Encryption Key Management Redaction
  • 17. 17© Cloudera, Inc. All rights reserved. Enterprise-wide breach detection Detect advanced threat leveraging machine learning models RespondDetect Search across Apache Spot’s user, endpoint, and network open data models for full context and accelerated investigation Investigate Use the Apache Spot open data models and Cloudera Navigator to see if the threat is widespread
  • 18. 18© Cloudera, Inc. All rights reserved. Erasing individual records on HDFS and cloud storage ● Concentrate personal data in a small number of “lookup tables” ● Replace personal data in most locations with anonymized or pseudonymised data ● Instead of deleting records upon request, add them to a “to be deleted” list ● Execute a periodic batch job to remove “to be deleted” records by re- writing entire files/tables Challenge: concentrate the personal data enough so that the overhead of the re-write step doesn’t render the cluster unusable for a significant period of time
  • 19. 19© Cloudera, Inc. All rights reserved. Erasing individual records on Kudu
  • 20. 20© Cloudera, Inc. All rights reserved. Laptop vs Centralized Data Science Data scientists pull data to their laptops so they can run their own tools • Copy personal data to laptops • Fails GDPR compliance audit • Potential data breach “Laptop Data Science” ● Personal data remains governed ● Purpose limitation enforced Centralized Data Science Typical Big Data Environment Data Science Workbench
  • 21. 21© Cloudera, Inc. All rights reserved. Cloudera GDPR approach CRM BI Staff Core Data Subject 360 Rights & Consent Manager Governance and audit Enterprise search Right to forget Cyber intelligence Data Science Workbench Web logs Sources Security - encryption, authentication, fine-grained access control
  • 22. 22© Cloudera, Inc. All rights reserved. Professional services: Cloudera GDPR architecture review This engagement offering focuses on providing technical expertise to help customers advance their GDPR compliance by optimising their Cloudera platform and advising on best-of-breed data architecture. The goal is to help customers maximise the value of their data while ensuring the impacts and risks of compliance are minimised. This engagement is for current Cloudera subscription customers The engagement is on a Time and Materials basis. Duration: approximately 2 weeks. The offering includes architecture design work for up to three data sources and implementation work for at least a single source. Cloudera will deliver a full set of documentation.
  • 23. 23© Cloudera, Inc. All rights reserved. What’s Next ● May 25, 2018 is 233 days away ● If you haven’t yet started the foundational activities, start them now! ● Get in touch with your Cloudera account team to learn more.
  • 24. 24© Cloudera, Inc. All rights reserved. Questions?
  • 25. 25© Cloudera, Inc. All rights reserved. Thank you Steve Ross: sross@cloudera.com Mark Donsky: md@cloudera.com