Scaling ML-Based Threat Detection For Production Cyber Attacks

•

0 gefällt mir•203 views

Vulnerabilities such as Spectre and Meltdown continue to plague many production servers, based on Intel CPUs. Our solution involves software-based monitoring of hardware counters and sending that data to Apache Spark clusters for threat detection. We leverage Spark's support for support vector machine (SVM) inference. Our machine learning models are trained off-line by a data scientist within a Jupyter notebook environment. As new models are validated, they can be easily deployed to the Spark cluster from the notebook. We have standardized model export and import using the ONNX machine learning open file format. In our presentation, we will demo the full pipeline, from model training to deployment. We will discuss the various challenges when deploying ML-based cyber-threat detection at scale using Apache Spark. For example, we found that gaps in detection can occur when Spark models are updated. We will describe a novel data ingestion architecture, based on Apache Kafka, that we developed to deal with this issue.

Daten & Analysen

WIFI SSID:SparkAISummit | Password: UnifiedAnalytics

George Williams, GSI Technology
Scaling ML-based
CyberThreat Detection
For Production Systems
#UnifiedAnalytics #SparkAISummit

Agenda
● Cybersecurity Trends
● ML + Cybersecurity + Production Systems
● GSI Technology
● Architecture
● Code
3#UnifiedAnalytics #SparkAISummit
Director of Data Science,
GSI Technology

Scary Trends
6#UnifiedAnalytics #SparkAISummit
There is a hacker attack now every 39 seconds
Denial-of-service attacks up 140%
Cost of breach at $150 million

Opportunities
7#UnifiedAnalytics #SparkAISummit
Cybersecurity spending tops $100 billion
Funding of cybersecurity startups up 25%
Shortfall of 3 million cybersecurity workers

Opportunities
8#UnifiedAnalytics #SparkAISummit
Cybersecurity spending tops $100 billion
Funding of cybersecurity startups up 25%
Shortfall of 3 million cybersecurity workers
Hottest tech jobs: Security DevOps Eng & Data Eng/Sci

9#UnifiedAnalytics #SparkAISummit
ML + Cybersecurity + Production

10#UnifiedAnalytics #SparkAISummit
ML + Cybersecurity

11#UnifiedAnalytics #SparkAISummit
ML + Cybersecurity

12#UnifiedAnalytics #SparkAISummit
ML + Cybersecurity
X

13#UnifiedAnalytics #SparkAISummit
ML + Cybersecurity
X
When ML and Data Science are the death of a
good company: A cautionary tale.X

14#UnifiedAnalytics #SparkAISummit
ML + Cybersecurity
● AI “Washing”
● Model Transparency
● Model Bias
● Adversarial Attacks

15#UnifiedAnalytics #SparkAISummit
Adversarial Attack
Image From OpenAI

16#UnifiedAnalytics #SparkAISummit
ML + Cybersecurity
ML is just another tool
and not a silver bullet
for cybersecurity!

17#UnifiedAnalytics #SparkAISummit
Cybersecurity + Production

18#UnifiedAnalytics #SparkAISummit
Cybersecurity + Production

19#UnifiedAnalytics #SparkAISummit
ML + Production Systems

20#UnifiedAnalytics #SparkAISummit
ML + Production Systems
Image from
Brendan Greg

21#UnifiedAnalytics #SparkAISummit
ML + Production Systems
Image from
Brendan Greg

22#UnifiedAnalytics #SparkAISummit
Putting It All Together...

23#UnifiedAnalytics #SparkAISummit
Putting It All Together...

24#UnifiedAnalytics #SparkAISummit
Why?
Custom Chip PCIe Boards Super Cluster

25#UnifiedAnalytics #SparkAISummit
Chip SuperCluster Cloud
● Lots and lots of custom chips
● Linux OS
● Managed Hosting
○ Drug Discovery
○ Government
○ Aerospace

26#UnifiedAnalytics #SparkAISummit
Defense In Depth
NETWORK
FILE SYSTEM
HOST

27#UnifiedAnalytics #SparkAISummit
Defense In Depth
Perimeter Protection
Malware/AV Detection
Host Is Still Vulnerable
○ Credential Theft, Session Hijacking
○ Insider Threats
○ Active Monitoring !

28#UnifiedAnalytics #SparkAISummit
Anomaly Detection
● Cache Side Channel
○ Spectre/Meltdown
● Control Flow Attacks
○ ROP
● Anomalous User Behavior
○ Exfiltration

29#UnifiedAnalytics #SparkAISummit
Anomaly Detection
● Cache Side Channel
○ Spectre/Meltdown ← CPU Performance Counters
● Control Flow Attacks
○ ROP ← Call Stack Sampling
● Anomalous User Behavior
○ Exfiltration ← File System Monitoring

30#UnifiedAnalytics #SparkAISummit
Supervised and Unsupervised
X
2
X
2
X
1
X
1
Boundary
Clusters

Data Pipeline
31#UnifiedAnalytics #SparkAISummit
Spark ClusterData Source

Raw Data Source
32#UnifiedAnalytics #SparkAISummit
perf
○ Hardware stats
eBPF
○ Latest linux kernels
Python interface
○ BCC library

Perf, eBPF Kafka
33#UnifiedAnalytics #SparkAISummit

Kafka Ingestion
34#UnifiedAnalytics #SparkAISummit
Real-Time Data
○ For ML Inference
Historical Data
○ For ML Training
○ For Baselining (AD)

Training and Inference
35#UnifiedAnalytics #SparkAISummit
Baseline
○ For AD
ML Training
○ SVM
○ Regression
○ Auto-Encoder

Cache Side Channel Model
36#UnifiedAnalytics #SparkAISummit

Scenario: Cache Side Channel
Attack Detector
37#UnifiedAnalytics #SparkAISummit

Scenario: Cache Side Channel
Attack Detector
38#UnifiedAnalytics #SparkAISummit
normal
POC
production

Scenario: Cache Side Channel
Attack Detector
39#UnifiedAnalytics #SparkAISummit
normal
POC
production
TRAINING DATA
INFERENCE DATA

Code: Baseline Data
40#UnifiedAnalytics #SparkAISummit

Code: Training
41#UnifiedAnalytics #SparkAISummit

Code: Inference
42#UnifiedAnalytics #SparkAISummit

43#UnifiedAnalytics #SparkAISummit
More...
● Model Validation (FP,FN), Hyperparameters ??
● Automated and Continuous Learning
● Adversarial Attacks
● Persistence: Models (ONNX), Forensics (Cass)
● Structured Streaming, Pipelines
● Networking/Local Inference

Schedule / Contact
Chip Cluster: Q4 2019
Github: HoneycombSecurity Project
Medium: GSI Technology Blog
Twitter: @cgeorgewilliams
44#UnifiedAnalytics #SparkAISummit

DON’T FORGET TO RATE
AND REVIEW THE SESSIONS
SEARCH SPARK + AI SUMMIT
The picture can't be displayed.

Empfohlen

MITRE ATT&CKcon 2.0: Tracking and Measuring ATT&CK Coverage with ATTACK2Jira ...MITRE - ATT&CKcon

MITRE ATT&CKcon 2018: VCAF: Expanding the ATT&CK Framework to cover VERIS Thr...MITRE - ATT&CKcon

MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...MITRE - ATT&CKcon

MITRE ATT&CKcon 2.0: The World's Most Dangerous ATT&CKers; Robert Lipovsky, ESETMITRE - ATT&CKcon

MITRE ATT&CKcon 2018: Sofacy 2018 and the Adversary Playbook, Robert Falcone,...MITRE - ATT&CKcon

CCPA (California Consumer Privacy Act) Tips For Software Developers and ManagersAdam Sbeta

MITRE ATT&CKcon 2018: ATT&CK: All the Things, Neelsen Cyrus and David Thompso...MITRE - ATT&CKcon

Security toolsAdri Jovin

Empfohlen

MITRE ATT&CKcon 2.0: Tracking and Measuring ATT&CK Coverage with ATTACK2Jira ...MITRE - ATT&CKcon

MITRE ATT&CKcon 2018: VCAF: Expanding the ATT&CK Framework to cover VERIS Thr...MITRE - ATT&CKcon

MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...MITRE - ATT&CKcon

MITRE ATT&CKcon 2.0: The World's Most Dangerous ATT&CKers; Robert Lipovsky, ESETMITRE - ATT&CKcon

MITRE ATT&CKcon 2018: Sofacy 2018 and the Adversary Playbook, Robert Falcone,...MITRE - ATT&CKcon

CCPA (California Consumer Privacy Act) Tips For Software Developers and ManagersAdam Sbeta

MITRE ATT&CKcon 2018: ATT&CK: All the Things, Neelsen Cyrus and David Thompso...MITRE - ATT&CKcon

Security toolsAdri Jovin

Evolución de la Ciber SeguridadCristian Garcia G.

Defcon23 why nation-state_malware_target_telco_omercoskunÖmer Coşkun

MITRE ATT&CKcon 2.0: attckr - A Toolkit for Analysis and Visualization of ATT...MITRE - ATT&CKcon

From stealing confidential data to revenue-generating attacksMinseok(Jacky) Cha

Certificação FORTINET NSE 1 e NSE 2 Network Security Associate Anderson Rodrigues

【HITCON FreeTalk 2018 - Spectre & Meltdown 漏洞的修補策略與 risk mitigation】Hacks in Taiwan (HITCON)

Certificação FORTINET NSE 1Anderson Rodrigues

Sigma and YARA RulesLionel Faleiro

Ending the Tyranny of Expensive Security ToolsMichele Chubirka

MITRE ATT&CKcon 2.0: Flashback with ATT&CK: Exploring Malware History with AT...MITRE - ATT&CKcon

DEF CON 24 - Gorenc Sands - hacker machine interfaceFelipe Prado

MITRE ATT&CKcon 2.0: Using Threat Intelligence to Focus ATT&CK Activities; Da...MITRE - ATT&CKcon

Antonio Sanz. S2Grupo. Ciberamenazas. Semanainformatica.com 2015COIICV

Ciberamenazas - ¿A qué nos enfrentamos?Antonio Sanz Alcober

Attack eu 2021 attack4cvcAndrey Bezverkhiy

Web Application Detection with SNORTSuwitcha Musijaral CISSP,CISA,GWAPT,SNORTCP

MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...MITRE - ATT&CKcon

Ntxissacsc5 blue 4-the-attack_life_cycle_erich_muellerNorth Texas Chapter of the ISSA

Malware AnalysisRamin Farajpour Cami

AI on Spark for Malware Analysis and Anomalous Threat DetectionDatabricks

Disruptionware-TRustedCISO103020v0.7.pptxDebra Baker, CISSP CSSP

Encryption in industrial control systems; Is the juice worth the squeeze?Brian Proctor - GICSP, CISSP, CRISC

Weitere ähnliche Inhalte

Was ist angesagt?

Evolución de la Ciber SeguridadCristian Garcia G.

Defcon23 why nation-state_malware_target_telco_omercoskunÖmer Coşkun

MITRE ATT&CKcon 2.0: attckr - A Toolkit for Analysis and Visualization of ATT...MITRE - ATT&CKcon

From stealing confidential data to revenue-generating attacksMinseok(Jacky) Cha

Certificação FORTINET NSE 1 e NSE 2 Network Security Associate Anderson Rodrigues

【HITCON FreeTalk 2018 - Spectre & Meltdown 漏洞的修補策略與 risk mitigation】Hacks in Taiwan (HITCON)

Certificação FORTINET NSE 1Anderson Rodrigues

Sigma and YARA RulesLionel Faleiro

Ending the Tyranny of Expensive Security ToolsMichele Chubirka

MITRE ATT&CKcon 2.0: Flashback with ATT&CK: Exploring Malware History with AT...MITRE - ATT&CKcon

DEF CON 24 - Gorenc Sands - hacker machine interfaceFelipe Prado

MITRE ATT&CKcon 2.0: Using Threat Intelligence to Focus ATT&CK Activities; Da...MITRE - ATT&CKcon

Antonio Sanz. S2Grupo. Ciberamenazas. Semanainformatica.com 2015COIICV

Ciberamenazas - ¿A qué nos enfrentamos?Antonio Sanz Alcober

Attack eu 2021 attack4cvcAndrey Bezverkhiy

Web Application Detection with SNORTSuwitcha Musijaral CISSP,CISA,GWAPT,SNORTCP

MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...MITRE - ATT&CKcon

Ntxissacsc5 blue 4-the-attack_life_cycle_erich_muellerNorth Texas Chapter of the ISSA

Malware AnalysisRamin Farajpour Cami

Was ist angesagt? (19)

Evolución de la Ciber Seguridad

Defcon23 why nation-state_malware_target_telco_omercoskun

MITRE ATT&CKcon 2.0: attckr - A Toolkit for Analysis and Visualization of ATT...

From stealing confidential data to revenue-generating attacks

Certificação FORTINET NSE 1 e NSE 2 Network Security Associate

【HITCON FreeTalk 2018 - Spectre & Meltdown 漏洞的修補策略與 risk mitigation】

Certificação FORTINET NSE 1

Sigma and YARA Rules

Ending the Tyranny of Expensive Security Tools

MITRE ATT&CKcon 2.0: Flashback with ATT&CK: Exploring Malware History with AT...

DEF CON 24 - Gorenc Sands - hacker machine interface

MITRE ATT&CKcon 2.0: Using Threat Intelligence to Focus ATT&CK Activities; Da...

Antonio Sanz. S2Grupo. Ciberamenazas. Semanainformatica.com 2015

Ciberamenazas - ¿A qué nos enfrentamos?

Attack eu 2021 attack4cvc

Web Application Detection with SNORT

MITRE ATT&CKcon 2018: Hunters ATT&CKing with the Data, Roberto Rodriguez, Spe...

Ntxissacsc5 blue 4-the-attack_life_cycle_erich_mueller

Malware Analysis

Ähnlich wie Scaling ML-Based Threat Detection For Production Cyber Attacks

AI on Spark for Malware Analysis and Anomalous Threat DetectionDatabricks

Disruptionware-TRustedCISO103020v0.7.pptxDebra Baker, CISSP CSSP

Encryption in industrial control systems; Is the juice worth the squeeze?Brian Proctor - GICSP, CISSP, CRISC

Infragard atlanta ulf mattsson - cloud security - regulations and data prot...Ulf Mattsson

Splunk Enterpise for Information Security Hands-OnSplunk

UNCOVER DATA SECURITY BLIND SPOTS IN YOUR CLOUD, BIG DATA & DEVOPS ENVIRONMENTUlf Mattsson

Global Cyber Threat IntelligenceNTT Innovation Institute Inc.

CLÍNICA DE RESPUESTAS A INCIDENTES Y THREAT HUNTING - WORKSHOP DAY TÉCNICO DE...Cristian Garcia G.

Dreaming of IoCs Adding Time Context to Threat IntelligencePriyanka Aash

Mitre ATT&CK by Mattias Almeflo NixuNixu Corporation

All Hope is Not LostNetwork Forensics Exposes Today's Advanced Security Thr...Savvius, Inc

2016, A New Era of OS and Cloud Security - Tudor DamianITCamp

2016, A new era of OS and Cloud SecurityTudor Damian

What I learned from RSAC 2019Ulf Mattsson

Splunk Enterprise for InfoSec Hands-On Breakout SessionSplunk

DEFCON 23 Why Nation-State Malwares Target Telco Networks - OMER COSKUNÖmer Coşkun

OSINT Basics for Threat Hunters and PractitionersMegan DeBlois

Analyzing and Defending from Modern Internet ThreatsNECST Lab @ Politecnico di Milano

Secure 2019 - APT for Everyone - Adversary Simulations based on ATT&CK FrameworkLeszek Mi?

[CONFidence 2016] Gaweł Mikołajczyk - Making sense out of the Security Operat...PROIDEA

Ähnlich wie Scaling ML-Based Threat Detection For Production Cyber Attacks (20)

AI on Spark for Malware Analysis and Anomalous Threat Detection

Disruptionware-TRustedCISO103020v0.7.pptx

Encryption in industrial control systems; Is the juice worth the squeeze?

Infragard atlanta ulf mattsson - cloud security - regulations and data prot...

Splunk Enterpise for Information Security Hands-On

UNCOVER DATA SECURITY BLIND SPOTS IN YOUR CLOUD, BIG DATA & DEVOPS ENVIRONMENT

Global Cyber Threat Intelligence

CLÍNICA DE RESPUESTAS A INCIDENTES Y THREAT HUNTING - WORKSHOP DAY TÉCNICO DE...

Dreaming of IoCs Adding Time Context to Threat Intelligence

Mitre ATT&CK by Mattias Almeflo Nixu

All Hope is Not LostNetwork Forensics Exposes Today's Advanced Security Thr...

2016, A New Era of OS and Cloud Security - Tudor Damian

2016, A new era of OS and Cloud Security

What I learned from RSAC 2019

Splunk Enterprise for InfoSec Hands-On Breakout Session

DEFCON 23 Why Nation-State Malwares Target Telco Networks - OMER COSKUN

OSINT Basics for Threat Hunters and Practitioners

Analyzing and Defending from Modern Internet Threats

Secure 2019 - APT for Everyone - Adversary Simulations based on ATT&CK Framework

[CONFidence 2016] Gaweł Mikołajczyk - Making sense out of the Security Operat...

Mehr von Databricks

DW Migration Webinar-March 2022.pptxDatabricks

Data Lakehouse Symposium | Day 1 | Part 1Databricks

Data Lakehouse Symposium | Day 1 | Part 2Databricks

Data Lakehouse Symposium | Day 2Databricks

Data Lakehouse Symposium | Day 4Databricks

5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks

Democratizing Data Quality Through a Centralized PlatformDatabricks

Learn to Use Databricks for Data ScienceDatabricks

Why APM Is Not the Same As ML MonitoringDatabricks

The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks

Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks

Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks

Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks

Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks

Sawtooth Windows for Feature AggregationsDatabricks

Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks

Re-imagine Data Monitoring with whylogs and SparkDatabricks

Raven: End-to-end Optimization of ML Prediction QueriesDatabricks

Processing Large Datasets for ADAS Applications using Apache SparkDatabricks

Massive Data Processing in Adobe Using Delta LakeDatabricks

Mehr von Databricks (20)

DW Migration Webinar-March 2022.pptx

Data Lakehouse Symposium | Day 1 | Part 1

Data Lakehouse Symposium | Day 1 | Part 2

Data Lakehouse Symposium | Day 2

Data Lakehouse Symposium | Day 4

5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop

Democratizing Data Quality Through a Centralized Platform

Learn to Use Databricks for Data Science

Why APM Is Not the Same As ML Monitoring

The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix

Stage Level Scheduling Improving Big Data and AI Integration

Simplify Data Conversion from Spark to TensorFlow and PyTorch

Scaling your Data Pipelines with Apache Spark on Kubernetes

Scaling and Unifying SciKit Learn and Apache Spark Pipelines

Sawtooth Windows for Feature Aggregations

Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink

Re-imagine Data Monitoring with whylogs and Spark

Raven: End-to-end Optimization of ML Prediction Queries

Processing Large Datasets for ADAS Applications using Apache Spark

Massive Data Processing in Adobe Using Delta Lake

Kürzlich hochgeladen

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

Week-01-2.ppt BBB human Computer interactionfulawalesam

Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823

Capstone Project on IBM Data Analytics ProgramMoniSankarHazra

Probability Grade 10 Third Quarter LessonsJoseMangaJr1

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Carero dropshipping via API with DroFx.pptxolyaivanovalion

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823

Kürzlich hochgeladen (20)

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha

Week-01-2.ppt BBB human Computer interaction

Predicting Loan Approval: A Data Science Project

CebaBaby dropshipping via API with DroFX.pptx

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

FESE Capital Markets Fact Sheet 2024 Q1.pdf

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Capstone Project on IBM Data Analytics Program

Probability Grade 10 Third Quarter Lessons

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Carero dropshipping via API with DroFx.pptx

BabyOno dropshipping via API with DroFx.pptx

Ravak dropshipping via API with DroFx.pptx

Anomaly detection and data imputation within time series

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...

Scaling ML-Based Threat Detection For Production Cyber Attacks

1. WIFI SSID:SparkAISummit | Password: UnifiedAnalytics

2. George Williams, GSI Technology Scaling ML-based CyberThreat Detection For Production Systems #UnifiedAnalytics #SparkAISummit

3. Agenda ● Cybersecurity Trends ● ML + Cybersecurity + Production Systems ● GSI Technology ● Architecture ● Code 3#UnifiedAnalytics #SparkAISummit Director of Data Science, GSI Technology

4. 4#UnifiedAnalytics #SparkAISummit

5. 5#UnifiedAnalytics #SparkAISummit

6. Scary Trends 6#UnifiedAnalytics #SparkAISummit There is a hacker attack now every 39 seconds Denial-of-service attacks up 140% Cost of breach at $150 million

7. Opportunities 7#UnifiedAnalytics #SparkAISummit Cybersecurity spending tops $100 billion Funding of cybersecurity startups up 25% Shortfall of 3 million cybersecurity workers

8. Opportunities 8#UnifiedAnalytics #SparkAISummit Cybersecurity spending tops $100 billion Funding of cybersecurity startups up 25% Shortfall of 3 million cybersecurity workers Hottest tech jobs: Security DevOps Eng & Data Eng/Sci

9. 9#UnifiedAnalytics #SparkAISummit ML + Cybersecurity + Production

10. 10#UnifiedAnalytics #SparkAISummit ML + Cybersecurity

11. 11#UnifiedAnalytics #SparkAISummit ML + Cybersecurity

12. 12#UnifiedAnalytics #SparkAISummit ML + Cybersecurity X

13. 13#UnifiedAnalytics #SparkAISummit ML + Cybersecurity X When ML and Data Science are the death of a good company: A cautionary tale.X

14. 14#UnifiedAnalytics #SparkAISummit ML + Cybersecurity ● AI “Washing” ● Model Transparency ● Model Bias ● Adversarial Attacks

15. 15#UnifiedAnalytics #SparkAISummit Adversarial Attack Image From OpenAI

16. 16#UnifiedAnalytics #SparkAISummit ML + Cybersecurity ML is just another tool and not a silver bullet for cybersecurity!

17. 17#UnifiedAnalytics #SparkAISummit Cybersecurity + Production

18. 18#UnifiedAnalytics #SparkAISummit Cybersecurity + Production

19. 19#UnifiedAnalytics #SparkAISummit ML + Production Systems

20. 20#UnifiedAnalytics #SparkAISummit ML + Production Systems Image from Brendan Greg

21. 21#UnifiedAnalytics #SparkAISummit ML + Production Systems Image from Brendan Greg

22. 22#UnifiedAnalytics #SparkAISummit Putting It All Together...

23. 23#UnifiedAnalytics #SparkAISummit Putting It All Together...

24. 24#UnifiedAnalytics #SparkAISummit Why? Custom Chip PCIe Boards Super Cluster

25. 25#UnifiedAnalytics #SparkAISummit Chip SuperCluster Cloud ● Lots and lots of custom chips ● Linux OS ● Managed Hosting ○ Drug Discovery ○ Government ○ Aerospace

26. 26#UnifiedAnalytics #SparkAISummit Defense In Depth NETWORK FILE SYSTEM HOST

27. 27#UnifiedAnalytics #SparkAISummit Defense In Depth Perimeter Protection Malware/AV Detection Host Is Still Vulnerable ○ Credential Theft, Session Hijacking ○ Insider Threats ○ Active Monitoring !

28. 28#UnifiedAnalytics #SparkAISummit Anomaly Detection ● Cache Side Channel ○ Spectre/Meltdown ● Control Flow Attacks ○ ROP ● Anomalous User Behavior ○ Exfiltration

29. 29#UnifiedAnalytics #SparkAISummit Anomaly Detection ● Cache Side Channel ○ Spectre/Meltdown ← CPU Performance Counters ● Control Flow Attacks ○ ROP ← Call Stack Sampling ● Anomalous User Behavior ○ Exfiltration ← File System Monitoring

30. 30#UnifiedAnalytics #SparkAISummit Supervised and Unsupervised X 2 X 2 X 1 X 1 Boundary Clusters

31. Data Pipeline 31#UnifiedAnalytics #SparkAISummit Spark ClusterData Source

32. Raw Data Source 32#UnifiedAnalytics #SparkAISummit perf ○ Hardware stats eBPF ○ Latest linux kernels Python interface ○ BCC library

33. Perf, eBPF Kafka 33#UnifiedAnalytics #SparkAISummit

34. Kafka Ingestion 34#UnifiedAnalytics #SparkAISummit Real-Time Data ○ For ML Inference Historical Data ○ For ML Training ○ For Baselining (AD)

35. Training and Inference 35#UnifiedAnalytics #SparkAISummit Baseline ○ For AD ML Training ○ SVM ○ Regression ○ Auto-Encoder

36. Cache Side Channel Model 36#UnifiedAnalytics #SparkAISummit

37. Scenario: Cache Side Channel Attack Detector 37#UnifiedAnalytics #SparkAISummit

38. Scenario: Cache Side Channel Attack Detector 38#UnifiedAnalytics #SparkAISummit normal POC production

39. Scenario: Cache Side Channel Attack Detector 39#UnifiedAnalytics #SparkAISummit normal POC production TRAINING DATA INFERENCE DATA

40. Code: Baseline Data 40#UnifiedAnalytics #SparkAISummit

41. Code: Training 41#UnifiedAnalytics #SparkAISummit

42. Code: Inference 42#UnifiedAnalytics #SparkAISummit

43. 43#UnifiedAnalytics #SparkAISummit More... ● Model Validation (FP,FN), Hyperparameters ?? ● Automated and Continuous Learning ● Adversarial Attacks ● Persistence: Models (ONNX), Forensics (Cass) ● Structured Streaming, Pipelines ● Networking/Local Inference

44. Schedule / Contact Chip Cluster: Q4 2019 Github: HoneycombSecurity Project Medium: GSI Technology Blog Twitter: @cgeorgewilliams 44#UnifiedAnalytics #SparkAISummit

45. DON’T FORGET TO RATE AND REVIEW THE SESSIONS SEARCH SPARK + AI SUMMIT The picture can't be displayed.