Phishing spie 2012 presentation - jsw - d2

•Download as PPT, PDF•

0 likes•310 views

Joshua S. White, PhD josh@securemind.org

Technology Business

A Method for Automated D etection of P hishing
Websites: Through B oth S ite Characteristics and
Image Analysis

Joshua S. White
Jeanna N. Matthews, PhD

Outline
• Problem
• Method
– Image Analysis (in detail)
• Method Verification
• Results
• Conclusion
• References

P roblem
• Phishing site detection
– A largely manual process
• Requires human visual review of site to
eliminate false positives / negatives
– URL's comes from actual phishing attempts
• Email, and other user report URL's
– Analysis is responsive, not proactive

Method
• For rapid proof of concept
– Data collected using the 140Dev php script
and MySQL schema

• Page characteristics collected using PHP for
DOM object parsing
– Links, Images, Forms, Iframes, Meta Tags

Image Analysis
• Collected using headless web-browser
– CutyCapt, XVFB-RUN
• Hashing of resultant images
– MD5Sum, SHA512, PHash
• Final choice was PHash (Perceptual Hash)
– Uses descrete cosign transformation
» Reduces Sampling Frequency
• Hamming Distance used to compare
each hash value

Image Analysis
• Process:
– Reduce the size of the image 32 x 32
– Reduce the color to greyscale
– Calculate the DCT (creates frequency scalars)
– Reduce the DCT to 8 x 8 pixels
– Second DCT reduction, set bits to 1 or 0 depending on
placement above or below average DCT
– Take Hash

R esults
• After our method was verified we concentrated
on the top 5 most spoofed sites:

• Some False Characteristic Matches:

Conclusion
• Phishing URL posting on social media networks
is a growing problem
• We have developed a tool that quickly and
effectively detects matches between legitimate
and spoofed sites
• Future work includes:
– Integration of our characteristic mapping and
image analysis technique into our social
media analytics toolkit

Recently uploaded

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Developing An App To Navigate The Roads of BrazilV3cube

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

GenCyber Cyber Security Day PresentationMichael W. Hawkins

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Artificial Intelligence: Facts and MythsJoaquim Jorge

GenAI Risks & Security Meetup 01052024.pdflior mazor

Partners Life - Insurer Innovation Award 2024The Digital Insurer

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Recently uploaded (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...

Developing An App To Navigate The Roads of Brazil

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

GenCyber Cyber Security Day Presentation

How to Troubleshoot Apps for the Modern Connected Worker

Axa Assurance Maroc - Insurer Innovation Award 2024

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Handwritten Text Recognition for manuscripts and early printed texts

Strategies for Landing an Oracle DBA Job as a Fresher

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Artificial Intelligence: Facts and Myths

GenAI Risks & Security Meetup 01052024.pdf

Partners Life - Insurer Innovation Award 2024

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Automating Google Workspace (GWS) & more with Apps Script

Exploring the Future Potential of AI-Enabled Smartphone Processors

Finology Group – Insurtech Innovation Award 2024

Phishing spie 2012 presentation - jsw - d2

1. A Method for Automated D etection of P hishing Websites: Through B oth S ite Characteristics and Image Analysis Joshua S. White Jeanna N. Matthews, PhD

2. Outline • Problem • Method – Image Analysis (in detail) • Method Verification • Results • Conclusion • References

3. P roblem • Phishing site detection – A largely manual process • Requires human visual review of site to eliminate false positives / negatives – URL's comes from actual phishing attempts • Email, and other user report URL's – Analysis is responsive, not proactive

4. Method (Overview)

5. Method • For rapid proof of concept – Data collected using the 140Dev php script and MySQL schema • Page characteristics collected using PHP for DOM object parsing – Links, Images, Forms, Iframes, Meta Tags

6. Image Analysis • Collected using headless web-browser – CutyCapt, XVFB-RUN • Hashing of resultant images – MD5Sum, SHA512, PHash • Final choice was PHash (Perceptual Hash) – Uses descrete cosign transformation » Reduces Sampling Frequency • Hamming Distance used to compare each hash value

7. Image Analysis

8. Image Analysis • Process: – Reduce the size of the image 32 x 32 – Reduce the color to greyscale – Calculate the DCT (creates frequency scalars) – Reduce the DCT to 8 x 8 pixels – Second DCT reduction, set bits to 1 or 0 depending on placement above or below average DCT – Take Hash

9. Method Verification

10. R esults • After our method was verified we concentrated on the top 5 most spoofed sites: • Some False Characteristic Matches:

11. Conclusion • Phishing URL posting on social media networks is a growing problem • We have developed a tool that quickly and effectively detects matches between legitimate and spoofed sites • Future work includes: – Integration of our characteristic mapping and image analysis technique into our social media analytics toolkit

12. Questions ?

13. R eferences

14. R eferences

Phishing spie 2012 presentation - jsw - d2

Recommended

Recommended

More Related Content

Similar to Phishing spie 2012 presentation - jsw - d2

Similar to Phishing spie 2012 presentation - jsw - d2 (20)

More from Joshua S. White, PhD josh@securemind.org

More from Joshua S. White, PhD josh@securemind.org (12)

Recently uploaded

Recently uploaded (20)

Phishing spie 2012 presentation - jsw - d2