SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Data leakage detectionData leakage detection
ABSTRACTABSTRACT
A data distributor has given sensitive data to a
set of supposedly trusted agents.
Sometimes data is leaked and found in
unauthorized place e.g., on the web or on
somebody's laptop.
Data leakage happens every day when
confidential business information are leaked out.
When these are leaked out it leaves the company
unprotected and goes outside the jurisdiction of
the corporation.
MotivationMotivation
In the past few years ,there has been a
sharp increase in data leakage from many
organizations.
According to 2006, the FBI computer
crime and security survey, Data leakage is
the greatest source of financial loss of
organization.
The above issues motivated to me to
choose this project.
ObjectiveObjective
The objective of this project is to
improve the probability of identifying
leakages using Data allocation strategies
across the agents and also to identify the
guilty party who leaked the data by
injecting “realistic but fake” data records.
Problem StatementProblem Statement
In the course of doing business, sometimes
sensitive data must be given to trusted third
parties. Some of the data is leaked and found in
an unauthorized place.
The distributor cannot blame the agent without
any evidence. This project identifies the agent
who leaked the data with enough evidence.
  Limitations of current systemLimitations of current system
Current approach can detect the hackers
but the total number of evidence will be
less and the organization may not be able
to proceed legally for further proceedings
due to lack of good amount of evidence
and the chances to escape of hackers are
high.
Proposed system addressesProposed system addresses
following issuesfollowing issues
1. Algorithm used to distribute the objects
to agents that improves the chances of
identifying a leaker.
2. Realistic but fake objects is injected to
the distributed set.
3. Leakers cannot argue that they did not
leak the confidential data, because this
system traces leakers with good amount
of evidence.
Block diagramBlock diagram
Request data
Leaks the data
Distributor
Agent
Database
View Data to transfer
the agents
Add the fake objects
to the original data
Find the guilty
agents
Probability distribution
of data leaked by guilty
agents
Login registration
Explicit Data
request
Transfer data to
agents
E-Random
(Algorithm)
E-Optimal
(Algorithm)
ModulesModules
1. Data allocation module
2. Fake object module
3. Optimization module
4. Data distributor module
Data Allocation Module:Data Allocation Module:
The main focus of our project is the data
allocation problem as how can the
distributor “intelligently” give data to
agents in order to improve the chances of
detecting a guilty agent.
Fake Object Module:Fake Object Module:
Fake objects are objects generated by the
distributor in order to increase the
chances of detecting agents that leak data.
The distributor may be able to add fake
objects to the distributed data in order to
improve his effectiveness in detecting
guilty agents. Our use of fake objects is
inspired by the use of “trace” records in
mailing lists.
Optimization Module:Optimization Module:
The Optimization Module is the
distributor’s data allocation to agents has
one constraint and one objective. The
distributor’s constraint is to satisfy
agents’ requests, by providing them with
the number of objects they request or
with all available objects that satisfy their
conditions. His objective is to be able to
detect an agent who leaks any portion of
his data.
Data Distributor:Data Distributor:
A data distributor has given sensitive data
to a set of supposedly trusted agents
(third parties). Some of the data is leaked
and found in an unauthorized place (e.g.,
on the web or somebody’s laptop). The
distributor must assess the likelihood that
the leaked data came from one or more
agents, as opposed to having been
independently gathered by other means.
Software &Hardware RequirementsSoftware &Hardware Requirements
Hardware Required:
      System : Pentium IV 2.4 GHz
Hard Disk : 40 GB
Floppy Drive : 1.44 MB
RAM : 256 MB
 Software Required:
O/S : Windows XP.
Language : J2EE
Data Base : MySql Server
 
 
ReferencesReferences
 P. Papadimitriou and H. Garcia-molina “Data leakage 
detection " IEEE Transaction on knowledge and data 
engineering, pages 51-63 volume 23,2011 
 P.M Pardalos and S.A Vavasis,”Quadratic programming 
with one negative Eigen value is NP-Hard,” J. Global 
Optimization. Vol 1, no 1, pp.
 IEEE conference paper: Agrawal and J. Kiernan. 
Watermarking relational databases. In VLDB ’02: 
Proceedings of the 28th
 international conference on Very 
Large Data Bases, pages 155–166. VLDB Endowment, 
2002
 Y. Cui and J. Widom. Lineage tracing for general data 
warehouse transformations. In The VLDB Journal, 
pages 471–480, 2001.

Weitere ähnliche Inhalte

Was ist angesagt?

Jpdcs1 data leakage detection
Jpdcs1 data leakage detectionJpdcs1 data leakage detection
Jpdcs1 data leakage detection
Chaitanya Kn
 
Seminar on detecting fake accounts in social media using machine learning
Seminar on detecting fake accounts in social media using machine learningSeminar on detecting fake accounts in social media using machine learning
Seminar on detecting fake accounts in social media using machine learning
Parvathi Sanil Nair
 

Was ist angesagt? (20)

Data leakage detection Complete Seminar
Data leakage detection Complete SeminarData leakage detection Complete Seminar
Data leakage detection Complete Seminar
 
Jpdcs1 data leakage detection
Jpdcs1 data leakage detectionJpdcs1 data leakage detection
Jpdcs1 data leakage detection
 
IRJET- A Literature Review on Deta Leakage Detection
IRJET-  	  A Literature Review on Deta Leakage DetectionIRJET-  	  A Literature Review on Deta Leakage Detection
IRJET- A Literature Review on Deta Leakage Detection
 
Security services and mechanisms
Security services and mechanismsSecurity services and mechanisms
Security services and mechanisms
 
Data Leakage Presentation
Data Leakage PresentationData Leakage Presentation
Data Leakage Presentation
 
Data Security Using Steganography
Data Security Using Steganography Data Security Using Steganography
Data Security Using Steganography
 
FAKE NEWS DETECTION PPT
FAKE NEWS DETECTION PPT FAKE NEWS DETECTION PPT
FAKE NEWS DETECTION PPT
 
Network security and viruses
Network security and virusesNetwork security and viruses
Network security and viruses
 
Authentication
AuthenticationAuthentication
Authentication
 
Man in The Middle Attack
Man in The Middle AttackMan in The Middle Attack
Man in The Middle Attack
 
The Rise of Deep Fake Technology: A Comprehensive Guide
The Rise of Deep Fake Technology: A Comprehensive GuideThe Rise of Deep Fake Technology: A Comprehensive Guide
The Rise of Deep Fake Technology: A Comprehensive Guide
 
Cloud security
Cloud securityCloud security
Cloud security
 
Seminar on detecting fake accounts in social media using machine learning
Seminar on detecting fake accounts in social media using machine learningSeminar on detecting fake accounts in social media using machine learning
Seminar on detecting fake accounts in social media using machine learning
 
Fake News Detection Using Machine learning algorithm
Fake News Detection Using Machine learning algorithm Fake News Detection Using Machine learning algorithm
Fake News Detection Using Machine learning algorithm
 
Fake News detection.pptx
Fake News detection.pptxFake News detection.pptx
Fake News detection.pptx
 
Legal aspects of digital forensics
Legal aspects of digital forensics Legal aspects of digital forensics
Legal aspects of digital forensics
 
Computer Worms
Computer WormsComputer Worms
Computer Worms
 
Steganography
SteganographySteganography
Steganography
 
Digital image forgery detection
Digital image forgery detectionDigital image forgery detection
Digital image forgery detection
 
Deepfakes - How they work and what it means for the future
Deepfakes - How they work and what it means for the futureDeepfakes - How they work and what it means for the future
Deepfakes - How they work and what it means for the future
 

Andere mochten auch

Fast detection of transformed data leaks[mithun_p_c]
Fast detection of transformed data leaks[mithun_p_c]Fast detection of transformed data leaks[mithun_p_c]
Fast detection of transformed data leaks[mithun_p_c]
MithunPChandra
 

Andere mochten auch (14)

Fast detection of transformed data leaks[mithun_p_c]
Fast detection of transformed data leaks[mithun_p_c]Fast detection of transformed data leaks[mithun_p_c]
Fast detection of transformed data leaks[mithun_p_c]
 
Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
 
web services
web servicesweb services
web services
 
Iris recognition
Iris recognition Iris recognition
Iris recognition
 
Biometrics iris recognition
Biometrics iris recognitionBiometrics iris recognition
Biometrics iris recognition
 
main project doument
main project doumentmain project doument
main project doument
 
Iris Recognition
Iris RecognitionIris Recognition
Iris Recognition
 
Online Railway Reservation System
Online Railway Reservation SystemOnline Railway Reservation System
Online Railway Reservation System
 
Smart note taker
Smart note takerSmart note taker
Smart note taker
 
Smart note taker
Smart note takerSmart note taker
Smart note taker
 
SMART NOTE TAKER REPORT
SMART NOTE TAKER REPORTSMART NOTE TAKER REPORT
SMART NOTE TAKER REPORT
 
Smart note taker
Smart note takerSmart note taker
Smart note taker
 
blutooth based smart sensor network
blutooth based smart sensor networkblutooth based smart sensor network
blutooth based smart sensor network
 
3d password ppt
3d password ppt3d password ppt
3d password ppt
 

Ähnlich wie Data leakage detection

Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
bunnz12345
 
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
Drog3
 
83504808-Data-Leakage-Detection-1-Final.ppt
83504808-Data-Leakage-Detection-1-Final.ppt83504808-Data-Leakage-Detection-1-Final.ppt
83504808-Data-Leakage-Detection-1-Final.ppt
naresh2004s
 
A model to find the agent who responsible for data leakage
A model to find the agent who responsible for data leakageA model to find the agent who responsible for data leakage
A model to find the agent who responsible for data leakage
eSAT Publishing House
 
Dn31538540
Dn31538540Dn31538540
Dn31538540
IJMER
 
Privacy Preserving Based Cloud Storage System
Privacy Preserving Based Cloud Storage SystemPrivacy Preserving Based Cloud Storage System
Privacy Preserving Based Cloud Storage System
Kumar Goud
 
10.1.1.436.3364.pdf
10.1.1.436.3364.pdf10.1.1.436.3364.pdf
10.1.1.436.3364.pdf
mistryritesh
 
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdfData leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
naresh2004s
 
dataleakagedetection-1811210400vgjcd01.pptx
dataleakagedetection-1811210400vgjcd01.pptxdataleakagedetection-1811210400vgjcd01.pptx
dataleakagedetection-1811210400vgjcd01.pptx
naresh2004s
 
Final review m score
Final review m scoreFinal review m score
Final review m score
azhar4010
 

Ähnlich wie Data leakage detection (20)

Data leakage detection
Data leakage detectionData leakage detection
Data leakage detection
 
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
164788616_Data_Leakage_Detection_Complete_Project_Report__1_.docx.pdf
 
A Robust Approach for Detecting Data Leakage and Data Leaker in Organizations
A Robust Approach for Detecting Data Leakage and Data Leaker in OrganizationsA Robust Approach for Detecting Data Leakage and Data Leaker in Organizations
A Robust Approach for Detecting Data Leakage and Data Leaker in Organizations
 
83504808-Data-Leakage-Detection-1-Final.ppt
83504808-Data-Leakage-Detection-1-Final.ppt83504808-Data-Leakage-Detection-1-Final.ppt
83504808-Data-Leakage-Detection-1-Final.ppt
 
A model to find the agent who responsible for data leakage
A model to find the agent who responsible for data leakageA model to find the agent who responsible for data leakage
A model to find the agent who responsible for data leakage
 
A model to find the agent who responsible for data leakage
A model to find the agent who responsible for data leakageA model to find the agent who responsible for data leakage
A model to find the agent who responsible for data leakage
 
Dn31538540
Dn31538540Dn31538540
Dn31538540
 
DLD_SYNOPSIS
DLD_SYNOPSISDLD_SYNOPSIS
DLD_SYNOPSIS
 
Privacy Preserving Based Cloud Storage System
Privacy Preserving Based Cloud Storage SystemPrivacy Preserving Based Cloud Storage System
Privacy Preserving Based Cloud Storage System
 
10.1.1.436.3364.pdf
10.1.1.436.3364.pdf10.1.1.436.3364.pdf
10.1.1.436.3364.pdf
 
Data Allocation Strategies for Leakage Detection
Data Allocation Strategies for Leakage DetectionData Allocation Strategies for Leakage Detection
Data Allocation Strategies for Leakage Detection
 
Psdot 13 robust data leakage and email filtering system
Psdot 13 robust data leakage and email filtering systemPsdot 13 robust data leakage and email filtering system
Psdot 13 robust data leakage and email filtering system
 
709 713
709 713709 713
709 713
 
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdfData leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
Data leakage detbxhbbhhbsbssusbgsgsbshsbsection.pdf
 
dataleakagedetection-1811210400vgjcd01.pptx
dataleakagedetection-1811210400vgjcd01.pptxdataleakagedetection-1811210400vgjcd01.pptx
dataleakagedetection-1811210400vgjcd01.pptx
 
Sub1555
Sub1555Sub1555
Sub1555
 
Final review m score
Final review m scoreFinal review m score
Final review m score
 
Privacy preserving detection of sensitive data exposure
Privacy preserving detection of sensitive data exposurePrivacy preserving detection of sensitive data exposure
Privacy preserving detection of sensitive data exposure
 
Modeling and Detection of Data Leakage Fraud
Modeling and Detection of Data Leakage FraudModeling and Detection of Data Leakage Fraud
Modeling and Detection of Data Leakage Fraud
 
Privacy preserving detection of sensitive data exposure
Privacy preserving detection of sensitive data exposurePrivacy preserving detection of sensitive data exposure
Privacy preserving detection of sensitive data exposure
 

Data leakage detection

  • 1. Data leakage detectionData leakage detection
  • 2. ABSTRACTABSTRACT A data distributor has given sensitive data to a set of supposedly trusted agents. Sometimes data is leaked and found in unauthorized place e.g., on the web or on somebody's laptop. Data leakage happens every day when confidential business information are leaked out. When these are leaked out it leaves the company unprotected and goes outside the jurisdiction of the corporation.
  • 3. MotivationMotivation In the past few years ,there has been a sharp increase in data leakage from many organizations. According to 2006, the FBI computer crime and security survey, Data leakage is the greatest source of financial loss of organization. The above issues motivated to me to choose this project.
  • 4. ObjectiveObjective The objective of this project is to improve the probability of identifying leakages using Data allocation strategies across the agents and also to identify the guilty party who leaked the data by injecting “realistic but fake” data records.
  • 5. Problem StatementProblem Statement In the course of doing business, sometimes sensitive data must be given to trusted third parties. Some of the data is leaked and found in an unauthorized place. The distributor cannot blame the agent without any evidence. This project identifies the agent who leaked the data with enough evidence.
  • 6.   Limitations of current systemLimitations of current system Current approach can detect the hackers but the total number of evidence will be less and the organization may not be able to proceed legally for further proceedings due to lack of good amount of evidence and the chances to escape of hackers are high.
  • 7. Proposed system addressesProposed system addresses following issuesfollowing issues 1. Algorithm used to distribute the objects to agents that improves the chances of identifying a leaker. 2. Realistic but fake objects is injected to the distributed set. 3. Leakers cannot argue that they did not leak the confidential data, because this system traces leakers with good amount of evidence.
  • 8. Block diagramBlock diagram Request data Leaks the data Distributor Agent Database View Data to transfer the agents Add the fake objects to the original data Find the guilty agents Probability distribution of data leaked by guilty agents Login registration Explicit Data request Transfer data to agents E-Random (Algorithm) E-Optimal (Algorithm)
  • 9. ModulesModules 1. Data allocation module 2. Fake object module 3. Optimization module 4. Data distributor module
  • 10. Data Allocation Module:Data Allocation Module: The main focus of our project is the data allocation problem as how can the distributor “intelligently” give data to agents in order to improve the chances of detecting a guilty agent.
  • 11. Fake Object Module:Fake Object Module: Fake objects are objects generated by the distributor in order to increase the chances of detecting agents that leak data. The distributor may be able to add fake objects to the distributed data in order to improve his effectiveness in detecting guilty agents. Our use of fake objects is inspired by the use of “trace” records in mailing lists.
  • 12. Optimization Module:Optimization Module: The Optimization Module is the distributor’s data allocation to agents has one constraint and one objective. The distributor’s constraint is to satisfy agents’ requests, by providing them with the number of objects they request or with all available objects that satisfy their conditions. His objective is to be able to detect an agent who leaks any portion of his data.
  • 13. Data Distributor:Data Distributor: A data distributor has given sensitive data to a set of supposedly trusted agents (third parties). Some of the data is leaked and found in an unauthorized place (e.g., on the web or somebody’s laptop). The distributor must assess the likelihood that the leaked data came from one or more agents, as opposed to having been independently gathered by other means.
  • 14. Software &Hardware RequirementsSoftware &Hardware Requirements Hardware Required:       System : Pentium IV 2.4 GHz Hard Disk : 40 GB Floppy Drive : 1.44 MB RAM : 256 MB  Software Required: O/S : Windows XP. Language : J2EE Data Base : MySql Server    
  • 15. ReferencesReferences  P. Papadimitriou and H. Garcia-molina “Data leakage  detection " IEEE Transaction on knowledge and data  engineering, pages 51-63 volume 23,2011   P.M Pardalos and S.A Vavasis,”Quadratic programming  with one negative Eigen value is NP-Hard,” J. Global  Optimization. Vol 1, no 1, pp.  IEEE conference paper: Agrawal and J. Kiernan.  Watermarking relational databases. In VLDB ’02:  Proceedings of the 28th  international conference on Very  Large Data Bases, pages 155–166. VLDB Endowment,  2002  Y. Cui and J. Widom. Lineage tracing for general data  warehouse transformations. In The VLDB Journal,  pages 471–480, 2001.