4. Hadoop
• Hadoop is allows distributed processing of large
datasets across clusters of computers using simple
programmingmodels.
• A Hadoop frame-worked application works in an
environment that provides distributed storage and
computationacross clusters of computers.
6. Hadoop Cluster
A Hadoop cluster is a special type of
computational cluster designed specifically
for storing and analyzing huge amounts
of unstructured data in a distributed
computing environment.
7.
8.
9.
10. Security
A Form of Protection Where a Separation is between
the assets and threat
Security in IT
Application Security
Computing Security
Data Security
Information Security
Network Security
11. Why Secure Hadoop
Client operating system is trusted to identify user (weak
authentication
• If I can compromise client, I can run jobs or access HDFS as
anyone
• think about Virtual machines with root access
Hadoop servers trust anyone that can reach them on
the network
Intruders can see and modify all network traffic
12.
13.
14.
15.
16.
17. Kerberos
Kerberos is a network protocol that
uses secret-key cryptography to
authenticate client-server applications.
Kerberos requests an encrypted ticket
via an authenticated server sequence to
use services.