SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
GRAU Data Space 2.0 –
The Secure Communication Platform for
Businesses and Organizations
YOUR DATA. YOUR CONTROL

7. Dez 2013
Architectural Overview
●

●

●

The GDS is based on a very robust core which is available since years
The architecture scales from SMB (<100 user) to large enterprises and
service providers (>100.000 users)
The key features for scalability are:
–
–
–
–
–
–
–
–
–
–
–

Separation between data and meta data (optional)
Transactional scalable storage backend
Versioning of all file objects (UUID)
Chunking of large objects (chunksize can be different for each object)
Hashing of chunked objects (offloading to object store is possible)
Chunk level deduplication based on hash (under development)
Bidirectional master/master replication of all data and meta data on folder level
Session director allows redirection of sessions to another node
RESTful APIs
CMIS (getContentChanges)
Distributable in-memory cache for meta data
Open interfaces
●

Open standard interfaces
–
–

JSON/SOAP core API

–
●

WebDAV
CIFS

Gateways
–
–

●

OwnCloud
CMIS 1.1 (SOAP, AtomPub, JSON)

Identity Management
–

Provisioning Gateway (LDAP, AD,SQL)

–

Authentication Gateway (LDAP, AD, RADIUS)
Architecture
Admin
GUI

WebGUI

ownCloudGW

Adm GW
GDS2 API (JSON)

CMIS GW
CIFS

WebDAV
GDS core

Storage Backend
Object-Store

Caringo
S3
SWIFT

FS/CIFS

NAS
GAM

Metadata
SQL

DB/2
Oracle
MySQL
Postgres

SQL

DB/2
Oracle
MySQL
Postgres
Storage Backend (1)
●

Storage backends:
– Filesystem (ext4, XFS)
–
–

RDBMS (MySQL, Oracle, Postgres, MSSQL, DB2)

–
●

NAS / CIFS
Object stores (Caringo, S3, SWIFT)

Plugins:
–

Object chunking (size definable on object level, 512k default)

–

Hashing (MD5, SHA-1, SHA-256)

–

Dedup on chunk-level [under development]

–

Mirroring (one or many backends) [planed]

–

Crypto (symmetrical) [planed]

–

HSM [planed]
Storage Backend (2)
GDS core
Storage Backend
Hashing
(optional)

Chunking
(512kB)

Crypto
(sym.)

Mirroring

Object store

Filesystem

CIFS

SQL

NAS
GAM/Archive

DB/2
Oracle
MySQL
Postgres

Caringo
RADOS
SWIFT/S3

ext4
XFS
Storage Backend (3)
GDS2 API (JSON)

GDS2 API (JSON)

GDS core

GDS core

Metadata

Object Store

Replication
Metadata

Object Store

SWIFT

SWIFT

RADOS GW

RADOS GW

librados

librados

RADOS
OSD

RADOS
OSD

RADOS
OSD

Metadata
Scalability / High availibility
●

Master/master replication on folder level
–
–

Users, groups

–
●

Data, metadata
Access lists

Shared nothing architecture
–
–

High availability

–

Users that share a lot of folders can be relocated to the same node

–

Adding or removing nodes dynamically

–
●

Horizontal scalability

Software updates on deactivated nodes

Distributed metadata cache
–

●

CMIS gateway allows session and metadata caching

Session redirector (reverse proxy)
–

Redirects session to the home node of the user

–

If the home node is down, one of the backup nodes will be used
High availibility
Load Balancer

Load Balancer

GDS (Session) Director

GDS (Session) Director

GDS2 API (JSON)

GDS2 API (JSON)

GDS core

GDS core

Storage

Metadata

Replication
Data
Metadata

Storage

Metadata
Scalability (1)
Load Balancer

Load Balancer

GDS (Session) Director

GDS (Session) Director

GDS2 API (JSON)

GDS2 API (JSON)

GDS core

GDS core

Metadata

Data

Master/Master
Replication
Metadata

Objectstore / Cluster filesystem

Data

Metadata
Scalability (2)
Load Balancer

Load Balancer

GDS (Session) Director

GDS (Session) Director
CMIS Cache

CMIS Cache

CMIS Cache

CMIS Cache

GDS2 API (JSON)

GDS2 API (JSON)

GDS2 API (JSON)

GDS core

GDS core

GDS core

MD

Data

Metadata
Replication

MD

Data

Metadata
Replication

Objectstore / Cluster filesystem

MD

Data
Multiple Sites - Roaming (1)
●
●
●
●
●
●
●

Every user has a home node which is stored in the account data
Redundancy of file objects is provided by objects store at each site
Users, groups and ACLs are synchronized between all sites
File objects are not synchronized between sites
Synchronization takes place asynchronously
Load balancer directs client request to session director
Session director redirects request based on user account to
–
–

Node which hosts shared data room [shared]

–
●

Home node of the user [my]
Any node [global]

Session director analyzes the request and forwards to
–

CMIS caching layer

–

JSON API layer
Multiple Sites - Roaming (2)
CMIS

JSON
LB

LB

LB

LB

GDS Director

GDS Director

GDS Director

GDS Director

CMIS Cache

CMIS Cache

CMIS Cache

CMIS Cache

GDS2 API

GDS2 API

GDS2 API

GDS2 API

GDS core

GDS core

GDS core

GDS core

MD

Data

Data

Site A

MD

MD

Data

Data

Site B

MD
Identity Management (1)
●
●
●

●

Separation between user provisioning and authentication
Multiple instances of gateways are possible
Multiple directories can be connected in parallel
Provisioning gateway
–

LDAP/AD/SQL crawler

–

Users that match a regular expression are created in the GDS

–

Users that got deleted in the directory get deactivated in the GDS

–

SCIM/SAML module [planed]
Identity Management (2)
●

Authentication gateway
–
–

Multilevel authentication

–

Google authenticator [planed]

–

RADIUS module [planned]

–
●

LDAP/AD/SQL module

MTAN/OTP module [planed]

Single Sign-On [planned]
–

Kerberos module

–

OAUTH2 module
Identity Management (3)
WebGUI

Admin GUI

Admin GW
GDS2 API (JSON)

Provisioning
Gateway

LDAP/AD
SQL
SAML

GDS core
Storage Backend

Metadata

Authentication
Gateway

LDAP/AD
SAML
RADIUS
Multi Tenancy
●

Dedicated Hardware
–
–

●

Highest level of separation and security
No performance impact of virtualization layer

Full virtualization (KVM, HyperV, Vmware, XEN)
–
–

Similar static memory pages can be shared between instances

–
●

Highest level of separation and security in virtualized environment
GDS version can be different for each tenant

Linux Containers (LXC)
–
–

●

Lightweight virtualization
Memory and program files on disk can be shared between instances

Single instance
–

Same GDS version for all tenants

–

Everything gets shared

–

Software bugs or operational problems affect all tenants
Distributed Data Space
GDS
CIFS

FW

Site B

GDS

CIFS JSON

LAN

Site A

CIFS JSON

HT
T

S
TP
HT

CIFS

FW

LAN

FW

LAN

GDS

CIFS

PS

Internet
HT
T

LAN

CIFS

FW

GDS
CIFS JSON

PS

Site C

HT

Site D

PS
T

CIFS JSON
CMIS

Site B1

Site B2

WebDAV

GDS

HT
T

Site B
CIFS

GDS

CMIS

GDS

WebDAV
PS

OS

OS

Site C
CIFS
CMIS
WebDAV

GDS

HT
T

PS

CMIS Cache

HT

S
TP

SD

WebDAV

CMIS

GDS

GDS

CMIS

CIFS

HT

WebDAV

PS
T

HTTPS

CIFS

GDS

SD

CIFS

CMIS Cache

Site A

CMIS Cache

Corporate CDN

OS
GDS
Cloud attached Data Space
Site A

GDS
CIFS

LAN

CIFS JSON

GDS

FW
HT

LB

PS
T

GDS
Internet

HT
T

LAN

CIFS

FW

GDS
CIFS JSON

FW
GDS

PS

Site B

LB
GDS
YOUR DATA. YOUR CONTROL.

WWW:

HTTP://WWW.GRAUDATA.COM/DATASPACE

E-MAIL:

THOMAS.UHL@GRAUDATA.COM

CEL:

+49 151 54354373

TWITTER:

@graudataspace

Weitere ähnliche Inhalte

Was ist angesagt?

Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015Atin Mukherjee
 
NIIF Grid Development portfolio
NIIF Grid Development portfolioNIIF Grid Development portfolio
NIIF Grid Development portfolioFerenc Szalai
 
GlusterD 2.0 - Managing Distributed File System Using a Centralized Store
GlusterD 2.0 - Managing Distributed File System Using a Centralized StoreGlusterD 2.0 - Managing Distributed File System Using a Centralized Store
GlusterD 2.0 - Managing Distributed File System Using a Centralized StoreAtin Mukherjee
 
Cache Tiering and Erasure Coding
Cache Tiering and Erasure CodingCache Tiering and Erasure Coding
Cache Tiering and Erasure CodingShinobu Kinjo
 
Lisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introductionLisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introductionGluster.org
 
Red Hat Gluster Storage, Container Storage and CephFS Plans
Red Hat Gluster Storage, Container Storage and CephFS PlansRed Hat Gluster Storage, Container Storage and CephFS Plans
Red Hat Gluster Storage, Container Storage and CephFS PlansRed_Hat_Storage
 
Tiering barcelona
Tiering barcelonaTiering barcelona
Tiering barcelonaGluster.org
 
Scalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudScalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudRed_Hat_Storage
 
Postgres-XC Write Scalable PostgreSQL Cluster
Postgres-XC Write Scalable PostgreSQL ClusterPostgres-XC Write Scalable PostgreSQL Cluster
Postgres-XC Write Scalable PostgreSQL ClusterMason Sharp
 
Introducing the ultimate MariaDB cloud, SkySQL
Introducing the ultimate MariaDB cloud, SkySQLIntroducing the ultimate MariaDB cloud, SkySQL
Introducing the ultimate MariaDB cloud, SkySQLMariaDB plc
 
SSDs: A New Generation of Storage Devices
SSDs: A New Generation of Storage DevicesSSDs: A New Generation of Storage Devices
SSDs: A New Generation of Storage DevicesHTS Hosting
 
Red Hat Ceph Storage Roadmap: January 2016
Red Hat Ceph Storage Roadmap: January 2016Red Hat Ceph Storage Roadmap: January 2016
Red Hat Ceph Storage Roadmap: January 2016Red_Hat_Storage
 
Sdc challenges-2012
Sdc challenges-2012Sdc challenges-2012
Sdc challenges-2012Gluster.org
 
The role of databases in modern application development
The role of databases in modern application developmentThe role of databases in modern application development
The role of databases in modern application developmentMariaDB plc
 
Cloud economics design, capacity and operational concerns
Cloud economics  design, capacity and operational concernsCloud economics  design, capacity and operational concerns
Cloud economics design, capacity and operational concernsMarcos García
 
Red Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - OpenStack + Ceph StorageRed Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - OpenStack + Ceph StorageRed_Hat_Storage
 

Was ist angesagt? (20)

Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015Gluster fs architecture_&_roadmap_atin_punemeetup_2015
Gluster fs architecture_&_roadmap_atin_punemeetup_2015
 
NIIF Grid Development portfolio
NIIF Grid Development portfolioNIIF Grid Development portfolio
NIIF Grid Development portfolio
 
GlusterD 2.0 - Managing Distributed File System Using a Centralized Store
GlusterD 2.0 - Managing Distributed File System Using a Centralized StoreGlusterD 2.0 - Managing Distributed File System Using a Centralized Store
GlusterD 2.0 - Managing Distributed File System Using a Centralized Store
 
Cache Tiering and Erasure Coding
Cache Tiering and Erasure CodingCache Tiering and Erasure Coding
Cache Tiering and Erasure Coding
 
Ceph meetup montreal
Ceph meetup montrealCeph meetup montreal
Ceph meetup montreal
 
Lisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introductionLisa 2015-gluster fs-introduction
Lisa 2015-gluster fs-introduction
 
YDAL Barcelona
YDAL BarcelonaYDAL Barcelona
YDAL Barcelona
 
Red Hat Gluster Storage, Container Storage and CephFS Plans
Red Hat Gluster Storage, Container Storage and CephFS PlansRed Hat Gluster Storage, Container Storage and CephFS Plans
Red Hat Gluster Storage, Container Storage and CephFS Plans
 
Tiering barcelona
Tiering barcelonaTiering barcelona
Tiering barcelona
 
Scalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the CloudScalable POSIX File Systems in the Cloud
Scalable POSIX File Systems in the Cloud
 
Postgres-XC Write Scalable PostgreSQL Cluster
Postgres-XC Write Scalable PostgreSQL ClusterPostgres-XC Write Scalable PostgreSQL Cluster
Postgres-XC Write Scalable PostgreSQL Cluster
 
Gluster d2
Gluster d2Gluster d2
Gluster d2
 
GlusterFS And Big Data
GlusterFS And Big DataGlusterFS And Big Data
GlusterFS And Big Data
 
Introducing the ultimate MariaDB cloud, SkySQL
Introducing the ultimate MariaDB cloud, SkySQLIntroducing the ultimate MariaDB cloud, SkySQL
Introducing the ultimate MariaDB cloud, SkySQL
 
SSDs: A New Generation of Storage Devices
SSDs: A New Generation of Storage DevicesSSDs: A New Generation of Storage Devices
SSDs: A New Generation of Storage Devices
 
Red Hat Ceph Storage Roadmap: January 2016
Red Hat Ceph Storage Roadmap: January 2016Red Hat Ceph Storage Roadmap: January 2016
Red Hat Ceph Storage Roadmap: January 2016
 
Sdc challenges-2012
Sdc challenges-2012Sdc challenges-2012
Sdc challenges-2012
 
The role of databases in modern application development
The role of databases in modern application developmentThe role of databases in modern application development
The role of databases in modern application development
 
Cloud economics design, capacity and operational concerns
Cloud economics  design, capacity and operational concernsCloud economics  design, capacity and operational concerns
Cloud economics design, capacity and operational concerns
 
Red Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - OpenStack + Ceph StorageRed Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - OpenStack + Ceph Storage
 

Ähnlich wie GRAU DataSpace Architecture

Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration TopicsGlobus
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonSage Weil
 
Managing 100s of PetaBytes of data in Cloud
Managing 100s of PetaBytes of data in CloudManaging 100s of PetaBytes of data in Cloud
Managing 100s of PetaBytes of data in Cloudlohitvijayarenu
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo SeidelOSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo SeidelNETWAYS
 
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...IRJET Journal
 
Architecting Big Data Applications with HDInsight
Architecting Big Data Applications with HDInsightArchitecting Big Data Applications with HDInsight
Architecting Big Data Applications with HDInsightAshish Thapliyal
 
Webinar: Utilisations courantes de MongoDB
Webinar: Utilisations courantes de MongoDBWebinar: Utilisations courantes de MongoDB
Webinar: Utilisations courantes de MongoDBMongoDB
 
Bootcamp 2017 - SQL Server on Linux
Bootcamp 2017 - SQL Server on LinuxBootcamp 2017 - SQL Server on Linux
Bootcamp 2017 - SQL Server on LinuxMaximiliano Accotto
 
Hungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsHungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsFerenc Szalai
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed_Hat_Storage
 
Map Reduce
Map ReduceMap Reduce
Map Reduceopenak
 
Backup multi-cloud solution based on named pipes
Backup multi-cloud solution based on named pipesBackup multi-cloud solution based on named pipes
Backup multi-cloud solution based on named pipesLeandro Totino Pereira
 
Strategies for Context Data Persistence
Strategies for Context Data PersistenceStrategies for Context Data Persistence
Strategies for Context Data PersistenceFIWARE
 
Microsoft Azure News - February 2018
Microsoft Azure News - February 2018Microsoft Azure News - February 2018
Microsoft Azure News - February 2018Daniel Toomey
 
SQL Server 2017 on Linux Introduction
SQL Server 2017 on Linux IntroductionSQL Server 2017 on Linux Introduction
SQL Server 2017 on Linux IntroductionTravis Wright
 

Ähnlich wie GRAU DataSpace Architecture (20)

Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration Topics
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit Boston
 
mongodb tutorial
mongodb tutorialmongodb tutorial
mongodb tutorial
 
Managing 100s of PetaBytes of data in Cloud
Managing 100s of PetaBytes of data in CloudManaging 100s of PetaBytes of data in Cloud
Managing 100s of PetaBytes of data in Cloud
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo SeidelOSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
 
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
 
SQL on linux
SQL on linuxSQL on linux
SQL on linux
 
Architecting Big Data Applications with HDInsight
Architecting Big Data Applications with HDInsightArchitecting Big Data Applications with HDInsight
Architecting Big Data Applications with HDInsight
 
Webinar: Utilisations courantes de MongoDB
Webinar: Utilisations courantes de MongoDBWebinar: Utilisations courantes de MongoDB
Webinar: Utilisations courantes de MongoDB
 
Training
TrainingTraining
Training
 
Bootcamp 2017 - SQL Server on Linux
Bootcamp 2017 - SQL Server on LinuxBootcamp 2017 - SQL Server on Linux
Bootcamp 2017 - SQL Server on Linux
 
Azure Data Storage
Azure Data StorageAzure Data Storage
Azure Data Storage
 
Hungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applicationsHungarian ClusterGrid and its applications
Hungarian ClusterGrid and its applications
 
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructureRed Hat Enterprise Linux: Open, hyperconverged infrastructure
Red Hat Enterprise Linux: Open, hyperconverged infrastructure
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Backup multi-cloud solution based on named pipes
Backup multi-cloud solution based on named pipesBackup multi-cloud solution based on named pipes
Backup multi-cloud solution based on named pipes
 
Strategies for Context Data Persistence
Strategies for Context Data PersistenceStrategies for Context Data Persistence
Strategies for Context Data Persistence
 
Microsoft Azure News - February 2018
Microsoft Azure News - February 2018Microsoft Azure News - February 2018
Microsoft Azure News - February 2018
 
SQL Server 2017 on Linux Introduction
SQL Server 2017 on Linux IntroductionSQL Server 2017 on Linux Introduction
SQL Server 2017 on Linux Introduction
 

Kürzlich hochgeladen

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Kürzlich hochgeladen (20)

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

GRAU DataSpace Architecture

  • 1. GRAU Data Space 2.0 – The Secure Communication Platform for Businesses and Organizations YOUR DATA. YOUR CONTROL 7. Dez 2013
  • 2. Architectural Overview ● ● ● The GDS is based on a very robust core which is available since years The architecture scales from SMB (<100 user) to large enterprises and service providers (>100.000 users) The key features for scalability are: – – – – – – – – – – – Separation between data and meta data (optional) Transactional scalable storage backend Versioning of all file objects (UUID) Chunking of large objects (chunksize can be different for each object) Hashing of chunked objects (offloading to object store is possible) Chunk level deduplication based on hash (under development) Bidirectional master/master replication of all data and meta data on folder level Session director allows redirection of sessions to another node RESTful APIs CMIS (getContentChanges) Distributable in-memory cache for meta data
  • 3. Open interfaces ● Open standard interfaces – – JSON/SOAP core API – ● WebDAV CIFS Gateways – – ● OwnCloud CMIS 1.1 (SOAP, AtomPub, JSON) Identity Management – Provisioning Gateway (LDAP, AD,SQL) – Authentication Gateway (LDAP, AD, RADIUS)
  • 4. Architecture Admin GUI WebGUI ownCloudGW Adm GW GDS2 API (JSON) CMIS GW CIFS WebDAV GDS core Storage Backend Object-Store Caringo S3 SWIFT FS/CIFS NAS GAM Metadata SQL DB/2 Oracle MySQL Postgres SQL DB/2 Oracle MySQL Postgres
  • 5. Storage Backend (1) ● Storage backends: – Filesystem (ext4, XFS) – – RDBMS (MySQL, Oracle, Postgres, MSSQL, DB2) – ● NAS / CIFS Object stores (Caringo, S3, SWIFT) Plugins: – Object chunking (size definable on object level, 512k default) – Hashing (MD5, SHA-1, SHA-256) – Dedup on chunk-level [under development] – Mirroring (one or many backends) [planed] – Crypto (symmetrical) [planed] – HSM [planed]
  • 6. Storage Backend (2) GDS core Storage Backend Hashing (optional) Chunking (512kB) Crypto (sym.) Mirroring Object store Filesystem CIFS SQL NAS GAM/Archive DB/2 Oracle MySQL Postgres Caringo RADOS SWIFT/S3 ext4 XFS
  • 7. Storage Backend (3) GDS2 API (JSON) GDS2 API (JSON) GDS core GDS core Metadata Object Store Replication Metadata Object Store SWIFT SWIFT RADOS GW RADOS GW librados librados RADOS OSD RADOS OSD RADOS OSD Metadata
  • 8. Scalability / High availibility ● Master/master replication on folder level – – Users, groups – ● Data, metadata Access lists Shared nothing architecture – – High availability – Users that share a lot of folders can be relocated to the same node – Adding or removing nodes dynamically – ● Horizontal scalability Software updates on deactivated nodes Distributed metadata cache – ● CMIS gateway allows session and metadata caching Session redirector (reverse proxy) – Redirects session to the home node of the user – If the home node is down, one of the backup nodes will be used
  • 9. High availibility Load Balancer Load Balancer GDS (Session) Director GDS (Session) Director GDS2 API (JSON) GDS2 API (JSON) GDS core GDS core Storage Metadata Replication Data Metadata Storage Metadata
  • 10. Scalability (1) Load Balancer Load Balancer GDS (Session) Director GDS (Session) Director GDS2 API (JSON) GDS2 API (JSON) GDS core GDS core Metadata Data Master/Master Replication Metadata Objectstore / Cluster filesystem Data Metadata
  • 11. Scalability (2) Load Balancer Load Balancer GDS (Session) Director GDS (Session) Director CMIS Cache CMIS Cache CMIS Cache CMIS Cache GDS2 API (JSON) GDS2 API (JSON) GDS2 API (JSON) GDS core GDS core GDS core MD Data Metadata Replication MD Data Metadata Replication Objectstore / Cluster filesystem MD Data
  • 12. Multiple Sites - Roaming (1) ● ● ● ● ● ● ● Every user has a home node which is stored in the account data Redundancy of file objects is provided by objects store at each site Users, groups and ACLs are synchronized between all sites File objects are not synchronized between sites Synchronization takes place asynchronously Load balancer directs client request to session director Session director redirects request based on user account to – – Node which hosts shared data room [shared] – ● Home node of the user [my] Any node [global] Session director analyzes the request and forwards to – CMIS caching layer – JSON API layer
  • 13. Multiple Sites - Roaming (2) CMIS JSON LB LB LB LB GDS Director GDS Director GDS Director GDS Director CMIS Cache CMIS Cache CMIS Cache CMIS Cache GDS2 API GDS2 API GDS2 API GDS2 API GDS core GDS core GDS core GDS core MD Data Data Site A MD MD Data Data Site B MD
  • 14. Identity Management (1) ● ● ● ● Separation between user provisioning and authentication Multiple instances of gateways are possible Multiple directories can be connected in parallel Provisioning gateway – LDAP/AD/SQL crawler – Users that match a regular expression are created in the GDS – Users that got deleted in the directory get deactivated in the GDS – SCIM/SAML module [planed]
  • 15. Identity Management (2) ● Authentication gateway – – Multilevel authentication – Google authenticator [planed] – RADIUS module [planned] – ● LDAP/AD/SQL module MTAN/OTP module [planed] Single Sign-On [planned] – Kerberos module – OAUTH2 module
  • 16. Identity Management (3) WebGUI Admin GUI Admin GW GDS2 API (JSON) Provisioning Gateway LDAP/AD SQL SAML GDS core Storage Backend Metadata Authentication Gateway LDAP/AD SAML RADIUS
  • 17. Multi Tenancy ● Dedicated Hardware – – ● Highest level of separation and security No performance impact of virtualization layer Full virtualization (KVM, HyperV, Vmware, XEN) – – Similar static memory pages can be shared between instances – ● Highest level of separation and security in virtualized environment GDS version can be different for each tenant Linux Containers (LXC) – – ● Lightweight virtualization Memory and program files on disk can be shared between instances Single instance – Same GDS version for all tenants – Everything gets shared – Software bugs or operational problems affect all tenants
  • 18. Distributed Data Space GDS CIFS FW Site B GDS CIFS JSON LAN Site A CIFS JSON HT T S TP HT CIFS FW LAN FW LAN GDS CIFS PS Internet HT T LAN CIFS FW GDS CIFS JSON PS Site C HT Site D PS T CIFS JSON
  • 19. CMIS Site B1 Site B2 WebDAV GDS HT T Site B CIFS GDS CMIS GDS WebDAV PS OS OS Site C CIFS CMIS WebDAV GDS HT T PS CMIS Cache HT S TP SD WebDAV CMIS GDS GDS CMIS CIFS HT WebDAV PS T HTTPS CIFS GDS SD CIFS CMIS Cache Site A CMIS Cache Corporate CDN OS GDS
  • 20. Cloud attached Data Space Site A GDS CIFS LAN CIFS JSON GDS FW HT LB PS T GDS Internet HT T LAN CIFS FW GDS CIFS JSON FW GDS PS Site B LB GDS
  • 21. YOUR DATA. YOUR CONTROL. WWW: HTTP://WWW.GRAUDATA.COM/DATASPACE E-MAIL: THOMAS.UHL@GRAUDATA.COM CEL: +49 151 54354373 TWITTER: @graudataspace