IPFS: A New Distributed World

IPFS: a whole new world
Brought to you by Tyr Chen
1

Collaboration & O ine support 5

And something horrible…WTF on resiliency? 6

More horrible…WTF data security? 7

And the ultimate tragedy…in the history of mankind 8

What rst?
• web first
• mobile first
• data first
• AI first
• …
• …
• distributed, offline first?
9

Entering InterPlanetary File System world!
10

The big things before IPFS
• DHT: Ditributed Hash Table
• Kademlia DHT: query is . Used widely by Gnutella and BitTorrent
• Coral DSHT: make the storage and bandwidth usage more efficient than Kademlia
• s/kademlia DHT: add PoW to prevent attack (PoW on node id gen, ).
• BitTorrent: incentified by bit-for-tat and prioritized with rare block
• Git: Merkle DAG
O(log2N )
11

What is DHT?
A distributed hash table (DHT) is a class of a decentralized distributed system
that provides a lookup service similar to a hash table: (key, value) pairs are
stored in a DHT, and any participating node can efficiently retrieve the value
associated with a given key. Keys are unique identifiers which map to particular
values, which in turn can be anything from addresses, to documents, to
arbitrary data.[1] Responsibility for maintaining the mapping from keys to
values is distributed among the nodes, in such a way that a change in the set of
participants causes a minimal amount of disruption.
“
13

IPFS & BitTorrent
• Similarity:
• exchange of data (blocks) in IPFS is inspired by BitTorrent
• tit-for-tat strategy (if you don’t share, you won’t get)
• get rare pieces first
• Difference:
• separate swarm for each file (BitTorrent), one swarm for all (BitSwap in IPFS)
17

IPFS & Git (copied from white paper)
1. Immutable objects represent Files (blob), Directories (tree), and Changes (commit).
2. Objects are content-addressed, by the cryptographic hash of their contents.
3. Links to other objects are embedded, forming a Merkle DAG. This provides many
useful integrity and workflow properties.
4. Most versioning metadata (branches, tags, etc.) are simply pointer references, and
thus inexpensive to create and update.
5. Version changes only update references or add objects.
6. Distributing version changes to other users is simply transferring objects and
updating remote references.
18

What are the use cases for Merkle DAG? 20

IPFS Core Parts
• Identities: node identity generation & verification
• Network: p2p
• Routing: DHT
• Exchange: BitSwap
• Objects: Merkle DAG
• Files: versioned file system like Git
• Naming: self-certifying mutable name system
22

Exchange: BitSwap
• peers exchange which blocks they have (have_list) and which blocks they are looking
for (want_list) upon connecting
• to decide if a node will actually share data, it will apply its BitSwap Strategy
• based on previous data exchanges between these two peers
• peers keep track of the amount of data they share (builds credit) and the amount of
data they receive (builds debt)
• kept track of in the BitSwap Ledger
• if a peer has credit (shared more than received)
• our node will send the requested block
• if a peer has debt, our node will share or not share
• depending on a deterministic function where the chance of sharing becomes smaller when the
debt is bigger
• a data exchange always starts with the exchange of the ledger, if it is not identical our
node disconnects
23

BitSwap Ledger
type Ledger struct {
owner NodeId
partner NodeId
bytes_sent int
bytes_recv int
timestamp Timestamp
}
24

BitSwap Spec
// Additional state kept
type BitSwap struct {
ledgers map[NodeId]Ledger
// Ledgers known to this node, inc inactive
active map[NodeId]Peer
// currently open connections to other nodes
need_list []Multihash
// checksums of blocks this node needs
have_list []Multihash
// checksums of blocks this node has
}
type Peer struct {
nodeid NodeId
ledger Ledger
// Ledger between the node and this peer
last_seen Timestamp
// timestamp of last received message
want_list []Multihash
// checksums of all blocks wanted by peer
// includes blocks wanted by peer’s peers
}
// Protocol interface:
interface Peer {
open (nodeid :NodeId, ledger :Ledger);
send_want_list (want_list :WantList);
send_block (block :Block) -> (complete :Bool);
25

Files: unixfs
syntax = "proto2";
package unixfs.pb;
message Data {
enum DataType {
Raw = 0;
Directory = 1;
File = 2;
Metadata = 3;
Symlink = 4;
HAMTShard = 5;
}
required DataType Type = 1;
optional bytes Data = 2;
optional uint64 filesize = 3;
repeated uint64 blocksizes = 4;
optional uint64 hashType = 5;
optional uint64 fanout = 6;
}
message Metadata {
optional string MimeType = 1;
}
26

Naming: add mutability
• The root address of a node is /ipns/
• The content it points to can be changed by publishing an IPFS object to this address
• By publishing, the owner of the node (the person who knows the secret key that was
generated with ipfs init) cryptographically signs this “pointer”.
• This enables other users to verify the authenticity of the object published by the
owner.
• Just like IPFS paths, IPNS paths also start with a hash, followed by a Unix-like path.
• IPNS records are announced and resolved via the DHT.
27

IPFS stack
• Moving the data easily and efficiently: libp2p
• Defining the data: IPLD, IPNS
• Using the data: IPFS app
29

Concepts
• CID: content identifier. Based on the content’s cryptographic hash.
• DNS link: use DNS TXT records to map a domain name (e.g. ipfs.io) to an IPFS address.
• IPNS: Inter-Planetary Name System is a system for creating and updating mutable
links to IPFS content. IPFS address changes everytime the content changes. A name in
IPNS is the hash of a public key.
• MFS: Mutible File System allows to treat files like a normal file system. It takes care of
all the work of updating links and hashes upon change of file.
• Pinning: IPFS nodes treads data like a cache so if you want something to be retained
long-term you can pin it.
• UinxFS: UnixFS is a data format to respresent files and all their links and metadata,
loosely based on how files work in Unix.
30

Start IPFS
$ ipfs init
initializing IPFS node at /Users/tchen/.ipfs
generating 2048-bit RSA keypair...done
peer identity: QmYu24HbZC3FTMxfKPJFNFM16tNdbMSJYtvfT2Kixe9Qo6
to get started, enter:
ipfs cat /ipfs/QmS4ustL54uo8FzR9455qaxZwuMiUhyvMcX9Ba8nUH4uVv/readme
$ brew services start ipfs
==> Successfully started `ipfs` (label: homebrew.mxcl.ipfs)
34

Add a le
$ echo 'hello world' | ipfs add
added QmT78zSuBmuS4z925WZfrqQ1qHaJ56DQaTfyMUF7F8ff5o QmT78zSuBmuS4z925WZfrqQ1qHaJ56DQaTfyMUF7F8ff5o
12 B / ? [--------------------------------------------------------------------------------------------------------------------------=------
$ ipfs cat QmT78zSuBmuS4z925WZfrqQ1qHaJ56DQaTfyMUF7F8ff5o
hello world
35

IPFS peers
$ ipfs swarm peers
/ip4/100.6.104.240/tcp/4001/ipfs/Qmb8unXAJNurpDkXKbJ33pTV8Yukdm6AXj2ReWVLFvFAvt
/ip4/103.26.76.33/tcp/4001/ipfs/QmTmFmwHfBQVb7xFRzxp2BsXPF1ouK8yY2tc6Wjb79BraZ
/ip4/103.60.164.126/tcp/4001/ipfs/QmaLvFXd1b5GcdbdPTEspnSkDMDojfCsSfwYmRotSovjas
/ip4/103.94.185.80/tcp/4001/ipfs/QmP3CNi3z2c3JivxQMrmegZWx8n7r2LQWAD5yGiUT6nPX1
/ip4/104.131.131.82/tcp/4001/ipfs/QmaCpDMGvV2BGHeYERUEnRQAwe3N8SzbUtfsmvsqQLuvuJ
/ip4/104.190.120.58/tcp/58352/ipfs/QmQ3kYdbLMLEyF2sMgFb4eAygRmiPeNnFrE6xCQ2n4ifVe
/ip4/104.196.41.154/tcp/4001/ipfs/QmXJVVR32yC1ehjQhp5ZBiq7EymxZF9ZBB1F6FodW2JytF
/ip4/104.214.73.60/tcp/4001/ipfs/QmNe5CK9EWEZYttijNJ4gaGuHLsjNbjHbj9MdM6XAJR9es
36

IPFS id
$ ipfs id
{
"ID": "QmYu24HbZC3FTMxfKPJFNFM16tNdbMSJYtvfT2Kixe9Qo6",
"PublicKey": "CAASpgIwggEiMA0GCSqGSIb3DQEBAQUAA4IBDwAwggEKAoIBAQC0VFzN0Dv90LqJXTvRoS0G1nhHi6S0mONQ1jftl9QUUv8hTucf1XpWu+VfkSKcoWwr4MZZi5
"Addresses": [
"/ip4/127.0.0.1/tcp/4001/ipfs/QmYu24HbZC3FTMxfKPJFNFM16tNdbMSJYtvfT2Kixe9Qo6",
"/ip6/::1/tcp/4001/ipfs/QmYu24HbZC3FTMxfKPJFNFM16tNdbMSJYtvfT2Kixe9Qo6",
"/ip4/192.168.0.16/tcp/27891/ipfs/QmYu24HbZC3FTMxfKPJFNFM16tNdbMSJYtvfT2Kixe9Qo6"
],
"AgentVersion": "go-ipfs/0.4.18/aefc746",
"ProtocolVersion": "ipfs/0.1.0"
}
37

Providers
$ ipfs dht findprovs QmT78zSuBmuS4z925WZfrqQ1qHaJ56DQaTfyMUF7F8ff5o
QmYu24HbZC3FTMxfKPJFNFM16tNdbMSJYtvfT2Kixe9Qo6
Qmbut9Ywz9YEDrz8ySBSgWyJk41Uvm2QJPhwDJzJyGFsD6
QmceZ5vcFtpcaPeAa66xnL79xo6fRrAmx6Gp6UZAFdkrDA
QmfFj5Am7Jw2DZm7s1aVdR5Ti5baT6a8B1ifcxCbya1vw5
QmVGkGSV25o3AMjcjjnPVb1PqJzrA1PvvhMiV57cMEuExb
QmaAbZQVFavUub1cvsXP8tfbk7p5i2cRVvimWv5La2E9U8
QmPDcnTLF5HftAhteRGVhAHnxwNfLzm541W7LL1rpaChy7
QmPDpBw1xsGvnNmkt9z9NsgNpezKzFNwVPcLcPVh2Weuwv
QmPMT3ZUATZWqZEiHd4Kjfe6H9UTTYjRJN4kqLvgWhrJpU
QmPR6Ggp5BTaKtvGJ9rwn3e86dMtzPRGRNmdLG8Yp7bhkP
QmNRZdPPtYycST9dPUPLQEqMoFq43fRx5pkKYBjYKuw1Fa
QmNSYDHzei4vn91k1sdF7oEcBHQWVTbibEBtXUpYSdkapX
QmNTXQyssJ5vMynxdW1jo5EcQq9XARS9nrGJykcYGKYbNH
QmNdtfrpDhP4yaRnPTUdqynnewBF4p5tUoq6Ngw1XFdmdj
QmNfFvrEoBThgW7dcPTkWjqAsicH1eq2GHsHrmCx8bMrxv
QmNjm16wkUUsUoRmr3b8QoQAPZYBwfBHcygkjGbSauTSWu
QmNkZvJVtf1AfkudwvkSwfV3Ru5hHFvitUugzvYBxtuPT7
QmNn2QFMrNcHstJVWnaT8XK31xe1e6HLvz7qg29yE2BGkS
QmNnnkCTY1ZbtusjMAtJ9Rn5arVtaDBjwRkFTf2WdPGmRq
QmNoBE4qVq7vuNtAMTqLhMYryL2YiWcrVB7eEbw4nYkjpW
38

Wait a moment, why everything starts with Qm ?
• sha2-256
• base58
• multihash
39

IPFS use cases
1. As a mounted global filesystem, under /ipfs and /ipns.
2. As a mounted personal sync folder that automatically versions, publishes, and backs
up any writes.
3. As an encrypted file or data sharing system.
4. As a versioned package manager for all software.
5. As the root filesystem of a Virtual Machine.
6. As the boot filesystem of a VM (under a hypervisor).
7. As a database: applications can write directly to the Merkle DAG data model and get
all the versioning, caching, and distribution IPFS provides.
8. As a linked (and encrypted) communications platform.
9. As an integrity checked CDN for large files (without SSL).
10. As an encrypted CDN.
11. On webpages, as a web CDN.
40

Problems in IPFS
• Data is not automatically replicated by default
• you may lose your data if nobody is using or pinning it, see this discussion
• at the moment it serves as a filesystem cache
• ipfs cluster allows files to be pinned across a cluster
• IPFS cluster is not efficient on replication
• at the moment, either accept it
• or build your own with eraser code like Reed-Solomon algo
41

IPFS: A New Distributed World

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie IPFS: A New Distributed World

Ähnlich wie IPFS: A New Distributed World (20)

Mehr von ArcBlock

Mehr von ArcBlock (18)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

IPFS: A New Distributed World