Suche senden
Hochladen
bup backup system (2011-04)
âą
Als ODP, PDF herunterladen
âą
0 gefÀllt mir
âą
1,506 views
A
apenwarr
Folgen
Avery's presentation about bup at LinuxFest Northwest 2011 in Bellingham, WA.
Weniger lesen
Mehr lesen
Technologie
Melden
Teilen
Melden
Teilen
1 von 22
Jetzt herunterladen
Empfohlen
SFScon19 - Davide Montesin - Why you should consider using btrfs
SFScon19 - Davide Montesin - Why you should consider using btrfs
South Tyrol Free Software Conference
Â
GCE NFS HA
GCE NFS HA
Simon Su
Â
Bulk Exporting from Cassandra - Carlo Cabanilla
Bulk Exporting from Cassandra - Carlo Cabanilla
Datadog
Â
Fragging Rights: A Tale of a Pathological Storage Workload
Fragging Rights: A Tale of a Pathological Storage Workload
Eric Sproul
Â
Case Study - DR on Demand
Case Study - DR on Demand
CTRLS
Â
Some analysis of BlueStore and RocksDB
Some analysis of BlueStore and RocksDB
Xiao Yan Li
Â
ulimit
ulimit
hiyelata
Â
Your data isn't that big @ Big Things Meetup 2016-05-16
Your data isn't that big @ Big Things Meetup 2016-05-16
Boaz Menuhin
Â
Empfohlen
SFScon19 - Davide Montesin - Why you should consider using btrfs
SFScon19 - Davide Montesin - Why you should consider using btrfs
South Tyrol Free Software Conference
Â
GCE NFS HA
GCE NFS HA
Simon Su
Â
Bulk Exporting from Cassandra - Carlo Cabanilla
Bulk Exporting from Cassandra - Carlo Cabanilla
Datadog
Â
Fragging Rights: A Tale of a Pathological Storage Workload
Fragging Rights: A Tale of a Pathological Storage Workload
Eric Sproul
Â
Case Study - DR on Demand
Case Study - DR on Demand
CTRLS
Â
Some analysis of BlueStore and RocksDB
Some analysis of BlueStore and RocksDB
Xiao Yan Li
Â
ulimit
ulimit
hiyelata
Â
Your data isn't that big @ Big Things Meetup 2016-05-16
Your data isn't that big @ Big Things Meetup 2016-05-16
Boaz Menuhin
Â
Performance
Performance
Christophe Marchal
Â
Syslog-ng Performance
Syslog-ng Performance
Chakrit Phain
Â
(BDT307) Running NoSQL on Amazon EC2 | AWS re:Invent 2014
(BDT307) Running NoSQL on Amazon EC2 | AWS re:Invent 2014
Amazon Web Services
Â
Bsdtw17: brooks davis: is it time to replace mmap?
Bsdtw17: brooks davis: is it time to replace mmap?
Scott Tsai
Â
FOSDEM 2019: M3, Prometheus and Graphite with metrics and monitoring in an in...
FOSDEM 2019: M3, Prometheus and Graphite with metrics and monitoring in an in...
Rob Skillington
Â
ĐŃĐ”ĐČĐŸĐ»ĐŸĐŽ ĐĐŸĐ»ŃĐșĐŸĐČ (DevOps Team Lead ĐČ Grammarly)
ĐŃĐ”ĐČĐŸĐ»ĐŸĐŽ ĐĐŸĐ»ŃĐșĐŸĐČ (DevOps Team Lead ĐČ Grammarly)
Provectus
Â
Đ Đ°ŃĐżŃĐ”ĐŽĐ”Đ»Đ”ĐœĐœŃĐ” ŃĐžŃŃĐ”ĐŒŃ Ń ŃĐ°ĐœĐ”ĐœĐžŃ ĐŽĐ°ĐœĐœŃŃ , ĐŸŃĐŸĐ±Đ”ĐœĐœĐŸŃŃĐž ŃДалОзаŃОО DHT ĐČ ĐżŃĐŸĐ”ĐșŃĐ” ...
Đ Đ°ŃĐżŃĐ”ĐŽĐ”Đ»Đ”ĐœĐœŃĐ” ŃĐžŃŃĐ”ĐŒŃ Ń ŃĐ°ĐœĐ”ĐœĐžŃ ĐŽĐ°ĐœĐœŃŃ , ĐŸŃĐŸĐ±Đ”ĐœĐœĐŸŃŃĐž ŃДалОзаŃОО DHT ĐČ ĐżŃĐŸĐ”ĐșŃĐ” ...
yaevents
Â
Parallel computing with GPars
Parallel computing with GPars
Pablo Molnar
Â
Is It Faster to Go with Redpanda Transactions than Without Them?!
Is It Faster to Go with Redpanda Transactions than Without Them?!
ScyllaDB
Â
OS
OS
MathavanKrishnan2
Â
Scaling metrics
Scaling metrics
Vladimir Varfolomeev
Â
[COSCUP 2018] uTensor C++ Code Generator
[COSCUP 2018] uTensor C++ Code Generator
Yin-Chen Liao
Â
Customizing Your Shell With Dotfiles
Customizing Your Shell With Dotfiles
Clayton Parker
Â
High Performance OSM Data Manipulation With Osmium - State of the Map 2013
High Performance OSM Data Manipulation With Osmium - State of the Map 2013
OSMFstateofthemap
Â
"Metrics: Where and How", Vsevolod Polyakov
"Metrics: Where and How", Vsevolod Polyakov
Yulia Shcherbachova
Â
Example the aam
Example the aam
ThpHnhHNi
Â
Go on!
Go on!
Vadim Petrov
Â
FOSDEM 2020: Querying over millions and billions of metrics with M3DB's index
FOSDEM 2020: Querying over millions and billions of metrics with M3DB's index
Rob Skillington
Â
Autentia OS - 20180210 - Docker y las peliÌculas de chinos
Autentia OS - 20180210 - Docker y las peliÌculas de chinos
Alejandro PĂ©rez GarcĂa
Â
Move from C to Go
Move from C to Go
Yu-Shuan Hsieh
Â
HPDC'23 Rapidgzip
HPDC'23 Rapidgzip
MaximilianKnespel1
Â
Lrz kurs: big data analysis
Lrz kurs: big data analysis
Ferdinand Jamitzky
Â
Weitere Àhnliche Inhalte
Was ist angesagt?
Performance
Performance
Christophe Marchal
Â
Syslog-ng Performance
Syslog-ng Performance
Chakrit Phain
Â
(BDT307) Running NoSQL on Amazon EC2 | AWS re:Invent 2014
(BDT307) Running NoSQL on Amazon EC2 | AWS re:Invent 2014
Amazon Web Services
Â
Bsdtw17: brooks davis: is it time to replace mmap?
Bsdtw17: brooks davis: is it time to replace mmap?
Scott Tsai
Â
FOSDEM 2019: M3, Prometheus and Graphite with metrics and monitoring in an in...
FOSDEM 2019: M3, Prometheus and Graphite with metrics and monitoring in an in...
Rob Skillington
Â
ĐŃĐ”ĐČĐŸĐ»ĐŸĐŽ ĐĐŸĐ»ŃĐșĐŸĐČ (DevOps Team Lead ĐČ Grammarly)
ĐŃĐ”ĐČĐŸĐ»ĐŸĐŽ ĐĐŸĐ»ŃĐșĐŸĐČ (DevOps Team Lead ĐČ Grammarly)
Provectus
Â
Đ Đ°ŃĐżŃĐ”ĐŽĐ”Đ»Đ”ĐœĐœŃĐ” ŃĐžŃŃĐ”ĐŒŃ Ń ŃĐ°ĐœĐ”ĐœĐžŃ ĐŽĐ°ĐœĐœŃŃ , ĐŸŃĐŸĐ±Đ”ĐœĐœĐŸŃŃĐž ŃДалОзаŃОО DHT ĐČ ĐżŃĐŸĐ”ĐșŃĐ” ...
Đ Đ°ŃĐżŃĐ”ĐŽĐ”Đ»Đ”ĐœĐœŃĐ” ŃĐžŃŃĐ”ĐŒŃ Ń ŃĐ°ĐœĐ”ĐœĐžŃ ĐŽĐ°ĐœĐœŃŃ , ĐŸŃĐŸĐ±Đ”ĐœĐœĐŸŃŃĐž ŃДалОзаŃОО DHT ĐČ ĐżŃĐŸĐ”ĐșŃĐ” ...
yaevents
Â
Parallel computing with GPars
Parallel computing with GPars
Pablo Molnar
Â
Is It Faster to Go with Redpanda Transactions than Without Them?!
Is It Faster to Go with Redpanda Transactions than Without Them?!
ScyllaDB
Â
OS
OS
MathavanKrishnan2
Â
Scaling metrics
Scaling metrics
Vladimir Varfolomeev
Â
[COSCUP 2018] uTensor C++ Code Generator
[COSCUP 2018] uTensor C++ Code Generator
Yin-Chen Liao
Â
Customizing Your Shell With Dotfiles
Customizing Your Shell With Dotfiles
Clayton Parker
Â
High Performance OSM Data Manipulation With Osmium - State of the Map 2013
High Performance OSM Data Manipulation With Osmium - State of the Map 2013
OSMFstateofthemap
Â
"Metrics: Where and How", Vsevolod Polyakov
"Metrics: Where and How", Vsevolod Polyakov
Yulia Shcherbachova
Â
Example the aam
Example the aam
ThpHnhHNi
Â
Go on!
Go on!
Vadim Petrov
Â
FOSDEM 2020: Querying over millions and billions of metrics with M3DB's index
FOSDEM 2020: Querying over millions and billions of metrics with M3DB's index
Rob Skillington
Â
Autentia OS - 20180210 - Docker y las peliÌculas de chinos
Autentia OS - 20180210 - Docker y las peliÌculas de chinos
Alejandro PĂ©rez GarcĂa
Â
Move from C to Go
Move from C to Go
Yu-Shuan Hsieh
Â
Was ist angesagt?
(20)
Performance
Performance
Â
Syslog-ng Performance
Syslog-ng Performance
Â
(BDT307) Running NoSQL on Amazon EC2 | AWS re:Invent 2014
(BDT307) Running NoSQL on Amazon EC2 | AWS re:Invent 2014
Â
Bsdtw17: brooks davis: is it time to replace mmap?
Bsdtw17: brooks davis: is it time to replace mmap?
Â
FOSDEM 2019: M3, Prometheus and Graphite with metrics and monitoring in an in...
FOSDEM 2019: M3, Prometheus and Graphite with metrics and monitoring in an in...
Â
ĐŃĐ”ĐČĐŸĐ»ĐŸĐŽ ĐĐŸĐ»ŃĐșĐŸĐČ (DevOps Team Lead ĐČ Grammarly)
ĐŃĐ”ĐČĐŸĐ»ĐŸĐŽ ĐĐŸĐ»ŃĐșĐŸĐČ (DevOps Team Lead ĐČ Grammarly)
Â
Đ Đ°ŃĐżŃĐ”ĐŽĐ”Đ»Đ”ĐœĐœŃĐ” ŃĐžŃŃĐ”ĐŒŃ Ń ŃĐ°ĐœĐ”ĐœĐžŃ ĐŽĐ°ĐœĐœŃŃ , ĐŸŃĐŸĐ±Đ”ĐœĐœĐŸŃŃĐž ŃДалОзаŃОО DHT ĐČ ĐżŃĐŸĐ”ĐșŃĐ” ...
Đ Đ°ŃĐżŃĐ”ĐŽĐ”Đ»Đ”ĐœĐœŃĐ” ŃĐžŃŃĐ”ĐŒŃ Ń ŃĐ°ĐœĐ”ĐœĐžŃ ĐŽĐ°ĐœĐœŃŃ , ĐŸŃĐŸĐ±Đ”ĐœĐœĐŸŃŃĐž ŃДалОзаŃОО DHT ĐČ ĐżŃĐŸĐ”ĐșŃĐ” ...
Â
Parallel computing with GPars
Parallel computing with GPars
Â
Is It Faster to Go with Redpanda Transactions than Without Them?!
Is It Faster to Go with Redpanda Transactions than Without Them?!
Â
OS
OS
Â
Scaling metrics
Scaling metrics
Â
[COSCUP 2018] uTensor C++ Code Generator
[COSCUP 2018] uTensor C++ Code Generator
Â
Customizing Your Shell With Dotfiles
Customizing Your Shell With Dotfiles
Â
High Performance OSM Data Manipulation With Osmium - State of the Map 2013
High Performance OSM Data Manipulation With Osmium - State of the Map 2013
Â
"Metrics: Where and How", Vsevolod Polyakov
"Metrics: Where and How", Vsevolod Polyakov
Â
Example the aam
Example the aam
Â
Go on!
Go on!
Â
FOSDEM 2020: Querying over millions and billions of metrics with M3DB's index
FOSDEM 2020: Querying over millions and billions of metrics with M3DB's index
Â
Autentia OS - 20180210 - Docker y las peliÌculas de chinos
Autentia OS - 20180210 - Docker y las peliÌculas de chinos
Â
Move from C to Go
Move from C to Go
Â
Ăhnlich wie bup backup system (2011-04)
HPDC'23 Rapidgzip
HPDC'23 Rapidgzip
MaximilianKnespel1
Â
Lrz kurs: big data analysis
Lrz kurs: big data analysis
Ferdinand Jamitzky
Â
Designs, Lessons and Advice from Building Large Distributed Systems
Designs, Lessons and Advice from Building Large Distributed Systems
Daehyeok Kim
Â
Hadoop-Introduction
Hadoop-Introduction
Sandeep Deshmukh
Â
Introduction to Hadoop
Introduction to Hadoop
Apache Apex
Â
Advanced off heap ipc
Advanced off heap ipc
Peter Lawrey
Â
Docker Online Meetup #27: Raspberry Pi DockerCon Challenge
Docker Online Meetup #27: Raspberry Pi DockerCon Challenge
Docker, Inc.
Â
Hadoop trainting in hyderabad@kelly technologies
Hadoop trainting in hyderabad@kelly technologies
Kelly Technologies
Â
Introduction to Big Data
Introduction to Big Data
Albert Bifet
Â
Google File System
Google File System
DreamJobs1
Â
Wuala, P2P Online Storage
Wuala, P2P Online Storage
adunne
Â
PyData Paris 2015 - Closing keynote Francesc Alted
PyData Paris 2015 - Closing keynote Francesc Alted
PĂŽle Systematic Paris-Region
Â
âShow Me the Garbage!â, Garbage Collection a Friend or a Foe
âShow Me the Garbage!â, Garbage Collection a Friend or a Foe
Haim Yadid
Â
SEMLA_logging_infra
SEMLA_logging_infra
swy351
Â
Hadoop trainting-in-hyderabad@kelly technologies
Hadoop trainting-in-hyderabad@kelly technologies
Kelly Technologies
Â
Scaling an ELK stack at bol.com
Scaling an ELK stack at bol.com
Renzo TomĂ
Â
Hadoop - Simple. Scalable.
Hadoop - Simple. Scalable.
elliando dias
Â
Vips 4mar09e
Vips 4mar09e
guest0f52728
Â
The Google file system
The Google file system
Sergio Shevchenko
Â
Python VS GO
Python VS GO
Ofir Nir
Â
Ăhnlich wie bup backup system (2011-04)
(20)
HPDC'23 Rapidgzip
HPDC'23 Rapidgzip
Â
Lrz kurs: big data analysis
Lrz kurs: big data analysis
Â
Designs, Lessons and Advice from Building Large Distributed Systems
Designs, Lessons and Advice from Building Large Distributed Systems
Â
Hadoop-Introduction
Hadoop-Introduction
Â
Introduction to Hadoop
Introduction to Hadoop
Â
Advanced off heap ipc
Advanced off heap ipc
Â
Docker Online Meetup #27: Raspberry Pi DockerCon Challenge
Docker Online Meetup #27: Raspberry Pi DockerCon Challenge
Â
Hadoop trainting in hyderabad@kelly technologies
Hadoop trainting in hyderabad@kelly technologies
Â
Introduction to Big Data
Introduction to Big Data
Â
Google File System
Google File System
Â
Wuala, P2P Online Storage
Wuala, P2P Online Storage
Â
PyData Paris 2015 - Closing keynote Francesc Alted
PyData Paris 2015 - Closing keynote Francesc Alted
Â
âShow Me the Garbage!â, Garbage Collection a Friend or a Foe
âShow Me the Garbage!â, Garbage Collection a Friend or a Foe
Â
SEMLA_logging_infra
SEMLA_logging_infra
Â
Hadoop trainting-in-hyderabad@kelly technologies
Hadoop trainting-in-hyderabad@kelly technologies
Â
Scaling an ELK stack at bol.com
Scaling an ELK stack at bol.com
Â
Hadoop - Simple. Scalable.
Hadoop - Simple. Scalable.
Â
Vips 4mar09e
Vips 4mar09e
Â
The Google file system
The Google file system
Â
Python VS GO
Python VS GO
Â
KĂŒrzlich hochgeladen
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(âïž+971_581248768%)**%*]'#abortion pills for sale in dubai@
Â
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Maria Levchenko
Â
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
Product Anonymous
Â
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Michael W. Hawkins
Â
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
Â
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
V3cube
Â
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
DianaGray10
Â
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
wesley chun
Â
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Khem
Â
Scaling API-first â The story of a global engineering organization
Scaling API-first â The story of a global engineering organization
Radu Cotescu
Â
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
apidays
Â
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
Boston Institute of Analytics
Â
đŹ The future of MySQL is Postgres đ
đŹ The future of MySQL is Postgres đ
RTylerCroy
Â
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
The Digital Insurer
Â
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
Â
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Safe Software
Â
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
lior mazor
Â
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
Antenna Manufacturer Coco
Â
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc
Â
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
Pixlogix Infotech
Â
KĂŒrzlich hochgeladen
(20)
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
Â
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Â
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
Â
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Â
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
Â
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
Â
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Â
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
Â
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Â
Scaling API-first â The story of a global engineering organization
Scaling API-first â The story of a global engineering organization
Â
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Â
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
Â
đŹ The future of MySQL is Postgres đ
đŹ The future of MySQL is Postgres đ
Â
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
Â
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
Â
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Â
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
Â
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
Â
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
Â
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
Â
bup backup system (2011-04)
1.
bup: the git-based
backup system Avery Pennarun 2011 04 30
2.
3.
Including huge VM
disk images (files >100GB)
4.
Lots of separate
files (500k or more)
5.
Calculate/store incrementals efficiently
6.
Create backups in
O(n), where n = number of changed bytes
7.
Incremental backup direct
to a remote computer (no local copy)
8.
...and don't forget
metadata 5
9.
10.
11.
Dedupe between
different client machines
12.
O(log N) seek
times to any part of any file
13.
You can mount
your backup history as a filesystem or browse it as a web page 8
14.
The Design 8
15.
16.
Easily deduplicates between
identical files and trees
17.
Debug my code
using git commands
18.
Someone already thought
about the repo format (packs, idxes)
19.
Three-level âwork treeâ
vs âindexâ vs âcommitâ 11
20.
21.
git bigfiles fork
âsolvesâ the problem by just never deltifying large objects: lame
22.
zlib window size
is very small: lousy compression on VM images -- 13
23.
24.
25.
Duplicated data >64k
apart can't be compressed
26.
27.
28.
29.
He wrote a
(readable) Ph.D. thesis about it
30.
bup uses a
variant of the rsync algorithm to -- 19
31.
32.
33.
But any
n-byte subsequence might be the match
34.
Searching naively is
about O(n^2) ... ouch.
35.
So we use
a rolling checksum instead 21
36.
37.
38.
Requires a special
ârollableâ checksum (adler32) 23
39.
40.
Changing a byte
early in a file changes the compression dictionary, so the rest of the file is different
41.
--rsyncable resets the
compression dictionary whenever low bits of adler32 == 0
42.
Fraction of a
percent overhead on file size
43.
But now your
gzip files are rsyncable! 26
44.
45.
46.
47.
Inserting/deleting bytes changes
at most two chunks! 28
48.
49.
50.
If low 21
bits are 1, end a superduperchunk
51.
52.
Inserting/deleting bytes changes
at most 2*log(n) chunks! 30
53.
54.
Compresses most VM
images more (and faster) than gzip
55.
Works well on
binary and text files
56.
Don't need to
teach it about file formats
57.
Diff huge files
in about O(log n) time
58.
Seek to any
offset in a file in O(log n) time 32
59.
60.
x 20 bytes
per SHA1: 2.4GB
61.
Divided into 2GB
packs: 500 .idx files of 5MB each
62.
63.
64.
Larger initial lookup
table to immediately narrow search to the last 7 bits
65.
log(2.4GB) = 31
bits
66.
24-bit lookup table
* 4 bytes per entry: 64 MB
67.
Adding a new
chunk always only touches two pages 35
68.
69.
Storing 20 bytes
per hash is wasteful; even 48 bits would be unique
70.
71.
Create a big
hash table
72.
Hash each block
several ways
73.
Gives false positives,
never false negatives
74.
Can update incrementally
36
75.
76.
77.
bup ftp (command-line
browsing)
78.
bup web
79.
bup fuse (mount
backups as a filesystem) 36
80.
81.
82.
'bup restore' that
updates the index like 'git checkout' does
83.
Distributed filesystem: bittorrent
with a smarter data structure 39
84.
Questions? 40
Jetzt herunterladen