SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Downloaden Sie, um offline zu lesen
ISCC – a solution to some challenges
presented by generative AI?
Sebastian Posth, ISCC Foundation
Musical Works Data and Rights Standards Implementation
Seminar
17th November 2023, Arlington
DDEX 2023
© 2023 CC-BY-SA Sebastian Posth
© 2023 CC-BY-SA Sebastian Posth
● Co-initiator of the International Standard
Content Code (ISCC)
● Co-founder and member of directorʼs board
of ISCC Foundation (NL)
● Convenor of ISO/DIS 24138 on ISCC
● Background in publishing, digital distribution
and data analytics (Bertelsmann, et al.)
● Entrepreneur and consultant on digital
innovation projects in the media industries
● Building Liccium (liccium.com) and
CreatorCredentials.com
SEBASTIAN POSTH
© 2023 CC-BY-SA Sebastian Posth
ISCC – INTRODUCTION
● ISCC originated in the German book market – addressing
a number of inefficiencies of the digital supply chain:
○ Manually identifier management, file naming
conventions, missing metadata, issues with updates,
versioning, duplicate content, etc…
● Growing relevance of a decentralised media environment
(web, platforms, user generated content)
● The ISCC is an open system for the decentralised
identification of digital media content of all media
types and formats (text, image, video, and audio)
© 2023 CC-BY-SA Sebastian Posth
ISO STAGE CODES
Goal:
Publication
Q1/2024
We are
here
© 2023 CC-BY-SA Sebastian Posth
ISO/DIS 24138
© 2023 CC-BY-SA Titusz Pan
THE DNA OF YOUR DIGITAL CONTENT
ISCC:KADV5PDFXBL7HGBXFFW64KVNP6UGTUZC2CJTDBKMFYTTZPLQQVX22FI
AAAV5PDFXBL7HGBX EAASS3POFKWX7KDJ IAASOPF5OCCW7LIV
GAA5GIWQSMYYKTBO
Meta-Code Content-Code Data-Code Instance-Code
Metadata
Similarity
Content
Similarity
Data
Similarity
Data
Integrity
Integrity verifying
checksum (crypto hash)
Similarity-preserving hashes
(SIM hash)
© 2023 CC-BY-SA Titusz Pan
CONTENT MATCHING
© 2023 CC-BY-SA Sebastian Posth/Titusz Pan
WHAT IS THE ISCC (NOT)
ISCC ACR
An open identifier standard proposal Services require proprietary software
Short identifier string Not an identifier but a content identification
system
Can be generated by anyone with access
to content for all media types and formats
Often optimised for specific media types
ISCC has near duplicate content-matching
capabilities (lightweight fingerprints)
Developed for content matching, can
match and compare assets and small
chunks in great detail
Can be easily implemented in existing
applications, which ensures
interoperability across entities
Detailed fingerprints not interoperable
© 2023 CC-BY-SA Titusz Pan
WHAT IS THE ISCC (NOT)
T-034.524.680-1
ISWC (Work)
US-S1Z-20-00001
ISRC (Recording)
US-S1Z-20-00002
ISRC (Recording)
US-S1Z-20-00003
ISRC (Recording)
RMMXPR2HGBYNXE5T…
ISCC (File)
RMM362DSFHZPYS2S…
ISCC (File)
RMM6DDMSNP5NFDY2…
ISCC (File)
ISWC
Work
ISRC
Recording event
ISCC
Media asset
ORIGINAL
KEC2LO3CU6XL6ZVXS3G5LUWUZDEJNBWUBDZPXNR542DDKYVPG372D2A
Source: https://www.iptc.org/std-dev/photometadata/examples/google-licensable/example-page1.html
FAKE 😉
© 2023 CC-BY-SA Sebastian Posth https://twitter.com/spsth/status/1718591647087251519
KECX4FAQRQG2VJ3YS3G5JUWUZDMJNE7GIT67RT27H3AAFVV4NDGABUI
AAA2LO3CU6XL6ZVX Meta-Code AAAX4FAQRQG2VJ3Y 41%
EEAZNTOV2LKMRSEW Content-Code EEAZNTOU2LKMRWEW 94%
GAAYNVAI6L53MPPG Data-Code GAAZHZSE7X4M6XZ6 27%
IAAYMNLCV43P7IPI Instance-Code IAA4AAWWXRUMYAGR Unique
EXAMPLE FAKE NEWS
JPG File
KEC2LO3CU6XL6ZVXS3G5LUWUZDEJNBWUBDZPXNR542DDKYVPG372D2A KECX4FAQRQG2VJ3YS3G5JUWUZDMJNE7GIT67RT27H3AAFVV4NDGABUI
JPG File Components Similarity
© 2023 CC-BY-SA Sebastian Posth
2 bits of difference
of the 64 bit hash
Fake declaration
Original declaration
EXAMPLE FAKE NEWS
© 2023 CC-BY-SA Sebastian Posth
The clustering und matching of near-duplicate
content is possible by having
access only to the ISCC codes!
KEC37M6L6YX645BORDR3KWVUJSLJ5AU7SVSE46E56YV33YW4HQKU5PY KEC3XJ7PYYXUM5KORG2ZMGVUNSL6DSI6GT2UZMTBMYOUULIGBBFXHYQ
AAA2LO3CU6XL6ZVX Meta-Code AAAX4FAQRQG2VJ3Y 41%
EEAZNTOV2LKMRSEW Content-Code EEAZNTOU2LKMRWEW 94%
GAAYNVAI6L53MPPG Data-Code GAAZHZSE7X4M6XZ6 27%
IAAYMNLCV43P7IPI Instance-Code IAA4AAWWXRUMYAGR Unique
JPG File
JPG File Components Similarity
2 bits of difference
of the 64 bit hash
Fake declaration
Original declaration
© 2023 CC-BY-SA Sebastian Posth
SUPPORTED MEDIA TYPES/FORMATS
● TEXT
doc, docx, xls, xlsx, pptx, epub, mobi, ibooks,
html, xhtml, odt, pdf, rtf, txt, xml, json, md
● IMAGE
gif, jpg, png, tif, bmp, psd, eps, webp
● AUDIO
aif, flac, mp3, opus, ogg, wav
● VIDEO
3gp, 3g2, asf, avi, drc, flv, f4v, flu, gif, h264, mpg,
mp4, mkv, mov, ogv, rm, swf, webm, wmv
Algorithm
</>
Content
Codes
Digital Media
Asset
© 2023 CC-BY-SA Sebastian Posth
MAIN INNOVATIONS
Algorithm
</>
Content
Codes
Digital Media
Asset
● All users or machines with access to the
content to the content can generate the ISCC
from the media file – without the need for
centralised databases or registries
● With the ISCC, users or machines can confirm
the integrity of a media file or recognise and
match near-duplicate content
● Recognition is possible even when content
has been altered, manipulated or embedded
metadata, watermarks or steganographic data
have been stripped from the content!
© 2023 CC-BY-SA Sebastian Posth
PUBLIC DECLARATIONS
Public ISCC declarations allow for the
persistent binding of metadata, rights and
other information to the media asset:
● Sector-specific product and title metadata,
e.g. IPTC photo metadata, ONIX, DDEX, etc.
● Rights and licencing offerings
● Usage statistics, reporting data
● Opt-out for TDM and the use of content
as AI training data
(1) ISCC + OPT-OUT
© 2023 CC-BY-SA Sebastian Posth
● Creators and rightsholders can inseparably
bind a machine-readable opt-out
declaration to prevent their content
from being used as AI training data
(Article 4, EU DSM Directive on Copyright)
● Providers of AI applications can derive the
legal restrictions from the ISCC, and thus
respect the requirements set out by the
rightsholders
AI Training
Opt-out
MACHINE-READABLE TDM·AI POLICY
© 2023 CC-BY-SA Sebastian Posth/Sabine Richly
(2) ISCC FOR INPUT TRANSPARENCY
© 2023 CC-BY-SA Sebastian Posth
● With the help of the ISCC, providers of AI
systems can provide lists of copyright
protected works that are/were used for
training their models
● This will allow future EU regulatory
requirements to be met
(An obligation may arise under Art. 28b 4c of
the revised EU AI Act)
(3) ISCC FOR OUTPUT TRANSPARENCY
© 2023 CC-BY-SA Sebastian Posth
● AI system providers can publicly declare
AI-generated content
● This will increase trustworthiness of the
digital media landscape
● At the same time, AI system providers can
prevent AI-generated output from being used
to train the LLM/base models
(Model Collapse, AI Entropy)
Gonzalo Martinez Ruiz De Arcaute, via
https://spectrum.ieee.org/ai-collapse
DISCUSSION OF USE CASES
© 2023 CC-BY-SA Sebastian Posth
● AI opt-out
● Anti-piracy
● Anti-counterfeit
● Reporting of sales or shares
● Using MRT + ISCC
● Using ISCC in DDEX metadata
● etc.
ISCC FOUNDATION
© 2023 CC-BY-SA Sebastian Posth
Your support will make a difference!
ISCC Foundation is a purpose-driven non-profit
organisation, dedicated to developing and
promoting of open source technology for
decentralised, digital content identification.
You can support our goals:
● with donations
● sponsored development
● testing the ISCC
● promoting the ISCC system
https://core.iscc.codes
Please contact:
https://iscc.foundation
Sebastian Posth
posth@iscc.foundation
Titusz Pan
tp@iscc.foundation

Weitere ähnliche Inhalte

Ähnlich wie ISCC Foundation Presentation from DDEX MRT Summit 16 Nov 2023

Ähnlich wie ISCC Foundation Presentation from DDEX MRT Summit 16 Nov 2023 (20)

Cisco Connect 2018 Indonesia - Building a secure data center
Cisco Connect 2018 Indonesia - Building a secure data center Cisco Connect 2018 Indonesia - Building a secure data center
Cisco Connect 2018 Indonesia - Building a secure data center
 
Smart Surveillance Bot with Low Power MCU
Smart Surveillance Bot with Low Power MCUSmart Surveillance Bot with Low Power MCU
Smart Surveillance Bot with Low Power MCU
 
Survey of Operating Systems for the IoT Environment
Survey of Operating Systems for the IoT EnvironmentSurvey of Operating Systems for the IoT Environment
Survey of Operating Systems for the IoT Environment
 
[Cisco Connect 2018 - Vietnam] Anh duc le building a secure data center
[Cisco Connect 2018 - Vietnam] Anh duc le   building a secure data center[Cisco Connect 2018 - Vietnam] Anh duc le   building a secure data center
[Cisco Connect 2018 - Vietnam] Anh duc le building a secure data center
 
Cisco Connect 2018 Vietnam - building a secure data center
Cisco Connect 2018 Vietnam - building a secure data centerCisco Connect 2018 Vietnam - building a secure data center
Cisco Connect 2018 Vietnam - building a secure data center
 
Developing TI RTOS Applications and BLE Profiles
Developing TI RTOS Applications and BLE ProfilesDeveloping TI RTOS Applications and BLE Profiles
Developing TI RTOS Applications and BLE Profiles
 
MIPI DevCon 2021: MIPI I3C interface for the ETSI Smart Secure Platform
MIPI DevCon 2021: MIPI I3C interface for the ETSI Smart Secure PlatformMIPI DevCon 2021: MIPI I3C interface for the ETSI Smart Secure Platform
MIPI DevCon 2021: MIPI I3C interface for the ETSI Smart Secure Platform
 
Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...
 
11th International Conference of Advanced Computer Science & Information Tech...
11th International Conference of Advanced Computer Science & Information Tech...11th International Conference of Advanced Computer Science & Information Tech...
11th International Conference of Advanced Computer Science & Information Tech...
 
Cisco Connect 2018 Malaysia - Secure data center-building a secure zero-trus...
Cisco Connect 2018 Malaysia -  Secure data center-building a secure zero-trus...Cisco Connect 2018 Malaysia -  Secure data center-building a secure zero-trus...
Cisco Connect 2018 Malaysia - Secure data center-building a secure zero-trus...
 
9th International Conference on Computer Science, Information Technology (CS...
9th  International Conference on Computer Science, Information Technology (CS...9th  International Conference on Computer Science, Information Technology (CS...
9th International Conference on Computer Science, Information Technology (CS...
 
Horizontal Requirement Engineering in Integration of Multiple IoT Use Cases o...
Horizontal Requirement Engineering in Integration of Multiple IoT Use Cases o...Horizontal Requirement Engineering in Integration of Multiple IoT Use Cases o...
Horizontal Requirement Engineering in Integration of Multiple IoT Use Cases o...
 
11th International Conference of Advanced Computer Science & Information Tech...
11th International Conference of Advanced Computer Science & Information Tech...11th International Conference of Advanced Computer Science & Information Tech...
11th International Conference of Advanced Computer Science & Information Tech...
 
Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...
 
About Infracritical
About InfracriticalAbout Infracritical
About Infracritical
 
Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...
 
Paper Submission - International Journal of Computer Science & Information Te...
Paper Submission - International Journal of Computer Science & Information Te...Paper Submission - International Journal of Computer Science & Information Te...
Paper Submission - International Journal of Computer Science & Information Te...
 
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
 
Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...
 
Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...Call for Articles - International Journal of Computer Science & Information T...
Call for Articles - International Journal of Computer Science & Information T...
 

Kürzlich hochgeladen

如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证成绩单原版一比一
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证成绩单原版一比一如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证成绩单原版一比一
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证成绩单原版一比一
hwhqz6r1y
 
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra MalangToko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
adet6151
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
Amil baba
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
ppy8zfkfm
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
fztigerwe
 
如何办理新加坡国立大学毕业证(NUS毕业证)学位证成绩单原版一比一
如何办理新加坡国立大学毕业证(NUS毕业证)学位证成绩单原版一比一如何办理新加坡国立大学毕业证(NUS毕业证)学位证成绩单原版一比一
如何办理新加坡国立大学毕业证(NUS毕业证)学位证成绩单原版一比一
hwhqz6r1y
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Valters Lauzums
 

Kürzlich hochgeladen (20)

AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证成绩单原版一比一
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证成绩单原版一比一如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证成绩单原版一比一
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证成绩单原版一比一
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"
 
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra MalangToko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
123.docx. .
123.docx.                                 .123.docx.                                 .
123.docx. .
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
 
如何办理新加坡国立大学毕业证(NUS毕业证)学位证成绩单原版一比一
如何办理新加坡国立大学毕业证(NUS毕业证)学位证成绩单原版一比一如何办理新加坡国立大学毕业证(NUS毕业证)学位证成绩单原版一比一
如何办理新加坡国立大学毕业证(NUS毕业证)学位证成绩单原版一比一
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data Analytics
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 

ISCC Foundation Presentation from DDEX MRT Summit 16 Nov 2023

  • 1. ISCC – a solution to some challenges presented by generative AI? Sebastian Posth, ISCC Foundation Musical Works Data and Rights Standards Implementation Seminar 17th November 2023, Arlington DDEX 2023 © 2023 CC-BY-SA Sebastian Posth
  • 2. © 2023 CC-BY-SA Sebastian Posth ● Co-initiator of the International Standard Content Code (ISCC) ● Co-founder and member of directorʼs board of ISCC Foundation (NL) ● Convenor of ISO/DIS 24138 on ISCC ● Background in publishing, digital distribution and data analytics (Bertelsmann, et al.) ● Entrepreneur and consultant on digital innovation projects in the media industries ● Building Liccium (liccium.com) and CreatorCredentials.com SEBASTIAN POSTH
  • 3. © 2023 CC-BY-SA Sebastian Posth ISCC – INTRODUCTION ● ISCC originated in the German book market – addressing a number of inefficiencies of the digital supply chain: ○ Manually identifier management, file naming conventions, missing metadata, issues with updates, versioning, duplicate content, etc… ● Growing relevance of a decentralised media environment (web, platforms, user generated content) ● The ISCC is an open system for the decentralised identification of digital media content of all media types and formats (text, image, video, and audio)
  • 4. © 2023 CC-BY-SA Sebastian Posth ISO STAGE CODES Goal: Publication Q1/2024 We are here
  • 5. © 2023 CC-BY-SA Sebastian Posth ISO/DIS 24138
  • 6. © 2023 CC-BY-SA Titusz Pan THE DNA OF YOUR DIGITAL CONTENT ISCC:KADV5PDFXBL7HGBXFFW64KVNP6UGTUZC2CJTDBKMFYTTZPLQQVX22FI AAAV5PDFXBL7HGBX EAASS3POFKWX7KDJ IAASOPF5OCCW7LIV GAA5GIWQSMYYKTBO Meta-Code Content-Code Data-Code Instance-Code Metadata Similarity Content Similarity Data Similarity Data Integrity Integrity verifying checksum (crypto hash) Similarity-preserving hashes (SIM hash)
  • 7. © 2023 CC-BY-SA Titusz Pan CONTENT MATCHING
  • 8. © 2023 CC-BY-SA Sebastian Posth/Titusz Pan WHAT IS THE ISCC (NOT) ISCC ACR An open identifier standard proposal Services require proprietary software Short identifier string Not an identifier but a content identification system Can be generated by anyone with access to content for all media types and formats Often optimised for specific media types ISCC has near duplicate content-matching capabilities (lightweight fingerprints) Developed for content matching, can match and compare assets and small chunks in great detail Can be easily implemented in existing applications, which ensures interoperability across entities Detailed fingerprints not interoperable
  • 9. © 2023 CC-BY-SA Titusz Pan WHAT IS THE ISCC (NOT) T-034.524.680-1 ISWC (Work) US-S1Z-20-00001 ISRC (Recording) US-S1Z-20-00002 ISRC (Recording) US-S1Z-20-00003 ISRC (Recording) RMMXPR2HGBYNXE5T… ISCC (File) RMM362DSFHZPYS2S… ISCC (File) RMM6DDMSNP5NFDY2… ISCC (File) ISWC Work ISRC Recording event ISCC Media asset
  • 11. FAKE 😉 © 2023 CC-BY-SA Sebastian Posth https://twitter.com/spsth/status/1718591647087251519 KECX4FAQRQG2VJ3YS3G5JUWUZDMJNE7GIT67RT27H3AAFVV4NDGABUI
  • 12. AAA2LO3CU6XL6ZVX Meta-Code AAAX4FAQRQG2VJ3Y 41% EEAZNTOV2LKMRSEW Content-Code EEAZNTOU2LKMRWEW 94% GAAYNVAI6L53MPPG Data-Code GAAZHZSE7X4M6XZ6 27% IAAYMNLCV43P7IPI Instance-Code IAA4AAWWXRUMYAGR Unique EXAMPLE FAKE NEWS JPG File KEC2LO3CU6XL6ZVXS3G5LUWUZDEJNBWUBDZPXNR542DDKYVPG372D2A KECX4FAQRQG2VJ3YS3G5JUWUZDMJNE7GIT67RT27H3AAFVV4NDGABUI JPG File Components Similarity © 2023 CC-BY-SA Sebastian Posth 2 bits of difference of the 64 bit hash Fake declaration Original declaration
  • 13. EXAMPLE FAKE NEWS © 2023 CC-BY-SA Sebastian Posth The clustering und matching of near-duplicate content is possible by having access only to the ISCC codes! KEC37M6L6YX645BORDR3KWVUJSLJ5AU7SVSE46E56YV33YW4HQKU5PY KEC3XJ7PYYXUM5KORG2ZMGVUNSL6DSI6GT2UZMTBMYOUULIGBBFXHYQ AAA2LO3CU6XL6ZVX Meta-Code AAAX4FAQRQG2VJ3Y 41% EEAZNTOV2LKMRSEW Content-Code EEAZNTOU2LKMRWEW 94% GAAYNVAI6L53MPPG Data-Code GAAZHZSE7X4M6XZ6 27% IAAYMNLCV43P7IPI Instance-Code IAA4AAWWXRUMYAGR Unique JPG File JPG File Components Similarity 2 bits of difference of the 64 bit hash Fake declaration Original declaration
  • 14. © 2023 CC-BY-SA Sebastian Posth SUPPORTED MEDIA TYPES/FORMATS ● TEXT doc, docx, xls, xlsx, pptx, epub, mobi, ibooks, html, xhtml, odt, pdf, rtf, txt, xml, json, md ● IMAGE gif, jpg, png, tif, bmp, psd, eps, webp ● AUDIO aif, flac, mp3, opus, ogg, wav ● VIDEO 3gp, 3g2, asf, avi, drc, flv, f4v, flu, gif, h264, mpg, mp4, mkv, mov, ogv, rm, swf, webm, wmv Algorithm </> Content Codes Digital Media Asset
  • 15. © 2023 CC-BY-SA Sebastian Posth MAIN INNOVATIONS Algorithm </> Content Codes Digital Media Asset ● All users or machines with access to the content to the content can generate the ISCC from the media file – without the need for centralised databases or registries ● With the ISCC, users or machines can confirm the integrity of a media file or recognise and match near-duplicate content ● Recognition is possible even when content has been altered, manipulated or embedded metadata, watermarks or steganographic data have been stripped from the content!
  • 16. © 2023 CC-BY-SA Sebastian Posth PUBLIC DECLARATIONS Public ISCC declarations allow for the persistent binding of metadata, rights and other information to the media asset: ● Sector-specific product and title metadata, e.g. IPTC photo metadata, ONIX, DDEX, etc. ● Rights and licencing offerings ● Usage statistics, reporting data ● Opt-out for TDM and the use of content as AI training data
  • 17. (1) ISCC + OPT-OUT © 2023 CC-BY-SA Sebastian Posth ● Creators and rightsholders can inseparably bind a machine-readable opt-out declaration to prevent their content from being used as AI training data (Article 4, EU DSM Directive on Copyright) ● Providers of AI applications can derive the legal restrictions from the ISCC, and thus respect the requirements set out by the rightsholders AI Training Opt-out
  • 18. MACHINE-READABLE TDM·AI POLICY © 2023 CC-BY-SA Sebastian Posth/Sabine Richly
  • 19. (2) ISCC FOR INPUT TRANSPARENCY © 2023 CC-BY-SA Sebastian Posth ● With the help of the ISCC, providers of AI systems can provide lists of copyright protected works that are/were used for training their models ● This will allow future EU regulatory requirements to be met (An obligation may arise under Art. 28b 4c of the revised EU AI Act)
  • 20. (3) ISCC FOR OUTPUT TRANSPARENCY © 2023 CC-BY-SA Sebastian Posth ● AI system providers can publicly declare AI-generated content ● This will increase trustworthiness of the digital media landscape ● At the same time, AI system providers can prevent AI-generated output from being used to train the LLM/base models (Model Collapse, AI Entropy) Gonzalo Martinez Ruiz De Arcaute, via https://spectrum.ieee.org/ai-collapse
  • 21. DISCUSSION OF USE CASES © 2023 CC-BY-SA Sebastian Posth ● AI opt-out ● Anti-piracy ● Anti-counterfeit ● Reporting of sales or shares ● Using MRT + ISCC ● Using ISCC in DDEX metadata ● etc.
  • 22. ISCC FOUNDATION © 2023 CC-BY-SA Sebastian Posth Your support will make a difference! ISCC Foundation is a purpose-driven non-profit organisation, dedicated to developing and promoting of open source technology for decentralised, digital content identification. You can support our goals: ● with donations ● sponsored development ● testing the ISCC ● promoting the ISCC system https://core.iscc.codes Please contact: https://iscc.foundation Sebastian Posth posth@iscc.foundation Titusz Pan tp@iscc.foundation