SlideShare ist ein Scribd-Unternehmen logo
1 von 9
Downloaden Sie, um offline zu lesen
The sad saga of XHTML;
or,
what happens when markup geeks
get arrogant
LIS 652
Dorothea Salo
HTML’s early days
•Tim Berners-Lee: great humanitarian, LOUSY
document analyst/content modeler.
•HTML was supposedly designed for journal articles in physics.
•Based on your experience modeling articles... was it EVER gonna work for that?

•Early HTML was extremely crude markup.
•Crude in “structure.”
•Crude in appearance (as implemented in web browsers).
•Practically nonexistent interactivity. Documents just sat there; you couldn’t DO
anything on the web except read documents and (every once in a while) click
buttons.
Result: tag soup!
•“Tag soup:” markup-geek’s disrespectful
term for lousy markup
•“Tag abuse:” markup-geek’s disrespectful
term for using a tag for a reason other
than its structural appropriateness
•Early HTML pages: lots and LOTS of both!
•“Who put <font> tags in my nice neat structural markup?!”
•“What the heck are you doing with table markup?! Stop that!”
•“OMG VALIDATE YOUR HTML, WILL YOU?” “Validation? What’s that?”
Funny, but...
•Absolutely terrible for accessibility
•Even tag-abusing HTML won’t make
pretty web pages. Or interactive ones.
•Browser incompatibilities/differences
•partly due to browsers needing to consume near-completely incompatible,
bizarre, or just plain WRONG markup
•partly due to some browser implementors (MICROSOFT) trying to take
over the brand-new Web
So the W3C said “Stop.”
•And the W3C said “Use the CSS which we have made
for thee to separate structure from presentation.”
•And there was much (well, some) rejoicing!

•And the W3C said “Make not tag soup, but use the
stricter XHTML syntax, and validate thy documents.”
•And web designers said “NOPE.”
•(Adoption of XHTML on the web was essentially zero, except for those who were starting from XML
to begin with.)

•And after a long time, the W3C said “... okay, FINE.
Here’s HTML5, then. You want to be sloppy with tags?
Be sloppy.”
Life lessons
•Human beings are very bad at:
•Precision
•Consistency
•Following instructions
•Checking their work

•This means that human beings are VERY
BAD AT MAKING XML.
•We have a whole course on this at SLIS for a reason! It’s pretty hard for
most people to learn on their own!

•Demanding XML from most human
beings is a loser’s game!!!!!!!!!!!!!
Why am I telling you this?
•Because Very Smart People keep making
the Very Stupid Mistake of demanding
XML from human beings.
•W3C: XHTML
•National Science Digital Library: OAI-PMH
•Many, many server-based software packages
•Library supply chain/Impelsys: ONIX from indie/self-publishers (http://
www.americanlibrariesmagazine.org/blog/acquisition-dashboardimpelsys-results-experiment)

•This never, ever works out well!
•GO YE AND DO NOT DO LIKEWISE.
Postel’s law: a better way
•Be conservative in what you do, be liberal in
what you accept from others.
•Often reworded as “Be conservative in what you send, be liberal in what you
accept.”

•If you want XML:
•Get the data in a way the other people are comfortable with.
•Plan on having to clean it up. (Automate that as best you can!)
•Turn it into XML yourself.
•Seems like a hassle because it is... but it’s the ONLY THING THAT ALWAYS WORKS.
Trust me.
Thank you!
This presentation is available under a
Creative Commons Attribution 4.0
United States license.

Weitere ähnliche Inhalte

Andere mochten auch

"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
"Serez-vous des nôtres ?" - Un hommage à Aaron SwartzCalimaq S.I.Lex
 
Le statut des données de la recherche : entre droit des bases de données et d...
Le statut des données de la recherche : entre droit des bases de données et d...Le statut des données de la recherche : entre droit des bases de données et d...
Le statut des données de la recherche : entre droit des bases de données et d...Calimaq S.I.Lex
 
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...Calimaq S.I.Lex
 
Quelles réciprocités pour les communs ?
Quelles réciprocités pour les communs ?Quelles réciprocités pour les communs ?
Quelles réciprocités pour les communs ?Calimaq S.I.Lex
 
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ? Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ? Calimaq S.I.Lex
 
L'exception TDM dans la loi numérique : mérites, limites et perspectives
L'exception TDM dans la loi numérique : mérites, limites et perspectivesL'exception TDM dans la loi numérique : mérites, limites et perspectives
L'exception TDM dans la loi numérique : mérites, limites et perspectivesCalimaq S.I.Lex
 
Les universités françaises et l'Open Data après la loi "République numérique"
Les universités françaises et l'Open Data après la loi "République numérique"Les universités françaises et l'Open Data après la loi "République numérique"
Les universités françaises et l'Open Data après la loi "République numérique"Calimaq S.I.Lex
 
Digitizing Out of Commerce and Orphan Works : a (critical) french perspective
Digitizing Out of Commerce and Orphan Works : a (critical) french perspectiveDigitizing Out of Commerce and Orphan Works : a (critical) french perspective
Digitizing Out of Commerce and Orphan Works : a (critical) french perspectiveCalimaq S.I.Lex
 
Libre Accès aux publications scientifiques : quelles incidences de la loi num...
Libre Accès aux publications scientifiques : quelles incidences de la loi num...Libre Accès aux publications scientifiques : quelles incidences de la loi num...
Libre Accès aux publications scientifiques : quelles incidences de la loi num...Calimaq S.I.Lex
 
Les licences à réciprocité : une piste pour le développement des Communs ?
Les licences à réciprocité : une piste pour le développement des Communs ? Les licences à réciprocité : une piste pour le développement des Communs ?
Les licences à réciprocité : une piste pour le développement des Communs ? Calimaq S.I.Lex
 

Andere mochten auch (10)

"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
 
Le statut des données de la recherche : entre droit des bases de données et d...
Le statut des données de la recherche : entre droit des bases de données et d...Le statut des données de la recherche : entre droit des bases de données et d...
Le statut des données de la recherche : entre droit des bases de données et d...
 
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
 
Quelles réciprocités pour les communs ?
Quelles réciprocités pour les communs ?Quelles réciprocités pour les communs ?
Quelles réciprocités pour les communs ?
 
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ? Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
 
L'exception TDM dans la loi numérique : mérites, limites et perspectives
L'exception TDM dans la loi numérique : mérites, limites et perspectivesL'exception TDM dans la loi numérique : mérites, limites et perspectives
L'exception TDM dans la loi numérique : mérites, limites et perspectives
 
Les universités françaises et l'Open Data après la loi "République numérique"
Les universités françaises et l'Open Data après la loi "République numérique"Les universités françaises et l'Open Data après la loi "République numérique"
Les universités françaises et l'Open Data après la loi "République numérique"
 
Digitizing Out of Commerce and Orphan Works : a (critical) french perspective
Digitizing Out of Commerce and Orphan Works : a (critical) french perspectiveDigitizing Out of Commerce and Orphan Works : a (critical) french perspective
Digitizing Out of Commerce and Orphan Works : a (critical) french perspective
 
Libre Accès aux publications scientifiques : quelles incidences de la loi num...
Libre Accès aux publications scientifiques : quelles incidences de la loi num...Libre Accès aux publications scientifiques : quelles incidences de la loi num...
Libre Accès aux publications scientifiques : quelles incidences de la loi num...
 
Les licences à réciprocité : une piste pour le développement des Communs ?
Les licences à réciprocité : une piste pour le développement des Communs ? Les licences à réciprocité : une piste pour le développement des Communs ?
Les licences à réciprocité : une piste pour le développement des Communs ?
 

Mehr von Dorothea Salo

Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Dorothea Salo
 
Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Dorothea Salo
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and librariesDorothea Salo
 
Risk management and auditing
Risk management and auditingRisk management and auditing
Risk management and auditingDorothea Salo
 
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)Dorothea Salo
 
Preservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesPreservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesDorothea Salo
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Dorothea Salo
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesDorothea Salo
 
Research Data and Scholarly Communication
Research Data and Scholarly CommunicationResearch Data and Scholarly Communication
Research Data and Scholarly CommunicationDorothea Salo
 
Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Dorothea Salo
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing SerendipityDorothea Salo
 
RDF, RDA, and other TLAs
RDF, RDA, and other TLAsRDF, RDA, and other TLAs
RDF, RDA, and other TLAsDorothea Salo
 
Librarians love data!
Librarians love data!Librarians love data!
Librarians love data!Dorothea Salo
 
Taming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsTaming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsDorothea Salo
 

Mehr von Dorothea Salo (20)

Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)
 
Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!
 
Encryption
EncryptionEncryption
Encryption
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and libraries
 
Paying for it
Paying for itPaying for it
Paying for it
 
Risk management and auditing
Risk management and auditingRisk management and auditing
Risk management and auditing
 
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
 
Preservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesPreservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanities
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
 
Library Linked Data
Library Linked DataLibrary Linked Data
Library Linked Data
 
FRBR and RDA
FRBR and RDAFRBR and RDA
FRBR and RDA
 
Research Data and Scholarly Communication
Research Data and Scholarly CommunicationResearch Data and Scholarly Communication
Research Data and Scholarly Communication
 
Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing Serendipity
 
What We Organize
What We OrganizeWhat We Organize
What We Organize
 
Occupy Copyright!
Occupy Copyright!Occupy Copyright!
Occupy Copyright!
 
RDF, RDA, and other TLAs
RDF, RDA, and other TLAsRDF, RDA, and other TLAs
RDF, RDA, and other TLAs
 
Librarians love data!
Librarians love data!Librarians love data!
Librarians love data!
 
Taming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsTaming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation Tools
 

Kürzlich hochgeladen

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 

Kürzlich hochgeladen (20)

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 

The sad saga of XHTML; or, what happens when markup geeks get arrogant

  • 1. The sad saga of XHTML; or, what happens when markup geeks get arrogant LIS 652 Dorothea Salo
  • 2. HTML’s early days •Tim Berners-Lee: great humanitarian, LOUSY document analyst/content modeler. •HTML was supposedly designed for journal articles in physics. •Based on your experience modeling articles... was it EVER gonna work for that? •Early HTML was extremely crude markup. •Crude in “structure.” •Crude in appearance (as implemented in web browsers). •Practically nonexistent interactivity. Documents just sat there; you couldn’t DO anything on the web except read documents and (every once in a while) click buttons.
  • 3. Result: tag soup! •“Tag soup:” markup-geek’s disrespectful term for lousy markup •“Tag abuse:” markup-geek’s disrespectful term for using a tag for a reason other than its structural appropriateness •Early HTML pages: lots and LOTS of both! •“Who put <font> tags in my nice neat structural markup?!” •“What the heck are you doing with table markup?! Stop that!” •“OMG VALIDATE YOUR HTML, WILL YOU?” “Validation? What’s that?”
  • 4. Funny, but... •Absolutely terrible for accessibility •Even tag-abusing HTML won’t make pretty web pages. Or interactive ones. •Browser incompatibilities/differences •partly due to browsers needing to consume near-completely incompatible, bizarre, or just plain WRONG markup •partly due to some browser implementors (MICROSOFT) trying to take over the brand-new Web
  • 5. So the W3C said “Stop.” •And the W3C said “Use the CSS which we have made for thee to separate structure from presentation.” •And there was much (well, some) rejoicing! •And the W3C said “Make not tag soup, but use the stricter XHTML syntax, and validate thy documents.” •And web designers said “NOPE.” •(Adoption of XHTML on the web was essentially zero, except for those who were starting from XML to begin with.) •And after a long time, the W3C said “... okay, FINE. Here’s HTML5, then. You want to be sloppy with tags? Be sloppy.”
  • 6. Life lessons •Human beings are very bad at: •Precision •Consistency •Following instructions •Checking their work •This means that human beings are VERY BAD AT MAKING XML. •We have a whole course on this at SLIS for a reason! It’s pretty hard for most people to learn on their own! •Demanding XML from most human beings is a loser’s game!!!!!!!!!!!!!
  • 7. Why am I telling you this? •Because Very Smart People keep making the Very Stupid Mistake of demanding XML from human beings. •W3C: XHTML •National Science Digital Library: OAI-PMH •Many, many server-based software packages •Library supply chain/Impelsys: ONIX from indie/self-publishers (http:// www.americanlibrariesmagazine.org/blog/acquisition-dashboardimpelsys-results-experiment) •This never, ever works out well! •GO YE AND DO NOT DO LIKEWISE.
  • 8. Postel’s law: a better way •Be conservative in what you do, be liberal in what you accept from others. •Often reworded as “Be conservative in what you send, be liberal in what you accept.” •If you want XML: •Get the data in a way the other people are comfortable with. •Plan on having to clean it up. (Automate that as best you can!) •Turn it into XML yourself. •Seems like a hassle because it is... but it’s the ONLY THING THAT ALWAYS WORKS. Trust me.
  • 9. Thank you! This presentation is available under a Creative Commons Attribution 4.0 United States license.