SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
Web Not For All
A Large Scale Study of Web Accessibility

     Rui Lopes1, Daniel Gomes2, Luís Carriço1

          1   LaSIGE, University of Lisbon
                      2 FCCN
Context

• The Web is the biggest information source
  for Mankind. Decentralised architecture
  made it blossom.
• Humans (and computers!) contribute to
  information production and consumption,
  leading to ~45B Web pages.
Context
• Growth of users contributing and
  interacting with the Web leads to significant
  diversity of users, including people with
  disabilities.
• The openness and decentralisation of the
  Web leads to an uncontrolled quality check
  of Websites’ usability (and accessibility).
What is the state of accessibility on the Web?
• It is known that Web accessibility adequacy
  is often far worse than desired.
• Studies tend to focus on a restricted (small)
  set of Web sites.


• Do macroscopic properties of Web
  accessibility emerge from analysing at a
  large scale?
Experiment
                   background




• The Portuguese Web Archive initiative
  periodically crawls contents from the
  Portuguese Web (.pt and others) for future
  preservation.
• Services are built on top of crawled
  collections: search (end users) & analysis
  framework (researchers).
Methodology
      data acquisition - obtaining the document collection



• Collect a sufficiently large portion of the
  Web, yet representative (e.g., national
  Webs)
• Spider traps handled gracefully
• Boostraped with 200,000 Website
  addresses from the .pt TLD
• Collected March/May 2008
Methodology
              data acquisition - evaluation process




• Implementation of 39 WCAG 1.0
  checkpoints yield pass, fail, warn.
  (collection previous to WCAG 2.0 TR)


• Overcome computational effort with
  Hadoop cluster, streams, caching, etc.
Methodology
                  data analysis




• Failure rate, 3 criteria:
Results
                    general



• 28M Web pages were evaluated.              (58%)



• 21GB evaluation data collected for analysis.
• 40B HTML elements evaluated.              (~1500/page)



   • 1.5B elements passed.     (56/page, 3.89%)



   • 2.9B elements failed.    (103/page, 7.15%)



   • 36B elements warned.        (1291/page, 89%)
Results
               rates versus page count distribution




conservative                 optimistic               strict
Results
               rates versus page complexity (# HTML elements)




conservative                      optimistic                    strict
Discussion
                   on the results



• Large scale confirms predictions of small
  scale studies - the Web is still not for all.
• Smaller Web pages tend to have greater
  accessibility quality.
• Nature of warnings is more striking than
  expected, completely different
  interpretations.
• Automated evaluation is just the
  beginning.
Discussion
                                  on the limitations of the experiment




• HTML structure vs. content rhetorics.
  (CSS & Javascript can change it all)



• Collecting the Web is hard.
  (deep Web - AJAX & forms -, infinite generation, robots.txt, etc.)



• Scaling evaluation & analysis processes is hard.
  (evaluation streamability, resource inter-dependencies, billion node graphs, etc.)
Conclusions
• Large scale accessibility evaluation of the
  Portuguese Web.
• Re-confirmed studies at the large.
• Educating developers & designers about
  warnings is crucial for accessibility success!
• Automated evaluation is just the start.
  Always need for expert & users evaluations.
Ongoing Work
                   we’re still at the tip of the iceberg



• Linking properties               (ranking vs. accessibility)


• Evolution of accessibility compliance in
  time (different document collections)
• Cross-cuts: gov, e-com, personalisation, etc.

• Developing countries
  countries)
                                        (Portuguese speaking African
Ongoing Work
              help wanted from community!




• Making available evaluation datasets (e.g.,
  Linked Data). Ours and yours!
• Larger document collections.

• Transforming warnings into failures with
  machine learning.
Thank you!
 rlopes@di.fc.ul.pt

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (16)

292 daniel dollar ssp yale_28_may2008
292 daniel dollar ssp yale_28_may2008292 daniel dollar ssp yale_28_may2008
292 daniel dollar ssp yale_28_may2008
 
Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration
 
What Libraries Still Need from Discovery Layers
What Libraries Still Need from Discovery LayersWhat Libraries Still Need from Discovery Layers
What Libraries Still Need from Discovery Layers
 
Peer Council 2016 Keynote Address with John Chapman
Peer Council 2016 Keynote Address with John ChapmanPeer Council 2016 Keynote Address with John Chapman
Peer Council 2016 Keynote Address with John Chapman
 
Transforming University Research - Mar 2006
Transforming University Research - Mar 2006Transforming University Research - Mar 2006
Transforming University Research - Mar 2006
 
Lightning talk on MARC records for the Contemporary Composers Web Archive pre...
Lightning talk on MARC records for the Contemporary Composers Web Archive pre...Lightning talk on MARC records for the Contemporary Composers Web Archive pre...
Lightning talk on MARC records for the Contemporary Composers Web Archive pre...
 
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
 
Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
 
Discovery Service Implementation: What We Wish We Had Known, or Known to Ask
Discovery Service Implementation: What We Wish We Had Known, or Known to AskDiscovery Service Implementation: What We Wish We Had Known, or Known to Ask
Discovery Service Implementation: What We Wish We Had Known, or Known to Ask
 
Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collec...
Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collec...Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collec...
Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collec...
 
Siegman "Creating Accessible Content"
Siegman "Creating Accessible Content"Siegman "Creating Accessible Content"
Siegman "Creating Accessible Content"
 
PESC-Kirchhoff-ALA Annual 2015 NISO Update
PESC-Kirchhoff-ALA Annual 2015 NISO UpdatePESC-Kirchhoff-ALA Annual 2015 NISO Update
PESC-Kirchhoff-ALA Annual 2015 NISO Update
 
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolExposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
 

Andere mochten auch (7)

Some notes on UX
Some notes on UXSome notes on UX
Some notes on UX
 
Assistive technology
Assistive technologyAssistive technology
Assistive technology
 
Networking
NetworkingNetworking
Networking
 
On Web Accessibility Environments
On Web Accessibility EnvironmentsOn Web Accessibility Environments
On Web Accessibility Environments
 
Luottamus digitaalisessa turvallisuudessa yleisöluento jarno limnéll_08032016
Luottamus digitaalisessa turvallisuudessa yleisöluento jarno limnéll_08032016Luottamus digitaalisessa turvallisuudessa yleisöluento jarno limnéll_08032016
Luottamus digitaalisessa turvallisuudessa yleisöluento jarno limnéll_08032016
 
Assistive technology
Assistive technologyAssistive technology
Assistive technology
 
Mahdollistava turvallisuus Jarno Limnéll Rytminmuutos 13062016
Mahdollistava turvallisuus Jarno Limnéll Rytminmuutos 13062016Mahdollistava turvallisuus Jarno Limnéll Rytminmuutos 13062016
Mahdollistava turvallisuus Jarno Limnéll Rytminmuutos 13062016
 

Ähnlich wie W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility

Spca2014 practical large scale migration guidance v1.0 andries den haan
Spca2014 practical large scale migration guidance v1.0 andries den haanSpca2014 practical large scale migration guidance v1.0 andries den haan
Spca2014 practical large scale migration guidance v1.0 andries den haan
NCCOMMS
 
Scalability andefficiencypres
Scalability andefficiencypresScalability andefficiencypres
Scalability andefficiencypres
NekoGato
 

Ähnlich wie W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility (20)

Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post Implementation
 
Spca2014 practical large scale migration guidance v1.0 andries den haan
Spca2014 practical large scale migration guidance v1.0 andries den haanSpca2014 practical large scale migration guidance v1.0 andries den haan
Spca2014 practical large scale migration guidance v1.0 andries den haan
 
Practical large scale migration guidance
Practical large scale migration guidancePractical large scale migration guidance
Practical large scale migration guidance
 
Web Mining
Web MiningWeb Mining
Web Mining
 
Web mining
Web miningWeb mining
Web mining
 
IRT Unit_4.pptx
IRT Unit_4.pptxIRT Unit_4.pptx
IRT Unit_4.pptx
 
Ir1
Ir1Ir1
Ir1
 
Measuring impact
Measuring impactMeasuring impact
Measuring impact
 
Web Archiving – Lessons and Potential
 Web Archiving – Lessons and Potential Web Archiving – Lessons and Potential
Web Archiving – Lessons and Potential
 
Archiving the French Web: the BnF web archiving workflow. Sara Aubry
Archiving the French Web: the BnF web archiving workflow. Sara AubryArchiving the French Web: the BnF web archiving workflow. Sara Aubry
Archiving the French Web: the BnF web archiving workflow. Sara Aubry
 
Scalability andefficiencypres
Scalability andefficiencypresScalability andefficiencypres
Scalability andefficiencypres
 
IWMW 2005: Lies, Damn Lies, and Web Statistics (1)
IWMW 2005:  Lies, Damn Lies, and Web Statistics (1)IWMW 2005:  Lies, Damn Lies, and Web Statistics (1)
IWMW 2005: Lies, Damn Lies, and Web Statistics (1)
 
Introduction to Web Technology by Mahesh Sharma
Introduction to Web Technology by Mahesh SharmaIntroduction to Web Technology by Mahesh Sharma
Introduction to Web Technology by Mahesh Sharma
 
introduction to web engineering.pdf
introduction to web engineering.pdfintroduction to web engineering.pdf
introduction to web engineering.pdf
 
Tools. Techniques. Trouble?
Tools. Techniques. Trouble?Tools. Techniques. Trouble?
Tools. Techniques. Trouble?
 
introduction to web engineering.pptx
introduction to web engineering.pptxintroduction to web engineering.pptx
introduction to web engineering.pptx
 
Subject gateway knowledge organisation
Subject gateway knowledge organisationSubject gateway knowledge organisation
Subject gateway knowledge organisation
 
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
 
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
 
WELecture01.pptx
WELecture01.pptxWELecture01.pptx
WELecture01.pptx
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 

W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility

  • 1. Web Not For All A Large Scale Study of Web Accessibility Rui Lopes1, Daniel Gomes2, Luís Carriço1 1 LaSIGE, University of Lisbon 2 FCCN
  • 2. Context • The Web is the biggest information source for Mankind. Decentralised architecture made it blossom. • Humans (and computers!) contribute to information production and consumption, leading to ~45B Web pages.
  • 3. Context • Growth of users contributing and interacting with the Web leads to significant diversity of users, including people with disabilities. • The openness and decentralisation of the Web leads to an uncontrolled quality check of Websites’ usability (and accessibility).
  • 4. What is the state of accessibility on the Web?
  • 5. • It is known that Web accessibility adequacy is often far worse than desired. • Studies tend to focus on a restricted (small) set of Web sites. • Do macroscopic properties of Web accessibility emerge from analysing at a large scale?
  • 6. Experiment background • The Portuguese Web Archive initiative periodically crawls contents from the Portuguese Web (.pt and others) for future preservation. • Services are built on top of crawled collections: search (end users) & analysis framework (researchers).
  • 7. Methodology data acquisition - obtaining the document collection • Collect a sufficiently large portion of the Web, yet representative (e.g., national Webs) • Spider traps handled gracefully • Boostraped with 200,000 Website addresses from the .pt TLD • Collected March/May 2008
  • 8. Methodology data acquisition - evaluation process • Implementation of 39 WCAG 1.0 checkpoints yield pass, fail, warn. (collection previous to WCAG 2.0 TR) • Overcome computational effort with Hadoop cluster, streams, caching, etc.
  • 9. Methodology data analysis • Failure rate, 3 criteria:
  • 10. Results general • 28M Web pages were evaluated. (58%) • 21GB evaluation data collected for analysis. • 40B HTML elements evaluated. (~1500/page) • 1.5B elements passed. (56/page, 3.89%) • 2.9B elements failed. (103/page, 7.15%) • 36B elements warned. (1291/page, 89%)
  • 11. Results rates versus page count distribution conservative optimistic strict
  • 12. Results rates versus page complexity (# HTML elements) conservative optimistic strict
  • 13. Discussion on the results • Large scale confirms predictions of small scale studies - the Web is still not for all. • Smaller Web pages tend to have greater accessibility quality. • Nature of warnings is more striking than expected, completely different interpretations. • Automated evaluation is just the beginning.
  • 14. Discussion on the limitations of the experiment • HTML structure vs. content rhetorics. (CSS & Javascript can change it all) • Collecting the Web is hard. (deep Web - AJAX & forms -, infinite generation, robots.txt, etc.) • Scaling evaluation & analysis processes is hard. (evaluation streamability, resource inter-dependencies, billion node graphs, etc.)
  • 15. Conclusions • Large scale accessibility evaluation of the Portuguese Web. • Re-confirmed studies at the large. • Educating developers & designers about warnings is crucial for accessibility success! • Automated evaluation is just the start. Always need for expert & users evaluations.
  • 16. Ongoing Work we’re still at the tip of the iceberg • Linking properties (ranking vs. accessibility) • Evolution of accessibility compliance in time (different document collections) • Cross-cuts: gov, e-com, personalisation, etc. • Developing countries countries) (Portuguese speaking African
  • 17. Ongoing Work help wanted from community! • Making available evaluation datasets (e.g., Linked Data). Ours and yours! • Larger document collections. • Transforming warnings into failures with machine learning.