SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Color naming 65,274,705,768 pixels

Nathan Moroney and Giordano Beretta

HP Labs




Electronic Imaging 2013: Color Imaging XVIII
Outline
 Motivation
       More (pixel) data
 Finding and processing 65 billion pixels
       Hint: Wikipedia & a dual core Open MP color namer
 What did you learn?
       The most frequent non-achromatic color term is…
 What’s next?
       Other than a trillion pixels




Electronic Imaging 2013: Color Imaging XVIII
Motivation
 Previous work in crowd-sourcing color training data
  and experimental efforts
 Related work in the area of big (image) data
       A. Torralba, R. Fergus, W. T. Freeman, "80 million tiny images: a
        large dataset for non-parametric object and scene recognition",
        IEEE Transactions on Pattern Analysis and Machine Intelligence,
        vol.30(11), pp. 1958-1970, 2008.
       Ben Shneiderman, "Extreme Visualization: Squeezing a Billion
        Records into a Million Pixels", SIGMOD Conference, pp. 3-12,
        (2008).
       Steven Seitz, “A Trillion Photos”, EI’13 Keynote (2013).



Electronic Imaging 2013: Color Imaging XVIII
Motivation




            0             1            2            3        4        5   6
                                               Log Number of Images




Electronic Imaging 2013: Color Imaging XVIII
Source Data
 ImageClef 2010 snapshot
       Adrian Popescu, Theodora Tsikrika and Jana Kludas, "Overview
        of the wikipedia retrieval task at ImageCLEF 2010", In the
        Working Notes for the CLEF 2010 Workshop, 20-23 September,
        Padova, Italy, 2010.
       250,000 images plus associated wikipedia data
       20 gigabytes
       65,000,000,000 pixels uncompressed




Electronic Imaging 2013: Color Imaging XVIII
Source Data: At 200 PPI




Electronic Imaging 2013: Color Imaging XVIII
Processing
 Basic single dual-core (but Open MP threaded) script
  to process over all image files
 Simple stuff like getting image dimensions can be
  done over lunch
 Uncompressing all the JPEG files to memory can
  take hours
 Goal was a color naming algorithm that could be run
  in less than a day




Electronic Imaging 2013: Color Imaging XVIII
Processing
 Some testing done using HP Cloud Services and
  compute clusters
 But majority of focus on single computing device
       Antony Rowstron, Dushyanth Narayanan, Austin Donnelly, Greg
        O'Shea, and Andrew Douglas. "Nobody ever got fired for using
        hadoop on a cluster", In HotCDP 2012 - 1st International
        Workshop on Hot Topics in Cloud Data Processing, (2012).




Electronic Imaging 2013: Color Imaging XVIII
Processing
 Won’t describe the specifics of the color naming
    algorithm (throw produce if you have it) but generally
       Input single RGB pixel and output is a single color term
       Size of vocabulary or number of color terms is a parameter
       Relative range of chroma values corresponding to an achromatic
        values is also a parameter
 Also currently testing a completely revised model
 Finally, in the Future directions section note that the
    best option for formal publication is to make use of
    currently available open source machine learning
    toolboxes.

Electronic Imaging 2013: Color Imaging XVIII
Results: Aspect Ratios
 Wide range of
  image types
 Most basic test
  of processing
  scripts




Electronic Imaging 2013: Color Imaging XVIII
Results: Median
 Additional test and
  visualization of
  basic color
  properties of images
 Large enough data
  set was worthwhile
  to write custom
  HTML5 2d canvas
  renderer



Electronic Imaging 2013: Color Imaging XVIII
Results: Median
 So much data, that
  as noted by
  Shneiderman the
  density plot "uses a
  spatial substrate
  organizing
  principle, but shows
  concentrations of
  markers” is maybe a
  better idea
 Data, alpha=0.05

Electronic Imaging 2013: Color Imaging XVIII
Results: Max
 Max of R+G+B for
  the images
 Final test of basic
  scripting code




Electronic Imaging 2013: Color Imaging XVIII
Results
 Color terms
  across all images
 Majority pixels
  achromatic
 Top chromatic
  colors are
  arguably natural
  tones
 Higher chroma
  terms relatively
  infrequent
Electronic Imaging 2013: Color Imaging XVIII
Results
                                                                                  Color Terms for 200,000+ images
                                                                  60000


 Color terms per
  image                                                           50000




 Peak at 5 are all                                               40000


  achromatic terms
                                               Number of Images
                                                                  30000
  or images
 Gradual then                                                    20000



  rapid usage of                                                  10000

  chromatic terms
                                                                     0
                                                                          0   5       10            15             20             25    30   35
                                                                                      Number of Color Terms. Maximum Vocabulary of 30




Electronic Imaging 2013: Color Imaging XVIII
Results
                                                                                  Color Terms for 200,000+ images
                                                                  60000


 Sudden drop off
  at 30 is a model                                                50000




  failure                                                         40000


 Term added to
                                               Number of Images
                                                                  30000
  vocabulary based
  on previous                                                     20000



  limited
                                                                  10000

  optimization
                                                                     0
                                                                          0   5       10            15             20             25    30   35
                                                                                      Number of Color Terms. Maximum Vocabulary of 30




Electronic Imaging 2013: Color Imaging XVIII
Current Work
 Repeated entire process adjusting the model
  parameters
 Processing to fill SQL databases
 Query the database to validate all of the steps and
  explore specific




Electronic Imaging 2013: Color Imaging XVIII
Current Work
 SELECT * from
    cntable order by
    skyblue desc limit 40




Electronic Imaging 2013: Color Imaging XVIII
Future Directions
 Image collections as “pixel
    corpora” for algorithm
    design, testing and optimization.
       Similar to the role that written and spoken
        corpora fill for NLP and corpus linguistics
       Useful to formalize for citation and
        repeatability
 Additional analysis features
 Testing with more public domain
    machine learning algorithms for
    repeatability

Electronic Imaging 2013: Color Imaging XVIII
Summary
 Algorithm optimization, like machine color
  naming, with 200,000 images is different than with
  200.
 Based on Wikipedia, majority of visual content or
  pixels are achromatic
 Based on Wikipedia, higher chroma named pixels are
  less frequent
 Based on Wikipedia, there is a gradual then sudden
  transition in color term usage



Electronic Imaging 2013: Color Imaging XVIII

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (10)

Lectures 1 3 final (4)
Lectures 1 3 final (4)Lectures 1 3 final (4)
Lectures 1 3 final (4)
 
Comprehensive Infrared Image Edge detection Algorithm
Comprehensive Infrared Image Edge detection AlgorithmComprehensive Infrared Image Edge detection Algorithm
Comprehensive Infrared Image Edge detection Algorithm
 
Coin recognition using matlab
Coin recognition using matlabCoin recognition using matlab
Coin recognition using matlab
 
Digital Image Fundamentals
Digital Image FundamentalsDigital Image Fundamentals
Digital Image Fundamentals
 
Lec15 graph laplacian embedding
Lec15 graph laplacian embeddingLec15 graph laplacian embedding
Lec15 graph laplacian embedding
 
Lec12 review-part-i
Lec12 review-part-iLec12 review-part-i
Lec12 review-part-i
 
Ec section
Ec section Ec section
Ec section
 
Lec07 aggregation-and-retrieval-system
Lec07 aggregation-and-retrieval-systemLec07 aggregation-and-retrieval-system
Lec07 aggregation-and-retrieval-system
 
Seema dip
Seema dipSeema dip
Seema dip
 
Image colorization
Image colorizationImage colorization
Image colorization
 

Andere mochten auch

How to Play Well with Others (A Program on Dealing with Difficult People)
How to Play Well with Others (A Program on Dealing with Difficult People)How to Play Well with Others (A Program on Dealing with Difficult People)
How to Play Well with Others (A Program on Dealing with Difficult People)Marian Madonia, CSP
 
Scientific Applications of The Data Distribution Service
Scientific Applications of The Data Distribution ServiceScientific Applications of The Data Distribution Service
Scientific Applications of The Data Distribution ServiceAngelo Corsaro
 
Rupert.Reading.Jan 2015
Rupert.Reading.Jan 2015 Rupert.Reading.Jan 2015
Rupert.Reading.Jan 2015 Faye Brownlie
 
Office 365 + Windows Azure (del 2)
Office 365 + Windows Azure (del 2)Office 365 + Windows Azure (del 2)
Office 365 + Windows Azure (del 2)Wictor Wilén
 
Land and Home
Land and HomeLand and Home
Land and Homeoiwan
 
Carols Presentation53
Carols  Presentation53Carols  Presentation53
Carols Presentation53guest576d5
 
Sunshine coast admin
Sunshine coast adminSunshine coast admin
Sunshine coast adminFaye Brownlie
 
Redshift Company Credentials
Redshift Company CredentialsRedshift Company Credentials
Redshift Company CredentialsLisa_Bella
 
Make a Wave - Branding Intro webinar - PatchworkPresent
Make a Wave - Branding Intro webinar - PatchworkPresentMake a Wave - Branding Intro webinar - PatchworkPresent
Make a Wave - Branding Intro webinar - PatchworkPresentOgunte CIC
 
Compramos la merienda 2º ciclo 2016
Compramos la merienda 2º ciclo 2016Compramos la merienda 2º ciclo 2016
Compramos la merienda 2º ciclo 2016XXX XXX
 
Raspberry PiとActiveMQで作るセンサーライト
Raspberry PiとActiveMQで作るセンサーライトRaspberry PiとActiveMQで作るセンサーライト
Raspberry PiとActiveMQで作るセンサーライトTakayuki Konishi
 
Social Realism
Social RealismSocial Realism
Social Realismp102
 
La Governance e le risorse finanziarie
La Governance e le risorse finanziarieLa Governance e le risorse finanziarie
La Governance e le risorse finanziarieJacopo Zurlo
 
Baile alumnado 2º ciclo 2013
Baile alumnado 2º ciclo 2013Baile alumnado 2º ciclo 2013
Baile alumnado 2º ciclo 2013XXX XXX
 

Andere mochten auch (20)

HR Assignment
HR AssignmentHR Assignment
HR Assignment
 
How to Play Well with Others (A Program on Dealing with Difficult People)
How to Play Well with Others (A Program on Dealing with Difficult People)How to Play Well with Others (A Program on Dealing with Difficult People)
How to Play Well with Others (A Program on Dealing with Difficult People)
 
Scientific Applications of The Data Distribution Service
Scientific Applications of The Data Distribution ServiceScientific Applications of The Data Distribution Service
Scientific Applications of The Data Distribution Service
 
Camel
CamelCamel
Camel
 
Rupert.Reading.Jan 2015
Rupert.Reading.Jan 2015 Rupert.Reading.Jan 2015
Rupert.Reading.Jan 2015
 
Office 365 + Windows Azure (del 2)
Office 365 + Windows Azure (del 2)Office 365 + Windows Azure (del 2)
Office 365 + Windows Azure (del 2)
 
Land and Home
Land and HomeLand and Home
Land and Home
 
Carols Presentation53
Carols  Presentation53Carols  Presentation53
Carols Presentation53
 
Pintura 2 Eso
Pintura 2 EsoPintura 2 Eso
Pintura 2 Eso
 
Sunshine coast admin
Sunshine coast adminSunshine coast admin
Sunshine coast admin
 
Destiny Overview
Destiny OverviewDestiny Overview
Destiny Overview
 
Redshift Company Credentials
Redshift Company CredentialsRedshift Company Credentials
Redshift Company Credentials
 
Make a Wave - Branding Intro webinar - PatchworkPresent
Make a Wave - Branding Intro webinar - PatchworkPresentMake a Wave - Branding Intro webinar - PatchworkPresent
Make a Wave - Branding Intro webinar - PatchworkPresent
 
Compramos la merienda 2º ciclo 2016
Compramos la merienda 2º ciclo 2016Compramos la merienda 2º ciclo 2016
Compramos la merienda 2º ciclo 2016
 
soal-pemrograman-b
soal-pemrograman-bsoal-pemrograman-b
soal-pemrograman-b
 
Raspberry PiとActiveMQで作るセンサーライト
Raspberry PiとActiveMQで作るセンサーライトRaspberry PiとActiveMQで作るセンサーライト
Raspberry PiとActiveMQで作るセンサーライト
 
Social Realism
Social RealismSocial Realism
Social Realism
 
La Governance e le risorse finanziarie
La Governance e le risorse finanziarieLa Governance e le risorse finanziarie
La Governance e le risorse finanziarie
 
ikp321-05
ikp321-05ikp321-05
ikp321-05
 
Baile alumnado 2º ciclo 2013
Baile alumnado 2º ciclo 2013Baile alumnado 2º ciclo 2013
Baile alumnado 2º ciclo 2013
 

Ähnlich wie Color naming 65,274,705,768 pixels

IHC 2011 - Widgets Internship
IHC 2011 - Widgets InternshipIHC 2011 - Widgets Internship
IHC 2011 - Widgets InternshipEduardo Oliveira
 
Vladimir Surin and Alexander Tyrsin - Research of properties of digital nois...
Vladimir Surin and  Alexander Tyrsin - Research of properties of digital nois...Vladimir Surin and  Alexander Tyrsin - Research of properties of digital nois...
Vladimir Surin and Alexander Tyrsin - Research of properties of digital nois...AIST
 
Projects on Digital Image Processing Research Thesis Topics
Projects on Digital Image Processing Research Thesis TopicsProjects on Digital Image Processing Research Thesis Topics
Projects on Digital Image Processing Research Thesis TopicsMatlab Simulation
 
Image Processing using Matlab . Useful for beginners to learn Image Processing
Image Processing using Matlab . Useful for beginners to learn Image ProcessingImage Processing using Matlab . Useful for beginners to learn Image Processing
Image Processing using Matlab . Useful for beginners to learn Image ProcessingAshok Kumar
 
3.point operation and histogram based image enhancement
3.point operation and histogram based image enhancement3.point operation and histogram based image enhancement
3.point operation and histogram based image enhancementmukesh bhardwaj
 
Digital image processing using matlab
Digital image processing using matlab Digital image processing using matlab
Digital image processing using matlab Amr Rashed
 
Digital Image Processing_ ch1 introduction-2003
Digital Image Processing_ ch1 introduction-2003Digital Image Processing_ ch1 introduction-2003
Digital Image Processing_ ch1 introduction-2003Malik obeisat
 
Basics of image processing & analysis
Basics of image processing & analysisBasics of image processing & analysis
Basics of image processing & analysisMohsin Siddique
 
Laureate Online Education Internet and Multimedia Technolog.docx
Laureate Online Education    Internet and Multimedia Technolog.docxLaureate Online Education    Internet and Multimedia Technolog.docx
Laureate Online Education Internet and Multimedia Technolog.docxDIPESH30
 
What Color is Solid State Lighting - Panel Discussion
What Color is Solid State Lighting - Panel DiscussionWhat Color is Solid State Lighting - Panel Discussion
What Color is Solid State Lighting - Panel DiscussionCindy Foster-Warthen
 
IT6005 digital image processing question bank
IT6005   digital image processing question bankIT6005   digital image processing question bank
IT6005 digital image processing question bankGayathri Krishnamoorthy
 
Color Imaging Lab Research Interests 2010
Color Imaging Lab Research Interests 2010Color Imaging Lab Research Interests 2010
Color Imaging Lab Research Interests 2010Juan Luis Nieves
 
introduction to Digital Image Processing
introduction to Digital Image Processingintroduction to Digital Image Processing
introduction to Digital Image Processingnikesh gadare
 

Ähnlich wie Color naming 65,274,705,768 pixels (14)

IHC 2011 - Widgets Internship
IHC 2011 - Widgets InternshipIHC 2011 - Widgets Internship
IHC 2011 - Widgets Internship
 
Vladimir Surin and Alexander Tyrsin - Research of properties of digital nois...
Vladimir Surin and  Alexander Tyrsin - Research of properties of digital nois...Vladimir Surin and  Alexander Tyrsin - Research of properties of digital nois...
Vladimir Surin and Alexander Tyrsin - Research of properties of digital nois...
 
4 image enhancement in spatial domain
4 image enhancement in spatial domain4 image enhancement in spatial domain
4 image enhancement in spatial domain
 
Projects on Digital Image Processing Research Thesis Topics
Projects on Digital Image Processing Research Thesis TopicsProjects on Digital Image Processing Research Thesis Topics
Projects on Digital Image Processing Research Thesis Topics
 
Image Processing using Matlab . Useful for beginners to learn Image Processing
Image Processing using Matlab . Useful for beginners to learn Image ProcessingImage Processing using Matlab . Useful for beginners to learn Image Processing
Image Processing using Matlab . Useful for beginners to learn Image Processing
 
3.point operation and histogram based image enhancement
3.point operation and histogram based image enhancement3.point operation and histogram based image enhancement
3.point operation and histogram based image enhancement
 
Digital image processing using matlab
Digital image processing using matlab Digital image processing using matlab
Digital image processing using matlab
 
Digital Image Processing_ ch1 introduction-2003
Digital Image Processing_ ch1 introduction-2003Digital Image Processing_ ch1 introduction-2003
Digital Image Processing_ ch1 introduction-2003
 
Basics of image processing & analysis
Basics of image processing & analysisBasics of image processing & analysis
Basics of image processing & analysis
 
Laureate Online Education Internet and Multimedia Technolog.docx
Laureate Online Education    Internet and Multimedia Technolog.docxLaureate Online Education    Internet and Multimedia Technolog.docx
Laureate Online Education Internet and Multimedia Technolog.docx
 
What Color is Solid State Lighting - Panel Discussion
What Color is Solid State Lighting - Panel DiscussionWhat Color is Solid State Lighting - Panel Discussion
What Color is Solid State Lighting - Panel Discussion
 
IT6005 digital image processing question bank
IT6005   digital image processing question bankIT6005   digital image processing question bank
IT6005 digital image processing question bank
 
Color Imaging Lab Research Interests 2010
Color Imaging Lab Research Interests 2010Color Imaging Lab Research Interests 2010
Color Imaging Lab Research Interests 2010
 
introduction to Digital Image Processing
introduction to Digital Image Processingintroduction to Digital Image Processing
introduction to Digital Image Processing
 

Kürzlich hochgeladen

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Color naming 65,274,705,768 pixels

  • 1. Color naming 65,274,705,768 pixels Nathan Moroney and Giordano Beretta HP Labs Electronic Imaging 2013: Color Imaging XVIII
  • 2. Outline  Motivation  More (pixel) data  Finding and processing 65 billion pixels  Hint: Wikipedia & a dual core Open MP color namer  What did you learn?  The most frequent non-achromatic color term is…  What’s next?  Other than a trillion pixels Electronic Imaging 2013: Color Imaging XVIII
  • 3. Motivation  Previous work in crowd-sourcing color training data and experimental efforts  Related work in the area of big (image) data  A. Torralba, R. Fergus, W. T. Freeman, "80 million tiny images: a large dataset for non-parametric object and scene recognition", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30(11), pp. 1958-1970, 2008.  Ben Shneiderman, "Extreme Visualization: Squeezing a Billion Records into a Million Pixels", SIGMOD Conference, pp. 3-12, (2008).  Steven Seitz, “A Trillion Photos”, EI’13 Keynote (2013). Electronic Imaging 2013: Color Imaging XVIII
  • 4. Motivation 0 1 2 3 4 5 6 Log Number of Images Electronic Imaging 2013: Color Imaging XVIII
  • 5. Source Data  ImageClef 2010 snapshot  Adrian Popescu, Theodora Tsikrika and Jana Kludas, "Overview of the wikipedia retrieval task at ImageCLEF 2010", In the Working Notes for the CLEF 2010 Workshop, 20-23 September, Padova, Italy, 2010.  250,000 images plus associated wikipedia data  20 gigabytes  65,000,000,000 pixels uncompressed Electronic Imaging 2013: Color Imaging XVIII
  • 6. Source Data: At 200 PPI Electronic Imaging 2013: Color Imaging XVIII
  • 7. Processing  Basic single dual-core (but Open MP threaded) script to process over all image files  Simple stuff like getting image dimensions can be done over lunch  Uncompressing all the JPEG files to memory can take hours  Goal was a color naming algorithm that could be run in less than a day Electronic Imaging 2013: Color Imaging XVIII
  • 8. Processing  Some testing done using HP Cloud Services and compute clusters  But majority of focus on single computing device  Antony Rowstron, Dushyanth Narayanan, Austin Donnelly, Greg O'Shea, and Andrew Douglas. "Nobody ever got fired for using hadoop on a cluster", In HotCDP 2012 - 1st International Workshop on Hot Topics in Cloud Data Processing, (2012). Electronic Imaging 2013: Color Imaging XVIII
  • 9. Processing  Won’t describe the specifics of the color naming algorithm (throw produce if you have it) but generally  Input single RGB pixel and output is a single color term  Size of vocabulary or number of color terms is a parameter  Relative range of chroma values corresponding to an achromatic values is also a parameter  Also currently testing a completely revised model  Finally, in the Future directions section note that the best option for formal publication is to make use of currently available open source machine learning toolboxes. Electronic Imaging 2013: Color Imaging XVIII
  • 10. Results: Aspect Ratios  Wide range of image types  Most basic test of processing scripts Electronic Imaging 2013: Color Imaging XVIII
  • 11. Results: Median  Additional test and visualization of basic color properties of images  Large enough data set was worthwhile to write custom HTML5 2d canvas renderer Electronic Imaging 2013: Color Imaging XVIII
  • 12. Results: Median  So much data, that as noted by Shneiderman the density plot "uses a spatial substrate organizing principle, but shows concentrations of markers” is maybe a better idea  Data, alpha=0.05 Electronic Imaging 2013: Color Imaging XVIII
  • 13. Results: Max  Max of R+G+B for the images  Final test of basic scripting code Electronic Imaging 2013: Color Imaging XVIII
  • 14. Results  Color terms across all images  Majority pixels achromatic  Top chromatic colors are arguably natural tones  Higher chroma terms relatively infrequent Electronic Imaging 2013: Color Imaging XVIII
  • 15. Results Color Terms for 200,000+ images 60000  Color terms per image 50000  Peak at 5 are all 40000 achromatic terms Number of Images 30000 or images  Gradual then 20000 rapid usage of 10000 chromatic terms 0 0 5 10 15 20 25 30 35 Number of Color Terms. Maximum Vocabulary of 30 Electronic Imaging 2013: Color Imaging XVIII
  • 16. Results Color Terms for 200,000+ images 60000  Sudden drop off at 30 is a model 50000 failure 40000  Term added to Number of Images 30000 vocabulary based on previous 20000 limited 10000 optimization 0 0 5 10 15 20 25 30 35 Number of Color Terms. Maximum Vocabulary of 30 Electronic Imaging 2013: Color Imaging XVIII
  • 17. Current Work  Repeated entire process adjusting the model parameters  Processing to fill SQL databases  Query the database to validate all of the steps and explore specific Electronic Imaging 2013: Color Imaging XVIII
  • 18. Current Work  SELECT * from cntable order by skyblue desc limit 40 Electronic Imaging 2013: Color Imaging XVIII
  • 19. Future Directions  Image collections as “pixel corpora” for algorithm design, testing and optimization.  Similar to the role that written and spoken corpora fill for NLP and corpus linguistics  Useful to formalize for citation and repeatability  Additional analysis features  Testing with more public domain machine learning algorithms for repeatability Electronic Imaging 2013: Color Imaging XVIII
  • 20. Summary  Algorithm optimization, like machine color naming, with 200,000 images is different than with 200.  Based on Wikipedia, majority of visual content or pixels are achromatic  Based on Wikipedia, higher chroma named pixels are less frequent  Based on Wikipedia, there is a gradual then sudden transition in color term usage Electronic Imaging 2013: Color Imaging XVIII