SlideShare ist ein Scribd-Unternehmen logo
1 von 40
Collaborative 3D Modeling by the Crowd
The University of Tokyo
Ryohei Suzuki Takeo Igarashi
GI'17
3D modeling by crowdsourced sketching
Purpose
Synthesizing a 3D model from a single reference image.
Our approach
Crowdsourcing 2D sketching from multiple viewing angles,
then automatically integrating them into a 3D geometry.
Reference image
(photo/illustration) 3D modelSketches projections
Background
3D modeling is difficult for novice users
6DOF object operation
Local coordinate?
Global coordinate?
Many operation modes
Object mode? Edit mode?
Sculpt mode?
3D view rotation
Complex mouse operation
Many setting items
What is the easiest way?
Simplified 3D CAD tools
(e.g., Sketchup, Tinkercad) Sketch-based modeling
[Igarashi et al., 1999]
[Nealen et al., 2007]
[Chen et al., 2013]
Image processing
+ user interaction
Crowdsourcing!
Macrotask vs. microtask crowdsourcing
Macrotask
Outsourcing complex tasks to a small
number of professional workers
Microtask
Outsourcing simple tasks to a large
number of non-professional workers
Pros: skilled work
Cons: small worker pool, high cost
Pros: large worker pool
Cons: low-quality, unskilled work
Human computation for creative purposes
Human Computation (HC) [von Ahn, 2006]
“a paradigm for utilizing human processing power to
solve problems that computers cannot yet solve.”
microtask
skilled work
[Gingold et al., 2011]
Normal vector annotation
[Koyama et al., 2014]
Optimizing photo color correction
Applications of HC to content enhancement
Our approach
• Decomposing 3D modeling process into microtasks to enable
3D shape synthesis by HC.
• Proposing algorithms to integrate many inconsistent sketches to
extract geometrical information.
• Proposing novel crowdsourcing workflow for improving the
quality of submitted sketches.
→ Show the possibilities of HC for content creation
System overview
3D modeling workflow
Crowd workers
Reference image
+ three directions
+ parts number
2D sketches
Orthogonal
projections
3D model
(output)
Iterative
refinement
Peer reviewing
Sketching
Integrate Synthesis
User (customer)
Evaluate
Continue/stop
“7 parts”
Sketching task
• Draw a sketch of the object seen from a specified view
• 1 sketch / 1 worker, $0.36 basic reward
• Partly/entirely occluded parts should also be drawn overlapped
3D synthesis algorithms
1. Extraction of valid sketches
Problem: existence of invalid sketches in submissions
• Sketches drawn from wrong viewing angles
• Completely meaningless submissions
Reference image
1. Extraction of valid sketches
Observation
Strategy
Modified Hausdorff Distance Matrix
[Dubuisson 1994]
Clustering by Medoidshifts
[Sheikh et al. 2007]
Cluster 1
Cluster 2
Cluster 4
Cluster 3 Cluster 7
Cluster 6
Cluster 5
Reference
image
valid sketches are similar to each other
clustering sketches, then use the largest cluster
2. Integrating sketches into a projection
Analyzing the correspondence between individual sketches
1. Clustering all the parts contained in the valid sketches
• Same strategy as sketch clustering
2. Calculate the average shape for every cluster
3. Synthesizing 3D primitives from multi-
view projections
1. Inferring the correspondence between parts from multi-view
projections to extract triplets by cost calculation
2. Generate a 3D primitive for each triplet
Iterative refinement
What are the problems with sketches?
Small proportion of valid sketches
• Only ~40% of submissions are valid
• Most invalid sketches are caused by misunderstanding the task
Most valid sketches are incomplete
• Imperfect coverage of parts in the reference image
• Poor precision of parts arrangements
• Lack of motivation?
How can we help/encourage workers to draw better sketches?
1. Example-sharing
• Providing satisfactory submissions from previous workers [Little et al., 2010]
• Workers can avoid misunderstanding by referring to the examples
Previously submitted
distinguished sketches
2. Introducing competition
• Provide extra rewards ($0.18) for excellent submitters
• Motivating workers to draw better sketches than minimum requirements
• Peer-review based evaluation of sketches
Peer-reviewing interface
7-stage evaluation
Iterative workflow for sketch refinement
1st iteration 2nd iteration
Sketch
workers
Sketch
workers
Submitted sketches
Review
workers
Outstanding sketches
Extra
rewards
examples
Example of refinement results
Top sketches from the 1st iteration
Generation result from the 20×3 sketches
Example of refinement results
Top sketches from the 3rd iteration
Generation result from the 20×3 sketches
Example of refinement results
Top sketches from the 5th iteration
Generation result from the 20×3 sketches
Valid sketch ratio: 40% → 80% improvement
Modeling results
Reflection of real world knowledge
Parts not explicitly present in the reference image were created.
Synthesis from scribbles
Required Time 15mins (1 iteration, without review)
(Reference Image)
Evaluation
Difficulty of the tasks
Required timer for task completion
• Sketching 8.0 mins (median)
• Reviewing 3.8 mins (median)
Survey results from crowd workers (5 is best)
Acceptable as “microtasks”
Overall
satisfaction
Clarity of task
instruction
Ease of the task Payment
Sketching 4.7 4.5 4.1 4.1
Reviewing 4.6 4.5 4.1 4.3
Monetary costs / time consumption
Paid fees per an iteration
• Sketching $0.36 × 20 sketches × 3 views
• Reviewing $0.24 × 20 sketches × 3 views
• Bonus $0.18 × 4 workers × 3 views
Total $45.78/iteration (including transaction fee of CrowdFlower)
Required time for completion
• 45 mins (1 iteration) ~ 3.5 hours (5 iterations)
Fees were decided observing Dynamo
payment guidelines for research on
Mturk*
*http://wiki.wearedynamo.org/index.php?title=Guidelines_for_Academic_Requesters
Comparison with professional outsourcing
Model by
professional
Monetary cost $45 (vs. $46/iter)
Time consumption a whole day (vs. ~3.5h)
Extra cost ~10 email writing
Quality precise, with chamfer
Tested macrotask crowdsourcing using a freelancer platform*
*http://www.lancers.jp/
Model by
crowd
Advantages / disadvantages of our approach
Pros
• Small time consumption and communication cost
• High availability and scalability thanks to vast worker pool
Cons
• Lower quality than professional work
• Larger monetary cost
Limitations and Future work
Supported 3D primitives / operations
Current algorithm supports:
• Primitives: cuboid / cylinder / ellipsoids
• Rotation: about one of X-Y-Z axes
view 1 view 2 view 3 3D primitive
rectangle rectangle rectangle cuboid
cylinder
ellipsoid
rectangle rectangle ellipse
ellipseellipseellipse
Ambiguity in 3D synthesis from projections
Confusion occurs when multiple parts overlap from a certain view
Overlapping
Future work
Applying HC for diverse 3D modeling processes
• Voting for resolving ambiguity
• Fillet / chamfer design of edges
• Alignment of objects
• etc.
Conclusion
Conclusion
• We proposed a crowd-powered approach for 3D modeling
from a single reference image
• We designed 3D synthesis algorithms as well as
iterative crowdsourcing workflow for quality improvement
• We showed the practicability of the approach by evaluation
Thank you!

Weitere ähnliche Inhalte

Ähnlich wie Collaborative 3D Modeling by the Crowd

Design pattern in android
Design pattern in androidDesign pattern in android
Design pattern in android
Jay Kumarr
 

Ähnlich wie Collaborative 3D Modeling by the Crowd (20)

Design engineering
Design engineeringDesign engineering
Design engineering
 
Navigating Help - Testing Information Architecture with Treejack
Navigating Help - Testing Information Architecture with TreejackNavigating Help - Testing Information Architecture with Treejack
Navigating Help - Testing Information Architecture with Treejack
 
M sc thesis proposal v4
M sc thesis proposal v4M sc thesis proposal v4
M sc thesis proposal v4
 
Integration of Virtual Labs into science  e-learning.
Integration of Virtual Labs into science  e-learning.Integration of Virtual Labs into science  e-learning.
Integration of Virtual Labs into science  e-learning.
 
Evolutionary Architecture And Design
Evolutionary Architecture And DesignEvolutionary Architecture And Design
Evolutionary Architecture And Design
 
3d technology rahul
3d technology rahul3d technology rahul
3d technology rahul
 
Personalized Job Recommendation System at LinkedIn: Practical Challenges and ...
Personalized Job Recommendation System at LinkedIn: Practical Challenges and ...Personalized Job Recommendation System at LinkedIn: Practical Challenges and ...
Personalized Job Recommendation System at LinkedIn: Practical Challenges and ...
 
Design Patterns - General Introduction
Design Patterns - General IntroductionDesign Patterns - General Introduction
Design Patterns - General Introduction
 
3D printing ppt.pptx
3D printing ppt.pptx3D printing ppt.pptx
3D printing ppt.pptx
 
Software Design principales
Software Design principalesSoftware Design principales
Software Design principales
 
Ch 9-design-engineering
Ch 9-design-engineeringCh 9-design-engineering
Ch 9-design-engineering
 
Design pattern in android
Design pattern in androidDesign pattern in android
Design pattern in android
 
Effort estimation
Effort estimationEffort estimation
Effort estimation
 
UX Design process, #UX, #Design Process, #Agile UX
UX Design process, #UX, #Design Process, #Agile UX UX Design process, #UX, #Design Process, #Agile UX
UX Design process, #UX, #Design Process, #Agile UX
 
virtual interior engineer
virtual interior engineer virtual interior engineer
virtual interior engineer
 
Parents
ParentsParents
Parents
 
3 d printing
3 d printing 3 d printing
3 d printing
 
Intro to PM.ppt
Intro to PM.pptIntro to PM.ppt
Intro to PM.ppt
 
From Experimentation to Production: The Future of WebGL
From Experimentation to Production: The Future of WebGLFrom Experimentation to Production: The Future of WebGL
From Experimentation to Production: The Future of WebGL
 
Machine learning workshop @DYP Pune
Machine learning workshop @DYP PuneMachine learning workshop @DYP Pune
Machine learning workshop @DYP Pune
 

Mehr von Ryohei Suzuki

アナログとはなんだろう。―古くて新しい、もう一つの計算―
アナログとはなんだろう。―古くて新しい、もう一つの計算―アナログとはなんだろう。―古くて新しい、もう一つの計算―
アナログとはなんだろう。―古くて新しい、もう一つの計算―
Ryohei Suzuki
 
Overview of User Interfaces
Overview of User InterfacesOverview of User Interfaces
Overview of User Interfaces
Ryohei Suzuki
 

Mehr von Ryohei Suzuki (20)

Transformer based approaches for visual representation learning
Transformer based approaches for visual representation learningTransformer based approaches for visual representation learning
Transformer based approaches for visual representation learning
 
Paper memo: persistent homology on biological problems
Paper memo: persistent homology on biological problemsPaper memo: persistent homology on biological problems
Paper memo: persistent homology on biological problems
 
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
Paper memo: Optimal-Transport Analysis of Single-Cell Gene Expression Identif...
 
Basic Concepts of Entanglement Measures
Basic Concepts of Entanglement MeasuresBasic Concepts of Entanglement Measures
Basic Concepts of Entanglement Measures
 
Disentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative ModelsDisentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative Models
 
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
論文紹介: "MolGAN: An implicit generative model for small molecular graphs"
 
Report: "MolGAN: An implicit generative model for small molecular graphs"
Report: "MolGAN: An implicit generative model for small molecular graphs"Report: "MolGAN: An implicit generative model for small molecular graphs"
Report: "MolGAN: An implicit generative model for small molecular graphs"
 
等号と不等号の物理学
等号と不等号の物理学等号と不等号の物理学
等号と不等号の物理学
 
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
Wolf et al. "Graph abstraction reconciles clustering with trajectory inferen...
 
コンピュータは知恵熱を出すか?
コンピュータは知恵熱を出すか?コンピュータは知恵熱を出すか?
コンピュータは知恵熱を出すか?
 
身体の中の小宇宙:免疫研究の最前線
身体の中の小宇宙:免疫研究の最前線身体の中の小宇宙:免疫研究の最前線
身体の中の小宇宙:免疫研究の最前線
 
Single-cell pseudo-temporal ordering 近年の技術動向
Single-cell pseudo-temporal ordering 近年の技術動向Single-cell pseudo-temporal ordering 近年の技術動向
Single-cell pseudo-temporal ordering 近年の技術動向
 
汝は計算機なりや?
汝は計算機なりや?汝は計算機なりや?
汝は計算機なりや?
 
アナログとはなんだろう。―古くて新しい、もう一つの計算―
アナログとはなんだろう。―古くて新しい、もう一つの計算―アナログとはなんだろう。―古くて新しい、もう一つの計算―
アナログとはなんだろう。―古くて新しい、もう一つの計算―
 
AnnoTone (CHI 2015)
AnnoTone (CHI 2015)AnnoTone (CHI 2015)
AnnoTone (CHI 2015)
 
色字共感覚と書記素学習
色字共感覚と書記素学習色字共感覚と書記素学習
色字共感覚と書記素学習
 
AnnoTone: 高周波音の映像収録時 埋め込みによる編集支援
AnnoTone: 高周波音の映像収録時埋め込みによる編集支援AnnoTone: 高周波音の映像収録時埋め込みによる編集支援
AnnoTone: 高周波音の映像収録時 埋め込みによる編集支援
 
立体音響とインタラクション
立体音響とインタラクション立体音響とインタラクション
立体音響とインタラクション
 
SIGGRAPH 2014 Preview -"Shape Collection" Session
SIGGRAPH 2014 Preview -"Shape Collection" SessionSIGGRAPH 2014 Preview -"Shape Collection" Session
SIGGRAPH 2014 Preview -"Shape Collection" Session
 
Overview of User Interfaces
Overview of User InterfacesOverview of User Interfaces
Overview of User Interfaces
 

Kürzlich hochgeladen

biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
Bhagirath Gogikar
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 

Kürzlich hochgeladen (20)

biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 

Collaborative 3D Modeling by the Crowd

  • 1. Collaborative 3D Modeling by the Crowd The University of Tokyo Ryohei Suzuki Takeo Igarashi GI'17
  • 2. 3D modeling by crowdsourced sketching Purpose Synthesizing a 3D model from a single reference image. Our approach Crowdsourcing 2D sketching from multiple viewing angles, then automatically integrating them into a 3D geometry. Reference image (photo/illustration) 3D modelSketches projections
  • 4. 3D modeling is difficult for novice users 6DOF object operation Local coordinate? Global coordinate? Many operation modes Object mode? Edit mode? Sculpt mode? 3D view rotation Complex mouse operation Many setting items
  • 5. What is the easiest way? Simplified 3D CAD tools (e.g., Sketchup, Tinkercad) Sketch-based modeling [Igarashi et al., 1999] [Nealen et al., 2007] [Chen et al., 2013] Image processing + user interaction
  • 7. Macrotask vs. microtask crowdsourcing Macrotask Outsourcing complex tasks to a small number of professional workers Microtask Outsourcing simple tasks to a large number of non-professional workers Pros: skilled work Cons: small worker pool, high cost Pros: large worker pool Cons: low-quality, unskilled work
  • 8. Human computation for creative purposes Human Computation (HC) [von Ahn, 2006] “a paradigm for utilizing human processing power to solve problems that computers cannot yet solve.” microtask skilled work [Gingold et al., 2011] Normal vector annotation [Koyama et al., 2014] Optimizing photo color correction Applications of HC to content enhancement
  • 9. Our approach • Decomposing 3D modeling process into microtasks to enable 3D shape synthesis by HC. • Proposing algorithms to integrate many inconsistent sketches to extract geometrical information. • Proposing novel crowdsourcing workflow for improving the quality of submitted sketches. → Show the possibilities of HC for content creation
  • 11. 3D modeling workflow Crowd workers Reference image + three directions + parts number 2D sketches Orthogonal projections 3D model (output) Iterative refinement Peer reviewing Sketching Integrate Synthesis User (customer) Evaluate Continue/stop “7 parts”
  • 12. Sketching task • Draw a sketch of the object seen from a specified view • 1 sketch / 1 worker, $0.36 basic reward • Partly/entirely occluded parts should also be drawn overlapped
  • 14. 1. Extraction of valid sketches Problem: existence of invalid sketches in submissions • Sketches drawn from wrong viewing angles • Completely meaningless submissions Reference image
  • 15. 1. Extraction of valid sketches Observation Strategy Modified Hausdorff Distance Matrix [Dubuisson 1994] Clustering by Medoidshifts [Sheikh et al. 2007] Cluster 1 Cluster 2 Cluster 4 Cluster 3 Cluster 7 Cluster 6 Cluster 5 Reference image valid sketches are similar to each other clustering sketches, then use the largest cluster
  • 16. 2. Integrating sketches into a projection Analyzing the correspondence between individual sketches 1. Clustering all the parts contained in the valid sketches • Same strategy as sketch clustering 2. Calculate the average shape for every cluster
  • 17. 3. Synthesizing 3D primitives from multi- view projections 1. Inferring the correspondence between parts from multi-view projections to extract triplets by cost calculation 2. Generate a 3D primitive for each triplet
  • 19. What are the problems with sketches? Small proportion of valid sketches • Only ~40% of submissions are valid • Most invalid sketches are caused by misunderstanding the task Most valid sketches are incomplete • Imperfect coverage of parts in the reference image • Poor precision of parts arrangements • Lack of motivation? How can we help/encourage workers to draw better sketches?
  • 20. 1. Example-sharing • Providing satisfactory submissions from previous workers [Little et al., 2010] • Workers can avoid misunderstanding by referring to the examples Previously submitted distinguished sketches
  • 21. 2. Introducing competition • Provide extra rewards ($0.18) for excellent submitters • Motivating workers to draw better sketches than minimum requirements • Peer-review based evaluation of sketches Peer-reviewing interface 7-stage evaluation
  • 22. Iterative workflow for sketch refinement 1st iteration 2nd iteration Sketch workers Sketch workers Submitted sketches Review workers Outstanding sketches Extra rewards examples
  • 23. Example of refinement results Top sketches from the 1st iteration Generation result from the 20×3 sketches
  • 24. Example of refinement results Top sketches from the 3rd iteration Generation result from the 20×3 sketches
  • 25. Example of refinement results Top sketches from the 5th iteration Generation result from the 20×3 sketches Valid sketch ratio: 40% → 80% improvement
  • 27.
  • 28. Reflection of real world knowledge Parts not explicitly present in the reference image were created.
  • 29. Synthesis from scribbles Required Time 15mins (1 iteration, without review) (Reference Image)
  • 31. Difficulty of the tasks Required timer for task completion • Sketching 8.0 mins (median) • Reviewing 3.8 mins (median) Survey results from crowd workers (5 is best) Acceptable as “microtasks” Overall satisfaction Clarity of task instruction Ease of the task Payment Sketching 4.7 4.5 4.1 4.1 Reviewing 4.6 4.5 4.1 4.3
  • 32. Monetary costs / time consumption Paid fees per an iteration • Sketching $0.36 × 20 sketches × 3 views • Reviewing $0.24 × 20 sketches × 3 views • Bonus $0.18 × 4 workers × 3 views Total $45.78/iteration (including transaction fee of CrowdFlower) Required time for completion • 45 mins (1 iteration) ~ 3.5 hours (5 iterations) Fees were decided observing Dynamo payment guidelines for research on Mturk* *http://wiki.wearedynamo.org/index.php?title=Guidelines_for_Academic_Requesters
  • 33. Comparison with professional outsourcing Model by professional Monetary cost $45 (vs. $46/iter) Time consumption a whole day (vs. ~3.5h) Extra cost ~10 email writing Quality precise, with chamfer Tested macrotask crowdsourcing using a freelancer platform* *http://www.lancers.jp/ Model by crowd
  • 34. Advantages / disadvantages of our approach Pros • Small time consumption and communication cost • High availability and scalability thanks to vast worker pool Cons • Lower quality than professional work • Larger monetary cost
  • 36. Supported 3D primitives / operations Current algorithm supports: • Primitives: cuboid / cylinder / ellipsoids • Rotation: about one of X-Y-Z axes view 1 view 2 view 3 3D primitive rectangle rectangle rectangle cuboid cylinder ellipsoid rectangle rectangle ellipse ellipseellipseellipse
  • 37. Ambiguity in 3D synthesis from projections Confusion occurs when multiple parts overlap from a certain view Overlapping
  • 38. Future work Applying HC for diverse 3D modeling processes • Voting for resolving ambiguity • Fillet / chamfer design of edges • Alignment of objects • etc.
  • 40. Conclusion • We proposed a crowd-powered approach for 3D modeling from a single reference image • We designed 3D synthesis algorithms as well as iterative crowdsourcing workflow for quality improvement • We showed the practicability of the approach by evaluation Thank you!

Hinweis der Redaktion

  1. Hello everyone. I am Ryohei Suzuki, an ex-master student in the user interface research group at the University of Tokyo. Today I'm going to talk about our work "collaborative 3D modeling by the crowd." This paper was authored by me and Takeo Igarashi.
  2. I would like to start from briefly introducing the problem what we want to tackle and our approach to that. Our purpose is to synthesize a complete 3D model from a single reference image, such as a picture or an illustration, as the input. But this is one of the long-standing problems in computer graphics, and currently there is no straightforward computational solution to this. In this work, we propose an approach that takes advantage of human cognitive functions utilizing a crowdsourcing system. That is, we gather 2D sketching of the target object drawn from multiple viewing angles by human workers, then automatically integrate them into a 3D geometry. This approach enables synthesis of 3D models without complicated image processing or fine-tuned machine learning system.
  3. Then, let me explain the background of our research.
  4. Since this work is about 3D modeling, let's see the existing 3D modeling methods briefly. Recently more and more consumers have interests in creating their own 3D objects as the penetration of digital fabrication. However, 3D modeling using conventional authoring software designed for professionals, such as Maya, Blender, Cinema4D is quite difficult for novice users. It involves 3D view rotation with complex mouse operation, 6DOF object operation with multiple coordinate systems, transition between many operation modes, and many many setting items.
  5. So, we are interested in what is the easiest way to create a new 3D model for such users. Firstly, we have a number of simplified 3D CAD software such as Sketchup, Tinkercad, Fusion360.The interface designs of these modern software are sophisticated and the user can create arbitrary object with small efforts. However, these software still take several tens of minutes to couple of hours for learning the usage, and sometimes require complex 3D operation using multi-button mouse. As another option, we have some sketch-based modeling methods which only require 2D operations to create pretty 3D models. But, as you may know, using these methods is not as easy as it may looks in a demo video prepared by the authors. And we have more modern techniques like 3-Sweep that combine image processing and user interaction for semi-automatically extracting geometry from inputs, such as images. Such methods give us great ease of modeling, but do not always work, and still require the users to remember new interaction methods like sweeping. These three ways each have advantages and are useful in certain situations, but we have another option that should be simplest.
  6. Yes, that is crowdsourcing. We can entirely outsource the task of 3D modeling to another person and just wait for the result.
  7. There is largely two distinct categories of crowdsourcing, macrotask crowdsourcing and microtask crowdsourcing. The former is outsourcing of complex tasks to one or several workers with professional skills. It can take advantage of skilled work, so 3D modeling by macrotask crowdsourcing is straightforward. But it has a weak point in the availability because of the small skilled worker pool. In contrast, the latter simultaneously outsources very simple tasks that can be processed in several minutes to a large number of workers without special skills. It can utilize the virtually infinite worker pool, but basically it can only produce low-quality and unskilled work results. Obtaining complex fruits like that of macrotask crowdsourcing from microtask crowdsourcing is a non-trivial and challenging problem. We would like to explore such possibility in 3D modeling.
  8. Such idea was firstly formulated by von Ahn and named "human computation.” His original definition of human computation was "a paradigm for utilizing human processing power to solve problems that computers cannot yet solve.” In this passage, "human processing power" corresponds to microtask and the "problems" corresponds to skilled work of professional workers in our context. There has been some work applying human computation to creative purposes, such as normal vector annotation of images for re-lighting and optimization of photo color correction. These work can be seen as the applications of human computation to content enhancement. We consider that application of HC to content creation from scratch, not enhancement, should be a challenging frontier of HCI research.
  9. Then, let me introduce the summary our approach. Basically, we decompose 3D modeling process into microtasks, 2D sketching, to enable 3D shape synthesis by human computation. To do so, we propose algorithms for integrating many sketches to extract geometrical information. We also propose a novel crowdsourcing workflow that is needed for improving the submission quality. And, ultimately, we would like to show the possibilities of human computation for content creation.
  10. Let me move onto the system overview.
  11. 3D modeling workflow in our system is as follows. Firstly, the user uploads a reference image and annotate it with orthogonal viewing directions using a web interface. The user also provide the number of parts consisting the target object. Then, crowd workers are recruited using CrowdFlower platform and they draw 2D sketches of the target object viewed from one of the orthogonal angles. Gathered sketches are integrated to reconstruct an orthogonal projections, then the resulting 3D model is synthesized from the projections. In order to refine the output quality, sketches are iteratively gathered with peer reviewing process by the crowd workers. The user evaluate the quality of output at the end of each iteration, then decides to continue or stop the iteration.
  12. Sketching task is executed in a web interface like this, each worker is directed to draw a single sketch of the object seen from a specified view, and given 36 cents as the basic reward. They are requested to draw occluded parts as well.
  13. Then, I would like to present the 3D synthesis algorithms.
  14. The process starts from the extraction of valid sketches. Some of the submitted sketches are drawn from wrong viewing angles, and the others are completely meaningless. We should exclude them and extract only the valid sketches to generate a clean projection.
  15. From the observation that valid sketches are similar to each other in contrast to the diverse appearance of invalid ones, we take a strategy that firstly cluster the sketches based on their similarities, then adopt the largest cluster as the valid one. We defined the similarity matrix by modified Hausdorff distance, then calculate the clusters by Medoidshifts method.
  16. After extracting the valid sketches, we integrate them into a projection. To do so, we should analyze the correspondence between elements contained in different sketches. We take the clustering-based strategy same as the previous process to obtain the sets of 2D elements representing a same part in the target object. We calculate the average shape, that is size, position, and rotation, for every cluster, then obtain a clean projection.
  17. Finally, we synthesize 3D primitives from multi-view projections. We infer the correspondence between parts contained in each projection, then extract triplets that have small costs. Each triplet is converted to a 3D primitive according to the combination of the composing 2D parts. The cost of a triplet is calculated as the square-sum of the mismatch between the endpoints of the parts along the three axes. Please see the paper for the detail.
  18. Then, I would like to present our iterative refinement mechanism.
  19. In the pilot research, we realized that there are two major problems with the gathered sketches. The first was the small proportion of the valid sketches. It was only about 40%, and considerable proportion of the invalid sketches are caused by misunderstanding of the task, such as the specification of viewing direction. The second problem was the incompleteness of the valid sketches. The coverage of the parts drawn in the sketches was far from 100%, and the precision of parts arrangements was also poor. This might be naturally caused by the lack of motivation for more than satisfying the minimum requirements. Hence, we consider the way to help and encourage workers to draw better sketches.
  20. The first element is example-sharing. We can provide satisfactory submission from previous workers to help the successive workers to comprehend the task instruction. It can be seen as an implicit collaboration between workers. To do so, we evaluate the submitted sketches in reasonable manner.
  21. Then, we introduce the second element, competition. We recruit additional workers from the crowdsourcing platform to evaluate the sketches, then we provide extra rewards to sketch workers whose submissions are rated in top 20%. Highly-rated sketches are used in the example-sharing mechanism explained before.
  22. We integrate these two concepts, collaboration and competition into an iterative workflow like this. In the first iteration, submitted sketches from the workers are reviewed by other workers, then outstanding sketches are selected. Their authors receive rewards, and they are used as the examples for the next iteration. And the same process continues for several times.
  23. Here we show the example results of iterative refinement. In the first iteration, only three parts could be synthesized from the 60 submitted sketches.
  24. After three iterations, most of the major parts become able to be synthesized,
  25. then all the parts contained in the target object was synthesized from only the submissions of the 5th iteration. The ratio of valid sketches increased from 40% to 80% after five iterations.
  26. Then, let me show you some modeling results.
  27. Interestingly, the chair model contains the back apron part. This part is not present explicitly in the input image, but its existence can be inferred from the other visible parts. This reproduction may be done by the utilization of real world knowledge of the human workers. It shows that the workers intensively use their cognitive functions for dealing with sketching.
  28. Our system can accept not only pictures, but also a rough drawing as long as its spatial structure can be interpreted by human workers uniformly. This drawer model was generated from 60 sketches gathered in 15 minutes.
  29. Let's move on to the evaluation of the method.
  30. In order to say that our system successfully works as a microtask crowdsourcing system, the involved tasks should be easy and light-weight enough. The required time for task completion was 8 minutes for sketching and 4 minutes for reviewing. The survey responses from the recruited workers indicate they consider that the task instruction was clear and easy enough, and the payment was satisfactory. These results show that the tasks were acceptable as microtasks.
  31. Monetary cost can be directly calculated by the rewards for tasks and the platform transaction fee rate, that is 46 dollars per iteration. This is somewhat expensive and should be dealt to use the system in practical situations. The required time for completion was less than a few hours.
  32. To show the advantages and shortcomings of our approach against the straightforward macrotask crowdsourcing, we employed a professional modeler using a freelancer platform and compared the cost and the result. The paid fee was about 45 dollars, which is equivalent to the cost for a single iteration in our system. The time consumption was about a whole day, including recruiting, negotiation, and the modeling time. It also required us to write about 10 short messages for negotiation and task instruction, which was quite cumbersome. The quality of the resulting model was more precise than ours, and it included beautiful chamfers and fillets.
  33. In summary, our approach has its advantages in the small time consumption, low communication cost, and the high availability and scalability thanks to the vast worker pool. On the other hand, ours has its shortcoming in the lesser output quality than professional work, and also ours costs more than simple outsourcing at present.
  34. We would like to mention the limitations and possible future work for our research.
  35. The most serious limitation is the small number of supported 3D primitives and operations. We support cuboids, cylinders, and ellipsoids rotated about one of X-Y-Z axes. Existing techniques such as silhouette-based modeling could be applied to extend the complexity of geometries that can be made.
  36. And the current orthogonal projection-based modeling inevitably involves ambiguity in some situations. For example, a simple input image shown here produces an overlap in the top-view of the projections, and it gives a wrong synthesis result. We should introduce any mechanism to select the correct geometry from ambiguous candidate to solve the problem.
  37. As the future work, we are thinking of utilizing human computation for solving such problems. For example, selection of correct geometry from candidates can be processed by voting using microtask crowdsourcing. More detailed modeling elements such as fillet and chamfer design can be processed by dedicated microtask that involves 2D operations. Alignments or distribution of elements in a model can also be inferred and specified by human workers to improve the modeling quality. Future work on such attempts will reveal the potential of microtask crowdsourcing for creative tasks in greater depth.
  38. Then, let me conclude the talk.
  39. We proposed a crowd-powered approach for 3D modeling from a single reference image. For that, we designed a set of algorithms or 3D synthesis, as well as an iterative crowdsourcing workflow for quality improvement. We showed the advantages and disadvantages of our approach by evaluation. Thank you!