Computational photography and inverse problems

Computational Light Transport and Computational Photography: Inverse problems Camera Culture Ramesh Raskar Ramesh Raskar http://raskar.info MIT Media Lab raskar@mit.edu

How to Invent? After X, what is neXt Full Presentation at http://www.slideshare.net/cameraculture/raskar-ideahexagonapr2010 Ramesh Raskar, MIT Media Lab

Ramesh Raskar, http://raskar.info X+Y X neXt Xd X X++ X Full Presentation at http://www.slideshare.net/cameraculture/raskar-ideahexagonapr2010

Simple Exercise .. Image Compression Save Bandwidth and storage What is neXt

Strategy #1: Xd Extend it to next (or some other) dimension ..

X = Idea you just heard Concept Patent New Product/Best project/invention award Product feature Design Art Algorithm

Research .. http://raskar.info How to come up w ideas: Idea Hexagon How to write a paper How to give a talk Open research problems How to decide merit of a project How to attend a conference, brainstorm Facebook.com/ rRaskar Tips Get on Seminar/Talks mailing lists worldwide http://www.cs.virginia.edu/~robins/YouAndYourResearch.html Why do so few scientists make significant contributions and so many are forgotten in the long run? Highly recommended Hamming talk at Bell Labs

Is project worthwhile? Heilmeier's Questions http://en.wikipedia.org/wiki/George_H._Heilmeier#Heilmeier.27s_Catechism What What are you trying to do? Articulate your objectives using absolutely no jargon. Related work How is it done today, and what are the limits of current practice? Contribution What's new in your approach and why do you think it will be successful? Motivation Who cares? If you're successful, what difference will it make? Challenges What are the risks and the payoffs? How much will it cost? How long will it take? Evaluation What are the midterm and final "exams" to check for success? Raskar additions: Why now? (why not before, what’s new that makes possible) Why us? (wrong answers: I am smart, I can work harder than others)

Great Research: Strive for Five Before Five teams Be first, often let others do details Beyond Five years What no one is thinking about Within Five layers of ‘Human’ Impact Relevance Beyond Five minutes of description Deep, iterative, participatory Fusing Five+ Expertise Multi-disciplinary, proactive Ramesh Raskar, http://raskar.info

MIT Media Lab raskar@mit.edu http://cameraculture.info fb.com/rraskar Inverse Problems How to do Research in Imaging ,[object Object],Co-design of Optics and Computation Photons not just pixels Mid-level cues Computational Photography Open research problems Compressive Sensing for High Speed Events Limits of CS for general imaging Computational Light Transport Looking Around Corners, trillion fps Lightfields: 3D Displays and Holograms

Tools for Visual Computing Shadow Refractive Reflective Fernald, Science [Sept 2006]

Computational Photography Camera Culture Ramesh Raskar

Traditional Photography Detector Lens Pixels Mimics Human Eye for a Single Snapshot: Single View, Single Instant, Fixed Dynamic range and Depth of field for given Illumination in a Static world Image Courtesy: Shree Nayar

Picture Computational Camera + Photography: Optics, Sensors and Computations GeneralizedSensor Generalized Optics Computations Ray Reconstruction 4D Ray Bender Upto 4D Ray Sampler Merged Views, Programmable focus and dynamic range, Closed-loop Controlled Illumination, Coded exposure/apertures

Computational Photography Novel Illumination Light Sources Modulators Computational Cameras Generalized Optics GeneralizedSensor Generalized Optics Processing 4D Incident Lighting 4D Ray Bender Ray Reconstruction Upto 4D Ray Sampler 4D Light Field Display Scene: 8D Ray Modulator Recreate 4D Lightfield

Computational Photography [Raskar and Tumblin] captures a machine-readable representation of our world to hyper-realistically synthesize the essence of our visual experience. Resources ICCP 2012, Seattle Apr 2012 Papers due Dec 2nd, 2011 http://wikipedia.org/computational_photography http://raskar.info/photo

Computational Photography Computational Photography aims to make progress on both axis Phototourism Comprehensive Essence Scene completion from photos Augmented Human Experience Looking Around Corners Priors Capture Human Stereo Vision Metadata Coded Depth fg/bg Non-visual Data, GPS Virtual Object Insertion Spectrum Decomposition problems 8D reflectance field Direct/Global LightFields Relighting Epsilon Angle, spectrum aware Camera Array HDR, FoV Focal stack Resolution Material editing from single photo Digital Motion Magnification Raw Low Level Mid Level HighLevel Hyper realism Synthesis/Analysis

Co-designing Optical and Digital Processing Computational Light Transport Optics Displays Sensors Computational Photography Photon Hacking Illumination Signal Processing Computer Vision Machine Learning Bit Hacking

Take home points Co-design of hw/sw Avoid computational or optical chauvinism in imaging (Camera flash/Kinect) Hardware cost going to zero, Parallel technology trends Computer vision not just mimicking human vision/perception Borrow ideas from other fields: astronomy, scientific imaging, audio, communications Photons not just Pixels Change the rules of the game Optics, Sensors, Illum, Priors, Sparsity, Transforms Meta-data, Internet collection, Crowdsourcing

Computational Photography Wish List: Open Research Problems Camera Culture Ramesh Raskar

Wish #1 Ultimate Post-capture Control Camera Culture Ramesh Raskar

Digital Refocusing using Light Field Camera 125μ square-sided microlenses [Ng et al 2005]

Traditional Blurred Photo Deblurred Image

Fluttered Shutter Camera Raskar, Agrawal, Tumblin Siggraph2006 Ferroelectric shutter in front of the lens is turnedopaque or transparent in a rapid binary sequence

Preserves High Spatial Frequencies Fourier Transform Sharp Photo Blurred Photo PSF == Broadband Function Flutter Shutter: Shutter is OPEN and CLOSED

Coded Exposure Traditional Deblurred Image Deblurred Image Image of Static Object

Fast periodic phenomena Vocal folds flapping at 40.4 Hz Bottling line 4000 fps hi-speed camera 500 fps hi-speed camera

Compressive Sensing Single Pixel Camera image compressive image measurement matrix

Periodic signals -fP -2fP -4fP 3fP -3fP 0 fMax - fMax 2fP fP=1/P 4fP Periodic signal x(t) with period P t P = 16ms Periodic signal with period P and band-limited to fMax = 500 Hz. Fourier transform is non-zero only at multiples of fP=1/P ~ 63Hz.

High speed camera P = 16ms Ts = 1/(2 fMax) -fP -2fP -4fP -3fP 4fP 3fP 2fP 0 fMax - fMax fP=1/P Nyquist Sampling of x(t) Periodic signal has regularly spaced, sparse Fourier coefficients. Is it necessary to use a high-speed video camera? Why waste bandwidth?

Traditional Strobing Use low frame-rate camera and generate beat frequencies. P t Low exposure to avoid blurring. Low light throughput. Period known apriori. Strobing animation credit Wikipedia

t P Random Projections Per Frame of Camera using Coded Strobing Photography In every exposure duration observe different linear combinations of the periodic signal. Advantage of the design ,[object Object]

On an average, light throughput is 50%Coded Strobing Photography. Reddy, D., Veeraraghavan, A., Raskar, R. IEEE PAMI 2011

Observation Model x at 2000fps y at 25fps

Signal Model x at 2000fps y at 25fps

Signal & Observation Model Ais M x N, M<<N x at 2000fps y at 25fps N / M = 2000 / 25 = 80

Recovery: Sparsity Very few non-zero elements y = A s Observed values Mixing matrix Structured Sparse Coefficients Basis Pursuit De-noising

Simulation on hi-speed toothbrush 25fps normal camera 25fps coded strobing camera Reconstructed frames 2000fps hi-speed camera ~100X speedup

Rotating mill tool Mill tool rotating at 50Hz Reconstructed Video at 2000fps Normal Video: 25fps Coded Strobing Video: 25fps Blur increases as rotational velocity increases rotating at 200Hz rotating at 150Hz rotating at 100Hz increasing blur

Compressive Sensing for Images .. A good idea? Single Pixel Camera image compressive image measurement matrix

Is Randomized Projection-based Captureapt for Natural Images ? Periodic Signals Progressive Projections Randomized Projections Compression Ratio [Pandharkar, Veeraraghavan, Raskar 2009]

Wish #1 Ultimate Post-capture Control ,[object Object]

Emulate studio light from compact flashCamera Culture Ramesh Raskar

Wish #2 Freedom from Form ,[object Object]

Flat camera: Bidirectional screen (BiDi) ,[object Object],Camera Culture Ramesh Raskar

Wish #3 Understand the World Camera Culture Ramesh Raskar

Convert single 2D photo into 3D ? Snavely, Seitz, Szeliski U of Washington/Microsoft: Photosynth

Exploit Community Photo Collections U of Washington/Microsoft: Photosynth

Wish #3 Understand the World ,[object Object]

Interact with informationCamera Culture Ramesh Raskar

Wish #4 Sharing Visual Experience ,[object Object]

Privacy in public and authentication

Print ‘material’ Camera Culture Ramesh Raskar

Wish #5 Capturing Essence Camera Culture Ramesh Raskar

What are the problems with ‘real’ photo in conveying information ? Why do we hire artists to draw what can be photographed ?

Shadows Clutter Many Colors Highlight Shape Edges Mark moving parts Basic colors

Depth Edges with MultiFlash Raskar, Tan, Feris, Jingyi Yu, Turk – ACM SIGGRAPH 2004

Depth Discontinuities Internal and externalShape boundaries, Occluding contour, Silhouettes

Result Photo Canny Intensity Edge Detection Our Method

Questions What will a camera look like in 10,20 years? How will a billion networked and portable cameras change the social culture? How will online photo collections transform visual social computing? How will movie making/new reporting change?

Photos of tomorrow: computed not recorded http://scalarmotion.wordpress.com/2009/03/15/propeller-image-aliasing/

Camera Culture Group, MIT Media Lab Ramesh Raskar http://raskar.info Sensor Computational Photography Wish List ,[object Object]

Emulate studio lights with compact flash

Flat camera, large LCDs as cameras

Image destabilization for larger aperture

Delta-camera and Blind-camera,[object Object]

Can you look around the corner ?

Multi-path Analysis 2nd Bounce 1st Bounce 3rd Bounce

Femto-Photography (Transient Imaging) FemtoFlash Trillion FPS camera With M Bawendi, MIT Chemistry Serious Sync Computational Optics ,[object Object]

2009: Marr PrizeHonorable Mention (Kirmani, Hutchinson, Davis, Raskar, ICCV’2009)

2008: Transient Light Transport (Raskar, Davis, March 2008),[object Object]

Multi-Dimensional Light Transport 5-D Transport Gigapan

Collision avoidance, robot navigation, …

z x S L R s Occluder Streak-camera 3rd bounce C Laser beam B Echoes of Light

Steady State 4D Impulse Response, 5D

Scene with Ultra fast illumination and camera hidden elements Raw 5D Capture Time profiles Signal Proc. Photo, geometry, reflectance beyond line of sight Novel light transport models and inference algorithms ® t 3D Time images Femto-PhotographyTime Resolved Multi-path Imaging

Team Moungi G. Bawendi, Professor, Dept of Chemistry, MITJames Davis, UC Santa CruzAndreas Velten, Postdoctoral Associate, MIT Media LabRohitPandharkar, RA, MIT Media Lab Otkrist Gupta, RA, MIT Media LabAndrew Matthew Bardagjy, RA, MIT Media Lab Nikhil Naik, RA, MIT Media LabTyler Hutchison, RA, MIT Media LabEverett Lawson, MIT Media Lab Ramesh Raskar, Asso. Prof., MIT Media Lab Camera Culture Ramesh Raskar

Photos from Streak Camera Capture Setup Hidden Scene

Photos from Streak Camera Capture Setup Hidden Scene Overlay Reconstruction

Motion beyond line of sight Pandharkar, Velten, Bardagjy, Lawson, Bawendi, Raskar, CVPR 2011

…, bronchoscopies, … Participating Media

Photo First Bounce Later Bounces + Direct Global [Nayar, Krishnan, Grossberg, Raskar 2006]

Each frame = ~2ps = 0.6 mm of Light Travel

View Dependent Appearance and Iridescent color Cross section through a single M. rhetenor scale

Two Layer Displays barrier lenslet sensor/display sensor/display PB = dim displays Lenslets = fixed spatial and angular resolution Dynamic Masks = Brighter, High spatial resolution

Limitations of 3D Display Parallaxbarrier LCD display Front Back Lanman, Hirsch, Kim, RaskarSiggraph Asia 2010

Light Field Analysis of Barriers k L[i,k] i ` k g[k] i L[i,k] f[i] light box

Content-Adaptive Parallax Barriers L[i,k] ` k g[k] i f[i] light box

Implementation Components ,[object Object],[object Object]

Content-Adaptive Parallax Barriers ` =

Lanman, Hirsch, Kim, Raskar Siggraph Asia 2010 Rank-Constrained Displays and LF Adaptation ` Content-Adaptive Parallax Barriers = All dual layer display = rank-1 constraint Light field display is a matrix approximation problem Exploit content-adaptive parallax barriers

Optimization: Iteration 1 rear mask: f1[i,j] front mask: g1[k,l] reconstruction (central view) Daniel Lee and Sebastian Seung. Non-negative Matrix Factorization. 1999. Vincent Blondel et al. Weighted Non-negative Matrix Factorization. 2008.

Computational photography and inverse problems

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (11)

Similar to Computational photography and inverse problems

Similar to Computational photography and inverse problems (20)

More from Camera Culture Group, MIT Media Lab

More from Camera Culture Group, MIT Media Lab (20)

Recently uploaded

Recently uploaded (20)

Computational photography and inverse problems

Editor's Notes