This document summarizes a method for single-view 3D reconstruction using differentiable ray sampling. It discusses prior work using 3D or 2D supervision and their limitations. The proposed method uses a neural 3D representation that maps coordinates to occupancy. It introduces differentiable ray sampling to allow end-to-end training with only 2D images. Results on cars and chairs show the method achieves similar or better accuracy compared to prior work, with constant memory usage at high resolutions.