Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
DeepFix: a fully convolutional neural network for predicting human fixations (UPC Reading Group)
1. DeepFix: A Fully Convolutional
Neural Network for Predicting
Human Fixations
Srinivas S S Kruthiventi, Kumar Ayush, and R. Venkatesh Babu
(arXiv October 2015) [URL]
Slides by Xavier Giró-i-Nieto, from the Computer Vision Reading Group. (27/10/2015)
https://imatge.upc.edu/web/teaching/computer-vision-reading-group
11. Very deep network
11
Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for
large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014)
● Inspired by Oxford’s VGG net (19 layers).
● 20 layers
● Small kernel sizes.
12. Fully convolutional network (FCN)
12
● Fully connected layers at the end
are replaced by convolutional
layers with very large receptive
fields.
● They capture the global context of
the scene.
● End-to-end training
Long, J., Shelhamer, E., & Darrell, T. (2015). Fully Convolutional Networks for Semantic
Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
(pp. 3431-3440)
13. 13
Inception layers
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., ... & Rabinovich, A. (2015). Going
Deeper With Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern
Recognition (pp. 1-9)
● GoogLeNet
● Different kernel sizes
operating in parallel.