PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

•

0 gefällt mir•486 views

Paper review: "NISP: Pruning Networks using Neural Importance Score Propagation" Presented at Tensorflow-KR paper review forum (#PR12) by Taesu Kim Paper link: https://arxiv.org/abs/1711.05908 Video link: https://youtu.be/3KoqN_yYhmI (in Korean)

Technologie

PR-12 presentation
NISP: Pruning Networks using Neuron Importance Score Propagation
CVPR2018

Authors: Ruichi Yu et al

Presented by Taesu Kim

Motivation
• Pruning
• Previous approaches
• Focus on single layer or two layers’ statistics
• Greedy pruning
• Entire network is a whole
• Error propagates, especially when network is deep

Motivation
• Entire CNN is a set of feature extractors
• The ﬁnal responses are the extracted features
• We measured the importance of the neurons across the entire CNN based on
a uniﬁed goal
• Minimizing the reconstruction errors of (important) ﬁnal responses

Approach
• Feature ranking on the ﬁnal response layer
• NISP: Neuron Importance Score Propagation
• Pruning network using NISP
• Fine-tune the pruned network

NISP: Objective function
• ! , !
• !
• a binary vector ! : neuron prune indicator for the l-th layer
• !
• ! , !
• ! , ! , !

Solution
• The network pruning problem can be formulated as a binary integer program
• Fining the optimal neuron prune indicator s
• It is hard to obtain efﬁcient analytical solutions by directly optimizing the objective
• a sub-optimal solution can be obtained by minimizing the upper bound
• ! !
• Assume the activation function ! is Lipschitz continuous: Identity, ReLU, sigmoid, tanh, etc.
• Lipschitz continuous if ! , ! ,

Solution
• ! , ! , !
• ! , !
• ! ! , ! !
• !
• !

Solution
• Backpropagate the importance values

Experiments
• Comparison with random pruning and training-from-scratch baseline
• randomly pruning the pre-trained CNN and then ﬁne-tuning
• training a small CNN with the same number of neurons/ﬁlters per layer as our pruned model
• !

Experiments
• Feature selection vs. Magnitude of weights
• NISP-FS: using feature selection method in [34]
• NISP-Mag: considering only magnitude of weights
•
[34] Inﬁnite feature selection. G. Roffo et al. ICCV 2015

Experiments
• NISP vs. Layer-by-Layer pruning
•

Experiments
• Comparison with existing methods
[11] Acceleration through elimination of redundant convolutions, M. Figurnov et al, NIPS2016
[20] Compression of deep convolutional neural networks for fast and low power mobile applications,
Y. Kim et al, ICLR 2016
[36] Learning the architecture of deep neural networks, S. Srinivas et al, BMVC 2016
[25] Pruning ﬁlters for efﬁcient convnets, H. Li et al, ICLR 2017
[29] Thinnet: A ﬁlter level pruning method for deep neural network compression, J.-H. Luo et al ICCV 2017
NISP-A: pruning all conv layers
NISP-B: pruning all conv layers except conv5
NISP-C: pruning all conv layers except conv5, conv4
NISP-D: pruning all conv layers except conv2, conv3, FC6
NISP-x-A: prune 15% ﬁlters of each layer
NISP-x-B: prune 25% ﬁlters of each layer

Conclusion
• Generic framework for network compression and acceleration based on identifying
the importance levels of neurons
• Neuron importance scores in the layer of interest are obtained by feature ranking
• Formulate the network pruning problem as a binary integer program
• Obtain a closed-form solution to a relaxed version of the formulation
• NISP algorithm propagates the importance to every neuron in the whole network
• It efﬁciently reduces CNN redundancy and achieves full-network acceleration and
compression

Weitere ähnliche Inhalte

Was ist angesagt?

[DL輪読会]A closer look at few shot classification

Deep Learning JP

【DL輪読会】GradMax: Growing Neural Networks using Gradient Information

Deep Learning JP

105번째 논문리뷰, 오늘 소개 드릴 논문은 2020 CVPR에서 발표된 Meta-Transfer Learning for Zero-Shot Super-Resolution 라는 논문입니다! 제목에서 유추가 가능하신것 처럼 학습 데이터없이 저해상도 사진을 고해상도 사진으로 바꿔주는 Zero Shot Super Resolution을 위한 Meta Transfer Learning을 소개합니다. Internal Learning에 적합한 General한 초기 parameter를 찾는것에 기반하여 한번의 Gradient Update만으로 최적의 성능을 보여주는것 방법에 대해서 소개합니다. 논문에 대한 자세한 리뷰를 이미지 처리팀 김선옥 님이 자세한 리뷰 도와주셨습니다! https://youtu.be/lEqbXLrUlW4

[딥논읽] Meta-Transfer Learning for Zero-Shot Super-Resolution paper review

taeseon ryu

[DL輪読会]Relational inductive biases, deep learning, and graph networks

Deep Learning JP

強化学習初心者が強化学習でニューラルネットワークの設計を自動化してみたい

Takuma Wakamori

【DL輪読会】ViT + Self Supervised Learningまとめ

Deep Learning JP

[DL輪読会]医用画像解析におけるセグメンテーション

Deep Learning JP

Neural networks for Graph Data NeurIPS2018読み会@PFN

emakryo

[DL輪読会]Live-Streaming Fraud Detection: A Heterogeneous Graph Neural Network A...

Deep Learning JP

6/9 (木) 14:45～15:15 メイン会場講師：石井雅人氏（ソニーグループ株式会社）概要：機械学習技術の急速な発達により、コンピュータによる知的処理は様々なタスクで人間に匹敵あるいは凌駕する性能を達成してきた。一方、このような高い性能は大量かつ高品質な学習データによって支えられており、多様化する機械学習応用においてデータの収集コストが大きな導入障壁の１つとなっている。本講演では、少ないデータやラベルから効率的に学習するための様々な技術について、「足りない情報をどのように補うか？」という観点から概観するとともに、特に画像認識分野における最新の研究動向についても紹介する。

SSII2022 [SS2] 少ないデータやラベルを効率的に活用する機械学習技術〜足りない情報をどのように補うか？〜

SSII

Coreset+SVM (論文紹介)

Naotaka Yamada

SSII2018TS: 大規模深層学習

SSII

[DL輪読会]Vector-based navigation using grid-like representations in artificial ...

Deep Learning JP

ResNetの仕組み

Kota Nagasato

論文紹介：End-to-End Object Detection with Transformers

Toru Tamaki

出典: Jiankang Deng, Jia Guo, Niannan Xue, Stefanos Zafeiriou : ArcFace: Additive Angular Margin Loss for Deep Face Recognition, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (2019) 公開URL：https://arxiv.org/abs/1801.07698 概要 : 顔認識のための畳み込みニューラルネットワーク(DCNN)の課題は識別力を高める適切な損失関数を設計することです。本論文では、顔認識のための識別性の高い特徴量を得るために、Additive Angular Margin Loss (ArcFace)を提案します。一般的な顔認識ベンチマークから1兆ペアの大規模データセットなどを用いて、最先端顔認識技術との比較実験を行いました。結果は、従来手法を凌駕する精度を持つことが明らかになりました。

ArcFace: Additive Angular Margin Loss for Deep Face Recognition

harmonylab

（DL輪読）Matching Networks for One Shot Learning

Masahiro Suzuki

Pruning convolutional neural networks for resource efficient inference

Kaushalya Madhawa

One shot learning

Vuong Ho Ngoc

[DL輪読会]Learning to Simulate Complex Physics with Graph Networks

Deep Learning JP

Was ist angesagt? (20)

[DL輪読会]A closer look at few shot classification

【DL輪読会】GradMax: Growing Neural Networks using Gradient Information

[딥논읽] Meta-Transfer Learning for Zero-Shot Super-Resolution paper review

[DL輪読会]Relational inductive biases, deep learning, and graph networks

強化学習初心者が強化学習でニューラルネットワークの設計を自動化してみたい

【DL輪読会】ViT + Self Supervised Learningまとめ

[DL輪読会]医用画像解析におけるセグメンテーション

Neural networks for Graph Data NeurIPS2018読み会@PFN

[DL輪読会]Live-Streaming Fraud Detection: A Heterogeneous Graph Neural Network A...

SSII2022 [SS2] 少ないデータやラベルを効率的に活用する機械学習技術〜足りない情報をどのように補うか？〜

Coreset+SVM (論文紹介)

SSII2018TS: 大規模深層学習

[DL輪読会]Vector-based navigation using grid-like representations in artificial ...

ResNetの仕組み

論文紹介：End-to-End Object Detection with Transformers

ArcFace: Additive Angular Margin Loss for Deep Face Recognition

（DL輪読）Matching Networks for One Shot Learning

Pruning convolutional neural networks for resource efficient inference

One shot learning

[DL輪読会]Learning to Simulate Complex Physics with Graph Networks

Ähnlich wie PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

Cvpr 2018 papers review (efficient computing)

DonghyunKang12

DSRLab seminar Introduction to deep learning

Poo Kuan Hoong

Introduction to deep learning

Abhishek Bhandwaldar

04 Deep CNN (Ch_01 to Ch_3).pptx

ZainULABIDIN496386

Wits presentation 6_28072015

Beatrice van Eden

This presentation is Part 2 of my September Lisp NYC presentation on Reinforcement Learning and Artificial Neural Nets. We will continue from where we left off by covering Convolutional Neural Nets (CNN) and Recurrent Neural Nets (RNN) in depth. Time permitting I also plan on having a few slides on each of the following topics: 1. Generative Adversarial Networks (GANs) 2. Differentiable Neural Computers (DNCs) 3. Deep Reinforcement Learning (DRL) Some code examples will be provided in Clojure. After a very brief recap of Part 1 (ANN & RL), we will jump right into CNN and their appropriateness for image recognition. We will start by covering the convolution operator. We will then explain feature maps and pooling operations and then explain the LeNet 5 architecture. The MNIST data will be used to illustrate a fully functioning CNN. Next we cover Recurrent Neural Nets in depth and describe how they have been used in Natural Language Processing. We will explain why gated networks and LSTM are used in practice. Please note that some exposure or familiarity with Gradient Descent and Backpropagation will be assumed. These are covered in the first part of the talk for which both video and slides are available online. A lot of material will be drawn from the new Deep Learning book by Goodfellow & Bengio as well as Michael Nielsen's online book on Neural Networks and Deep Learning as well several other online resources. Bio Pierre de Lacaze has over 20 years industry experience with AI and Lisp based technologies. He holds a Bachelor of Science in Applied Mathematics and a Master’s Degree in Computer Science. https://www.linkedin.com/in/pierre-de-lacaze-b11026b/

Deep Learning

Pierre de Lacaze

Dp2 ppt by_bikramjit_chowdhury_final

Bikramjit Chowdhury

Neural network techniques

Vipul Bhargava

Forecasting of Sales using Neural network techniques

Hitesh Dua

Continuous Control with Deep Reinforcement Learning, lillicrap et al, 2015

Chris Ohk

Towards better analysis of deep convolutional neural networks

曾子芸

Lecture on Deep Learning

Yasas Senarath

Deep learning and image analytics using Python by Dr Sanparit

BAINIDA

Deep_Learning_Frameworks_CNTK_PyTorch

Subhashis Hazarika

For the full video of this presentation, please visit: https://www.embedded-vision.com/platinum-members/embedded-vision-alliance/embedded-vision-training/videos/pages/sep-2019-alliance-vitf-facebook For more information about embedded vision, please visit: http://www.embedded-vision.com Raghuraman Krishnamoorthi, Software Engineer at Facebook, delivers the presentation "Quantizing Deep Networks for Efficient Inference at the Edge" at the Embedded Vision Alliance's September 2019 Vision Industry and Technology Forum. Krishnamoorthi gives an overview of practical deep neural network quantization techniques and tools.

"Quantizing Deep Networks for Efficient Inference at the Edge," a Presentatio...

Edge AI and Vision Alliance

A Survey of Convolutional Neural Networks

Rimzim Thube

Presentation on 'Deep Learning: Evolution of ML from Statistical to Brain-like Computing' Speaker- Dr. Vijay Srinivas Agneeswaran,Director, Big Data Labs, Impetus The main objective of the presentation is to give an overview of our cutting edge work on realizing distributed deep learning networks over GraphLab. The objectives can be summarized as below: - First-hand experience and insights into implementation of distributed deep learning networks. - Thorough view of GraphLab (including descriptions of code) and the extensions required to implement these networks. - Details of how the extensions were realized/implemented in GraphLab source – they have been submitted to the community for evaluation. - Arrhythmia detection use case as an application of the large scale distributed deep learning network.

Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...

Impetus Technologies

Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...

Sujit Pal

Image Segmentation Using Deep Learning : A survey

NUPUR YADAV

Handwritten Digit Recognition(Convolutional Neural Network) PPT

RishabhTyagi48

Ähnlich wie PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation (20)

Cvpr 2018 papers review (efficient computing)

DSRLab seminar Introduction to deep learning

Introduction to deep learning

04 Deep CNN (Ch_01 to Ch_3).pptx

Wits presentation 6_28072015

Deep Learning

Dp2 ppt by_bikramjit_chowdhury_final

Neural network techniques

Forecasting of Sales using Neural network techniques

Continuous Control with Deep Reinforcement Learning, lillicrap et al, 2015

Towards better analysis of deep convolutional neural networks

Lecture on Deep Learning

Deep learning and image analytics using Python by Dr Sanparit

Deep_Learning_Frameworks_CNTK_PyTorch

"Quantizing Deep Networks for Efficient Inference at the Edge," a Presentatio...

A Survey of Convolutional Neural Networks

Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...

Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...

Image Segmentation Using Deep Learning : A survey

Handwritten Digit Recognition(Convolutional Neural Network) PPT

Mehr von Taesu Kim

PR12-179 M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention

Taesu Kim

PR12-165 Few-Shot Adversarial Learning of Realistic Neural Talking Head Models

Taesu Kim

PR12-151 The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Taesu Kim

PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks

Taesu Kim

Issues in AI product development and practices in audio applications

Taesu Kim

PR-043: HyperNetworks

Taesu Kim

Mehr von Taesu Kim (6)

PR12-179 M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention

PR12-165 Few-Shot Adversarial Learning of Realistic Neural Talking Head Models

PR12-151 The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

PR12-094: Model-Agnostic Meta-Learning for fast adaptation of deep networks

Issues in AI product development and practices in audio applications

PR-043: HyperNetworks

Kürzlich hochgeladen

Scaling API-first – The story of a global engineering organization

Radu Cotescu

Histor y of HAM Radio presentation slide

vu2urc

Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024. Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights. Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer. Learn about: - The essence and purpose of taxonomies and ontologies in information and knowledge management; - Advantages of semantic layers leveraging organizational taxonomies; and - Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Enterprise Knowledge

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

Sara Mae O’Brien Scott and Tatiana Baquero Cakici, Senior Consultants at Enterprise Knowledge (EK), presented “AI Fast Track to Search-Focused AI Solutions” at the Information Architecture Conference (IAC24) that took place on April 11, 2024 in Seattle, WA. In their presentation, O’Brien-Scott and Cakici focused on what Enterprise AI is, why it is important, and what it takes to empower organizations to get started on a search-based AI journey and stay on track. The presentation explored the complexities of enterprise search challenges and how IA principles can be leveraged to provide AI solutions through the use of a semantic layer. O’Brien-Scott and Cakici showcased a case study where a taxonomy, an ontology, and a knowledge graph were used to structure content at a healthcare workforce solutions organization, providing personalized content recommendations and increasing content findability. In this session, participants gained insights about the following: Most common types of AI categories and use cases; Recommended steps to design and implement taxonomies and ontologies, ensuring they evolve effectively and support the organization’s search objectives; Taxonomy and ontology design considerations and best practices; Real-world AI applications that illustrated the value of taxonomies, ontologies, and knowledge graphs; and Tools, roles, and skills to design and implement AI-powered search solutions.

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Enterprise Knowledge

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

Presentation on how to chat with PDF using ChatGPT code interpreter

naman860154

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Delhi Call girls

In an era where artificial intelligence (AI) stands at the forefront of business innovation, Information Architecture (IA) is at the core of functionality. See “There’s No AI Without IA” – (from 2016 but even more relevant today) Understanding and leveraging how Information Architecture (IA) supports AI synergies between knowledge engineering and prompt engineering is critical for senior leaders looking to successfully deploy AI for internal and externally facing knowledge processes. This webinar be a high-level overview of the methodologies that can elevate AI-driven knowledge processes supporting both employees and customers. Core Insights Include: Strategic Knowledge Engineering: Delve into how structuring AI's knowledge base is required to prevent hallucinations, enable contextual retrieval of accurate information. This will include discussion of gold standard libraries of use cases support testing various LLMs and structures and configurations of knowledge base. Precision in Prompt Engineering: Learn the art of crafting prompts that direct AI to deliver targeted, relevant responses, thereby optimizing customer experiences and business outcomes. Unified Approach for Enhanced AI Performance: Explore the intersection of knowledge and prompt engineering to develop AI systems that are not only more responsive but also aligned with overarching business strategies. Guiding Principles for Implementation: Equip yourself with best practices, ethical guidelines, and strategic considerations for embedding these technologies into your business ecosystem effectively. This webinar is designed to empower business and technology leaders with the knowledge to harness the full potential of AI, ensuring their organizations not only keep pace with digital transformation but lead the charge. Join us to map a roadmap to fully leverage Information Architecture (IA) and AI chart a course towards a future where AI is a key pillar of strategic innovation and business success.

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Earley Information Science

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

[2024]Digital Global Overview Report 2024 Meltwater.pdf

hans926745

08448380779 Call Girls In Civil Lines Women Seeking Men

Delhi Call girls

Handwritten Text Recognition for manuscripts and early printed texts

Maria Levchenko

How to convert PDF to text with Nanonets

naman860154

Discord is a free app offering voice, video, and text chat functionalities, primarily catering to the gaming community. It serves as a hub for users to create and join servers tailored to their interests. Discord’s ecosystem comprises servers, each functioning as a distinct online community with its own channels dedicated to specific topics or activities. Users can engage in text-based discussions, voice calls, or video chats within these channels. Understanding Discord Servers Discord servers are virtual spaces where users congregate to interact, share content, and build communities. Servers may revolve around gaming, hobbies, interests, or fandoms, providing a platform for like-minded individuals to connect. Communication Features Discord offers a range of communication tools, including text channels for messaging, voice channels for real-time audio conversations, and video channels for face-to-face interactions. These features facilitate seamless communication and collaboration. What Does NSFW Mean? The acronym NSFW stands for “Not Safe For Work,” indicating content that may be inappropriate for professional or public settings. NSFW Content NSFW content encompasses material that is sexually explicit, violent, or otherwise graphic in nature. It often includes nudity, profanity, or depictions of sensitive topics.

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

UK Journal

🐬 The future of MySQL is Postgres 🐘

RTylerCroy

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Partners Life - Insurer Innovation Award 2024

The Digital Insurer

What are drone anti-jamming systems? The drone anti-jamming systems and anti-spoof technology protect against interference, jamming, and spoofing of the UAVs. To protect their security, countries are beginning to research drone anti-jamming systems, also known as drone strike weapons. The anti-jam and anti-spoof technology protects against interference, jamming and spoofing. A drone strike weapon is a drone attack weapon that can attack and destroy enemy drones. So what is so unique about this amazing system?

What Are The Drone Anti-jamming Systems Technology?

Antenna Manufacturer Coco

Kürzlich hochgeladen (20)

Scaling API-first – The story of a global engineering organization

Histor y of HAM Radio presentation slide

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

A Domino Admins Adventures (Engage 2024)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Axa Assurance Maroc - Insurer Innovation Award 2024

Presentation on how to chat with PDF using ChatGPT code interpreter

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

[2024]Digital Global Overview Report 2024 Meltwater.pdf

08448380779 Call Girls In Civil Lines Women Seeking Men

Handwritten Text Recognition for manuscripts and early printed texts

How to convert PDF to text with Nanonets

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

🐬 The future of MySQL is Postgres 🐘

Data Cloud, More than a CDP by Matt Robison

Partners Life - Insurer Innovation Award 2024

What Are The Drone Anti-jamming Systems Technology?

PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

1. PR-12 presentation NISP: Pruning Networks using Neuron Importance Score Propagation CVPR2018 Authors: Ruichi Yu et al Presented by Taesu Kim

2. Motivation • Pruning • Previous approaches • Focus on single layer or two layers’ statistics • Greedy pruning • Entire network is a whole • Error propagates, especially when network is deep

3. Motivation • Entire CNN is a set of feature extractors • The final responses are the extracted features • We measured the importance of the neurons across the entire CNN based on a unified goal • Minimizing the reconstruction errors of (important) final responses

4. Approach • Feature ranking on the ﬁnal response layer • NISP: Neuron Importance Score Propagation • Pruning network using NISP • Fine-tune the pruned network

5. NISP: Objective function • ! , ! • ! • a binary vector ! : neuron prune indicator for the l-th layer • ! • ! , ! • ! , ! , !

6. Solution • The network pruning problem can be formulated as a binary integer program • Fining the optimal neuron prune indicator s • It is hard to obtain efﬁcient analytical solutions by directly optimizing the objective • a sub-optimal solution can be obtained by minimizing the upper bound • ! ! • Assume the activation function ! is Lipschitz continuous: Identity, ReLU, sigmoid, tanh, etc. • Lipschitz continuous if ! , ! ,

7. Solution • ! , ! , ! • ! , ! • ! ! , ! ! • ! • !

8. Solution • ! • !

9. Solution • Backpropagate the importance values

10. Experiments • Comparison with random pruning and training-from-scratch baseline • randomly pruning the pre-trained CNN and then ﬁne-tuning • training a small CNN with the same number of neurons/ﬁlters per layer as our pruned model • !

11. Experiments • Feature selection vs. Magnitude of weights • NISP-FS: using feature selection method in [34] • NISP-Mag: considering only magnitude of weights • [34] Inﬁnite feature selection. G. Roffo et al. ICCV 2015

12. Experiments • NISP vs. Layer-by-Layer pruning •

13. Experiments • Comparison with existing methods [11] Acceleration through elimination of redundant convolutions, M. Figurnov et al, NIPS2016 [20] Compression of deep convolutional neural networks for fast and low power mobile applications, Y. Kim et al, ICLR 2016 [36] Learning the architecture of deep neural networks, S. Srinivas et al, BMVC 2016 [25] Pruning filters for efficient convnets, H. Li et al, ICLR 2017 [29] Thinnet: A filter level pruning method for deep neural network compression, J.-H. Luo et al ICCV 2017 NISP-A: pruning all conv layers NISP-B: pruning all conv layers except conv5 NISP-C: pruning all conv layers except conv5, conv4 NISP-D: pruning all conv layers except conv2, conv3, FC6 NISP-x-A: prune 15% filters of each layer NISP-x-B: prune 25% filters of each layer

14. Conclusion • Generic framework for network compression and acceleration based on identifying the importance levels of neurons • Neuron importance scores in the layer of interest are obtained by feature ranking • Formulate the network pruning problem as a binary integer program • Obtain a closed-form solution to a relaxed version of the formulation • NISP algorithm propagates the importance to every neuron in the whole network • It efﬁciently reduces CNN redundancy and achieves full-network acceleration and compression

PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

Ähnlich wie PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation (20)

Mehr von Taesu Kim

Mehr von Taesu Kim (6)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation