Neural Behavior-Based Approach for Neural Network Pruning

Koji KAMMA  Yuki ISODA  Sarimu INOUE  Toshikazu WADA  

IEICE TRANSACTIONS on Information and Systems   Vol.E103-D   No.5   pp.1135-1143
Publication Date: 2020/05/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2019EDP7177
Type of Manuscript: PAPER
Category: Biocybernetics, Neurocomputing
neuro-coding,  neuro-unification,  

Full Text: PDF(576.6KB)>>
Buy this Article

This paper presents a method for reducing the redundancy in both fully connected layers and convolutional layers of trained neural network models. The proposed method consists of two steps, 1) Neuro-Coding: to encode the behavior of each neuron by a vector composed of its outputs corresponding to actual inputs and 2) Neuro-Unification: to unify the neurons having the similar behavioral vectors. Instead of just pruning one of the similar neurons, the proposed method let the remaining neuron emulate the behavior of the pruned one. Therefore, the proposed method can reduce the number of neurons with small sacrifice of accuracy without retraining. Our method can be applied for compressing convolutional layers as well. In the convolutional layers, the behavior of each channel is encoded by its output feature maps, and channels whose behaviors can be well emulated by other channels are pruned and update the remaining weights. Through several experiments, we comfirmed that the proposed method performs better than the existing methods.