Patent attributes
In the present disclosure, a method for compressing a feature map is provided, where the feature map is generated by passing a first input through a deep neural network (DNN). A respective optimal index order and a respective optimal unifying method are determined for each of super-blocks that are partitioned from the feature map. A selective structured unification (SSU) layer is subsequently determined based on the respective optimal index order and the respective optimal unifying method for each of the super-blocks. The SSU layer is added to the DNN to form an updated DNN, and is configured to perform unification operations on the feature map. Further, a first estimated output is determined, where the first estimated output is generated by passing the first input through the updated DNN.