The hrank's discuss from lmbxmu

The meaning of the "rank" of feature maps.

Thanks for your excellent work! When I try to reproduce HRank in other networks, I found an interesting result:

When I applied torch.matrix_rank to a network with leaky_relu, it turns out that all channels almost have the same rank and the rank is always full. At first, I thought the reason is the network I used is not redundant.
Then, I applied torch.matrix_rank to the resnet56-cifar10 network in your repository, and I found it works fine. However, if I move the torch.matrix_rank function to the BN layer (i.e. before ReLU Layer), the ranks also become the same full rank!
Hence, I put an extra ReLU Layer after the original Leaky_ReLU in my own network, and the ranks behave normally, i.e. some channels have high ranks and other channels have low ranks.
So I'm curious about two things:
What is the meaning of low rank or even zero rank? It seems not to be related to the amount of information, and only represents the mean activation distribution.
Is it reasonable if I calculate rank by adding an extra ReLU layer?

Size of the feature maps at later layers

How do you deal with feature maps comparison in later layers if you don't upsample the original input images? Wouldn't the feature maps at the last convolutional layers be too small?

compress rate

Thanks for your sharing.How to get the compress rate for different models or different FLOPs?

Automatically resume training from the highest test acc epoch may cause data leak.

It has been a while since HRank published. Let me start off by saying thank you for sharing this interesting piece of work and bringing in a novel perspective in the pruning realm. However, as we were trying to replicate the HRank results for benchmarking, we noticed the following issue.

By the following lines, it looks like for every new epoch, the checkpoint with the best test acc is being automatically loaded, then the training resumes:

HRank/main.py

Lines 232 to 244 in 33050a1

    
           if top1.avg > best_acc: 
        
               print_logger.info('Saving to '+args.arch+'_cov'+str(cov_id)+'.pt') 
        
               state = { 
        
                   'state_dict': net.state_dict(), 
        
                   'best_prec1': top1.avg, 
        
                   'epoch': epoch, 
        
                   'scheduler':scheduler.state_dict(), 
        
                   'optimizer': optimizer.state_dict()  
        
               } 
        
               if not os.path.isdir(args.job_dir+'/pruned_checkpoint'): 
        
                   os.mkdir(args.job_dir+'/pruned_checkpoint') 
        
               best_acc = top1.avg 
        
               torch.save(state, args.job_dir+'/pruned_checkpoint/'+args.arch+'_cov'+str(cov_id)+'.pt')

HRank/main.py

Lines 305 to 306 in 33050a1

    
           if len(args.gpu) == 1: 
        
               pruned_checkpoint = torch.load(args.job_dir + "/pruned_checkpoint/" + args.arch + "_cov" + str(cov_id) + '.pt', map_location='cuda:' + args.gpu)

We also confirmed it empirically by checking acc and printing out a portion of conv tensor:

While I understand that it is common and acceptable practice to report the epoch with the best test acc [1], training every epoch upon the checkpoint with the best test acc sounds like a potential data leak — as it is using test set info to determine operations. It looks like HRank may perform reasonably well without this setting (i.e. by just continuing training upon the latest epoch). Is this by accident?

[1] Li et al. Pruning Filters for Efficient ConvNets. ICLR 2017

论文中的公式问题

请问论文中的公式（5）该如何理解，为何期望值约等号后没有1/g呢？

模型问题

可以用你的压缩方法去压缩别的网络吗，比如HRNET

代码疑问

对您的研究非常感兴趣，
但是看太懂您的代码，请问完成剪裁是那一部分

关于剪枝后模型的问题

您好！
请问您的代码剪枝后生成的模型是不是并没有完全删去通道，就是说并没有将模型的结构变化，而是把需要剪掉的通道的卷积权重置零了呢？
如果是的话，那是不是剪枝后的模型保存的占用空间与剪枝前的是一样的？那参数和FLOPS的现存量是手动按照剪枝率计算的嘛？
谢谢！#因为我在自己做resnet剪枝的时候，想要做到结构变化，但是由于downsample的存在，想减每一层有点问题，不知道该怎么处理，所以想确认下您的代码是否实现了完全剪枝改变模型结构
谢谢！！

关于compute_rate、秩的计算顺序的问题

您好！我有两个问题：

compress_rate是指每层的保留率吗，还是指的是每层要删掉的比例呢？
是在开始剪枝前仅计算一次各卷积层的平均秩，还是每层剪枝完后计算下一个卷积层的平均秩呢（比如第1层剪完了，然后我才计算第2层的平均秩；然后再剪第2层，再计算第3层的平均秩......）？

为什么结果展示部分，每个表格有多个HRank的值？比如VGG网络在CIFAR10上的结果，HRank(Ours) 93.43 145.61M(53.5%) 2.51M(82.9%) ，还有一个是HRank(Ours) 91.23 73.70M(76.5%) 1.78M(92.0%) ，这两个结果分别是什么呢？

有关计算flops和params的问题

你好，我使用您的代码cal_flops_params.py（分别在HRank和HRankPlus）测试VGG-16未剪枝前的网络计算量和参数量，得到的结果：314.572M(FLOPs)、14.992M（Params），但是您在论文的结果是：313.73M(FLOPs)、14.98M（Params），想请教下关于结果差距的原因？

代码疑问？

Hi, @lmbxmu ，请问我应该先运行哪个文件呢？rank_generation.py需要用到pre-trained的权重（通过main.py训练得到），但是main.py又需要rank_conv*.npy（通过rank_generation得到）？

Download ResNet-50 on ImageNet

I'd like to use your code (rank_generation.py and main.py) for pruning with ResNet-50 on ImageNet.

However, it doesn't work and I've got an error

File "", line 1, in
File "/ssd7/skyeom/anaconda3/envs/py38/lib/python3.8/site-packages/torch/serialization.py", line 585, in load
return _legacy_load(opened_file, map_location, pickle_module, **pickle_load_args)
File "/ssd7/skyeom/anaconda3/envs/py38/lib/python3.8/site-packages/torch/serialization.py", line 740, in _legacy_load
return legacy_load(f)
File "/ssd7/skyeom/anaconda3/envs/py38/lib/python3.8/site-packages/torch/serialization.py", line 669, in legacy_load
args = pickle_module.load(f, **pickle_load_args)
_pickle.UnpicklingError: invalid load key, '\x00'.

Is the pth file (resnet50-19c8e357.pth) that we can download in the link correct one?

Many thanks.

Best regards,
Seul-Ki

Detection Model

Hi,

I read the paper on CVPR this year and find this excellent work. Thanks!
I am curious about the detect model like YOLO or FastRCNN on this method. Do they achieve the same results?
By the way, could you share the model for detection?

Thanks

关于论文图2的问题

您好！
关于您论文中图2，请问我这样的理解正确嘛？
“横坐标为当前卷积层经过bn层再经过relu后输出的特征图的序列号，纵坐标为第1、5、10、20、30、40、50组batch，其中的颜色深浅代表特征图秩的大小”

另外可以请教您画图2的代码嘛？非常感谢！！

Pruning other algorithmic models

Recently I saw this excellent work in CVPR. I would like to ask how to prune my own object tracking model with your pruning algorithm.It would be great if there were a project to prune the tracking algorithm.thank you！

How to determine the per-layer filter pruning rate for a given model

Hi,

Thanks for your awesome work. I was wondering - the per-layer pruning rates are hard-coded in the command line in your examples, but what is a non-manual way to determine these? Is there any code in your repo dedicated to searching for these per-layer pruning rates or did you tune them manually?

Thanks.

detect and classier model prunting?

@lmbxmu
Hi ,thank you your work,it's great~
I want to ask you ,the detect model (yolo ssd) to use your code ,is OK ? same?

关于rank的疑问

为什么我自己训练出来的模型，在用你代码里面的hook计算rank的时候，每个通道的feature map的rank都差不多，不像你存下来的npy那样有明显的从高到低的排序，请问这是什么原因造成的呢？

Question about code?

Hi~Thanks for your great work!
In main.py, the code:
if args.arch=='resnet_50':
skip_list=[1,5,8,11,15,18,21,24,28,31,34,37,40,43,47,50,53]
if cov_id+1 not in skip_list:
continue
else:
pruned_checkpoint = torch.load(
args.job_dir + "/pruned_checkpoint/" + args.arch + "_cov" + str(53) + '.pt')
net.load_state_dict(pruned_checkpoint['state_dict'])
Why the model load cov53.pt?

Question about code

Hi~Thanks for your great work!
In main.py, the code:
if args.arch=='resnet_50':
skip_list=[1,5,8,11,15,18,21,24,28,31,34,37,40,43,47,50,53]
if cov_id+1 not in skip_list:
continue
else:
pruned_checkpoint = torch.load(
args.job_dir + "/pruned_checkpoint/" + args.arch + "_cov" + str(53) + '.pt')
net.load_state_dict(pruned_checkpoint['state_dict'])
Why the model load cov53.pt?

	if top1.avg > best_acc:
	print_logger.info('Saving to '+args.arch+'_cov'+str(cov_id)+'.pt')
	state = {
	'state_dict': net.state_dict(),
	'best_prec1': top1.avg,
	'epoch': epoch,
	'scheduler':scheduler.state_dict(),
	'optimizer': optimizer.state_dict()
	}
	if not os.path.isdir(args.job_dir+'/pruned_checkpoint'):
	os.mkdir(args.job_dir+'/pruned_checkpoint')
	best_acc = top1.avg
	torch.save(state, args.job_dir+'/pruned_checkpoint/'+args.arch+'_cov'+str(cov_id)+'.pt')

	if len(args.gpu) == 1:
	pruned_checkpoint = torch.load(args.job_dir + "/pruned_checkpoint/" + args.arch + "_cov" + str(cov_id) + '.pt', map_location='cuda:' + args.gpu)

lmbxmu / hrank Goto Github PK

hrank's Issues

Recommend Projects

Recommend Topics

Recommend Org