shiqiyu / libfacedetection.train Goto Github PK

View Code? Open in Web Editor NEW

737.0 737.0 205.0 101.84 MB

The training program for libfacedetection for face detection and 5-landmark detection.

License: Apache License 2.0

Python 99.86% Shell 0.14%

libfacedetection.train's People

Contributors

Stargazers

Watchers

Forkers

qaz734913414 happog jamesagada merlin2013 kindlehe wobjtushisui mkzirncz1 fengyuentau fakegit justcallmewilliam jon-drugstore chaozhong2010 miwaliu zyg11 dsp6414 stc-cqupt myy1993 lxj0276 zhhzzw maxe-xq jackyyvan hedilong xiaoshzx kail85 chengwei920412 wishgale felixzhang7 tecsai free-dux mornydew baiyuang hlyu368 zhaowujie chenjun2hao royzon mincore sheenlinemaverick yfsun1 runrunrun1994 tonylibing cclauss hxhh amazingroad yfcyfc luchenxu73 chentgui h0n1 rechardchen123 justhikingcoder goodgoodstudy92 david123sw xzou999 deftruth zhangxujinsh striderw a350-1000 ggaoming dansonc elephantgit shensj wangdeyu jerevon001 wuxiaolianggit kongchibin asdlei99 ducbx sclast j201111100523 hardwan xujiafree zcyroot lightbillow vladkol xlmore verigle gemhalo1 jackrambor moonpieeee cakchit xjohnxjohn shah-aadil shaswata56 sysau xiaodouyaer davidtranno1 koosky todaysky1234 webart82 ancientremember foundations pywangyu adteven syan-timber kiyokawarin webstorage119 evergreengyq salary-only-17k michael-yyang xiaowenhe boosting

libfacedetection.train's Issues

请问有拉流摄像头实时检测的代码吗？

看到detect.py 只有图片和视频的，但是为尝试过拉流摄像头实时检测，发现不行，请问有相应的代码可以提供下吗？
谢谢，希望能够得到回复

老师，您好，我想问下为什么我按照你的redeme中的训练方法训练却出现如下的错误？您知道怎么解决吗？
Printing net...
Loading Dataset...
Printing net...
Printing net...
Printing net...
Printing net...
Printing net...
Printing net...
Printing net...
Printing net...
Traceback (most recent call last):
File "train.py", line 198, in
train()
File "train.py", line 133, in train
images, targets = next(batch_iterator)
File "C:\ProgramData\Anaconda3\envs\ozl\lib\site-packages\torch\utils\data\dataloader.py", line 637, in next
return self._process_next_batch(batch)
File "C:\ProgramData\Anaconda3\envs\ozl\lib\site-packages\torch\utils\data\dataloader.py", line 658, in _process_next_batch
raise batch.exc_type(batch.exc_msg)
AttributeError: Traceback (most recent call last):
File "C:\ProgramData\Anaconda3\envs\ozl\lib\site-packages\torch\utils\data\dataloader.py", line 138, in _worker_loop
samples = collate_fn([dataset[i] for i in batch_indices])
File "C:\ProgramData\Anaconda3\envs\ozl\lib\site-packages\torch\utils\data\dataloader.py", line 138, in
samples = collate_fn([dataset[i] for i in batch_indices])
File "F:\ozl\facenet\libfacedetection.train\src\data.py", line 325, in getitem
height, width, _ = img.shape
AttributeError: 'NoneType' object has no attribute 'shape'

Question about training the model

How to get the face_rect_landmark.py? Thank you!

what is the comparison between normal face detection and this module

Cannot download labelsv2

Hello!
I followed the suggested link to download "labelsv2" but after inserting the suggested password I cannot directly download the file because I am required some action with the QR-code (I don't speak Chinese)
What should I do?

Hello, can you write this code for training in C ++?

https://github.com/ShiqiYu/libfacedetection.train

模型为640x640,转onnx输入尺寸不同，没有报错是什么原因？

模型为640x640，但是为转onnx的时候尺寸改为641的大小也一样可以转onnx，请问这是为什么呢？不是规定了是640x640的大小吗？

关于Onnx 模型无法在前端的Onnxruntime-web上正常使用的问题

余老师您好，我在下载了您的开源项目之后想要尝试把onnx模型部署到VUE上并做一个demo的展示。目前我使用的Node.js是1.16.1，onnxruntime是最新的1.12.1.在这个过程中我发现，使用项目自带的onnx会遇到两个问题：
1.需要把INT64转换成INT32.这个比较好处理。我在opset_version等参数不变的情况下，把模型的params和Nodes都换成了精度是INT32的。
2.然后，我就发现，您提供的ONNX似乎在JS上仍然不是很兼容。具体错误如下图：

我还尝试在您的代码上把上采样的方法改写，不进行任何操作而直接返回一个符合上采样尺寸大小的torch.ones()。但前端还是会出现Shape的错误。这个有什么好方法或者建议去解决吗？

请问余老师，我看源码，图片在送入网络前会转成27通道，我需要训练单通道的图片，应该怎么修改呢

     //only 27 elements used for each pixel
    create((imgHeight+1)/2, (imgWidth+1)/2, 32);
    //since the pixel assignment cannot fill all the elements in the blob.
    //some elements in the blob should be initialized to 0
    setZero();

[Docs] Visualization of network architecture points to older onnx file link

Issue:

Link for visualization of YuNet architecture from the README file does not point to the correct onnx file. The current link points to the onnx file based on the previous directory structure.

Screenshot:

Possible Fix:

Have to update the README file to point to the latest onnx file (From the 3 onnx files currently present).

Can I work on it and raise a PR ?

支持戴口罩人脸检测吗

我在opencv 中使用这个模型，处理戴口罩图片提示检测不到人脸

训练出错：from data import FaceRectLMDataset, detection_collate

请问大神，在data里面要导入的两个模块是什么？从哪里找到？

ModuleNotFoundError: No module named 'resource'

How can I solve this probelm？ I'm using python3.6.5 torch 1.5.0+cuda92

请问后续有计划添加eval.py吗？

您好，目前代码是支持训练固定次迭代，没有val集做early-stop个人认为会难以捕捉over-fitting的情况，请问后续有添加eval.py的计划吗？我自己写了一份eval.py，但是不知道对不对。
此致敬礼，顺颂时祺

How to train with custom dataset by using the pretrained model?

Hello,
I would like to ask a few questions about this github repo.

What do I need to do to train using the pretrained model?
How can I create my own custom dataset other than Wider-Face? Is there an annotation tool you recommend that does annotation in the same format?
What should I do to train with the dataset I created?
How can I convert the model to onnx after training?

请问余老师如何理解网络结构中ConvDPUnit

请问余老师ConvDPUnit中的卷积部分如下代码：

为什么没有使用pointwise的操作进行降维呢？这样操作对速度和精度有什么作用吗？谢谢

当转onnx的时候，验证pytorch模型推理和onnx推理时候出现问题

python tools/yunet2onnx.py ./configs/yunet_n.py ./weights/yunet_n.pth --verify true
当转onnx的时候，验证pytorch模型推理和onnx推理时候出现问题。

Can't achieve widerface val score shown in the README when training from scratch.

I train this network using default parameters in train.py, then use final.pth to test on widerface val set scales=[1.], confidence_threshold=0.3.
The scores I get are
Easy Val AP: 0.7902144778486827
Medium Val AP: 0.7513014930849016
Hard Val AP: 0.5267147634247762
Do you use some other tricks while training?

GPU利用率低

为什么我训练的时候利用率很低呢，我的cuda环境也配置好了

Fine-tuned model license

Is the fine-tuned model covered by MIT license as well? If so, was it trained on WIDER FACE or another dataset?
Asking because your C++ code is BSD-3-clause, and int8data.cpp is a direct derivative of the fine-tuned model.

推理结果好像有问题

于老师，您好
我采用默认的命令python tools/detect_image.py ./configs/yunet_n.py ./weights/yunet_n.pth ./image.jpg 去检测图像，结果好像有问题，结果如下：

Training speed is very slow on multi gpus

Hi, I am trying to repeat your experiment by running your train.py in one gpu and four gpus, it turns out 4 gpus require longer time than one gpu, 150+ hours versus 60+ hours. I am using 2080Ti on ubuntu 16, cuda 10.1, nvidia 430, pytorch 1.2.0. Did you encounter the same problem on your side? Thanks.

How to generate json files for NVIDIA dali?

Hi,
I manually label custom datasets following the COCO format.
May I ask if is it possible for you to publish the code to convert to NVIDIA Dali (which is used in this repo)?
Thanks

new annotation format

Hello, I would like to ask how the new annotation txt file is written. From my understanding, 1st line is file name, 2nd line is number of faces, however 3rd/4th lines have 15 numbers, the last number i assume is visibility or opacity, however I do not know what the 14 other numbers represent because I assumed it was 5 facial landmarks thus 10 points.

Thank you

yunet_yunet_final_320_320_simplify.onnx not working.

Hello, I'm kijoong lee, a LG Electronics SW Developer.

We are developing a TFLite-based hardware-accelerated AI inference framework on webOS.

Recently, we judged YuNet to be the most suitable for face detection models through benchmarks. And, by converting the face_detection_yunet_2022mar.onnx model included in opencv dnn into a tflite model, a face detector with good performance was obtained. For reference, we used the xnnpack accelerated method.

However, we need a model larger than 160x120 that can be accelerated by GPU(or NPU), so we tried to convert the model included in https://github.com/ShiqiYu/libfacedetection.train/tree/master/onnx and use it, but it didn't work. .

The reasons we analyzed are as follows.

(face_detection_yunet_2022mar_float32.tflite)

(yunet_yunet_final_320_320_simplify_float32.tflite)

As you can see in the two figures above, the output shapes of the two models are different. A well-behaved model includes a reshaping part into a two-dimensional tensor and a Softmax operation.

How can we make a model with an input size larger than face_detection_yunet_2022mar.onnx? Or could you please fix this problem?

nonsquare input size training and exporting

I firstly used my custom dataset and trained a model. Since I use the config yunet_n.py, all 'img_scale' and 'size' are (640, 640); but the input_shape of my exported onnx file is (1, 3, 736, 1280). I successfully convert this onnx file to tensorrt engine and get recall 83% on my own testing set.

However, all my custom dataset and testing set are formed of (height, width) = (720, 1080) or (1080, 1920) pictures. Hence I thought maybe I should adjust all 'img_scale' and 'size' to (736, 1280) in order to get a better model for my task.

Unfortunately, the model which trained with size (736, 1280) only get recall 50%, I also trained two models with size (1280, 1280) and (352, 640), and only get recall 70% and 50%. (all the input_shape of exported onnx files are (1, 3, 736, 1280))

Did I ignore somewhere also need to be adjusted if I don't want to use default size = (640, 640)?
The ordering of 'img_scale' and 'size' in the config and the the ordering of flag 'shape' in yunet2onnx.py are all (height, width), right?

训练完一个epoch后没反应

按照readme.md配置好后，在训练完一个epoch后没反应，等了一个多小时也没有新的日志信息打印。这是为何。

日志信息如下：

LM:False || Epoch:0/500 || iter: 800/803 || L: 0.58(0.57) IOU: 0.13(0.17) LM: 277.46(232.85) C: 2.30(2.46) All: 3.02(3.20) || LR: 0.01000000
LM:False || Epoch:0/500 || iter: 802/803 || L: 0.57(0.57) IOU: 0.12(0.17) LM: 64.95(232.43) C: 2.30(2.46) All: 2.99(3.20) || LR: 0.01000000
Epoch time: 11.31 minutes; Time left: 94.09 hours

图像预处理时的均值和方差为什么是[0,0,0]和[1,1,1]

这样做是为了便于部署时候进行预处理吗？

Apply for priors box parameters of 'onnx' model

In file config/yufacedet.yaml

  anchor:
    min_sizes: [[10, 16, 24], [32, 48], [64, 96], [128, 192, 256]]
    steps: [8, 16, 32, 64]
    ratio: [1.]
    clip: False

These parameters are based on the weights/yunet_final.pth model .
But when i use the model with a fixed input size eg.onnx/yunet_yunet_final_320_320_simplify.onnx , this set of parameters will not match.
I wish to get a set of parameters which is able to adapt to the onnx model.
Thanks

Negative Samples

As i am testing the model, i encounter it creates a lot of false positive detection on hands and necks with high confidence, can you tell me how to add negative sample images.
Thanks

有keras版本的吗

模型类型

Is this a one-stage test or a two-stage test?
请问这是属于单阶段检测还是两阶段检测呢？

Will you release train codes for v3?

Thanks for your great work!
The libfacedetection-v3 has many differences from v2, such as DSC and less width.
So will you release train codes for v3?

May I ask do you still provide the training code for the model published at opencv github repo?

Currently, I have some problems converting the model trained by this repo, but successfully convert the model at the OpenCV GitHub repo.
So I wonder if the training code is still around?

和原始版本caffe模型的区别

于老师您好，我之前跑过您开源的caffe模型（https://github.com/ShiqiYu/libfacedetection/tree/master/models/caffe/yufacedetectnet-open-v1.caffemodel）
，效果挺好的。

现在想重新训练，但是开源的是pytorch版本，所以想问：

这个版本包括检测五个关键点，那么在推理时间上和之前的只检测人脸框的caffe模型有区别吗？
如果有区别，那么我把模型修改去掉检测关键点，然后训练的模型转caffe，能否复现之前开源的caffe模型效果呢（速度和精度上）？

谢谢！

About the version update in mmdet framework

Hi. I saw the new version of libfacedetection.train, which is switched to the mmdet framework. But both the yunet_n.py and yunet_s.py are little bit different from the original model structure. That means the converted libfacedetection-data.cpp will not be compatible with the project libfacedetection, in which libfacedetection-model.cpp has different inferencing code. Then will you update the corresponding cpp code in the project libfacedetection? What should we do if we want to use libfacedetection on new trainning?

Thanks in advance!

cannot import name '_DaliBaseIterator' from 'nvidia.dali'

安装了nvidia-dali-cuda102==1.5.0 和nvidia-dali-tf-plugin-cuda102==1.5.0
运行python train.py
报错如下：
Traceback (most recent call last):
File "train.py", line 18, in
from data import get_train_loader
File "/app/code/face_recognition/libface/libfacedetection.train/tasks/task1/../../src/data.py", line 11, in
from nvidia.dali import _DaliBaseIterator
ImportError: cannot import name '_DaliBaseIterator' from 'nvidia.dali' (/root/anaconda3/envs/libface/lib/python3.7/site-packages/nvidia/dali/init.py)

some confusion about trainning details

1.why set rgb_mean=(0,0,0)
2.why start trainning landmarks unitl 100 epoch later?
3.why trainning landmarks in every 2 epochs after 100 epoch?

模型是否支持关键点输出

看到训练中利用到关键点信息，有关键点相关的loss，模型推理是否支持关键点的输出呢？

only 1024 is supported?我想在100x100的图片上训练，应该怎么办呢？

can't find from .bbox import

请问论文什么时候发表？

在做实验的时候用到了您的代码，现在在写论文，怎么去引用您的文献？

最小像素点修改

请问现在检测最小人脸框是10x10吗，如何修改大小？

can not export to onnx

尝试通过python tools/yunet2onnx.py ./configs/yunet_n.py ./weights/yunet_n.pth 进行onnx转换，发现转换失败，我的mmcv安装版本是1.6

how to convert pytorch model to openvino model ?

how to convert pytorch model to openvino model?

the bounding boxes are not same in dataset_rect and dataset_landmark

The bounding boxes are not same in dataset_rect and dataset_landmark, the face count in dataset_rect are larger than the one in dataset_landmark. Would tiny faces in dataset_landmark be seriously restrained during training?

How to find requirements.txt?

Hi Mr Yu,

How can I find the requirements.txt file?

training loss fluctuates

Hello, i'm currently training the model using the wider face data set and your annotations, however after 300 epochs the IoU and L just fluctuates and does not go down. IoU is also extremely low. I have attached the terminal output of the first 40 epochs
training_log.txt

where can we get datasets with key points to train Yunet?

Hi, thanks for sharing your project. Yunet does detact faces fast and accurately than other baselines.

I try to re-train Yunet, but I wonder where can we get datasets with the-five-key-point to train the models?

Thank you in advance if you can give some advice or directions that can help me to re-train Yunet with key-points prediction ability.

训练脚本模型性能复现

于老师,您好　
　　我在使用项目中提供的训练脚本得到的模型性能比您开源的模型性能要差1~1.5个点.请问开源模型是否在训练时进行了别的参数调优,还是仅仅是因为数据shuffle不同导致的? 此外,还想请教一下,如果我们不需要输出人脸关键点预测,那么在训练时删除人脸关键点,对于人脸检测性能会有帮助吗?