The busyplan from createamind

pytorch 官方 Document：
http://pytorch.org/docs/master/nn.html
关键 operatoin：
3D deconvolution - torch.nn.ConvTranspose3d
3D convolution - torch.nn.Conv3d
3D maxpooling - torch.nn.MaxPool3d
3D dropout - torch.nn.Dropout3d

Keras 实现 已取消

计划注意：

计划列出的是最低时间，因为进度原因可能推迟

可能失败原因

1. 因为现有目标数据集不符合pix2pix coniditional gan 分布的原理，生成图像可能无法毫无价值
2. 3D convolution 耗费内存增大，最终模型以我们现有条件可能跑不起来
3. 技术能力不足，耦合失败
4. 公司调整方向，放弃

videogan资料

1.论文 https://arxiv.org/abs/1611.06624
代码 https://github.com/pfnet-research/tgan https://github.com/dandelin/Temporal-GAN-Pytorch
2.论文 http://carlvondrick.com/tinyvideo/paper.pdf
网站 http://web.mit.edu/vondrick/tinyvideo/
代码https://github.com/cvondrick/videogan

gan video 两周
第一周，熟悉已有论文，选用一个算法复现。使用自己的数据集，调参。
第二周，分析隐变量的语义相关信息，自动驾驶的转向角度和z的关系。

Nvidia DIGITS

复现
https://devblogs.nvidia.com/parallelforall/photo-editing-generative-adversarial-networks-2/
内容

Depth Perception from Images

http://cs231n.stanford.edu/reports/2017/pdfs/200.pdf

1.multi-scale deep network, outperformed most other meth- ods in nearly every metric. Inspection of the output maps, however, shows that the images produced are extremely blurry. So while they are able to achieve low average er- ror, their utility for practical depth mapping applications is limited.
生成的深度图模糊，原因在于优化目标是平均像素误差。
2.CycleGAN is able to best retain the image features with clear definition, but often with high error in the depth-space representation.
生成深度图比较清晰，特征重建较好，而像素级误差较大，原因在于优化目标是特征级误差。
3.改进方向
设计损失函数，使其能同时优化像素级误差和特征级误差。

Progressive-Growing-Of-GANs

https://github.com/Avhirup/Progressive-Growing-Of-GANs-Pytorch- 不完整

https://github.com/github-pengge/PyTorch-progressive_growing_of_gans 不完整

https://github.com/nashory/progressive-growing-torch 完整

RGB2Depth

目的：验证现有模型预测深度的可靠性，为是否进一步改良模型提供依据。

Quantitative Evaluation, 量化方法，与其他方式对比
重现已有模型，根据评估误差横向评估
需要输出图像达到256x256

2.- [ ] Deeper Depth Prediction with Fully Convolutional Residual Networks upsampling to 640x480
https://arxiv.org/pdf/1606.00373.pdf
https://github.com/iro-cp/FCRN-DepthPrediction 代码不完整

4.Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields https://arxiv.org/pdf/1502.07411.pdf

6.Single-Image Depth Perception in the Wild
https://arxiv.org/pdf/1604.03901.pdf
https://github.com/wfchen-umich/relative_depth
7.Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation
https://arxiv.org/pdf/1704.02157.pdf
https://github.com/danxuhk/ContinuousCRF-CNN
选用

http://perso.ensta-paristech.fr/%7Epinard/depthnet/ 1/4 downsampling，缺失训练代码，可以预测
https://hal.archives-ouvertes.fr/hal-01587652/document
https://github.com/ClementPinard/DepthNet
Depth Map Prediction from a Single Image using a Multi-Scale Deep Network 输出74x75 ,提供评估方法 https://www.cs.nyu.edu/~deigen/depth/
https://github.com/MasazI/cnn_depth_tensorflow 不完整
https://github.com/janivanecky/Depth-Estimation caffe
Qualitative Evaluation，质化方法，点云图像化展示
depth to （x,y,z)点云array 快速转换：https://codereview.stackexchange.com/questions/79032/generating-a-3d-point-cloud
点云可视化
https://github.com/daavoo/pyntcloud
https://github.com/createamind/busyplan/blob/master/zhangwei/point_cloud.ipynb

KITTI 数据集深度预测

深度预测模型预测真实数据

前期研究证明p2p模型可以利用虚拟vkitti数据集预测深度，现在决定继续深入，目的如下：

1. 替换原先用于训练模型的vkitti，改用真实数据集
2. 研究用预测深度转换为真实点云
3. (op) 尝试用预测形成的点云进行sensor fusion 或 localization 的试验

createamind / busyplan Goto Github PK

busyplan's People

Contributors

Stargazers

Watchers

Forkers

busyplan's Issues

短期目标-gazebo

隐变量分析

predict VIDEO

videogan资料

Nvidia DIGITS

Depth Perception from Images

Progressive-Growing-Of-GANs

RGB2Depth

KITTI 数据集深度预测

深度预测模型预测真实数据

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent