Coder Social home page Coder Social logo

owen-liuyuxuan / fsnet Goto Github PK

View Code? Open in Web Editor NEW
60.0 3.0 2.0 2.03 MB

Full Scale Monocular Depth Prediction. Official Implementation of "FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving" https://arxiv.org/abs/2304.10719

Home Page: https://sites.google.com/view/fsnet/

Dockerfile 0.14% Shell 0.37% Python 74.69% C++ 6.58% Cuda 7.71% Jupyter Notebook 10.50%
depth-estimation kitti-dataset kitti360 monocular-depth-estimation nuscenes

fsnet's Introduction

FSNet

This repo aims to provide a flexible and reproducible Full-Scale Unsupervised Monocular Depth Prediction from well-calibrated/tracked image sequences.

The repo share a similar design strategy with visualDet3D, but we adopt a even more flexible design for data processing and config construction, allowing easy insertion of new tasks/models while not interfere with the existing ones.

We provide cookbooks and solutions for Full-Scale Unsupervised Depth Prediction in KITTI, KITTI360, NuScenes, and KITTI-360 Fisheye. Links are directed to their corresponding cookbooks. We hope we could make training/testing/deploying/ROS demo easier.

Reference: this repo borrow codes and ideas from visualDet3D, monodepth2.

image

Setup:

Environment setup

pip3 install -r requirement.txt

or manually check dependencies.

Docker Setup

cd docker
docker build ./ -t fsnet

The image will contain necessary packages for the repo.

Training / Testing

Please check the corresponding datasets: KITTI, KITTI360, NuScenes, and KITTI-360 Fisheye. More will be available through contributions and further paper submission.

Config and Path setup

Please modify the path and other parameters in config/*.py. config/*_example files are templates.

Notice: *_examples are NOT utilized by the code and *.py under /config is ignored by .gitignore.

The content of the selected config file will be recorded in tensorboard at the beginning of training.

important paths to modify in config :

  1. cfg.path.{data}_path: Path to the data directory.
  2. cfg.path.project_path: Path to the workdirs of the projects (will have temp_outputs, log, checkpoints)

Multi-Dataset Training

We can use the ConcatDataset APIs to jointly train one depth prediction network using multiple datasets. The output of each dataset indexing methods should output dictionaries with same keys and same tensor shapes, but we could define different data-proprocessing pipelines for each dataset and align multiple datasets if needed.

Build Module Path

In the project, most of the modules are initialized by builder and find_object. The builder will import and initialize the target object/function based on the full import path. Please check the example configs.

Further Info and Bug Issues

  1. Open issues on the repo if you meet troubles or find a bug or have some suggestions.
  2. Email to [email protected]

Other Resources

fsnet's People

Contributors

owen-liuyuxuan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

fsnet's Issues

KITTI360 fisheye experiment

Hello,

I would like to express my appreciation for your impressive work.

I am currently attempting to replicate the results of the KITTI360 fisheye experiment as described in your paper and provided in the repository. However, I am encountering some difficulties in achieving the reported results.

  1. I trained the model using the original code and settings for 20 epochs. The results I obtained were different from those reported in the paper.

6741704971937_ pic

  1. To address this, I increased the batch size and extended the training to 50 epochs. Even with these adjustments, the results remain not good.

6751704971939_ pic

Thank you for your time and assistance, again.

How to use flow mask?

Hello, in the paper, it was mentioned that using optical flow to calculate the mask of moving objects.

But in all configs of this repo, motion_ maskใ€is_precompute_flow are all False, is this feature not yet open?

Nuscenes dataset problems

Thank you for your sharing!
When I trained on Nuscenes Dataset , I followed every step in readme doc.
But I found FSNET only have 1/3 output pics compared with Nuscenes-full dataset ,also there was no pics in CAM_BACK view.
Did I do something wrong?
image
this is the result of nuscenes training set
image
this is the result of nuscenes val set

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.