Coder Social home page Coder Social logo

liuguoyou / gl3d Goto Github PK

View Code? Open in Web Editor NEW

This project forked from lzx551402/gl3d

0.0 3.0 0.0 64.77 MB

GL3D (Geometric Learning with 3D Reconstruction): a large-scale database created for 3D reconstruction and geometry-related learning problems

License: MIT License

Shell 1.83% Python 98.17%

gl3d's Introduction

GL3D: Geometric Learning with 3D Reconstruction

Example sequence

About

GL3D (Geometric Learning with 3D Reconstruction) is a large-scale database created for 3D reconstruction and geometry-related learning problems. Most images contained are captured by drones from multiple scales and perspectives with large geometric overlaps, covering urban, rural area, or scenic spots. It also includes small object reconstructions to enrich the data diversity. If you find this dataset useful for your research, please cite:

@inproceedings{shen2018mirror,
    author={Shen, Tianwei and Luo, Zixin and Zhou, Lei and Zhang, Runze and Zhu, Siyu and Fang, Tian and Quan, Long},
    title={Matchable Image Retrieval by Learning from Surface Reconstruction},
    booktitle={The Asian Conference on Computer Vision (ACCV},
    year={2018},
}

If you have used the correspondence labels, please also cite:

@inproceedings{luo2018geodesc,
    title={Geodesc: Learning local descriptors by integrating geometry constraints},
    author={Luo, Zixin and Shen, Tianwei and Zhou, Lei and Zhu, Siyu and Zhang, Runze and Yao, Yao and Fang, Tian and Quan, Long},
    booktitle={European Conference on Computer Vision (ECCV)},
    year={2018}
}

Dataset Description

GL3D contains 125,623 high-resolution images regarding 543 different scenes. Each scene data is reconstructed to generate a triangular mesh model by the state-of-the-art 3D reconstruction pipeline. Refer to [1] for details. For each scene data, we provide the complete image sequence, geometric labels and reconstruction results.

To increase the data diversity, we have also applied the same data generation pipeline on some Internet tourism datasets that are publicly available. In practice, we recommend using both GL3D and tourism datasets collaboratively in training for better generalization ability. Refer to docs/tourism_data.md for details.

Tasks

Research works below are supported by GL3D:

Task Reference
Image retrieval MIRorR, ACCV'18
Local descriptor GeoDesc, ECCV'18
Local descriptor ContextDesc, CVPR'19
Outlier rejection OA-Net, ICCV'19

Downloads

Undistorted images resized to 1000x1000 are provided.

Sources Data Name Chunk Start Chunk End Disk Descriptions
GL3D gl3d_imgs 0 125 62G 1000x1000 undistorted images of GL3D
GL3D gl3d_raw_imgs 0 106 52G raw images of test set of GL3D

Use download_data.sh script to download the tar files, by passing augments

bash download_data.sh <data_name> <chunk_start> <chunk_end>

For example, to download GL3D images, run

bash download_data.sh gl3d_imgs 0 125

To extract the files, run

cat download_data_gl3d_imgs/*.tar.* | tar -xvf -

Dataset Format

data                          
 └── <pid> 
       ├── undist_images/*
       ├── geolabel/*
       ├── img_kpts/*.bin
       ├── depths/*.pfm
       └── image_list.txt
File Name Data Name Chunk Start Chunk End Disk Task Descriptions
geolabel/cameras.txt gl3d_cams 0 0 <0.1G Common Camera intrisic/extrinsic parameters, recovered by SfM.
img_kpts/<img_idx>.bin gl3d_kpts 0 57 28G Common Image keypoints detected by SIFT.
depths/<img_idx>.pfm gl3d_depths 0 59 30G Common Depth maps from MVS algorithms.
geolabel/corr.bin gl3d_corr 0 12 6.1G Local descriptor Image correspondences that haved survived from SfM.
geolabel/mask.bin gl3d_mask 0 10 5.3G Image retrieval Overlap masks of image pairs, computed from mesh re-projections.
geolabel/common_track.txt gl3d_ct 0 0 <0.1G Image retrieval Common track ratio of image pairs, computed from SfM.
geolabel/mesh_overlap.txt gl3d_mo 0 0 <0.1G Image retrieval Mesh overlap ratio of image pairs, computed from mesh re-projections.

Again, use download_data.sh to fetch the above geometric labels or reconstruction results,

For data organization, refer to docs/data_format.md.

Python-based IO utilities are provided to parse the data, refer to utils/io.py.

Visualizations and examples of usage can be found in example/README.md.

Please feel free to inform us if you need some other intermediate results for your research.

Data Preview

The mesh reconstruction is available for preview by substituting <pid> in the following link:

https://www.altizure.com/project-model?pid=<pid>

An example is provided here. Noted that some projects are not online available, from 000000000000000000000000 to 00000000000000000000001d.

Acknowledgments

This dataset is prepared and maintained by Zixin Luo, Tianwei Shen, Jacky Tang and Tian Fang. 3D reconstructions are obtained by Altizure.

Changelog

2019-9-17 Releasing of GL3D_V2

  • Another 165 datasets are added, covering mainly landmarks and small objects.
  • Rerun SfM for all datasets with GeoDesc to obtain denser reconstruction.
  • Camera distortion parameters are provided.
  • Undistorted images are provided.
  • More helper functions to perform geometry computation.

2019-12-4 Update GL3D_V2

  • Provide depth maps to enrich geometric labels.
  • Provide helper functions to parse depth maps.

2019-12-16 Update GL3D_V2

  • Another 530 Internet tourism datasets are added to enrich the data.
  • Mesh overlapping ratio and overlapping masks are provided.

gl3d's People

Contributors

hlzz avatar lzx551402 avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.