Hi all. I'm trying to evaluate this method on Semantic KITTI using the provided pretra

About Scene Reconstruction Performance about scenerf HOT 19 CLOSED

astra-vision commented on May 24, 2024

About Scene Reconstruction Performance

from scenerf.

Comments (19)

Luciferbobo commented on May 24, 2024 1

OK. I will look forward to it. w

from scenerf.

anhquancao commented on May 24, 2024 1

Great! You can watch the repo, so you will be notified when I update it.

from scenerf.

Luciferbobo commented on May 24, 2024 1

The reconstructed results from the updated code is looking much better, and I can clearly see a more reasonable FOV. I believe some minor details may still need improvement due to insufficient training epochs. I will try evaluating all reconstruction metrics once I finished my training. Thx for the update!

from scenerf.

anhquancao commented on May 24, 2024 1

Awesome! I'm so glad it's working for you. Let me know if there's anything else I can do to help. 😊

from scenerf.

anhquancao commented on May 24, 2024

Hi, which TSDF threshold did you use?
Also, when drawing, you need to remove the voxels with label 255.

from scenerf.

Luciferbobo commented on May 24, 2024

Truncated margin was set to 10. I used default settings in depth2tsdf.py and fusion.py. Thanks for your kindly reminding. I wanted to see the overall result so I put all the labels together :)

from scenerf.

anhquancao commented on May 24, 2024

The 255 denotes unknown voxels which are voxels that have no lidar rays pass through in all the sequence.
Do you use this function to generate the occupancy?
https://github.com/astra-vision/SceneRF/blob/main/scenerf/scripts/evaluation/eval_sr.py#L11
The idea is to set an increase threshold since the depth error increase with the distance to the vehicle
Because if you consider all voxels within +-10m around the depth as occupied then it means around 20/0.2=100 voxels are set to be occupied for each depth value.

from scenerf.

Luciferbobo commented on May 24, 2024

Yes. I used this function and the default settings in eval_sr.py. I tried to ignore the 255 label and I saw the corresponding no lidar area in GT disappeared, but my prediction result still looks the same.

from scenerf.

anhquancao commented on May 24, 2024

Is it the same for other frame? Did you try to draw the mesh? Do the rendered depth images look fine?

from scenerf.

Luciferbobo commented on May 24, 2024

I drew some frames and they all looked the same... Corresponding depth and RGB image looks fine. I attached them below.

from scenerf.

anhquancao commented on May 24, 2024

Thanks, let me check it!
What if you change the variable th here to a smaller number? is it still the same?
https://github.com/astra-vision/SceneRF/blob/main/scenerf/scripts/evaluation/eval_sr.py#L12

from scenerf.

anhquancao commented on May 24, 2024

It should look like this.

This what I drew months ago. I will need to check

from scenerf.

anhquancao commented on May 24, 2024

You can obtain the mesh from this line
https://github.com/astra-vision/SceneRF/blob/main/scenerf/scripts/reconstruction/depth2tsdf.py#L107

from scenerf.

Luciferbobo commented on May 24, 2024

I tried default th=0.25 and th=0.15. They still look the same :(

Many thanks for your prompt response! Very great work. I will check it also.

from scenerf.

anhquancao commented on May 24, 2024

Sorry for this, probably I changed smth when cleaning it.
I will update it in few weeks.

from scenerf.

anhquancao commented on May 24, 2024

Seem like the threshold is not applied at all.

from scenerf.

anhquancao commented on May 24, 2024

Hi @Luciferbobo,
Sorry, I am swamped, I just tried to draw the occupancy prediction using the following code:

def get_grid_coords(dims, resolution):
    '''
    :param dims: the dimensions of the grid [x, y, z] (i.e. [256, 256, 32])
    :return coords_grid: is the center coords of voxels in the grid
    '''

    # The sensor in centered in X (we go to dims/2 + 1 for the histogramdd)
    g_xx = np.arange(0, dims[0] + 1)
    # The sensor is in Y=0 (we go to dims + 1 for the histogramdd)
    g_yy = np.arange(0, dims[1] + 1)
    # The sensor is in Z=1.73. I observed that the ground was to voxel levels above the grid bottom, so Z pose is at 10
    # if bottom voxel is 0. If we want the sensor to be at (0, 0, 0), then the bottom in z is -10, top is 22
    # (we go to 22 + 1 for the histogramdd)
    # ATTENTION.. Is 11 for old grids.. 10 for new grids (v1.1) (https://github.com/PRBonn/semantic-kitti-api/issues/49)
    sensor_pose = 10
    g_zz = np.arange(0, dims[2] + 1)

    # Obtaining the grid with coords...
    xx, yy, zz = np.meshgrid(g_xx[:-1], g_yy[:-1], g_zz[:-1])

    coords_grid = np.array([xx.flatten(), yy.flatten(), zz.flatten()]).T
    coords_grid = coords_grid.astype(np.float)

    coords_grid = (coords_grid * resolution) + resolution/2

    temp = np.copy(coords_grid)
    temp[:, 0] = coords_grid[:, 1]
    temp[:, 1] = coords_grid[:, 0]
    coords_grid = np.copy(temp)

    return coords_grid, g_xx, g_yy, g_zz


def draw(
    voxels,
    cam_param_path="",
    voxel_size=0.04):    

    voxels[voxels == 255] = 0
    grid_coords, _, _, _ = get_grid_coords([voxels.shape[0], voxels.shape[1], voxels.shape[2]], voxel_size)    

    points = np.vstack([grid_coords.T, voxels.reshape(-1)]).T

    # Obtaining voxels with semantic class
    points = points[(points[:, 3] != 0)]
    
    vis = o3d.visualization.Visualizer()
    vis.create_window(width=1200, height=600)
    ctr = vis.get_view_control()
    param = o3d.io.read_pinhole_camera_parameters(cam_param_path)

    pcd = o3d.geometry.PointCloud()
    pcd.points = o3d.utility.Vector3dVector(points[:, :3])
    pcd.estimate_normals()
    vis.add_geometry(pcd)

    ctr.convert_from_pinhole_camera_parameters(param)

    vis.run()  # user changes the view and press "q" to terminate
    param = vis.get_view_control().convert_to_pinhole_camera_parameters()
    o3d.io.write_pinhole_camera_parameters(cam_param_path, param)
    
path = "Your path to stored TSDF output"
frame_id = "001385.npy"
tsdf_path = os.path.join(path, frame_id)
tsdf = np.load(tsdf_path)

occ = np.zeros_like(tsdf)
occ[tsdf > 0.2 ] = 0
occ[abs(tsdf) < 0.2 ] = 1
draw(occ)

This is the output:

from scenerf.

Luciferbobo commented on May 24, 2024

Hi. Apologize for also being swamped with other things lately. Thank you very much for the update! I tried the function you shared, while the results still seem to have some issues... Here's a comparison between the GT (left) and my prediction (right).

I guess the visualization results produced by Open3D and Matplotlib Axes3D should be similar, so the issue might be with the code in depth2tsdf.py. I must admit that I'm not very familiar with the parameter settings in the TSDFVolume function. I was wondering whether the visualization issue may be due to the parameter settings in this section of the code? Thx! =v=

from scenerf.

anhquancao commented on May 24, 2024

Hi @Luciferbobo,
Thank you for your information! I just found a bug related to the reconstruction and have updated them. Could you please try cloning again?

from scenerf.

About Scene Reconstruction Performance about scenerf HOT 19 CLOSED

Comments (19)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent