Coder Social home page Coder Social logo

Comments (9)

shrimalmadhur avatar shrimalmadhur commented on June 20, 2024

I believe that map is not that great for segmentation since the mesh of road and the side walk is the same, so when you get the segmentation and train on that, you might loose features which the side walk also provides. I guess if the map has better segmentation you could train a better model but still I think normal RGB image has more deep features than segmentation.

from autonomousdrivingcookbook.

BrianN92 avatar BrianN92 commented on June 20, 2024

Thanks! But as I mentioned, the shadow of trees is an issue when using normal RGB images as data set. So I think the segmentation might be a solution for that.

from autonomousdrivingcookbook.

shrimalmadhur avatar shrimalmadhur commented on June 20, 2024

Yup. I agree good segmentation image can be helpful. Maybe there's way we can utilize both of them, so that we get good features from RGB and use the segmentation to solve the shadow issue. But I don't think so that only segmentation can be that useful.

from autonomousdrivingcookbook.

mitchellspryn avatar mitchellspryn commented on June 20, 2024

@BrianN92 What, precisely, are you trying to do? Are you trying to train the end-to-end model on a dataset you collected from the neighborhood map?

from autonomousdrivingcookbook.

BrianN92 avatar BrianN92 commented on June 20, 2024

@mitchellspryn Yes, I am trying to train the E2E model on the neighborhood map. Sorry for the vagueness of this question.

from autonomousdrivingcookbook.

mitchellspryn avatar mitchellspryn commented on June 20, 2024

Ok. In that case, using the segmentation images as input to your model is suspicious. Consider that in a real scenario, your vehicle will not have access to the segmentation images. By requiring that as an input to your net, you'd be developing a model that only works in a simulator and would not translate to the real world.

A more generalizable approach would be to create a transformation that adds random patches of shadows on existing images. A simple way of doing this could be to select a random set of connected pixels from the RGB image, transform the pixel to HSV space, decrease the value by a set multiplier (i.e. 'value = value * 0.9'), and transforming back to RGB space. By doing this, the model will learn that the shadows don't really mean anything, and will become more robust to the noise.

from autonomousdrivingcookbook.

BrianN92 avatar BrianN92 commented on June 20, 2024

Thanks, I will try it!

from autonomousdrivingcookbook.

BrianN92 avatar BrianN92 commented on June 20, 2024

Hi @mitchellspryn , after the image transformation, like the images I attached below, I have another question.
img_0_0_341742162365
image-1

How can I make the names of images after transformation corresponding to the "ImageName" tag in every "airsim_rec.txt" file? Thank you!

from autonomousdrivingcookbook.

mitchellspryn avatar mitchellspryn commented on June 20, 2024

I assume you wrote another script that created the transformations? If so, you'll probably be better off directly generating the .h5s for training. Look in "cooking.py" and mimic that logic in your preprocessing script.

In fact, if you really wanted to get slick, you could probably add this as an additional transform inside of Generator.py. You'll see the set of transforms we've already coded inside of random_transform_with_states(); you can just add yours there.

from autonomousdrivingcookbook.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.