Coder Social home page Coder Social logo

Comments (6)

reiniscimurs avatar reiniscimurs commented on August 19, 2024

Hi,

Why are you adding the reward function to this code? Are you planning to continue training in this setting with already pre-trained model?

Be aware that there are differences in how state is represented here in GDAE and in the base DRL training repository. Mainly, the robot state.

from gdae.

Yangzuodong avatar Yangzuodong commented on August 19, 2024

Hi,
Thanks for your reply!

Firstly, I don't want to continue training, I want to make the robot move entirely through TD3's actor network, without relying on move_base. But I don't know how the agent knows local_goal. After looking at the test_velodyne_td3.py file, I guessed that the robot knew the end point because it was set up in env to receive a reward for reaching the end point.

Secondly, I modified the state representation in GDAM to be consistent with that in TD3. This includes laser_state and robot_state

Lastly,Thank you very much for your patient reply!

from gdae.

reiniscimurs avatar reiniscimurs commented on August 19, 2024

Reward is not necessary for model deployment. It is only needed to train the model. The model knows the goal as it is part of the state that is given to the model.

There is no distinction between local and global goal. At each individual step the model recieves a single target to go to and it does not know if it is a global or local goal. The step function reads the current selected node and uses it as the current target that is then passed to the TD3 model in the state. The node selection is done entirely through the heuristics function.

from gdae.

Yangzuodong avatar Yangzuodong commented on August 19, 2024

Hi,
Thanks for your reply!So does the robot know the target by the Dist_to_goal parameter in the robot_state returned from step function?
I modified the code and now it looks like the video:

freecompress-GDAM_4_9-ezgif.com-video-cutter.mp4

However, the motion path of the agent is a little strange, it seems that it always wants to move clockwise.This was not the case with test_velodeny_td3.

Thank you very much for your work and your patient response!

from gdae.

reiniscimurs avatar reiniscimurs commented on August 19, 2024

The goal is given in polar coordinates by distance and angle. This is explained in the tutorial: https://medium.com/@reinis_86651/deep-reinforcement-learning-in-mobile-robot-navigation-tutorial-part3-training-13b2875c7b51

from gdae.

Yangzuodong avatar Yangzuodong commented on August 19, 2024

Thank you for solving my doubts, I wish you a happy life!

from gdae.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.