-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
problem with GDAM.py #15
Comments
Hi, DRL-navigation is a separate repo from GDAE and is not an exact training method for training a network for this implementation, but a general DRL navigation policy training method. GDAE is a specific repo for a specific implementation of a physical robot, which means there is some engineering in the code that would not be required in a generic solution. In order to use the DRL-navigation policy here, you would have to update the GDAE code so that it aligns with it. One of these alignments is the order of the state that you pointed out. Others might be more subtle and dependent on your use case. See a bit of the discussion here: #14 |
Thank you for your reply. I understand what you mean, GDAM is an independent warehouse from DRL navigation.
and line 49-50:
the order of the states you entered before and after seems to be different, which is the main reason for my confusion. |
Feel free to change this around in GDAE to fit what is in the training. |
Okay, thank you for your reply. |
Hi, thank you for making such great work open source. But I do have some questions:
def test(env, actor):
In the GDAM.py, line 40-41:
s = np.append(s2, toGoal)
s = np.append(s, action)
the state of the input to the network is combine of " Laser ranges + Goal( dis and theta) + Action(linear and angular) ", the states are combined in the same order as you set in DRL-navigation.
However, In the GDAM.py, line 49-50:
s = np.append(s2, a[0])
s = np.append(s, toGoal)
the order of combinations of states seems to become " Laser ranges + Action(linear and angular) + Goal( dis and theta) " , this order is different from the order of states in the previous code in line 40-41.
Is there some mistake in my understanding? Thank you very much for taking the time to answer my questions!
The text was updated successfully, but these errors were encountered: