-
Notifications
You must be signed in to change notification settings - Fork 339
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reinforcejs VS ConvNetjs #8
Comments
Hi, I'm looking at and comparing the agent there with the one at http://cs.stanford.edu/people/karpathy/reinforcejs/waterworld.html They are acting in very similar environment, but have different AI implementaitons. My question is, which is the more advanced and complete AI agent between the two versions? What are the differences in the neural network implementations and which is more intelligent agent? Thanks, Mike |
Hi, both of those agents are using the same algorithm: DQN, but yes the On Fri, Oct 23, 2015 at 9:29 AM, functionsoft [email protected]
|
Hi, Thanks for getting back to me. I’m glad you said that, because that’s the library I chose out of the two to work with and understand. In the learn function of the DQNAgent there is a comment regarding replay memory, about priority sweeps, how could this be simply implemented with the current code? I assume it involves marking the experience memory with some value that represents good experience vs bad experience? So that the best memories are played back? Also, the type of neural network implemented in this agent, what is it? Is it a simple multilayer perceptron? Would the agent benefit from more hidden layers? Any ideas or suggestions greatly appreciated. Thanks and Regards, Mike From: Andrej Hi, both of those agents are using the same algorithm: DQN, but yes the On Fri, Oct 23, 2015 at 9:29 AM, functionsoft [email protected]
— |
Seen it done that way in a paper somewhere (can't find it), they added an extra property to the experience objects with a value which was then used to prune experiences. |
Hi, mryellow, I am very interesting in the prioritized sweeping with experience replay paper you talk about, can you recall anything that is related to it that I can use to google it? |
Not sure I have it saved here, think it may have been an incomplete draft, and not that interesting otherwise. They were using ReinforceJS, had modified this bit Line 1091 in 0b9315a
One bit that sticks in my head is they were using a Greek alphabet Rho or Psi or something and had in-line comments with it showing properly encoded rather than LaTex or a substitute simple character. |
On Learning Coordination Among Soccer Agents http://robocup.csu.edu.cn/web/wp-content/uploads/2012/12/data/pdfs/robio12-116.pdf |
Hangon, only result, but not it, although I've seen this paper before.... and don't think it passed in the score, but checked it before firing |
Thanks, mryellow |
There is a new paper in regards to deep reinforcements learning in continous spaces by deepmind. Continuos control with deep reinforcements learning. Is there plans to add this in code form. Many thanks Andrew. |
I'm also curious about the deepmind's learnings :D |
No description provided.
The text was updated successfully, but these errors were encountered: