reinforcement-learning_开发者

开发者

reinforcement-learning

相关标签：javascript jquery android 多少钱 iPhone

QLearning and never-ending episodes
Let\'s imagine we have an (x,y) plane where a robot can move. Now we define the middle of our world as the goal state, which means that we are going to give a reward of 100 to our robot once it reache
问答阅读(5)
Alpha and Gamma parameters in QLearning
What difference to the algorithm does it make having a big or small gamma value? In my optic, as long as it is neither 0 or 1, it should work exactly the same. On the other si开发者_运维问答de, whatev
问答阅读(5)
Negative rewards in QLearning
Let\'s assume we\'re in a room where our agent can move along the xx and yy axis. At each point he can move up, down, right and left. So our state space can be defined by (x, y) and our actions at eac
问答阅读(4)
What are the uses of recurrent neural networks when using them with Reinforcement Learning?
I do know that feedforward multi-layer neural networks with backprop are used with Reinforcement Learning as to help it generalize the actions our agent does. This is, if we have a big state space, we
问答阅读(5)
Improving Q-Learning
I am currently using Q-Learning to try to teach a bot how to move in a room filled with walls/obstacles. It must start in any place in the room and get to the goal state(this might be, to the开发者_如
问答阅读(7)
Generalization functions for Q-Learning
I have to do some work with Q Learning, about a guy that has to move furniture around a house (it\'s basically that). If the house is small enough, I can just have a matrix that represents actions/rew
问答阅读(4)
Why does AlphaZero perform better than vanilla MCTS? [closed]
Closed. This question is not about programming or software development. It is not currently accepting answers.
问答阅读(5)

首页上一页第2页下一页共2页