DBpedia 2016-04

Matches in DBpedia 2016-04 for { <http://dbpedia.org/resource/Q-learning> ?p ?o }

Showing triples 1 to 61 of 61 with 100 triples per page.

Q-learning abstract "Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter. A policy is a rule that the agent follows in selecting actions, given the state it is in. When such an action-value function is learned, the optimal policy can be constructed by simply selecting the action with the highest value in each state. One of the strengths of Q-learning is that it is able to compare the expected utility of the available actions without requiring a model of the environment. Additionally, Q-learning can handle problems with stochastic transitions and rewards, without requiring any adaptations. It has been proven that for any finite MDP, Q-learning eventually finds an optimal policy, in the sense that the expected value of the total reward return over all successive steps, starting from the current state, is the maximum achievable.".
Q-learning wikiPageExternalLink Reinforcement%20Learning%20Maze.
Q-learning wikiPageExternalLink 352693.html.
Q-learning wikiPageExternalLink master.
Q-learning wikiPageExternalLink citation.cfm?id=1143955.
Q-learning wikiPageExternalLink piqle.
Q-learning wikiPageExternalLink toki78.github.io.
Q-learning wikiPageExternalLink thesis.html.
Q-learning wikiPageExternalLink the-book.html.
Q-learning wikiPageExternalLink node65.html.
Q-learning wikiPageExternalLink node4.html.
Q-learning wikiPageID "1281850".
Q-learning wikiPageLength "14097".
Q-learning wikiPageOutDegree "23".
Q-learning wikiPageRevisionID "704527469".
Q-learning wikiPageWikiLink Action-value_function.
Q-learning wikiPageWikiLink Artificial_neural_network.
Q-learning wikiPageWikiLink Atari_2600.
Q-learning wikiPageWikiLink Backgammon.
Q-learning wikiPageWikiLink Category:Machine_learning_algorithms.
Q-learning wikiPageWikiLink Deep_learning.
Q-learning wikiPageWikiLink Deterministic_system.
Q-learning wikiPageWikiLink Expected_value.
Q-learning wikiPageWikiLink Fitted_Q_iteration_algorithm.
Q-learning wikiPageWikiLink Function_approximation.
Q-learning wikiPageWikiLink Game_theory.
Q-learning wikiPageWikiLink Google_DeepMind.
Q-learning wikiPageWikiLink Markov_decision_process.
Q-learning wikiPageWikiLink Prisoners_dilemma.
Q-learning wikiPageWikiLink Probably_approximately_correct_learning.
Q-learning wikiPageWikiLink Reinforcement_learning.
Q-learning wikiPageWikiLink State-Action-Reward-State-Action.
Q-learning wikiPageWikiLink Stochastic_process.
Q-learning wikiPageWikiLink Temporal_difference_learning.
Q-learning wikiPageWikiLinkText "Q‑learning".
Q-learning wikiPageWikiLinkText "Q-learning".
Q-learning wikiPageWikiLinkText "q-learning".
Q-learning wikiPageUsesTemplate Template:=.
Q-learning wikiPageUsesTemplate Template:Citation_needed.
Q-learning wikiPageUsesTemplate Template:Technical.
Q-learning wikiPageUsesTemplate Template:Tmath.
Q-learning subject Category:Machine_learning_algorithms.
Q-learning hypernym Reinforcement.
Q-learning type AnatomicalStructure.
Q-learning type Algorithm.
Q-learning type Redirect.
Q-learning comment "Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter. A policy is a rule that the agent follows in selecting actions, given the state it is in.".
Q-learning label "Q-learning".
Q-learning sameAs Q2664563.
Q-learning sameAs کیو-یادگیری.
Q-learning sameAs Q-Learning.
Q-learning sameAs Q-learning.
Q-learning sameAs Q学習.
Q-learning sameAs Q-læring.
Q-learning sameAs m.04pvn7.
Q-learning sameAs Q-learning.
Q-learning sameAs Q-обучение.
Q-learning sameAs Q-навчання.
Q-learning sameAs Q2664563.
Q-learning wasDerivedFrom Q-learning?oldid=704527469.
Q-learning isPrimaryTopicOf Q-learning.