Related Tags: QCQP, SDP, SOCP, Autonomous Navigation, Path Planning, Convex Programming, Continuous Environments Optimization Minimum Volume Ellipsoids Ellipsoidal Surfaces Reinforcement Learning Policy Evaluation Improvement Iteration Value Bellman Optimality Equation Single Agent FOMDPs Static Monte Carlo Constrained On-Policy Off-Policy e-soft e-greedy Exploring Starts GPI Temporal Difference SARSA Q-Learning R-Learning Actor-Critic 1-Step TD(0) Functions Markov Property
Sort by: Date Added - Title - View Count - Rating