Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)
Texto / recurso
Página del recurso disponible
Texto / recurso
OpenAlex
Due to the safety risks and training sample inefficiency, it is often preferred to develop controllers in simulation. However, minor differences between the simulation and the real world can cause a significant sim-to-re...
Idioma en
Página del recurso disponiblePágina de referencia del recurso. El texto completo no está confirmado automáticamente.
Página del recurso