Researchers at the University of Pennsylvania and University of Michigan have created the world's smallest fully programmable ...
Abstract: This letter proposes an algorithm for solving finite-time nonlinear optimal control problems. The proposed method employs the Gauss pseudospectral method to transform the optimal control ...
Abstract: In this article, we investigate the optimal control problem for an unknown linear time-invariant system. To solve this problem, a novel composite policy iteration algorithm based on adaptive ...
verl is a flexible, efficient and production-ready RL training library for large language models (LLMs). verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper.