The training framework is based on SDPO which in turn based on verl under Apache-2.0 License.