). Proven experience with large-scale reinforcement learning experiments, including online RL techniques such as Group Relative... is required, including state-of-the-art online RL methods and other gradient-based optimization approaches like policy gradients, actor...
RL systems. Mastery of IaC/GitOps frameworks and ability to build reusable multi-team infrastructure patterns...
-informed machine learning or mechanical simulation. Experience with reinforcement learning (RL) and safe deployment of RL...
). Experience with Ray for scalable training, tuning, and serving pipelines;RL experience a plus. Advanced proficiency in IaC...
-improving and self-evolving agent loops using evaluator-driven optimization (generate, evaluate, refine) and/or RL-style...
Lugar:
Dublin | 01/02/2026 00:02:15 AM | Salario: S/. No Especificado | Empresa:
EXL Service1