-training, and Reinforcement Learning (RL). Your work will leverage the latest LLMs and multimodal models to develop...-tuning on step-by-step trajectories and tool-use traces - RL for LLMs, Offline RL and off-policy evaluation - Agentic...
Lugar:
Bellevue, WA | 09/01/2026 02:01:53 AM | Salario: S/. No Especificado | Empresa:
AmazonRockstar is recruiting for a company building the AI backbone for the next generation of intelligent products..., and embeddings. Think of them as “AWS for AI models†rather than data/compute: a full-stack backend for fine-tuning, RL, inference...
improvements through: shape and extend our RL optimization platform - a pricing centric tool that automates the optimization..., please contact your Recruiting Partner. Our compensation reflects the cost of labor across several US geographic markets. The base...
Lugar:
Seattle, WA | 06/01/2026 00:01:49 AM | Salario: S/. No Especificado | Empresa:
Amazon for RL Sales Develop, maintain and improve standard operating procedure manual for all sales processes Develop and share... - RL covers your Medical deductible through reimbursements Employer paid dental, vision, disability & life insurance...
Lugar:
USA | 01/01/2026 03:01:35 AM | Salario: S/. $165000 - 175000 per year | Empresa:
ReversingLabs compensation packages (base, bonus and equity) HRA - RL covers your Medical deductible through reimbursements Employer paid... you directly. ReversingLabs is an equal opportunity employer. Applicants only - Recruiting agencies, please do not contact....
Lugar:
USA | 23/12/2025 22:12:25 PM | Salario: S/. $150000 - 160000 per year | Empresa:
ReversingLabsWho is Recruiting from Scratch: Recruiting from Scratch is a specialized talent firm dedicated to helping companies... frameworks, benchmarks, and RL environments . Set engineering guardrails that enable rapid growth without sacrificing stability...
Who is Recruiting from Scratch: Recruiting from Scratch is a specialized talent firm dedicated to helping companies... frameworks, benchmarks, and RL environments . Set engineering guardrails that enable rapid growth without sacrificing stability...
solutions. You will guide teams working at the intersection of large-scale foundation models, multi-agent systems, and RL-based... using machine learning and RL Collaborate with product, engineering, and business teams to translate research outcomes...
Lugar:
USA | 18/12/2025 02:12:19 AM | Salario: S/. No Especificado | Empresa:
GoDaddyRalph Lauren Corporation (NYSE:RL) is a global leader in the design, marketing and distribution of premium lifestyle..., Polo Ralph Lauren, Double RL, Lauren Ralph Lauren, Polo Ralph Lauren Children, Chaps, among others, constitute one of the...
areas like offline RL, reward modeling, RLHF, DPO, GPRO. Strong programming skills in Python and experience with machine...). To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting...