. Strong C/C++ and practical Python experience. Deep familiarity with TensorRT, TensorRT-LLM, ONNX, PyTorch, CUDA, Triton... out from the crowd: Hands-on experience with TensorRT internals, CUDA kernels, Triton kernels, or other compiler/runtime...
such as Charles River, Triton, and Bloomberg · Proficiency across trading channels, including high-touch, algorithmic, block pools...
), AI/ML concepts, LLMs, RAG architecture, VLLM. Ø Model Serving: NVIDIA Triton Server APIs for inference at scale. Ø Data...
NCCL, RDMA, and high-performance networking. Implement custom operators and fused kernels in PyTorch, JAX, or Triton... Experience with Triton, CUTLASS, or other GPU kernel authoring frameworks. Familiarity with TensorRT, FasterTransformer, or vLLM...
, Triton, CUDA, HIP etc. Excellent C/C++/Scripting (Python, etc.) experience Knowledge of Graphics/Compute APIs (DirectX...
platforms such as Dynamo-Triton, RAPIDS, NeMo, NIM. Skilled in developing systems and tools to measure developer sentiment...
: Da Vinci Xi, Getinge Autoclave, Amsco Autoclave, Steris V-Pro, Steris System 1e, Getinge 86 series washer, Triton 72 ultrasonic...
technical tasks and solutions involving JAX, ML systems, custom kernel workflows, Pallas, Triton, and related optimization... writing, optimizing, or evaluating custom GPU kernels using Pallas, Triton, or comparable kernel-level tools...
, Steris V-Pro, Steris System 1e, Getinge 86 series washer, and Triton 72 ultrasonic cleaner. Strong critical thinking...
and compute optimization Experience with: PyTorch (model-level workloads and execution) Triton (custom kernel development...