, stable, and scalable technology products. You will implement critical technology solutions across multiple technical areas... creative approaches to solve technical challenges. Write secure, high-quality production code and maintain algorithms...
of large-scale models, working across the software-hardware stack. THE PERSON The ideal candidate is a strong technical... (e.g., vLLM, SGLang, Triton, or similar systems). - Implement and evaluate inference optimization techniques...
, stable, and scalable technology products. As a core technical contributor, you will drive critical technology solutions... across multiple technical areas, supporting the firm's business objectives. Job Responsibilities Lead the design, development...
Location: Seattle, WA or Palo Alto, CA (Hybrid/Remote) Type: Full-time, Senior Technical Leadership About the Team... development. · Proven ability to lead ambitious technical programs and mentor junior researchers. Preferred Qualifications...
architecture, scalability, and technical direction of that platform. Build resilient, scalable AI platforms that empower startups... with cross-functional teams to translate business and customer needs into robust technical solutions. Stay up to date with the...
from development to production. You will work directly with our AI/ML engineers, the Lead Architect, and on-site client technical teams... across training and inference workloads. Configure and manage NVIDIA Triton Inference Server for multi-model serving, dynamic...
efficiency. Leads deployment and optimization using Model Inference servers such as Triton Inference Server and vLLM for high... response, security, and compliance, with continuous improvement. Translates highly complex technical concepts and emerging...
, and deliver world-class AI experiences to millions of users. The ideal candidate combines deep technical expertise...’s most ambitious startup founders and engineers. What you will be doing: Serve as a trusted technical advisor to the...
. ▸ Create technical documentation, reference architectures, and integration guides for enterprise and hyperscaler partners... services in containerized / cloud-native environments (e.g., vLLM, SGLang, Triton). ▸ Deep understanding of 1M+ token context...
's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding... or related technical field AND 10+ years technical engineering experience with coding in languages including, but not limited...