AI (OpenAI) EVAL Engineer with LLMs and GenAI
OpenAI Anthropic Google AI platforms. Performance benchmarking (speed, throughput, cost). Domain knowledge Office apps...
OpenAI Anthropic Google AI platforms. Performance benchmarking (speed, throughput, cost). Domain knowledge Office apps...
Experience with Azure OpenAI, OpenAI, Anthropic, and Google AI platforms Performance benchmarking (speed, throughput, cost...
with OpenAI/Anthropic APIs and frameworks like LangChain or LlamaIndex. Product Experience: Minimum of 2+ years in product...
planning, retries, grounding, and memory Integrations with LLM APIs (e.g., Azure OpenAI/OpenAI/Anthropic/Vertex) in production...
switching between LLMs (e.g., GPT-4, Llama, Anthropic) without refactoring core application code. Unified API Design: Build... Language Models (LLMs) such as Llama 3, GPT-4, and Anthropic. LLM Observability: Deep experience building and maintaining...
thought leadership across GitHub and Microsoft. Product/Service Development Partner with OpenAI, Anthropic, Google...
, Anthropic, Google) and establish performance baselines. Automate batch evaluations and reporting with Python, integrate...
experience that reflects Anthropic’s values Build and nurture long-term relationships for future pipeline needs Collaboration...
, Anthropic, Hugging Face, etc.). Knowledge of fine-tuning or instruction-tuning AI models. Familiarity with evaluation metrics...
Development: You have used APIs (OpenAI, Anthropic, Gemini) to build applications and understand the basics of prompt engineering...