As an AI Platform Engineer embedded within our Data Science and AI tribe, you will design and maintain the foundational infrastructure that enables our AI Software Engineers to build and scale generative AI applications.
You will build our internal AI provisioning platform, focusing on seamless model access, observability, and strict governance. In this role, you will ensure our AI integrations are secure, performant, and cost-effective, providing the robust backbone required for our agentic and LLM-driven workflows.
Responsibilities:
-
Develop and maintain an internal AI provisioning platform to streamline access to various LLMs & MCP Servers (e.g., via LiteLLM).
-
Implement comprehensive LLM observability, logging, and alerting systems using tools like Datadog and Langfuse.
-
Enforce model governance, ensuring security, data privacy, rate limiting, and strict LLM API cost management.
-
Collaborate closely with embedded AI Software Engineers to support cloud-native architectures and agentic workflow integrations.
-
Build and maintain infrastructure for monitoring the reliability and performance of production AI systems.