Your mission
Own end-to-end ML workflows: data ingestion, feature engineering, training, evaluation & production deployment
Deep-dive into at least one core application domain: (1) price prediction; (2) two-sided market design (ranking / recommendations); (3) agentic workflows using state of the art LLMs; from prototype to robust production services.
Establish and enforce LLMops best practices: design reproducible prompt-engineering workflows, manage model/version tracking, automate evaluation (e.g. evals frameworks), monitor latency/cost, and refine prompts based on performance metrics.
Set up performance tracking, alerting and automated evaluation to ensure model health in production.
Collaborate across product, engineering and data science to translate business needs into technical solutions.
Spar with peers in the AI team and contribute to code reviews, best practices and tooling for reproducible, production-grade ML.