(Senior) ML Platform Engineer (w/m/d)

Billie • Berlin, Berlin, Germany • 2d ago

About the role:

Join our dynamic team in Berlin as a Senior MLOps Engineer and play a crucial role in shaping the future of our ML infrastructure. You'll be responsible for designing, building, and maintaining scalable and resilient machine learning pipelines that support real-time and batch model deployment and monitoring. Your expertise will be key in supporting various AI and machine learning initiatives across the organization, making a real-time, real-life impact on our business.

What you will be working on:

ML Platform Development and Maintenance

Design and implement a scalable and reliable ML platform using tools such as MLflow, Kubeflow, and Metaflow
Ensure timely and reliable delivery of ML model predictions, with continuous monitoring for performance, accuracy, and reliability
Maintain and extend our in-house batch and real-time feature platforms powering numerous feature pipelines

Cross-functional Collaboration

Work closely with data scientists and data engineers to understand model and feature requirements
Translate requirements into scalable and reliable pipelines
Collaborate on deploying, monitoring, and optimizing ML models in production

Continuous Improvement and Innovation

Actively contribute to the team's knowledge sharing and participate in tech talks
Drive architectural decisions as the subject matter expert (SME) in MLOps
Stay abreast of industry trends and advancements in MLOps technologies to improve our ML Platform

Who we are looking for:

Extensive experience with machine learning workflows and MLOps tools such as Metaflow and MLflow
Strong knowledge of model deployment, serving, and monitoring using tools like AWS SageMaker, KServe, Seldon Core
Proficiency in designing and implementing model deployment pipelines using Github Actions, ArgoCD, or similar tools
Experience with orchestration tools such as Airflow, ArgoCD, Prefect
Familiarity with data versioning, data governance practices, and model versioning
Understanding of reproducibility and experimentation using tools like MLflow or Weights & Biases
Knowledge of model training and inference using popular frameworks like Scikit-learn (TensorFlow or PyTorch is a plus)
Excellent collaboration skills, particularly in working with data scientists on model deployment and optimization