Job Description
Job Description - Roles and Responsibilities
As a Cloud (ML)Ops Engineer, you’ll work at the intersection of cloud infrastructure, DevOps, and machine learning operations. Together with your team, you’ll help build a reliable, scalable, and secure platform that supports data scientists and analysts throughout their entire workflow.
This includes :
- hosting a multi-user Jupyter environment and a cloud IDE;
- providing frameworks for training, storing, serving, and monitoring custom models, primarily for high throughput batch processing;
- exposing models via APIs for low latency request-response use cases;
- enabling Generative AI initiatives.
Your responsibilities include :
Designing and building cloud-native platform services for AI models and data pipelines.Collaborating with colleagues and stakeholders across countries to develop technical solutions.Managing infrastructure using tools like Terraform, Docker, and Kubernetes on AWS.Automating workflows for data processing and model lifecycle management (Airflow, Spark, and Python)Ensuring platform reliability, performance, and cost-efficiency.Supporting colleagues in using the platform, including onboarding and troubleshooting.Contributing to the evolution of our MLOps practices.What do we expect from you?
You have a strong interest in cloud, data and AI, and eager to learn about new developments in the field.
Education or experience : Master’s degree in ICT, Engineering Sciences or Business Engineering with a focus on Informatics, or equivalent experience.Technical skills :
o Proficient in Python and the broader data science ecosystem.
o Experience with cloud infrastructure (preferably AWS).
o Familiar with Docker and Kubernetes.
o Skilled in infrastructure as code (Terraform).
o Experience with CI / CD tools like Jenkins or GitHub Actions.
o Knowledge of big data tools such as Spark.