Description:
Title: Site Reliability Engineer SRE ML platform Responsibilities Continuous Deployment using GitHub Actions, Flux, KustomizeDesign and implement cloud solutions, build MLOps on cloud AWSData science model containerization, deployment using docker, VLLM, KubernetesCommunicate with a team of data scientists, data engineers and architects, document the processesDevelop and deploy scalable tools and services for our clients to handle machine learning training and inference.Knowledge of ML models a
Aug 6, 2025;
from:
dice.com