#JR012414
operations across mainframe, distributed, and cloud environments.
As an MLOps Engineer, you'll be the guardian of the machine learning lifecycle-making sure models built by Data Scientists and deployed by ML Engineers continue to perform, scale, and deliver business value in production. From monitoring drift to automating retraining, you'll own the workflows that ensure AI doesn't just launch, but lasts.
This role is perfect for engineers who thrive in the space between data science, DevOps, and operations. You'll build pipelines, logging frameworks, and monitoring systems that make AI reliable and explainable. Your work ensures that our incident predictions, anomaly detections, and optimization models are not just powerful, but trusted by Ops teams and cost-effective for the business.
If you're a builder who loves creating end-to-end systems that keep AI alive and useful in the real world, this is the role for you.
What You Will Do:
Lifecycle Automation - Build workflows for retraining, validation, and redeployment of models to keep them current and reliable.
Monitoring & Observability - Develop systems to track model performance, drift, cost, and reliability in production environments.
Logging & Explainability - Ensure every prediction is traceable, explainable, and auditable to build trust with Ops and clients.
Performance & Cost Optimization - Identify opportunities to improve inference performance while reducing compute/storage costs.
Collaboration with AI/ML Teams - Partner with Data Scientists and ML Engineers to translate model requirements into production-ready, maintainable workflows.
Tooling & Platform Management - Standardize and scale the use of ML platforms, frameworks, and monitoring tools across the enterprise.
We want all new Associates to succeed in their roles at Ensono. That's why we've outlined the job requirements below. To be considered for this role, it's important that you meet all Required Qualifications. If you do not meet all of the Preferred Qualifications, we still encourage you to apply.
Required Skills & Experience
Strong programming skills in Python (must-have) with knowledge of C++ or Java as a plus.
Experience with Docker, Kubernetes, or similar orchestration platforms.
Familiarity with ML frameworks (TensorFlow, PyTorch, Scikit-learn) and their production lifecycle.
Hands-on experience with MLOps platforms and tools (MLflow, Kubeflow, SageMaker, etc.).
Strong background in monitoring, logging, and performance tuning of production ML systems.
Exposure to Snowflake, ServiceNow data flows, and enterprise IT operations datasets.
Knowledge of cost optimization strategies for AI/ML workloads.
JR012414