Senior DevOps Engineer

Ensono

4.6

(7)

Pune, India

Why you should apply for a job to Ensono:

  • 4.6/5 in overall job satisfaction
  • 4.7/5 in supportive management
  • 100% say women are treated fairly and equally to men
  • 100% would recommend this company to other women
  • 100% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.

    #JR010129

    Position summary

    it's public, multi or hybrid cloud, or mainframe. And because we span across all mission-critical platforms, we can meet you wherever you are in your digital transformation journey, with 24/7 support when you need it. We are your relentless ally, flexing with you when challenges emerge so you don't feel stuck in place. With cross-platform certifications and decades of experience, our technology experts have become an extension of your team so you're continuously innovating - doing more with less while remaining secure. And that's just the beginning.

    About Role:

    We are seeking an experienced Observability SME with deep expertise in observability architectures and leading monitoring platforms. This role will be responsible for designing, implementing, and optimizing end-to-end observability solutions for applications, infrastructure, and networks. The ideal candidate should have extensive hands-on experience with BMC TrueSight and Helix, VROPS, and Entuity, ensuring seamless monitoring, alerting, and analytics to enhance IT operations and service reliability. Reporting to the Sustain Engineering, Observability Manager, you will be part of the Observability Operations team, supporting mission-critical infrastructure for Ensono's strategic clients. Using your proven communication, analytical, and problem-solving skills, you will help identify, communicate, and resolve issues to optimize our IT infrastructure using various monitoring tools.

    The Observability team is responsible for maintaining and enhancing the service we deliver to our clients by effectively:
    Managing all tickets logged into the Monitoring queue
    Managing client/Internal communication on all assigned tickets
    Conducting proactive and reactive incident and event management
    Reducing the number of repeat issues through root cause analysis
    Working with internal departments to mitigate Monitoring related issue and resolve incidents/Requests/Change/Problem tickets.

    Key Responsibilities

    • Observability Strategy & Architecture: Design and implement comprehensive observability solutions to monitor applications, infrastructure, and network

    • performance.Monitoring Tool Implementation & Optimization: Deploy and fine-tune monitoring solutions such as BMC TrueSight and Helix, VROPS, and Entuity

    • Log Management & Analysis: Establish centralized logging, log parsing, and correlation for improved event detection and troubleshooting.

    • Metrics & Performance Monitoring: Define KPIs, dashboards, and alerts for proactive IT service monitoring.

    • Incident Management & Root Cause Analysis: Collaborate with IT operations, DevOps, and SRE teams to diagnose and resolve performance issues.

    • Automation & Integration: Integrate monitoring tools with ITSM platforms such as ServiceNow, AIOps solutions, and automation frameworks for enhanced efficiency.

    • Capacity Planning & Optimization: Analyze historical trends and real-time data to optimize resource allocation and performance.

    • Stakeholder Collaboration: Work closely with client stakeholders, network engineers, system administrators, and business units to ensure observability best practices are followed.

    • Continuous Improvement: Stay updated on emerging observability technologies and recommend improvements to existing processes and tools Adherence to ITIL processes

    Qualification and Experience:

    • Expertise in Observability & Monitoring Platforms: 8+ Years Hands-on experience with BMC TrueSight, VROPS, Entuity and similar platforms.Knowledge of

    • Infrastructure & Application Monitoring: Experience monitoring cloud, on-premises, and hybrid environments.

    • Automation & Scripting: Proficiency in scripting languages such as Python, PowerShell, or Bash for automation.

    • Cloud & DevOps Understanding: Experience with cloud platforms (AWS, Azure, GCP) and CI/CD pipelines.Networking & Security Awareness: Knowledge of network monitoring, SNMP, and security monitoring practices.

    • Excellent Communication & Documentation Skills: Ability to present findings, create technical documentation, and train teams on observability best practices.

    • Technical Acumen: An understanding of Infrastructure technologies including Linux, Microsoft Windows Server, Storage/Backup and Networking. TCP, UDP, PING, SNMP, WMI.

    • Knowledge of the ITIL framework desirable (Incident, Request, Change and Problems)

    JR010129

    Why you should apply for a job to Ensono:

  • 4.6/5 in overall job satisfaction
  • 4.7/5 in supportive management
  • 100% say women are treated fairly and equally to men
  • 100% would recommend this company to other women
  • 100% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.