#4160
n time to service restoration and impact to the business. Demonstrate end to end ownership
§ Apply Site Reliability Engineering (SRE) principles to design and implement robust tooling, proactive alerting, and automated response mechanisms that identify, mitigate, and resolve reliability risks-focusing on prevention, early detection, and continuous improvement through automation.
§ Own and evolve the architecture of Platform, PaaS, and SaaS solutions to meet current and future business need-driving innovation, scalability, and operational excellence.
§ Participate in an on-call rotation, providing hands-on technical expertise during service-impacting incidents to ensure rapid diagnosis, effective resolution, and continuous improvement of system reliability.
§ Evaluate, implement, and manage emerging data technologies with a focus on big data, analytics, data wrangling, business intelligence, and data visualization to drive innovation and efficiency.
§ Serve as a subject matter expert and technical lead for data platforms, tools, and application interfaces - driving root cause analysis, resolving complex technical issues, and ensuring platform reliability and performance.
§ Provide technical leadership and mentorship to junior and mid-level data engineers, fostering skill development and promoting engineering best practices.
§ Collaborate with and empower data engineers, data scientists, and business analysts by enabling self-service capabilities for data wrangling, exploration, and analysis
§ Develop training materials and deliver end-user training sessions to drive adoption, ensure effective use of data solutions, and enhance customer engagement.
§ Support the documentation, metadata management, and visualization of data assets to promote data discoverability, transparency, and self-service analytics.
§ Foster a culture of accountability and collaboration by building strong team commitment to shared priorities and strategic goals.
QUALIFICATIONS
§ Bachelor's degree in computer science, Computer Engineering, or a closely related field (or foreign equivalent). Relevant experience may be obtained through a qualifying post-baccalaureate academic program.
§ 7+ years extensive expertise in platform administration, big data technologies, data engineering, analytics, and operations.
§ Proficiency in a broad range of tools and concepts including Hadoop, Linux, Python, SQL, Spark, Kerberos, cloud platforms, security protocols, performance tuning, machine learning algorithms, production engineering, job scheduling, and operational support
§ At least 7+ years of progressive and diverse experience in IT, platform administration, database management, analytics or an equivalent combination of education and work experience
§ At least 3 years of hands-on engineering experience in developing platform solutions on AWS using Infrastructure as Code (CloudFormation, Terraform, Ansible) and CICD pipelines (Jenkins, Nexus, AWS CodeBuild, CodeDeploy or CodePipeline).
§ At least 3 years of experience providing architectural guidance and technical direction to developers, or platform administrators
§ Knowledge in data engineering, with exposure to designing and building data applications using database platforms.
§ Responsibilities should include data ingestion, data preparation, ETL processes, data aggregation, data mining, and the development of database and analytics applications
§ Experience in managing change through Change Management and Incident Management processes
§ Experience in implementing Reliability Engineering practices and Observability dashboards leveraging Splunk, Dynatrace, CloudWatch etc. will be a plus
§ Ability to learn new technologies quickly, and perform major job responsibilities proficiently within 6-12 months
§ Strong analytical ability, problem analysis techniques, and broad knowledge of alternatives technology
§ Strong communication skills, and the ability to work effectively with business and IT resources