#TOYOUS10279729EXTERNALENUS
The primary responsibility of this role is to help modernize and implement DevOps methodologies to fully shift from a hybrid on-premises and cloud organization today to a fully cloud-centric organization with DevSecOps, data ops, automated delivery via CICD, and automated preventive controls. Reporting to the DevOps Engineering Manager, the person in this role will support the organization's objective to ensure safe and reliable software deployments and measuring application service performance and availability using Infrastructure-as-Code, and Continuous Integration / Continuous Delivery Pipelines to handle the full application lifecycle. You will also lead the development of Intelligent Automation solutions to streamline our operations. The ideal candidate will have experience with Site Reliability Engineering (SRE) principles, AWS, Snowflake, and automation tools.
What you'll be doing
Manage day-to-day support activities, including L3 support, releases, and infrastructure provisioning.
Develop and implement DevSecOps, DevOps, and data ops best practices, including test automation.
Build and implement best-in-class CI/CD capabilities to automate data pipeline delivery.
Collaborate with platform teams to integrate tooling into existing pipelines.
Drive adoption and alignment of DevOps and cloud engineering practices across the Data Engineering team.
Partner with Risk Management and Security teams to ensure secured and compliant cloud infrastructure and services.
Lead the design, implementation, and maintenance of cloud-based infrastructure on AWS.
Drive the process and approach our Data Engineering team use to document sensitive, protected, and shared data to ensure compliance with appropriate information and data governance policies (GDPR, CCPA, SOX, etc.).
Performing site reliability engineering development efforts to improve availability and performance of software systems (debugging, triaging and identifying root cause for failure in a production environment and performing postmortem analysis).
Defining Standard Operating Procedures (SOPs) and Runbook for troubleshooting production issues; developing software operations resilience patterns for deployed software infrastructure and implementing highly available and resilient software systems
Technical Leadership
Lead the design, implementation, and maintenance of cloud-based infrastructure on AWS
Develop and implement SRE principles to ensure high availability, scalability, and security
Collaborate with cross-functional teams to identify and prioritize project requirements
Provide technical guidance and mentorship to junior team members
Intelligent Automation
Design and develop Intelligent Automation solutions to streamline operations
Implement automation tools such as Ansible, Terraform, or CloudFormation
Collaborate with stakeholders to identify areas for automation and process improvement
Snowflake and Data Engineering
Collaborate with data engineering teams to design and implement data pipelines on Snowflake
Ensure data security, governance, and compliance with regulatory requirements
Optimize data storage and query performance on Snowflake
Site Reliability Engineering (SRE)
Implement SRE principles to ensure high availability, scalability, and security
Develop and implement monitoring, logging, and alerting solutions
Collaborate with teams to identify and resolve incidents and outages
Requirements
Technical Requirements
8+ years of experience in DevOps, SRE, or a related field
Strong experience with AWS, including EC2, S3, Lambda, and CloudWatch
Experience with Snowflake and data engineering principles
Strong experience with automation tools such as Ansible, Terraform, or CloudFormation
Experience with SRE principles and practices
Strong programming skills in languages such as Python, Java, or C++
Soft Skills
Strong leadership and communication skills
Ability to collaborate with cross-functional teams
Strong problem-solving skills and attention to detail
Ability to adapt to changing priorities and requirements
Nice to Have
Experience with containerization using Docker or Kubernetes
Experience with CI/CD pipelines using Jenkins, and GitLab
Experience with monitoring and logging tools such as Prometheus, Grafana, or ELK
Experience with agile development methodologies such as Scrum or Kanban
Belonging at Toyota
Our success begins and ends with our people. We embrace diverse perspectives and value unique human experiences. Respect for all is our North Star. Toyota is proud to have 10+ different Business Partnering Groups across 100 different North American chapter locations that support team members' efforts to dream, do and grow without questioning that they belong.
Applicants for our positions are considered without regard to race, ethnicity, national origin, sex, sexual orientation, gender identity or expression, age, disability, religion, military or veteran status, or any other characteristics protected by law.
Have a question, need assistance with your application or do you require any special accommodations? Please send an email to [email protected].