#17892
hat power mission-critical applications, ensuring low-latency, high-throughput, and fault-tolerant data storage. Your work will directly impact system reliability, scalability, and cost efficiency.
You will collaborate closely with principal engineers, architects, SREs, and product teams to define technical roadmaps, improve storage efficiency, and optimize access patterns. You will drive performance tuning, data modeling, caching strategies, and cost optimization across AWS storage services like S3, DynamoDB, EBS, EFS, FSx, and Glacier. Additionally, you will contribute to infrastructure automation, security best practices, and monitoring strategies using tools like Terraform, CloudWatch, Prometheus, and OpenTelemetry.
In this role, you will also be responsible for troubleshooting and resolving production incidents related to data integrity, latency spikes, and storage failures, ensuring high availability and disaster recovery preparedness. You will mentor junior engineers, participate in design reviews and architectural discussions, and advocate for engineering best practices such as CI/CD automation, infrastructure as code, and observability-driven development. Your contributions will directly impact the organization's ability to scale its storage infrastructure efficiently while maintaining security, reliability, and compliance with industry standards.
Basic Requirements