Lead Site Reliability Engineer - Production Support Services (Flexible Hybrid)

Fannie Mae

3.9

(14)

Reston, VA

Why you should apply for a job to Fannie Mae:

  • 79% say women are treated fairly and equally to men
  • 71% would recommend this company to other women
  • 85% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Student loan repayment and employer-assisted housing program
  • Seramount (formerly “Working Mother”) 100 Best Companies 2023
  • Flexible work environment and paid leave for life’s unpredictable moments
  • #744000027466485

    Position summary

    software technology and coordinate simultaneous implementation tasks across teams.

    • May maintain or oversee the maintenance of existing software.

    Qualifications
    THE EXPERIENCE YOU BRING TO THE TEAM

    Required Experience

    • 4 plus years of experience developing enterprise applications
    • 4 plus years of engineering enterprise cloud infrastructure
    • Experience managing technical stakeholders
    • Experience mentoring and coaching junior engineers
    • Experience with Application Performance Management and Observability

    Desired Experience

    • Bachelor's degree in computer science, Management Information Systems (MIS), Systems Engineering, or related field
    • Certification in AWS Solutions Architect Associate or Developer Associate, Splunk Certification Developer, or Sun Certified Java Developer
    • Experience with application production / operations support, including incident response, problem management, runbooks, and knowledge articles
    • Experience with post-mortems, root-cause analysis (RCA), and / or AWS Correction-of-Errors (CoE)
    • Experience with Failure Mode Effect Analysis (FMEA) and Chaos testing / engineering
    • Experienced in application monitoring / observability, including building dashboards, establishing service level indicators / objectives / agreements (SLIs / SLOs / SLAs), and logging / tracing

    Skills

    • Skilled in programming in Java and / or Python with an understanding J2EE frameworks, such as Spring Boot / Spring Cloud, and REST
    • Skilled in AWS cloud applications and technologies, including containerization, virtualization, microservices, and server-less architecture in tools
    • Understanding of error budgeting and toil reduction
    • Ability to create disaster recovery plans and execute failover tests
    • Skilled in capacity planning and performance testing / engineering tools, such as JMeter and / or LoadRunner
    • Skilled in Scaled Agile Framework (SAFe) and Jira / Confluence
    • Understanding of fault tolerant / resilience architectural design patterns, such as Bulkhead, Circuit-breaker, Retry, Timeout, etc.
    • Ability to create automation solutions using tools such as BluePrism and / or Selenium
    • Excellent problem-solving skills and proactivity in resolving issues / blockers
    • Excellent verbal / written communication skills, relationship management skills, and ability to collaborate with multiple stakeholders

    Tools

    • AWS (ECS, EC2, RDS, Redshift, EMR, Lambda, Route 53, Step Functions)
    • Programming using Python/Java
    • DevOps - Infrastruture as Code, CICD - Jenkins, GitLab, Terraform
    • ServiceNow, Moogsoft, StatusHub, and / or Blameless
    • Gremlin, Chaos Monkey, Chaos Toolkit, AWS Fault Injection Service (FIS)

    Why you should apply for a job to Fannie Mae:

  • 79% say women are treated fairly and equally to men
  • 71% would recommend this company to other women
  • 85% say the CEO supports gender diversity
  • Ratings are based on anonymous reviews by Fairygodboss members.
  • Student loan repayment and employer-assisted housing program
  • Seramount (formerly “Working Mother”) 100 Best Companies 2023
  • Flexible work environment and paid leave for life’s unpredictable moments