Data Engineer/Integration Lead (AWS)

Infosys

2.7

Mexico City, Mexico

4.2/5 in supportive management

57% say women are treated fairly and equally to men

Ratings are based on anonymous reviews by Fairygodboss members.

Our STEM education and maker movement programs enable you to support a more equitable digital society

At Infosys, our D&I charter draws inspiration from our values and is contained in the first tenet of our Code of Conduct and Ethics.

At Infosys, we nurture that spirit with technology that can inspire you to not just ask ‘what next’, but actually help you to build it.

#122859BR

data governance frameworks and compliance requirements.

Key Responsibilities:
Design & Develop Data Pipelines:

Architect and implement end-to-end data pipelines using AWS S3, EMR, Glue, Step Functions, Apache NiFi, Spark.
Manage data ingestion processes from AWS S3, ensuring secure and efficient data transfer.
Implement initial data routing, validation, and transformations using Apache NiFi processors and Spark Data Engines

Data Processing & Transformation:

Integrate using AWS EMR, Apache NiFi, Spark to perform complex data transformations and analytics.
Optimize Spark jobs for processing large-scale datasets with a focus on performance and resource utilization.
Handle both historical and incremental data loads, ensuring data consistency and integrity.

Data Storage & Management:

Define and implement data storage strategies across S3, RDS, and Redshift, adhering to business requirements.
Manage data catalog creation and schema management using AWS Glue.

Automation & Orchestration:

Develop and manage workflows using Apache Airflow, AWS Step Functions to automate data processing tasks.
Implement monitoring, error handling, and retries within the orchestration framework.

Security & Compliance:

Ensure data security with encryption (AES-256, TLS) and IAM role-based access controls.
Implement data governance policies using AWS Glue Data Catalog to ensure compliance with regulatory requirements.