Senior Data Engineer (Informatica CDI & Legacy Migration)
KData AIFull Description
Role Summary
We are looking for a Senior Data Engineer to lead the migration and development of mission-critical data pipelines from legacy systems to Informatica Cloud Data Integration (CDI). The primary focus is building high-performance ETL patterns in a cloud-native environment while reverse-engineering existing Ab Initio logic. You will play a key role in ensuring data integrity during the transition of high-volume financial data assets.
Core Responsibilities
* CDI Development: Design, develop, and unit test complex mappings, task flows, and transformations within Informatica IDMC/CDI.
* Legacy Migration: Analyze existing Ab Initio graphs (GDE/EME) to understand business logic, data transformations, and dependencies for migration to CDI.
* Optimization: Implement Pushdown Optimization (PDO) in Informatica to leverage cloud warehouse processing power (Snowflake, Databricks, or AWS Redshift).
* Data Validation: Build robust reconciliation frameworks to ensure data parity between the legacy Ab Initio outputs and the new CDI pipelines.
* API Integration: Develop and maintain connectors using Cloud Integration Hub (CIH) and REST/SOAP transformations.
* Financial Compliance: Ensure all data movement adheres to banking security standards, including encryption at rest/transit and PII masking.
Requirements
Technical Skills Required
* Primary Expertise: 6+ years in ETL/Data Integration, with at least 3+ years of deep hands-on experience in Informatica CDI (Cloud).
* Legacy Knowledge: Functional understanding of Ab Initio (ability to navigate GDE, understand components like Reformat, Join, and Rollup, and read DML).
* Cloud Platforms: Strong experience with cloud data warehouses (e.g., Snowflake, Google BigQuery, or Azure Synapse).
* Advanced SQL: Ability to write and tune complex analytical queries.
* Scripting: Proficiency in Unix/Linux shell scripting to manage file-based triggers and automation.
Preferred Qualifications
* Migration Experience: Proven track record of successfully migrating ETL workloads from on-premise (Ab Initio, DataStage, or PowerCenter) to Informatica Cloud.
* Financial Domain: Understanding of regulatory reporting (GDPR, BCBS 239) or Capital Markets data flow.
* DevOps: Experience with version control (Git/Bitbucket) and automated deployment (CI/CD) for Informatica assets.