Back to jobsJob overview

About the role

Data Engineer I, WW FBA Central Analytics at ADCI - BLR 14 SEZ

Required Skills

pythonsqletlawsvector databasesdata pipelinesbig datadata modelingspark

About the Role

This Data Engineer I role supports the GenAI-powered insights assistant by building pipelines that process unstructured data in the S3 Data Lakehouse. The position involves managing vector databases for embeddings and working on one of Amazon's largest analytics ecosystems.

Key Responsibilities

  • Develop metadata pipelines to tag documents with freshness and ownership
  • Implement caching and multi-region replication to reduce query latency
  • Monitor data retrieval accuracy and log source citations to improve AI trustworthiness
  • Automate ingestion and embedding generation for unstructured data into vector databases

Required Skills & Qualifications

Must Have:

  • 1+ years of data engineering experience
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
  • Experience with one or more scripting language (e.g., Python, KornShell)

Nice to Have:

  • Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
  • Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.
  • Strong expertise in AWS Glue, Redshift, Kinesis/MSK, Lambda
  • Hands-on with data contracts, lineage tracking, and automated QA
  • Familiarity with multi-modal data ingestion (structured + unstructured)
  • Experience operationalizing cross-region replication and caching strategies