
About Amazon
The everything store and cloud computing leader
Key Highlights
- Headquartered in South Lake Union, Seattle, WA
- Over 1.5 million employees worldwide
- Leading cloud services through Amazon Web Services (AWS)
- Acquired Whole Foods, Twitch, and Ring
Amazon, headquartered in South Lake Union, Seattle, WA, is the world's largest online retailer and a leader in cloud computing through Amazon Web Services (AWS). With over 1.5 million employees globally, Amazon operates in various sectors, including AI with its Alexa devices and a vast marketplace k...
🎁 Benefits
Amazon offers competitive salaries, stock options, generous PTO policies, and comprehensive health benefits. Employees also have access to a learning ...
🌟 Culture
Amazon's culture is driven by customer obsession and a focus on innovation. The company encourages employees to think big and move fast, fostering an ...

Data Engineer I, WW FBA Central Analytics
Amazon • Bengaluru, Karnataka, IND
Job Description
Our charter includes building the foundational pipelines, governance frameworks, and intelligent interfaces that enable internal customers to query, analyze, and act on complex datasets with natural language. This is an opportunity to work on one of the largest, complex, and critical analytics ecosystems, designing solutions that combine massive scale, high reliability, and advanced AI.
We are seeking a Data Engineer I will support the GenAI-powered insights assistant by building pipelines that process unstructured data (knowledge articles and documents) in the S3 Data Lakehouse. You'll manage vector databases that store embeddings, helping the AI retrieve relevant info quickly and accurately.
Key job responsibilities
- Develop metadata pipelines to tag documents with freshness, ownership, and other context for better filtering.
- Implement caching and multi-region replication to reduce query latency.
- Monitor data retrieval accuracy and log source citations to improve AI trustworthiness.
- Automate ingestion and embedding generation for unstructured data into vector databases like Zilliz, Pinecone, or OpenSearch.
- 1+ years of data engineering experience
- Experience with data modeling, warehousing and building ETL pipelines
- Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
- Experience with one or more scripting language (e.g., Python, KornShell)- Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
- Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.
- Strong expertise in AWS Glue, Redshift, Kinesis/MSK, Lambda.
- Hands-on with data contracts, lineage tracking, and automated QA.
- Familiarity with multi-modal data ingestion (structured + unstructured).
- Experience operationalizing cross-region replication and caching strategies.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Amazon.