Data Engineer, AGI Autonomy Human Feedback

apartmentAmazon placeSan Francisco calendar_month 
Our team’s mission is to build the world’s most useful agent, and we’re looking for a Data Engineer to build the pipelines and tools for collecting and analyzing a wide range of human data. You’ll work alongside world class AI researchers, engineers, and program managers to identify and implement the best processes for human data collection.

This role is highly cross-functional, leveraging skills across data science, machine learning engineering, and project management to ensure our team collects the most effective agentic training data in a rapidly-evolving technological environment.

Key job responsibilities
  • Work closely with researchers engineers to create robust data pipelines and data collection tools.
  • Work closely with program managers to optimize data collection processes.
  • Simplify and enhance the accessibility, clarity, and usability of large or complex datasets through the development of advanced dashboards and applications.
  • Take ownership of the design, creation, and upkeep of metrics, reports, analyses, and dashboards to inform data collection projects.
  • Develop and manage scalable, automated, and fault-tolerant data solutions using cutting-edge technologies such as Spark, EMR, Python, Redshift, Glue, and S3.
  • Continually improve ongoing reporting and analysis processes, automating or simplifying self-service support for datasets.* 3+ years of data engineering experience
  • 1+ years of program or project management experience
  • Proficient in SQL
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience using data and metrics to determine and drive improvements
  • Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS* Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
  • Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
  • Experience with big data technologies such as: Hadoop, Hive, Spark, EMR

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies.
Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation.

Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information.

If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $118,900/year in our lowest geographic market up to $205,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.
Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits.

For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits.

local_fire_departmentUrgent

Data Engineer II

apartmentArtechplaceSan Francisco
Data Engineer II Department: Product Analytics & Data Science Job Category: Engineering / Infrastructure Duties: The Business Intelligence (BI) team is looking to hire a Data Engineer to build robust, extensible, and scalable data and BI...
business_centerHigh salary

Senior Associate - AI Data Engineer

placeSan Francisco
as the partner of choice in the $1 trillion consulting industry. The Role: We’re looking for an experienced Data Engineer to build the data and feature engineering pipelines that power machine learning and GenAI workloads. Data engineers at Andersen Consulting...
apartmentInfraStaffplaceSan Francisco
Data Engineer with advanced Python skills and NumPy. Someone who can use Python scripts to pull data from Hive tables and has the ability to do analysis. The consultant will Develop and maintain custom, complex ETL pipelines written in Python, some...