Lead Data Engineer
Role : Lead Data Engineer Location : Scottsdale AZ (100% onsite)Hire Type : FTE and CTH Must have skill set: Spark, S3, Glue, AWS Redshift , python and stream set exp 6-8 years of IT experience focusing on enterprise data architecture and management.Experience in Conceptual/Logical/Physical Data Modelling & expertise in Relational and Dimensional Data ModellingExperience with Databricks & on Prem , Structured Streaming, Delta Lake concepts, and Delta Live Tables requiredExperience with Spark scala Data Lake concepts such as time travel and schema evolution and optimizationStructured Streaming and Delta Live Tables with Databricks a bonusExperience leading and architecting enterprise-wide initiatives specifically system integration, data migration, transformation, data warehouse build, data mart build, and data lakes implementation / supportAdvanced level understanding of streaming data pipelines and how they differ from batch systemsFormalize concepts of how to handle late data, defining windows, and data freshnessAdvanced understanding of ETL and ELT and ETL/ELT tools such as Data Migration Service etcUnderstanding of concepts and implementation strategies for different incremental data loads such as tumbling window, sliding window, high watermark, etc.Familiarity and/or expertise with Great Expectations or other data quality/data validation frameworks a bonusFamiliarity with concepts such as late data, defining windows, and how window definitions impact data freshnessAdvanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design performance optimization)Indexing and partitioning strategy experienceDebug, troubleshoot, design and implement solutions to complex technical issuesExperience with large-scale, high-performance enterprise big data application deployment and solutionArchitecture experience in AWS environment a bonusFamiliarity working with Lambda specifically with how to push and pull data, how to use AWS tools to view data for processing massive data at scale a bonusExperience with Gitlabs and CloudWatch and ability to write and maintain gitlabs for supporting CI/CD pipelinesExperience working with AWS Lambdas for configuration and optimization and experience with S3Familiarity with Schema Registry, message formats such as Avro, ORC, etc.Ability to thrive in a team-based environmentExperience briefing the benefits and constraints of technology solutions to technology partners, stakeholders, team members, and senior level of management