Data Scientist

apartmentStartekk LLC placeMalvern calendar_month 
Note:
Location: Malvern, PA (Onsite)
C2C: Yes
Duration: 6+ Months
Primary Skills: Data Warehouse

Visa: H4EAD, green card or citizens preferred

Key Responsibilities: Develop, implement, and maintain robust machine learning models using GenAI agentic framework.

Engage in prompt engineering to optimize the interaction with large language models, with other AI systems or tools.
Utilize techniques like Retrieval-Augmented Generation (RAG) to enhance AI solutions with real-time information retrieval.
Develop and design graph databases using tools such as NetworkX or AWS Neptune for relationship-oriented data modeling.
Fine-tune Large Language Models (LLMs) to tailor solutions to specific business needs and improve model efficiency.
Leverage AWS services (including S3, ECS, ECR, Lambda, SageMaker, and more) to build scalable machine learning solutions.
Utilize data engineering skills with Glue and PySpark for efficient data preparation and processing.
Conduct model evaluation and testing to ensure accuracy, reliability, and robustness.
Monitor machine learning models in production, implementing strategies for ongoing performance tracking and optimization.
Ensure the security of models and data through secure API integration, utilizing tokens and comprehensive data security principles.
Govern models in compliance with the Machine Learning Development Lifecycle (MDLC) and Machine Learning Production Lifecycle (MPLC) processes.
Collaborate with cross-functional teams, including data scientists, engineers, and business stakeholders, to deliver impactful machine learning solutions.

Stay up-to-date with the latest advancements in machine learning, generative AI technologies, and methodologies.

Qualifications
Bachelors or Masters degree in Computer Science, Engineering, Mathematics, Data Science, or a related field.
5+ years of experience in machine learning and data engineering roles.
Experience with prompt engineering and fine-tuning large AI models.
Familiarity with Retrieval-Augmented Generation (RAG) techniques.
Proficiency in designing graph databases using tools like AWS Neptune, Neo4J or NetworkX.
Proficiency in Python, with a strong understanding of libraries such as TensorFlow, PyTorch, and Scikit-learn.
Extensive experience with AWS services relevant to machine learning, including SageMaker, Glue and PySpark.
Experience with secure API integration and a solid understanding of data security practices.
Strong understanding of data preprocessing, feature engineering, and model training/evaluation.
Experience with model monitoring, performance analysis, and optimization techniques.

Familiarity with model governance frameworks, including MDLC and MPLC processes.

placePhiladelphia, 17 mi from Malvern (PA)
Overview: ForMotiv is a real-time web behavioral science company. Our SaaS platform monitors user-behavior in web applications, and makes web-speed recommendations ( We’re looking for someone to join our data science team as a data scientist...
local_fire_departmentUrgent

Data Scientist II - Retail

apartmentBurlingtonplaceEdgewater Park, 32 mi from Malvern (PA)
LOCATION** 4287 Route 130 S Edgewater Park NJ US 08010 **Overview** Come join our growing team here at Burlington Stores as a **Data Scientist II!** The Data Scientist II will support business areas including Merchandising, Allocations...
apartmentPfizerplaceCollegeville (PA), 11 mi from Malvern (PA)
Procedures (SOPs) and working practices. + Promote the use of consistent, efficient, and quality processes to meet timelines and deliverables. + Serve as Clinical Data Scientist and Trial Lead for one or more clinical trials assuming responsibility...