AI Platform Architect
Solutionit New York
AI Platform Architect for one of its clients in NYC Job Title AI Platform Architect Required Skills Responsibilities: Design and develop end-to-end applications that seamlessly integrate machine learning capabilities, including real-time inference, batch processing, and efficient data management to deliver scalable and robust solutions.
Identify bottlenecks in the model development, deployment, and monitoring process. Design and implement production-ready machine learning pipelines, including model training, validation, deployment, and monitoring (e.g., labelled data sets to check performance of prompts).
Build scalable, high-performance infrastructure to support Generative AI workflows (e.g., distributed training, inference optimization, and GPU/TPU utilization). Deploy GenAI applications into production cloud environments with performance, cost, and latency trade-offs considered (e.g., open-source vs. closed-source, quantization, prompt token length, completion caching, prompt caching).
Monitor and troubleshoot model performance, addressing issues such as performance drift and response latency. Stay at the forefront of Generative AI advancements, identifying opportunities to incorporate the latest research and techniques into production systems.
Qualifications: Bachelor's or advanced degree in computer science, engineering, or a related field. 3+ years of experience in machine learning engineering, with a focus on deploying AI systems at scale. Experience working with large-scale Generative AI applications in production environments.
Relevant experience in the legal domain is a plus. Strong proficiency in Python and machine learning frameworks (e.g., TensorFlow, PyTorch). Experience with Generative AI tools and techniques (e.g., LLMs, quantization, synthetic data generation, knowledge distillation, retrieval-augmented generation, fine-tuning).
Proficiency in commons GenAI libraries (e.g., LangChain, Autogen) and cloud-native AI services (e.g., Azure search) Knowledge of cloud infrastructure (e.g., Azure) and management tools for IT components, storage, networking, and caching. Familiarity with ML Ops principles, including CI/CD pipelines, containerization, and automated testing for AI systems.
Experience with modern container platforms (e.g., Docker, OpenShift) and tools like Jenkins, Git, and Sonar. Strong problem-solving skills with the ability to address complex technical challenges. Excellent communication skills to collaborate with cross-functional teams and explain technical concepts to non-technical stakeholders.
Eagerness to stay updated with cutting-edge AI research and apply innovative ideas to real-world problems. Organization and attention to detail, ensuring high-quality delivery. Ability to work collaboratively to create innovative and efficient solutions.
Identify bottlenecks in the model development, deployment, and monitoring process. Design and implement production-ready machine learning pipelines, including model training, validation, deployment, and monitoring (e.g., labelled data sets to check performance of prompts).
Build scalable, high-performance infrastructure to support Generative AI workflows (e.g., distributed training, inference optimization, and GPU/TPU utilization). Deploy GenAI applications into production cloud environments with performance, cost, and latency trade-offs considered (e.g., open-source vs. closed-source, quantization, prompt token length, completion caching, prompt caching).
Monitor and troubleshoot model performance, addressing issues such as performance drift and response latency. Stay at the forefront of Generative AI advancements, identifying opportunities to incorporate the latest research and techniques into production systems.
Qualifications: Bachelor's or advanced degree in computer science, engineering, or a related field. 3+ years of experience in machine learning engineering, with a focus on deploying AI systems at scale. Experience working with large-scale Generative AI applications in production environments.
Relevant experience in the legal domain is a plus. Strong proficiency in Python and machine learning frameworks (e.g., TensorFlow, PyTorch). Experience with Generative AI tools and techniques (e.g., LLMs, quantization, synthetic data generation, knowledge distillation, retrieval-augmented generation, fine-tuning).
Proficiency in commons GenAI libraries (e.g., LangChain, Autogen) and cloud-native AI services (e.g., Azure search) Knowledge of cloud infrastructure (e.g., Azure) and management tools for IT components, storage, networking, and caching. Familiarity with ML Ops principles, including CI/CD pipelines, containerization, and automated testing for AI systems.
Experience with modern container platforms (e.g., Docker, OpenShift) and tools like Jenkins, Git, and Sonar. Strong problem-solving skills with the ability to address complex technical challenges. Excellent communication skills to collaborate with cross-functional teams and explain technical concepts to non-technical stakeholders.
Eagerness to stay updated with cutting-edge AI research and apply innovative ideas to real-world problems. Organization and attention to detail, ensuring high-quality delivery. Ability to work collaboratively to create innovative and efficient solutions.
Work Site: NYC Duration 12 Months Expected Start Date: Immediate Number of Positions 1
Henningson, Durham & Richardson, PCNew York
About Us
At HDR, we specialize in engineering, architecture, environmental and construction services. While we are most well-known for adding beauty and structure to communities through high-performance buildings and smart infrastructure, we...
AmazonNew York
As a Principal Partner Solutions Architect in the Amazon Ads Global Customer Development organization, you will be responsible for designing and delivering complex advertising technology solutions for strategic partners. This role requires deep...
Intone NetworksNew York
Position: Lead/Architect Salesforce Developer Location: Remote Fulltime Description: As a Salesforce Lead with expertise in Experience Cloud, Service Cloud, and Marketing Cloud, you will play a critical role in the design, development...