Job Description

Job Description Summary – AI Optimization Engineer (Onsite, Jersey City, NJ)

We are seeking an experienced AI Optimization Engineer to support large-scale AI/ML and Generative AI workloads for an enterprise environment. This role focuses on optimizing, deploying, and managing machine learning and large language models (LLMs) on GPU-accelerated HPC infrastructure. The ideal candidate will have strong experience in Python-based machine learning, deep learning frameworks, model optimization techniques, and scalable AI infrastructure.

The engineer will work closely with AI, infrastructure, and DevOps teams to design efficient model training and inference pipelines, implement SLURM-based workload orchestration, and deploy containerized ML solutions in production environments. Responsibilities include optimizing model performance using techniques such as pruning, quantization, and knowledge distillation, managing inference workflows using Triton Inference Server, and monitoring system performance using Prometheus and Grafana.

This role requires hands-on experience with HPC environments, GPU clusters, containerization technologies, and Linux system administration, along with strong knowledge of machine learning algorithms, deep learning architectures, and modern AI development tools. Experience with cloud platforms, vector embedding, and enterprise-scale AI deployments is highly preferred.

Core Responsibilities

Design and optimize AI/ML workloads on GPU-based HPC clusters.
Deploy and manage large language models (LLMs) in scalable production environments.
Implement model optimization techniques including pruning, quantization, and knowledge distillation.
Develop and manage automated job scheduling using SLURM with REST and Flask APIs.
Deploy ML models using containerized microservices architectures.
Monitor system performance using Prometheus and Grafana.
Optimize inference pipelines using Triton Inference Server and TRTLLM.
Conduct exploratory data analysis and model performance evaluation.
Collaborate with infrastructure and ML teams to improve scalability and efficiency.

Skills Required

The AI Optimization Engineer must have strong experience in Python-based machine learning and deep learning , including NumPy, scikit-learn, TensorFlow, PyTorch, and Keras, with hands-on knowledge of supervised and unsupervised learning, neural networks, transformer-based models, NLP, CNNs, and Generative AI concepts. The role requires expertise in AI infrastructure and optimization , including HPC environments, GPU clusters, SLURM workload management, Triton Inference Server, TRTLLM, and model optimization techniques such as pruning, quantization, and distillation for scalable LLM deployment.

Candidates should also have experience with DevOps and deployment tools such as Docker, Kubernetes, MLFlow, Terraform, Jenkins, GitHub, and HuggingFace, along with strong skills in performance monitoring using Prometheus and Grafana. Additional requirements include Flask API development, Linux administration (RHEL/CentOS), container runtimes like Enroot, Pyxis, and Podman, and experience with data analysis and visualization tools such as Plotly, Seaborn, and Matplotlib.

Job Tags

Full time

Similar Jobs

Blue Star Partners LLC

Business IT Analyst (Contract) Job at Blue Star Partners LLC

...Certifications: HL7 Certification (v2, CCDA and/or FHIR) Integration Engine Certification EHR Integration Certifications (e.g., Epic Bridges, Cerner FSI) Experience with Linux-based operating systems (e.g., RedHat) Understanding of networking principles...

Centessa Pharmaceuticals, LLC

Sr. Scientist, Drug Safety (Pharmacovigilance) Job at Centessa Pharmaceuticals, LLC

...improved success rates for programs with greater speed and modest costs. Description of Role We are seeking a Senior Scientist, Drug Safety (Pharmacovigilance) to support pharmacovigilance activities across Centessa's clinical-stage development programs. Reporting to...

GO2 Delivery

Contract Courier/Driver - Richmond Job at GO2 Delivery

...more about becoming a medical courier. Independent Contract Courier Medical Deliveries... ...We have immediate opportunities for drivers to complete urgent local mdeical STAT deliveries... ...Operate as an independent contractor (1099) Use your own small to mid-size...

VDart Inc

AI Engineer Job at VDart Inc

...Role: AI Engineer Location: Toronto, CA (Remote) Type: Contract Overview: We are seeking an Senior AI Engineer to join our team. The ideal candidate will have a strong background in implementing AI models, analyzing and improving existing AI...

Sam Hill and Shoco Oil

Class B Oil Delivery Driver Job Job at Sam Hill and Shoco Oil

...working order. Drivers deliver lubricating oil to customers, window wash, antifreeze, gasoline, fuel oil, open valves or starts pumps to... ...and issues ticket to customer, may attach ground wire to truck. Ability to work flexible hours including nights weekends and overtime...

AI Optimization Engineer - ONSITE Job at Simple Solutions, Jersey City, NJ

RUp1dVRlTVFMTkhtREtMV21UVGQvU1ZkZWc9PQ==