Back to jobs

Generative AI / ML Engineer

Precision Technologies
United States
Full-time
AI tools:
TensorFlow
PyTorch
OpenAI
Hugging Face
Pinecone
Applications go directly to the hiring team

Full Description

Job Title: Gen AI / Machine Learning Engineer (LLMs / Python / NLP / Deep Learning)

Location: United States — Onsite / Hybrid / Remote

Employment Type: W2 · Full-time

Key Responsibilities:

* Design, develop, and deploy machine learning and Generative AI models for real-world applications.

* Build and fine-tune Large Language Models (LLMs) for tasks such as text generation, summarization, classification, and conversational AI.

* Develop end-to-end ML pipelines including data ingestion, preprocessing, model training, evaluation, and deployment.

* Work with frameworks like TensorFlow, PyTorch, and Scikit-learn for building ML and deep learning models.

* Implement NLP techniques such as tokenization, embeddings, semantic search, and transformer-based architectures.

* Develop and optimize prompt engineering strategies for LLM-based applications. Build and integrate AI-powered solutions using APIs (e.g., OpenAI, Azure OpenAI, Hugging Face).

* Work with vector databases (e.g., Pinecone, FAISS, Weaviate) for retrieval-augmented generation (RAG) systems.

* Deploy models using cloud platforms such as AWS, Azure, or Google Cloud. Optimize model performance, scalability, and latency for production environments.

* Collaborate with data engineers, product managers, and stakeholders to define AI use cases and solutions.

* Implement MLOps practices including CI/CD, model versioning, monitoring, and retraining pipelines.

* Ensure responsible AI practices including bias mitigation, explainability, and data privacy compliance.

* Debug, troubleshoot, and enhance model performance and system reliability. Stay current with advancements in Generative AI, LLMs, and machine learning technologies.

Contact:

Mohana Guddanti, Talent Management Team

[email protected]

Applications go to the hiring team directly