Back to jobs

Generative AI Architect

Galent
New York, United States
Full-time
AI tools:
Azure OpenAI
RAG
embeddings
Applications go directly to the hiring team

Full Description

JOB DESCRIPTION

ROLE : Gen AI Architect

LOCATION: NYC / NJ Area

Experience

* 10+ years in ML/Engineering.

* At least 5 years in architecture roles.

* Proven experience delivering production-grade solutions on Azure.

* Hands-on ownership of end-to-end application lifecycle—from design to deployment.

Responsibilities

* Lead architecture and development of enterprise web applications with integrated Generative AI.

* Define scalable, secure architectural patterns and implementation standards.

* Work closely with product, AI/ML, DevOps, security, and network teams to align business and technical goals.

* Drive full-stack development best practices across backend, frontend, and infrastructure.

* Architect and integrate LLMs, embeddings, RAG pipelines, and vector databases into production systems.

* Ensure production readiness—security, networking, monitoring, performance, and compliance.

Mandatory Technical Skills

Cloud Architecture (Azure)

* Deep experience designing cloud-native, microservices and event-driven architectures.

* Expertise with Azure App Services, AKS, Functions, API Management, Event Grid, Service Bus, Storage, and Key Vault.

* Strong understanding of subscription design, resource hierarchy, and environment isolation.

Networking & Security

* Hands-on experience with:

* VNETs, subnets, private endpoints, service endpoints.

* NSGs, ASGs, routing, firewalls, and VPN/ExpressRoute connectivity.

* Whitelisting, IP restrictions, certificates, TLS, and enterprise-grade authentication flows.

* Experience implementing Zero Trust, RBAC, managed identities, and secure secrets handling.

Performance & Scalability

* Expertise in caching (Redis/CDN), async processing (RabbitMQ/Kafka), load balancing, auto-scaling, and performance tuning.

GenAI Integration

* Hands-on experience with Azure OpenAI, RAG patterns, embeddings,

* prompt engineering, vector search (Azure AI Search, PostgreSQL

* extensions).

* Experience orchestrating LLM pipelines in real-world production

* environments.

Backend Engineering

* Strong REST API design (versioning, throttling, API gateways).

* Expert in PostgreSQL/MongoDB, data modeling, query optimization.

* Experience with OAuth2, SSO, and secure coding aligned to GDPR/SOC2.

Frontend Engineering

* Deep experience with React/Next.js, component libraries, and enterprise UI patterns.

DevOps & IaC

* Strong proficiency with:

* Azure DevOps or GitHub Actions CI/CD.

* Docker, Kubernetes (AKS), Helm.

* Terraform or Bicep for infrastructure automation.

* Experience managing production roll-outs, blue-green deployments, and canary releases.

Real-Time & Scalable UI

* Knowledge of WebSockets/SSE, state management, and high-performance UI rendering.

Testing & Observability

* Experience with automated unit/E2E testing.

* Strong knowledge of logging, tracing, monitoring (App Insights, Log Analytics), and alerting.

JOB DESCRIPTION

ROLE : Gen AI Architect

LOCATION: NYC / NJ Area

Experience

* 10+ years in ML/Engineering.

* At least 5 years in architecture roles.

* Proven experience delivering production-grade solutions on Azure.

* Hands-on ownership of end-to-end application lifecycle—from design to deployment.

Responsibilities

* Lead architecture and development of enterprise web applications with integrated Generative AI.

* Define scalable, secure architectural patterns and implementation standards.

* Work closely with product, AI/ML, DevOps, security, and network teams to align business and technical goals.

* Drive full-stack development best practices across backend, frontend, and infrastructure.

* Architect and integrate LLMs, embeddings, RAG pipelines, and vector databases into production systems.

* Ensure production readiness—security, networking, monitoring, performance, and compliance.

Mandatory Technical Skills

Cloud Architecture (Azure)

* Deep experience designing cloud-native, microservices and event-driven architectures.

* Expertise with Azure App Services, AKS, Functions, API Management, Event Grid, Service Bus, Storage, and Key Vault.

* Strong understanding of subscription design, resource hierarchy, and environment isolation.

Networking & Security

* Hands-on experience with:

* VNETs, subnets, private endpoints, service endpoints.

* NSGs, ASGs, routing, firewalls, and VPN/ExpressRoute connectivity.

* Whitelisting, IP restrictions, certificates, TLS, and enterprise-grade authentication flows.

* Experience implementing Zero Trust, RBAC, managed identities, and secure secrets handling.

Performance & Scalability

* Expertise in caching (Redis/CDN), async processing (RabbitMQ/Kafka), load balancing, auto-scaling, and performance tuning.

GenAI Integration

* Hands-on experience with Azure OpenAI, RAG patterns, embeddings,

* prompt engineering, vector search (Azure AI Search, PostgreSQL

* extensions).

* Experience orchestrating LLM pipelines in real-world production

* environments.

Backend Engineering

* Strong REST API design (versioning, throttling, API gateways).

* Expert in PostgreSQL/MongoDB, data modeling, query optimization.

* Experience with OAuth2, SSO, and secure coding aligned to GDPR/SOC2.

Frontend Engineering

* Deep experience with React/Next.js, component libraries, and enterprise UI patterns.

DevOps & IaC

* Strong proficiency with:

* Azure DevOps or GitHub Actions CI/CD.

* Docker, Kubernetes (AKS), Helm.

* Terraform or Bicep for infrastructure automation.

* Experience managing production roll-outs, blue-green deployments, and canary releases.

Real-Time & Scalable UI

* Knowledge of WebSockets/SSE, state management, and high-performance UI rendering.

Testing & Observability

* Experience with automated unit/E2E testing.

* Strong knowledge of logging, tracing, monitoring (App Insights, Log Analytics), and alerting.

Applications go to the hiring team directly