Back to jobs

Senior Conversational AI / Voice Integration Engineer

OViiE Ai Inc
Davenport, FL
Full-time
AI tools:
ElevenLabs
Applications go directly to the hiring team

Full Description

Senior Conversational AI / Voice Integration Engineer

OViiE AI Inc. | Orlando, FL (Hybrid/Remote)

About OViiE AI

OViiE AI is building the world’s first language infrastructure layer for global communication. Our flagship platform, STELLA, enables real-time, voice-to-voice translation across hospitality, travel, and enterprise environments—removing language barriers at the moment they matter most.

We are not building a feature.

We are building infrastructure.

With upcoming deployments aligned to FIFA World Cup 2026 host cities, OViiE AI is scaling rapidly and seeking engineers who want to build real-world, high-impact AI systems used by millions globally.

The Role

We are looking for a Senior Conversational AI / Voice Integration Engineer to lead the development of our real-time voice orchestration layer, integrating:

* ElevenLabs conversational agents

* OViiE’s proprietary Virtual Translation Rooms (VTR)

* STELLA’s enterprise SaaS platform

This role is critical in enabling live, multilingual, voice-to-voice communication across multiple participants, systems, and environments.

You will own the end-to-end conversational pipeline—from audio input to translated output to workflow automation.

What You’ll Do

* Architect and build real-time voice pipelines for conversational AI systems

* Integrate ElevenLabs agents (WebSocket-based streaming) with STELLA and VTR

* Design and implement tool-calling / function orchestration to trigger translation and workflows

* Build APIs and services that handle:

* audio streaming

* transcript processing

* translation routing

* session state management

* Develop multi-participant conversation logic (speaker routing, language detection, turn-taking)

* Connect conversational flows to enterprise systems (PMS, CRM, service workflows)

* Optimize for low latency, high concurrency, and reliability at scale

* Collaborate closely with product, ML, and design teams to deliver seamless user experiences

What You Bring

* 5+ years in software engineering, with focus on real-time systems or conversational AI

* Strong experience with:

* WebSockets / WebRTC / streaming architectures

* Node.js or Python backend development

* REST APIs and event-driven systems

* Experience integrating or working with:

* LLMs or conversational AI frameworks

* speech systems (ASR/TTS)

* voice APIs (e.g., ElevenLabs, Twilio, etc.)

* Understanding of:

* multi-turn dialogue systems

* tool/function calling in LLM environments

* latency optimization in real-time applications

Bonus Points

* Experience with multilingual systems or translation pipelines

* Familiarity with NVIDIA Riva / NeMo / speech AI frameworks

* Experience building multi-user or session-based platforms

* Background in hospitality, travel tech, or real-time communication platforms

Applications go to the hiring team directly