Pave Talent

www.pavetalent.com

1 Job

26 Employees

About the Company

Pave Talent partners with high-growth companies across the US to recruit exceptional people; we build executive leadership teams and the people needed to support them. In today's war for talent just having a database, posting on job boards, and having great sourcing tools is just not enough. The very best people are happily employed and probably working at your competitors; do you have the process and resources in place to land them? We do. Actually, that's all we do. We place candidates across a variety of functions including Human Resources, Sales and Marketing, Research and Development Production/Operations, Customer Service, Finance and Accounting, Administration and IT.

Listed Jobs

Company Name: Pave Talent
Job Title: Software Engineer, AI/ML Systems (LLMs + RAG + Agents)(Only W2)
Job Description: **Job Title** Software Engineer – AI/ML Systems (LLMs + RAG + Agents) **Role Summary** Design, build, and maintain production‑grade AI/ML systems that enable autonomous vehicle operations. Work on multi‑step AI agents, conversational interfaces, retrieval‑augmented generation (RAG), and large language model (LLM) deployment, ensuring low latency, high reliability, and optimal cost. **Expecations** - Delivery of scalable, on‑time AI services that meet stringent performance metrics. - Close collaboration with cross‑functional engineering teams to translate business needs into robust AI solutions. - Continuous evaluation of quality, latency, cost, and user experience for LLMs and RAG systems. - Full ownership of architecture, deployment, and lifecycle management of AI models and agents. **Key Responsibilities** - Design and implement AI agents capable of multi‑step decision making. - Develop conversational AI (chatbots, voice‑based services). - Integrate AI functionalities across platforms and services. - Build and optimize RAG pipelines utilizing vector databases, embeddings, and advanced retrieval techniques. - Implement machine‑learning models in PyTorch; evaluate and deploy LLMs from OpenAI, Anthropic, Google, Meta. - Architect and maintain production‑grade AI systems with focus on scalability, reliability, and performance. - Work with REST, gRPC, or Kafka for inter‑service communication. - Utilize Docker/Kubernetes for containerization and orchestration. - Participate in CI/CD pipelines and DevOps practices. **Required Skills** - 6+ years Python programming for AI/ML (not just general web dev). - 6+ years experience with PyTorch or equivalent ML framework. - Proven expertise in building/deploying LLMs, RAG systems, or AI agents in production. - Deep understanding of transformer architectures and attention mechanisms. - Knowledge of AI frameworks: LangChain, LlamaIndex, AutoGen. - Cloud deployment on AWS, GCP, or Azure. - Full‑stack development (backend and frontend). - Kotlin proficiency. - Containerization (Docker) and orchestration (Kubernetes). - REST API, gRPC or Kafka experience. - Familiarity with CI/CD pipelines and DevOps workflows. **Required Education & Certifications** - None specified. - Must be authorized to work in the United States without sponsorship.

San diego, United states

On site

Mid level

15-03-2026