Together AI

2 Jobs

247 Employees

About the Company

Together AI is a research-driven AI cloud infrastructure provider. Our purpose-built GPU cloud platform empowers AI engineers and researchers to train, fine-tune, and run frontier class AI models. Our customers include leading SaaS companies such as Salesforce, Zoom, and Zomato, as well as pioneering AI startups like ElevenLabs, Hedra, and Cartesia. We advocate for open source AI and believe that transparent AI systems will drive innovation and create the best outcomes for society.

Listed Jobs

Company Name: Together AI
Job Title: Security Engineer Intern (Summer 2026)
Job Description: Job Title: Security Engineer Intern Role Summary: Develop and implement secure AI systems by designing enterprise-wide security solutions, building AI-driven security models, and collaborating with IT teams to enforce IAM best practices. Focus on safeguarding corporate assets through proactive and reactive security measures. Expectations: Write maintainable code, lead IAM policy implementation, and contribute to AI-assisted security applications to enhance threat detection and response. Key Responsibilities: - Design and deploy security controls to protect AI infrastructure - Develop clean, efficient code for security tools and automation - Collaborate with IT to establish identity and access management (IAM) policies - Build AI models for data classification and security operations - Support cross-functional teams in maintaining security standards Required Skills: - Proficiency in Python or bash - Experience with AI-assisted application development - Strong understanding of security frameworks and threat mitigation Required Education & Certifications: Bachelor’s degree (or equivalent) in Computer Science, Software Engineering, or related field, with graduation by Summer 2027. No certifications required.

San francisco, United states

Hybrid

Fresher

06-01-2026

Company Name: Together AI
Job Title: Machine Learning Engineer
Job Description: Job Title: Machine Learning Engineer Role Summary: Design, build, and maintain scalable, fault‑tolerant inference and fine‑tuning systems for large language models (LLMs). Deliver production‑ready APIs that enable customers to run inference at scale while ensuring reliability, performance, and ease of use. Expectations: - Write high‑performance, well‑tested production code. - Scale LLM inference services to meet high demand. - Collaborate with cross‑functional teams to translate research into operational features. - Participate in on‑call rotations and incident response. Key Responsibilities: - Architect and develop the cloud infrastructure powering inference and fine‑tuning APIs. - Optimize system resources for latency, throughput, and cost efficiency. - Conduct design and code reviews, enforce coding standards, and document services. - Build testing frameworks for robustness and fault‑tolerance. - Work closely with researchers, product managers, and designers to deploy new features. Required Skills: - 5+ years of experience writing production‑grade, high‑performance code. - Expertise in Python, Go, Rust, or C/C++ (at least one). - Deep familiarity with LLM inference ecosystems (vLLM, SGLang, TensorRT, etc.). - Proven experience building large‑scale, distributed systems (storage, search, computation). - Knowledge of runtime inference service design and deployment at scale. - Strong debugging, testing, and system monitoring skills. Required Education & Certifications: - Bachelor’s degree in Computer Science, Electrical Engineering, or equivalent industry experience.

San francisco, United states

On site

Mid level

11-03-2026