- Company Name
- Google
- Job Title
- Software Engineer, Google Distributed Cloud AI, Infra
- Job Description
-
Job Title: Software Engineer – Distributed Cloud AI & Infrastructure
Role Summary
Design, develop, test, and maintain large‑scale AI and infrastructure software that powers Google’s AI platform, TPU systems, Vertex AI, and global networking services. Collaborate across teams to deliver high‑performance, reliable, and secure distributed systems that scale to billions of users.
Expectations
- 2+ years of programming experience in C++, Python, or Go.
- 2+ years of building scalable infrastructure, distributed systems, networking, storage, or hardware architecture.
- Solid understanding of data structures, algorithms, performance tuning, and debugging.
- Experience with accessibility and AI/ML infrastructure is a plus.
Key Responsibilities
- Write clean, well‑tested code for products or systems.
- Lead or participate in design reviews, selecting appropriate technologies.
- Review peers’ code, enforce style guidelines, ensure testability and efficiency.
- Maintain and enhance documentation and educational content.
- Triage, debug, and resolve product or system issues, assessing impact on hardware, network, or service operations.
Required Skills
- Proficiency in C++, Python, or Go.
- Expertise in distributed computing, network protocols, storage systems, and hardware architecture.
- Strong knowledge of data structures, algorithms, and performance analysis.
- Debugging, profiling, and visualization skills.
- Familiarity with AI/ML infrastructure (TPUs, Vertex AI) and accessibility best practices.
- Ability to communicate clearly, collaborate with cross‑functional teams, and demonstrate leadership.
Required Education & Certifications
- Bachelor’s degree in Computer Science or related field (equivalent practical experience accepted).
- Master’s or PhD in Computer Science or related technical field preferred.
---