cover image
Sentiro Partners

Sentiro Partners

www.sentiropartners.com

2 Jobs

1 Employees

About the Company

Leadership for the Augmentation Era™. At Sentiro Partners, we architect executive teams that drive innovation, leverage AI and data strategically, and create sustainable competitive advantage for visionary organizations. Providing Executive Search, Talent Augmentation & Talent Advisory Services globally. Our Practices: - Digital & Technology Leadership - Data & AI Leadership - Semiconductor - Private Equity & Growth - CEO & Executive Leadership - Board & Governance Connect with us: explore@sentiropartners.com

Listed Jobs

Company background Company brand
Company Name
Sentiro Partners
Job Title
Machine Learning Researcher (Foundational Models) East Coast
Job Description
**Job title** Machine Learning Researcher – Pretraining Systems (Foundational Models) **Role Summary** Lead experimental and theoretical research on large‑scale model pretraining. Design, execute, and analyze pretraining runs (10B+ parameters), develop diagnostics, and optimize distributed training for efficiency and scalability. Bridge modeling, system engineering, and empirical science to uncover principles that enable efficient learning in massive models. **Expectations** - Deliver reproducible, quantifiable results on scaling, data mixtures, and training dynamics. - Reduce pretraining cost while maintaining or improving validation performance. - Publish actionable insights on scaling laws, mixture design, and system optimizations. - Mentor junior researchers and collaborate cross‑functionally with engineering teams. **Key Responsibilities** - Conduct controlled ablations and large‑scale experiments on pretraining objectives and data mixtures. - Build and maintain instrumentation for profiling loss surfaces, gradient flow, activation distributions, and inflection point prediction. - Optimize distributed training pipelines: scheduling, sharding, checkpointing, and resource utilization on multi‑node GPU/TPU clusters. - Design evaluation harnesses for emergent behaviors (reasoning, tool‑use, temporal consistency). - Analyze and interpret results, quantify trade‑offs (e.g., tokenization choices, mixture composition) and communicate findings. - Contribute to open‑source tool development and share insights with the broader research community. **Required Skills** - 2–5 years post‑PhD in machine learning research (PhD preferred, high‑performing MA accepted). - Proven experience designing/scaling pretraining runs (≥10B parameters) and distributed training systems. - Deep familiarity with FSDP, DeepSpeed, Megatron‑LM, JAX/TPU frameworks. - Strong profiling and diagnostics expertise: gradient noise scale, loss curvature, tokenization effects. - Data‑centric experimentation: dataset filtering, mixture sampling, quality assessment. - Proficiency in Python (PyTorch/JAX); C++ (or equivalent) for system instrumentation. - Quantitative mindset: rigorous statistical analysis, reproducibility, and metric‑driven hypothesis testing. **Required Education & Certifications** - PhD in Computer Science, Machine Learning, or related field (or equivalent high‑level experience). - Optional certifications in distributed systems or high‑performance computing are welcome.
California, United states
On site
Junior
05-11-2025
Company background Company brand
Company Name
Sentiro Partners
Job Title
Software Engineer – Low Latency C++
Job Description
**Job Title** Software Engineer – Low Latency C++ **Role Summary** Design, develop, and maintain high‑performance C++ systems that process massive data volumes with single‑digit microsecond latency. Own the end‑to‑end lifecycle from whiteboard to production, continuously monitoring and optimizing for correctness and speed in globally distributed infrastructure. **Expectations** - 2–6 years of production experience in core systems, preferably within performance‑critical domains (search, databases, game engines, trading). - Proven ability to write idiomatic, high‑quality C++ that balances elegant design with measurable efficiency. - Deep understanding of computer architecture, memory hierarchies, concurrency primitives, and branch‑prediction. **Key Responsibilities** - Implement lock‑free algorithms and cache‑aware data structures for extreme latency requirements. - Profile and optimize hot code paths using tools (e.g., flame graphs, compiler diagnostics). - Deploy and monitor systems in production, diagnosing and resolving load‑specific regressions. - Collaborate with researchers and fellow engineers to translate theoretical insights into production‑ready code. - Own feature ownership from design, code review, deployment, to post‑release monitoring. - Communicate complex technical decisions to non‑technical stakeholders clearly and concisely. **Required Skills** - Advanced C++ (C++17/20) programming. - Expertise in low‑level systems concepts: memory ordering semantics, thread scheduling, cache optimization. - Strong analytical mindset; adept at root‑cause analysis and performance debugging. - Proficient in modern debugging and profiling tools (gdb, perf, llvm‑clang, etc.). - Excellent written and verbal communication; ability to convey technical depth to diverse audiences. **Required Education & Certifications** - Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field (or equivalent practical experience). ---
New york, United states
On site
Junior
26-01-2026