Job Specifications
About the job
At ClearML, our mission is to make infrastructure management effortless across every phase of the AI lifecycle -- from building and training AI models to large-scale production. Trusted by more than 2,000 organizations, AI builders and IT teams use our AI infrastructure platform to power everything from early-stage R&D to mission-critical public sector and enterprise-grade AI pipelines.
We’re growing quickly and looking for curious, self-driven individuals who are excited to shape the future of AI and the infrastructure that powers it. Our customers are tackling some of the world’s most important challenges -- revolutionizing healthcare, discovering new medicines, securing global finance, protecting national security, and preserving our planet’s ecosystems.
About the Role
At ClearML, our SDK is the bridge between data scientists and our infrastructure. We are seeking a Senior Python Engineer to work on the client-side clearml open-source library. This is not a standard backend role; you won't be tuning databases. Instead, you will be building the "magic" that allows ClearML to automatically track experiments, creating wheels for PyPI, and ensuring our code runs smoothly in diverse user environments.
You will dive deep into Python internals and build integrations with the latest AI frameworks (PyTorch, TensorFlow, vLLM, etc.). If you love metaprogramming, understanding how imports work under the hood, and packaging libraries for thousands of users, this role is for you.
ClearML is a fully-remote, open-source company which promotes independence and personal growth and accountability - we look for people who thrive in environments where the technology you develop quickly affects the work and productivity of people around the world. If you like getting quick feedback from real-world users, rapidly shipping features and technologies to production and seeing the difference you make, ClearML is the place for you!
Responsibilities
Core SDK Development: Design and maintain the client-side Python SDK, ensuring high performance and backward compatibility across Python versions.
Automatic Instrumentation: Develop logic to automatically capture code execution, parameters, and metrics using advanced Python techniques (monkey patching, decorators, introspection).
ML Framework Integration: Build and maintain deep integrations with leading AI libraries (PyTorch, TensorFlow, Keras, Hugging Face, vLLM, scikit-learn).
Packaging & Distribution: Manage the build lifecycle, creating and optimizing Python wheels and Eggs, and managing releases to PyPI and Conda.
Cross-Platform Reliability: Ensure the SDK functions seamlessly across Linux, macOS, and Windows environments.
Debug User Environments: Troubleshoot complex client-side issues related to multiprocessing, serialization (pickle/dill), and memory management.
Requirements
8+ years of professional software development experience.
Deep Python Expertise: You understand metaclasses, descriptors, and how Python handles imports and memory.
Packaging Experience: Proven experience building and packaging Python libraries (setup.py, poetry, building wheels) and publishing to public repositories (PyPI).
Runtime Modification: Strong familiarity with monkey patching, AST manipulation, or writing custom import hooks.
AI Ecosystem Knowledge: Familiarity with the internals of frameworks like PyTorch, TensorFlow, or vLLM, llama.cpp.
Collaboration: Experience working in a remote, distributed team and contributing to code reviews.
Excellent communication skills and ability to work independently or as part of a team.
Advantages
Active contributor to major open-source Python projects.
Experience with C-extensions (Cython, PyBind11).
Experience with CI/CD pipelines for library publishing (GitHub Actions).
About the Company
As the leading infrastructure platform for unleashing AI in organizations worldwide, ClearML is used by more than 2,100 customers to manage GPU clusters and optimize utilization, streamline AI/ML workflows, and deploy GenAI models effortlessly. ClearML is an NVIDIA partner and is trusted by more than 300,000 forward-thinking AI builders and IT teams at leading Fortune 500 companies, enterprises, academia, public sector agencies, and innovative start-ups worldwide. To learn more, visit the company’s website at https://clear.m...
Know more