Job Specifications
IT EDW Operations
Full Time
87012BR
Job Summary
The Machine Learning and Data engineer role will lead the development, implementation, and maintenance of data pipelines and infrastructure to support the deployment and continuous monitoring of Machine Learning (ML) and generative Artificial Intelligence (AI) tools within UCSF’s APeX Enabled Research (AER) team. Most projects will be in partnership with other UCSF technical teams and involve highly customized research solutions. Communication skills and inventive technical solutioning are crucial.
The AER Team Provides a Large Array Of Services To The UCSF Research Community, Including Project Consultation, Grant Support, Budget Estimations, And Project Implementation And Support. Project Examples Include
Development of EHR-based interventions via clinical trials embedded within healthcare delivery systems to generate scientific evidence while delivering healthcare.
Enabling UCSF researchers with algorithms, digital tools and / or clinical interventions with strong evidence of feasibility and acceptability.
Develop technical approaches and budgets in order to implement these tools within the electronic medical record.
Supporting the development of scalable, low cost infrastructure to enable ongoing research.
This role primarily involves managing and optimizing the data and monitoring pipelines of the Health IT Platform for Advanced Computing (HIPAC), a cloud infrastructure that supports the development and deployment of AI/ML tools, including large language models (LLMs) in the EHR. Specifically, the ML/data engineer will work on implementing new data integrations, enhancing HIPAC’s ETL functionalities, productionizing AI/ML tools developed by UCSF data scientists/researchers, and designing and implementing metrics to continuously monitor AI/ML tools deployed at UCSF Health.
Competitive applicants for this position are software, machine learning, or data engineers with 5+ years of experience in implementing and maintaining AI/ML pipelines. Proficiency in MLOps, Python, SQL, and CI/CD is required. This role also requires a deep understanding of Epic data models (Clarity and Caboodle). Successful candidates either have or are able to obtain Epic Clinical/Clarity data model certification shortly after onboarding.
The final salary and offer components are subject to additional approvals based on UC policy.
Your placement within the salary range is dependent on a number of factors including your work experience and internal equity within this position classification at UCSF. For positions that are represented by a labor union, placement within the salary range will be guided by the rules in the collective bargaining agreement.
The salary range for this position is $132,000 - $180,000 (Annual Rate).
To learn more about the benefits of working at UCSF, including total compensation, please visit: https://ucnet.universityofcalifornia.edu/compensation-and-benefits/index.html
Department Description
The University of California, San Francisco (UCSF) Department of Information Technology Academic Research Systems (ARS) group is chartered to provide data services and infrastructure that support the UCSF Research Community’s computing and analytic requirements through centralized informatics services in the areas of Data, Tools, Secure Compute Environments, and Consulting Services.
Required Qualifications
Bachelor's degree in Computer Science, Computer Engineering, or related area and/or equivalent experience/training.
5+ years of experience in positions of increasing responsibility designing, implementing, and maintaining complex AI/ML applications.
Advanced experience with Python; ability to write clean, efficient, and production-level Python code
Advanced experience with SQL (e.g., SQLServer, PostgreSQL)
Experience with data analysis and machine learning tools such as Jupyter, Pandas, scikit-learn, Numpy/Scipy, PyTorch, etc.
Demonstrated advanced knowledge of full software development lifecycle
Demonstrated experience deploying, monitoring, and maintaining AI/ML models and pipelines
Advanced experience in database systems, data warehousing solutions, and understanding of ETL pipelines
Advanced experience in designing, building, or maintaining data infrastructure for efficient ML model training and inference.
Demonstrated experience working with MLOps, DevOps, and CI/CD pipeline toolsets
Experience in developing complex, automated testing
Experience with cloud-based architecture in platforms such as AWS, GCP, Azure
Demonstrated effective communication and interpersonal skills
Demonstrated ability to communicate technical information to technical and non-technical personnel at various levels in the organization
Self-motivated and works independently and as part of a team. Able to learn effectively and meet deadlines
Demonstrated broad problem-solving skills
Demonstrated ability to interface with management on a regular basis
Excellent project leadership and
About the Company
UC San Francisco is driven by the idea that when the best research, the best education and the best patient care converge, great breakthroughs are achieved. We pursue this integrated excellence with singular focus, fueled by collaboration among our top-ranked professional and graduate schools, medical center, research programs and support teams.
Know more