- Company Name
- Realty Income Corporation
- Job Title
- Machine Learning Operations & Data Engineering Intern
- Job Description
-
Job Title: Machine Learning Operations & Data Engineering Intern
Role Summary: Collaborate with a predictive analytics team to design, build, and deploy end‑to‑end machine learning pipelines. Leverage Databricks, Spark, SQL, and MLflow to ingest data, engineer features, train models, and support production deployment. Deliver reusable, scalable analytics components while ensuring code quality, testing, and documentation.
Expectations: Rising senior student pursuing a bachelor’s degree in Computer Science, Software Engineering, Information Technology, Data Science, Machine Learning, or related field. Minimum GPA 3.5. Self‑motivated, detail‑oriented, and able to communicate complex ideas clearly. Available for a 10‑week summer internship with a hybrid in‑office schedule.
Key Responsibilities:
- Engineer data pipelines in Databricks using Spark, SQL, and automated data validation.
- Develop features and modular code for predictive modeling, integrating with scikit‑learn, PySpark, and MLflow.
- Train, validate, tune, and monitor machine learning models, identifying and resolving bottlenecks.
- Write unit and integration tests; maintain version control with Git and enforce coding standards.
- Build reusable analytics components, including pipelines, feature libraries, and dashboards.
- Collaborate across data science, engineering, and business stakeholders to translate analytical needs into production‑ready solutions.
- Support data governance by documenting data pipelines and ensuring compliance with internal data policies.
Required Skills:
- Proficiency in Python, SQL, and Spark with practical experience in pandas, scikit‑learn, and PySpark.
- Familiarity with MLflow tracking, Git-based version control (GitHub, Azure DevOps), and CI/CD principles.
- Understanding of data structures, data modeling, and software architecture.
- Experience with cloud platforms such as Azure or AWS and associated data services.
- Strong analytical and critical‑thinking abilities.
- Excellent written and verbal communication, teamwork, and collaboration skills.
Required Education & Certifications:
- Current enrollment, rising senior, in a bachelor’s program in Computer Science, Software Engineering, Information Technology, Data Science, Machine Learning, or a related discipline.
- Minimum cumulative GPA of 3.5.
- No mandatory certifications, but experience with cloud services and version control is advantageous.