Skills

Communication Creativity Python Go Monitoring Problem-solving Decision-making Customer Service Content Creation Training Machine Learning Large Language Models Natural Language Processing

Job Specifications

Responsibilities

Team Introduction

This position is responsible for researching and building the company's LLMs. The role involves exploring new applications and solutions for related technologies in areas such as search, recommendation, advertising, content creation, and customer service. The goal is to meet the increasing demand for intelligent interactions from users and to significantly enhance their lifestyle and communication in the future.

The Student Researcher position provides unique opportunities that go beyond the constraints of our standard internship program, allowing for flexibility in duration, time commitment, and location of work. The Student Researcher program offers a flexible format that can accommodate both Onsite and Remote arrangements, as well as Part-Time or Full-Time commitments, depending on the needs of the project and the researcher.

We are looking for talented individuals to join us for a Student Researcher opportunity in 2025. Student Researcher opportunities aim to offer students industry exposure and hands-on experience. Turn your ambitions into reality as your inspiration brings infinite opportunities.

Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to TikTok and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early.

Responsibilities:

LLMPost Training
Reinforcement Learning from Human Feedback: Design advanced reinforcement learning algorithms for large language models by integrating technologies such as heuristic-guided search, multi-agent reinforcement learning, and other related techniques.
Reward modeling: Formulate a novel reward modeling methodology aimed at significantly enhancing robustness, improving generalization capabilities, and increasing overall accuracy.
Scalable Oversight: Develop scalable oversight mechanisms that enable efficient monitoring and control of LLMs as they grow in size and complexity, ensuring consistent alignment with predefined objectives.
Interpretability in LLMs: Focus on enhancing the interpretability of language models, ensuring that their decision-making processes and outputs are transparent and understandable to users and stakeholders.
LLM Horizon
Reasoning and planning for foundation models. Enhance reasoning and planning throughout the entire development process, encompassing data acquisition, model evaluation,pretraining, SFT, reward modeling, and reinforcement learning, to bolster overall performance.
Synthesize large-scale, high-quality (multi-modal) data through methods such as rewriting, augmentation, and generation to improve the abilities of foundation models in various stages (pretraining, SFT, RLHF).
Solve complex tasks via system 2 thinking, leverage advanced decoding strategies such as MCTS, A*.
Investigate and implement robust evaluation methodologies to assess model performance at various stages, unravel the underlying mechanisms and sources of their abilities, and utilize this understanding to drive model improvements.
Teach foundation models to use tools, interact with APIs and code interpreters. Build agents and multi-agents to solve complex tasks.

Qualifications

Minimum Qualifications

Currently enrolled in a PhD degree in Computer Science, Linguistics, Statistics, or related technical field.

Excellent knowledge of theory and practice of Large Language Models, Reinforcement Learning, Natural Language Processing, Machine Learning.

Strong publication record at leading conferences (NeurIPS, ICML, ACL, EMNLP etc.).
Excellent coding ability, familiar with data structures, and fundamental algorithm skills, proficient in Python, winners of competitions such as ACM/ICPC, USACO/NOI/IOI, Top Coder, Kaggle, etc. are preferred.
Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.

Preferred Qualifications

Demonstrated Deep Reinforcement Learning or Natural Language Processing, Machine Learning experience from previous internships, work experience, coding competitions, or publications.
High levels of creativity and quick problem-solving capabilities.

Job Information

For Pay TransparencyCompensation Description (Hourly) - Campus Intern

The hourly rate range for this position in the selected city is $60- $60.

Benefits may vary depending on the nature of employment and the country work location. Interns have day one access to health insurance, life insurance, wellbeing benefits and more. Interns also receive 10 paid holidays per year and paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year). Interns who are not working 100% remote may also be eligible for housing allowance.

The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates:

Qualified appl

About the Company

ByteDance is a global incubator of platforms at the cutting edge of commerce, content, entertainment and enterprise services - over 2.5bn people interact with ByteDance products including TikTok. Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. At ByteDance, we create together and grow together. That's how we dri... Know more