2025 Summer Intern - Data Analytics and Visualization (Early Clinical Development)
Department SummaryThe mission of the Data Intelligence (DI) team in the Early Clinical Development (ECD) department at Genentech is to optimize the support of clinical systems to provide high-quality clinical trial data insights and to offer data science-related consultation to the study teams. We leverage cutting-edge data science and operation data management to facilitate the study of start-up planning. We develop advanced analytical data visualization and conduct predictive analysis to increase the efficiency of clinical data review and clinical operations with the ultimate goal of driving data-driven decision-making throughout the study lifecycle.
This internship position is located in South San Francisco, on-site.
The OpportunityWe are seeking a motivated and detail-oriented intern to join our DI team. This position offers the opportunity to work on impactful projects that integrate data science, natural language processing, and data visualization to address real-world challenges in early-phase clinical trials.
Analyze clinical trial and real-world data to generate actionable insights and improve decision-making.
Design, develop, and implement analytical workflows and visualizations using tools such as R, Python, SAS, and Spotfire.
Contribute to data quality improvement initiatives and advanced data science efforts, including statistical analysis and advanced data analytics.
Develop and implement advanced text preprocessing techniques and apply string similarity algorithms and fuzzy matching techniques to standardize and reconcile inconsistent text data.
Manage and query large datasets using database management systems, such as SQL.
Participate in team brainstorming sessions to identify opportunities for improving data analytics and visualization capabilities.
Program HighlightsIntensive 12-week, full-time (40 hours/week) paid internship.
Program start date: May/June (Summer).
A stipend, based on location, will be provided to help offset costs associated with the internship.
Ownership of impactful, business-critical projects.
Opportunity to collaborate with some of the most talented people in the biotechnology industry.
Who You Are (Required)Required Education: You meet one of the following criteria:Must be pursuing a Master's degree.
Must have attained a Master's degree.
Must be pursuing a PhD.
Must have attained a PhD.
Required Majors: Data Science, Computer Science, Statistics/Biostatistics, Bioinformatics, or a related field with strong quantitative and computational focus.
Required SkillsProficiency in programming languages such as R, Python, SAS, and SQL.
Familiarity with data visualization tools (e.g., Spotfire, ggplot2, Matplotlib, Plotly).
Knowledge of natural language processing techniques and libraries.
Database management skills for handling and querying large datasets.
Strong analytical and problem-solving capabilities
Ability to work both independently and collaboratively in a team environment.
Excellent communication, collaboration, and interpersonal skills.
Complements our culture and the standards that guide our daily behavior & decisions: Integrity, Courage, and Passion.
Understanding of early-phase clinical trial data and operational workflows.
Experience with advanced statistical methods and calculations.
Familiarity with address parsing and geospatial tools (e.g., geopy, usaddress) for location standardization.
Knowledge of Git or other version control systems.
Relocation benefits are not available for this job posting.
The expected salary range for this position based on the primary location of California is $50.00 hour. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. This position also qualifies for paid holiday time off benefits.
#GNE-R&D-Interns-2025
Genentech is an equal opportunity employer, and we embrace the increasingly diverse world around us. Genentech prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin or ancestry, age, disability, marital status and veteran status.