Research Intern
Job Title: Research Intern — Statistical Computing and High-Dimensional Data Analysis
Job Location: 4th Floor, AP Area, Pediatric Endocrinology, Department of Pediatrics, University of Colorado School of Medicine
Position Purpose:
The intern will work closely with the Principal Investigator to design, implement, and validate an iterative testing procedure for large-scale genomic, exposomic, and other high-dimensional data. This position supports development of novel statistical methods and software for high-dimensional, low-sample-size hypothesis testing, with the goal of generating preliminary data and results to support a grant submission.
Key responsibilities include:
(1) Algorithm development: implement an iterative set-based testing procedure that partitions high-dimensional feature spaces into sets, applies multivariate hypothesis tests, eliminates non-significant sets, and recursively subdivides significant sets until individual features are tested.
(2) Statistical research: review computer science and statistical literature on spending functions for controlling error rates; synthesize and apply relevant algorithms.
(3) Mathematical derivation: assist with original mathematical work to derive conditional distributional functions underlying the testing procedure.
(4) Software development: develop well-modularized code in Python (with AI assistance), then translate completed modules to R and SAS/IML, following open-source principles.
(5) Quality assurance: maintain meticulous records of software updates, bugs, and fixes; develop testbeds using test-driven development (TDD) and behavior-driven development (BDD) practices; write unit, integration, and regression tests; apply agile sprint-based workflows and continuous integration principles; construct simulations to verify correctness of statistical implementations.
(6) Communication: create presentations to document and communicate research progress.
(7) Manuscript preparation (contingent on successful algorithm and software development): participate in writing and editing scientific manuscripts for peer-reviewed publication.
Eligibility Requirements:
Minimum: High school diploma or equivalent; at least two years of completed coursework toward a degree in Computer Science, Statistics, Mathematics, or a closely related field.
Preferred: Experience with Python; familiarity with R or SAS; exposure to hypothesis testing, multiple comparisons, or linear models; interest in genomics, epidemiology, or biomedical data science; familiarity with version control (e.g., Git), open-source development, and agile or test-driven workflows.
Length of Employment:
Summer semester; earliest start date May 20, 2026; latest end date October 1, 2026.
Pay Range: $20–$25 per hour