
Data Scientist Intern (Scientific Data Modeling & Analysis)
Data Scientist Intern (Scientific Data Modeling & Analysis)
Help teach AI to understand how scientific evidence works.
About Filyn
Filyn is an early-stage startup reimagining how scientific and regulatory data is understood.
Our mission is to accelerate drug approval by developing LLM-based agents that can reason over complex biomedical evidence with human-like precision.
We’re building AI systems that go beyond computation — systems that comprehend. Filyn’s work lives at the intersection of data science, biostatistics, and scientific reasoning, where questions of validity, provenance, and interpretation are as central as algorithms.
This is not a routine data science internship. It’s an opportunity to help define how knowledge itself is modeled — how data becomes evidence.
The Role
You’ll collaborate with Filyn’s founding team to analyze, reconstruct, and model quantitative findings drawn from life science and regulatory datasets — results, endpoints, trial data, pharmacokinetic tables, and beyond.
You will:
- Analyze structured and semi-structured scientific datasets to uncover their underlying quantitative logic.
- Reconstruct calculations, statistical summaries, and derived measures from source data.
- Translate complex or opaque tables and formulae into reproducible computational models.
- Identify inconsistencies, missing relationships, or questionable assumptions in data logic.
- Document reasoning and calculations in a transparent, auditable format for both humans and AI systems.
- You’ll act as a data modeler of evidence — connecting statistical reasoning with computational structure.
Who You Are
- A Master’s (2nd year preferred) or PhD student in Data Science, Statistics, Biostatistics, Computational Biology, Applied Math, or a related quantitative field.
- Fluent in Python or R, with experience handling biomedical or regulatory datasets.
- Comfortable with SQL and relational data structures.
- Skilled in statistical modeling, data interpretation, and analytical writing — able to explain how numbers acquire meaning.
- Curious, skeptical, and detail-oriented — you investigate data until it makes sense.
- Independent yet collaborative, able to learn new scientific contexts and data standards quickly.
- You think like a statistician, code like a scientist, and communicate like a researcher.
Bonus Points
- Experience with clinical trials, omics, or regulatory data (FDA, EMA, etc.).
- Familiarity with data provenance, computational reproducibility, or knowledge graphs.
- Background in Bayesian modeling, causal inference, or experimental design.
- Contributions to open data or scientific transparency projects.
Why Join Filyn
- Work at the frontier: Help build the reasoning engine that allows AI to interpret scientific evidence.
- Engage with meaningful data: Analyze high-stakes biomedical information where correctness and interpretation truly matter.
- Bridge disciplines: Collaborate across data science, life sciences, and AI reasoning.
- Grow with purpose: Join an ambitious, fast-moving team where your curiosity and insight directly shape the product.
If you’re a data scientist or statistician who believes understanding is more powerful than automation — and you want to help teach AI how science thinks — Filyn is where you’ll thrive.