
Data intern
I'm looking for a part time (~20 hrs/week) data intern to help find, aggregate, clean various public and private datasets to include in the American Dream Index. The datasets vary from well structured labor statistics from 2000-2024 to inconsistently labeled personal spend benchmarks by region, which vary greatly in structure.
The right person for this role will be hungry, creative and technical, and will bring the following skills to the table:
- Excellent Excel/Google Sheets skills
- Mid-level Python coding and SQL skills for cleaning and uploading data. You'll have done some of this before
- Advanced use of AI tools (Deepseek, Claude, ChatGPT, Brightdata) to conduct deep research to find, clean, compile required data in a tabular form. Bonus if you're an advanced-level prompt user, as tougher datasets require clever prompt engineering
- Creativity and resourcefulness. I'm here to help guide this work, but the more creative and self-guided you are, the better this will work
- Nice-to-have: website coding skills in next.js using modern frameworks (this is not a core part of the job, but helps to test new exposing new data sources and fields in the production website)
- Degree with good GPA from respected institution. CS, engineering, data science etc. preferred
- Great communication and judgment skills
Ideally based near San Mateo, CA, but can be flexible for the right person.