AI Research Scientist | Phizenix Inc

Should have a PhD in Computer Science, Machine Learning, or a related field

Hands-on experience with PyTorch and LLM fundamentals (transformers, KV caching, etc.)

Brings deep expertise in inference optimization for LLMs, including model quantization, CUDA/GPU tuning, and deployment of VLLMs for low-latency, high-throughput serving.

Should have recent/or any ICLR/ICML publications in LLM inference optimization would be ideal.

Familiarity with diffusion models and distributed model training

Solid research-to-production mindset with 2+ years in an ML/AI role