I'm an undergraduate student at the University of Virginia, studying computer science.
Currently working under Professor Chen-Yu Wei and his PhD student Haolin Liu on RLHF, Process Reward Models for Dense Reward RL. I've also explored preference-based reinforcement learning for control optimization with him in the past.
Also worked with Professor Judy Fox in the Summer 2023 at the Biocomplexity Institute, and did a short stint with Dr. Rohan Chandra and looked at applying VLM to help robots learn non-verbal social cues.
Email: contact
Linkedin: linkedin