- I'm actively seeking a research or engineering position in the market related to LLM Alignment or Agent. My research work primarily focuses on RLHF (Reinforcement Learning from Human Feedback).
- Here is my academic page
fakerbaby Goto Github PK
Name: Wei Shen
Type: User
Company: Fudan University
Bio: Focus on LLM Alignment (RLHF)
Location: shanghai