Coder Social home page Coder Social logo
Xuehai Pan photo

xuehaipan Goto Github PK

followers: 520.0 following: 49.0 repos: 73.0 gists: 0.0

Name: Xuehai Pan

Type: User

Company: CFCS @ PKU

Bio: Ph.D. student at Peking University. Interested in Reinforcement Learning & Multi-Agent Systems & Distributed Computing. Working on LLMs and AI Alignment.

Twitter: XuehaiPan

Location: Peking University, Beijing

Hi there 👋

Xuehai Pan (/ʃwɛˈhaɪ pæn/, 潘学海 in Mandarin, [email protected]) is a final-year Ph.D. student in Applied Computer Science at Peking University. His research interests lie in the intersection of Reinforcement Learning, Multi-Agent Systems, and Distributed Computing, with a focus on developing scalable and automated algorithms and exploring their theoretical and practical aspects. He has a solid background in both research and engineering, having obtained a B.S. degree in Physics with honors and a B.S. degree in Computer Science (double major) from Peking University before pursuing his Ph.D. degree. His academic journey is embellished with achievements such as winning gold medals in the Chinese Physics Olympiad (CPhO) and the Asian Physics Olympiad (APhO) during high school.

Xuehai is now working on pioneering research in the development of Large Language Models (LLMs) while ensuring they align with human intentions and values through AI Alignment techniques (essentially balancing between helpfulness and harmlessness). Specifically, he is exploring automated data syntactic, red teaming, and evolutional training via multi-agent interaction and self-play. The ultimate goal is to build a scalable and fully automated system, including training, evaluation, inference, and governance.

Beyond academia, Xuehai is an open-source enthusiast and an active contributor to influential projects such as PyTorch, CPython, Ray, Transformers, DeepSpeed, Homebrew, etc. He enjoys dedicating his spare time to helping people and sharing knowledge in the community, further enriching his impact beyond his research pursuits.

Xuehai Pan's Projects

addlicense icon addlicense

A program which ensures source code files have copyright license headers by scanning directory patterns recursively

alpa icon alpa

Auto parallelization for large-scale neural networks

auditwheel icon auditwheel

Auditing and relabeling cross-distribution Linux wheels.

baichuan-7b icon baichuan-7b

A large-scale 7B pretraining language model developed by Baichuan

brew icon brew

🍺 The missing package manager for macOS (or Linux)

conda icon conda

OS-agnostic, system-level binary package manager and ecosystem

deepspeed icon deepspeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

dev-setup icon dev-setup

Automation scripts for setting up a basic development environment.

fastchat icon fastchat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.

flake8-pyi icon flake8-pyi

A plugin for Flake8 that provides specializations for type hinting stub files

go-nvml icon go-nvml

Go Bindings for the NVIDIA Management Library (NVML)

gpustat icon gpustat

📊 A simple command-line utility for querying and monitoring GPU status

gymnasium icon gymnasium

A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)

homebrew-core icon homebrew-core

🍻 Default formulae for the missing package manager for macOS

isort icon isort

A Python utility / library to sort imports.

jax icon jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

latex-templates icon latex-templates

A collection of LaTeX templates in English/Chinese, with VS Code settings for LaTeX Workshop.

latex-workshop icon latex-workshop

Boost LaTeX typesetting efficiency with preview, compile, autocomplete, colorize, and more.

malib icon malib

A parallel framework for population-based multi-agent reinforcement learning.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.