Coder Social home page Coder Social logo

Hi, welcome to my Github 👋

I am Xiao Liu, a third-year PhD student in Tsinghua University since 2021.

  • 🔭 Interested in Machine Learning, Natural Language Processing, and Foundation Models.

  • 🌱 Find my up-to-date publication list in Google Scholar! Some of my proud leading works:

    Large Language Model (LLM) Training and Prompt Learning
    • P-tuning and P-tuning v2 (ACL'22): pioneer works on prompt tuning
    • GLM-130B (ICLR'23): an open bilingual (Enligsh & Chinese) pre-trained model with 130 billion parameters based on GLM (ACL'22); better than GPT-3 175B on LAMBADA and MMLU.
    • ChatGLM-6B & ChatGLM2-6B & ChatGLM3-6B: an open bilingual dialogue language model that requires only 6GB to run. Receiving GitHub stars, GitHub stars, and GitHub starsGitHub Stars!
    • WebGLM (KDD'23): an efficient web-enhanced question answering system based on GLM-10B, outperforming WebGPT-13B and approaching WebGPT-175B performance in human evaluation.
    • ChatGLM-Math: employing self-critique with RFT and DPO to enable SOTA mathematical capabilities wihtouth compromising language abilities.
    Foundational Agents For Real-world Challenging Missions
    • AgentBench (ICLR'24): the first systematic multi-dimensional benchmark to evaluate LLMs as Agents in 8 distinct environments deriving from real-world practical missions. Find LLM-as-Agent demos at llmbench.ai/agent!
    • AutoWebGLM (KDD'24): a strong web navigating agent constructed upon ChatGLM-3-6B, outperforming prompted GPT-4 on Mind2Web, WebArena, and our constructed new dataset AutoWebBench.
    Alignment and Scalable Oversights over LLMs and Diffusers
    • ImageReward (NeurIPS'23): the first general-purpose text-to-image human preference reward model (RM) for RLHF, outperforming CLIP/BLIP/Aesthetic by 30% in terms of human preference prediction.
    • BPO (Black-box Prompt Optimization, ACL'24): a novel direction to align LLMs via preference-aware prompt optimization. Improving ChatGPT, Claude, LLaMA on human preference's win rates by 20%+ without training them.
    • AlignBench (ACL'24): the first comprehensive benchmark on evaluating LLMs' Chinese alignment, deriving from ChatGLM's online real scenarios. Submit your LLMs to acquire CritiqueLLM's judgement on AlignBench on llmbench.ai/align!
    • CritiqueLLM (ACL'24): scaling LLM-as-Critic for scalable oversights on LLM alignment. A series of strong critqiue LLMs ranging from 6B to 66B.
    Self-supervised Learning and Reasoning
  • 🤔 Dedicated to building next-generation of AI systems via both Large Pre-trained Model and Symbolic Agent Reasoning.

  • 💬 Feel free to drop me an email for:

    • Any form of collaboration
    • Any issue about my works or code
    • Interesting ideas to discuss or just chatting

Shaw's Projects

awesome-bert icon awesome-bert

bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目

big-bench icon big-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

covid-19-tweetids icon covid-19-tweetids

The repository contains an ongoing collection of tweets IDs associated with the novel coronavirus COVID-19 (SARS-CoV-2), which commenced on January 28, 2020.

datasets icon datasets

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

gandissect icon gandissect

Pytorch-based tools for visualizing and understanding the neurons of a GAN. https://gandissect.csail.mit.edu/

glm icon glm

GLM (General Language Model)

glm-130b icon glm-130b

GLM-130B: An Open Bilingual Pre-Trained Model

graphmae icon graphmae

GraphMAE: Self-supervised Masked Graph Autoencoders

mips32-cpu icon mips32-cpu

奋战一学期,造台计算机(编译出的bit文件在release中,可以直接食用)

nanogpt icon nanogpt

The simplest, fastest repository for training/finetuning medium-sized GPTs.

oag icon oag

Source code and dataset for KDD 2019 paper "OAG: Toward Linking Large-scale Heterogeneous Entity Graphs"

oag_know icon oag_know

codes for OAG_know and GloMoCo: Unsupervised Embedding Training for Concept Linking

p-tuning-v2 icon p-tuning-v2

An optimized prompt tuning strategy comparable to fine-tuning across model scales and tasks.

promptpapers icon promptpapers

Must-read papers on prompt-based tuning for pre-trained language models.

promptsource icon promptsource

Toolkit for creating, sharing and using natural language prompts.

rekcarc-tsc-uht icon rekcarc-tsc-uht

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

seeact icon seeact

SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.