Coder Social home page Coder Social logo

yang-su2000 / voice2action Goto Github PK

View Code? Open in Web Editor NEW
26.0 1.0 3.0 1.1 GB

Implemetation of paper and Unity package "Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in VR".

Home Page: https://yang-su2000.github.io/Voice2Action/

License: MIT License

C# 100.00%
unity ai large-language-model llm-agent llms multimodal nlp unity-ml unity-package virtual-reality

voice2action's Introduction

Welcome to Yang's GitHub

Hi there! I am an MS student from Cornell, focusing on efficient, scalable language modeling and agent systems.

I am happy to chat and discuss potential collaborations, feel free to reach out by

Linkedin Twitter Gmail WeChat

๐ŸŒŸ Studying Zone

I am collaborating with Cornell ICPC and Millennium to build efficient LLMs for code and data generation in interactive environments.

  • This work is called ALICE (Aligning Language models for Interactive Code Execution), you can find more about it in this Google Slide.
  • ALICE aims to build actually usable (i.e. low-cost, efficient) code generation system in large-scale interactive environments.
  • We are currently experimenting the physics engine environment due to its simulation and feedback loop flexibility.
  • ALICE is expected to expand to other domains like robotics simulation, VR, autonomous driving, etc.
  • Borader Impact: ALICE can generate high-quality synthetic data with active human intervention, for training LLMs.

We are actively looking for brilliant people to join the ALICE project, shoot me an email if you are interested!

Previously, I led the prior work of ALICE called Voice2Action with Cornell XRC, an Unity Package for real-time code execution in VR.

I am also working on large-scale generation augmented retrieval systems (opposed to RAG) at Cornell NLP.

I used to work on graph machine learning at AWS AI Lab (2021-2022) and contribute to the open source Deep Graph Library.

๐Ÿ‘€ Chilling Zone

I like programming! I lead the "Cornell Tech" Group at Cornell ICPC and won the Top 20% in 2023 Regional!

LeetCode CodeForces Visitors

I enjoy cooking, listening to music of all forms, playing ping-pong, reading science fiction, and more!

โšก Developing Zone

๐Ÿ“ˆ "Accepted" Zone

voice2action's People

Contributors

984580403hyxhj avatar gracenho829 avatar yang-su2000 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.