Coder Social home page Coder Social logo

yinpeidai's Projects

act-plus-plus icon act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

algorithm_interview_notes-chinese icon algorithm_interview_notes-chinese

2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记

depth-anything icon depth-anything

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

grounded-segment-anything icon grounded-segment-anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs

habitat-lab icon habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

home-robot icon home-robot

Mobile manipulation research tools for roboticists

jiant icon jiant

The jiant toolkit for general-purpose text understanding models

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

minicpm-v icon minicpm-v

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

mmcoref icon mmcoref

Code for DSTC 10: SIMMC 2.0 track: Multimodal Coreference Resolution subtask.

multiwoz icon multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

naum icon naum

Code for NAUM project paper

nerf-navigation icon nerf-navigation

Code for the Nerf Navigation Paper. Implements a trajectory optimiser and state estimator which use NeRFs as an environment representation

olmo icon olmo

Modeling, training, eval, and inference code for OLMo

orb_slam2 icon orb_slam2

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

orb_slam3 icon orb_slam3

ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM

orion icon orion

Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型,包括对话模型,长文本模型,量化模型,RAG微调模型,Agent微调模型等。

peract icon peract

Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation

pirlnav icon pirlnav

Code for training embodied agents using IL and RL finetuning at scale for ObjectNav

pytorch-nec icon pytorch-nec

PyTorch Implementation of Neural Episodic Control (NEC)

rlbench icon rlbench

A large-scale benchmark and learning environment.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.