mrjohncv Goto Github PK
Type: User
Type: User
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!
Reading list for research topics in embodied vision
A curated list of image captioning and related area resources. :-)
LLM for robotics reasoning toward AGI / Awesome repos&surveys / Chain of Thought / LLM / Prompt engineering / Reasoning / Robot / Agent / Planning / Reinforcement Learning / Created by @shure-dev / Check Wiki
Collection of papers and resources on Multimodal Reasoning, including Vision-Language Models, Multimodal Chain-of-Thought, Visual Inference, and others.
β¨β¨Latest Papers and Benchmarks in Reasoning with Foundation Models
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.
Collection of papers and resources on Reasoning in Large Language Models (LLMs), including Chain-of-Thought (CoT), Instruction-Tuning, and others.
Survey Paper of foundation models for robotics
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
π A curated list of visual reasoning papers.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.