Coder Social home page Coder Social logo

genai_llm_timeline's Introduction

ChatGPT, GenerativeAI and LLMs Timeline

This repository organizes a timeline of key events (products, services, papers, GitHub, blog posts and news) that occurred before and after the ChatGPT announcement.

It's curating a variety of information in this timeline, with a particular focus on LLM and Generative AI.

Maybe it's a scene from the hottest history, so I thought it would be important to keep those memories well, so I organized them.

Statistics

These diagrams were generated by ChatGPT's Code Interpreter.

Contributing

Issues and Pull Requests are greatly appreciated. If you've never contributed to an open source project before I'm more than happy to walk you through how to create a pull request.

You can start by opening an issue describing the problem that you're looking to resolve and we'll go from there.

Emoji

arXiv ❌, PDF 📎, arxiv-vanity 📙, paper page 🏠, papers with code ✳️, Github :octocat:

License

This document is licensed under the MIT license © Jonghong Jeon(전종홍)

Timeline

Date Announcement
12.9 Artificial intelligence act: Council and Parliament strike a deal on the first rules for AI in the world (press)
12.9 Google’s best Gemini AI demo video was fabricated (news)
12.8 GPT4 paper assistant: A daily ArXiv scanner (:octocat:GitHub Repo stars), (demo)
12.7 Announcing Purple Llama: Towards open trust and safety in the new world of generative AI (blog)
12.7 Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations (paper)
12.7 Enhancing Medical Task Performance in GPT-4V: A Comprehensive Study on Prompt Engineering Strategies (), (📖), (📎), (📙), (🏠), (✳️)
12.7 GPT-4V with Emotion: A Zero-shot Benchmark for Multimodal Emotion Understanding (), (📖), (📎), (📙), (🏠), (✳️)
12.7 Chain of Code: Reasoning with a Language Model-Augmented Code Emulator (), (📖), (📎), (📙), (🏠), (✳️)
12.7 OneLLM: One Framework to Align All Modalities with Language (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.6 EU Artificial Intelligence act: potential implications for healthcare AI (blog)
12.6 DiffusionSat: A Generative Foundation Model for Satellite Imagery (), (📖), (📎), (📙), (🏠), (✳️)
12.6 Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.6 Gemini: A Family of Highly Capable Multimodal Models (PDF)
12.6 Google - AlphaCode 2 Technical Report (PDF)
12.6 Google -Introducing Gemini: our largest and most capable AI model (blog), (Hands-on with Gemini: Interacting with multimodal AI - youtube)
12.6 Google - Learn more about Gemini, our most capable AI model (blog), (Welcome to the Gemini era - youtube)
12.6 Pixel 8 Pro — the first smartphone with AI built in — is now running Gemini Nano, plus more AI updates coming to the Pixel portfolio (blog)
12.6 Early LLM-based Tools for Enterprise Information Workers Likely Provide Meaningful Boosts to Productivity (paper), (pdf)
12.5 BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.5 Foundation Models for Weather and Climate Data Understanding: A Comprehensive Survey (), (📖), (📎), (📙), (🏠), (✳️)
12.5 Creative Agents: Empowering Agents with Imagination for Creative Tasks (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.5 Large Language Models on Graphs: A Comprehensive Survey (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.5 Magicoder: Source Code Is All You Need (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.5 Llamafile - Distribute and run LLMs with a single file (:octocat:GitHub Repo stars)
12.5 LLM Visualization (demo)
12.4 Towards General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.4 Aligning and Prompting Everything All at Once for Universal Visual Perception (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.4 Hulk: A Universal Knowledge Translator for Human-Centric Tasks (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.4 AI Alliance Launches as an International Community of Leading Technology Developers, Researchers, and Adopters Collaborating Together to Advance Open, Safe, Responsible AI (Meta blog)
12.4 Style Aligned Image Generation via Shared Attention (project), (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.4 Data Management For Large Language Models: A Survey (), (📖), (📎), (📙), (🏠), (✳️)
12.4 Merlin:Empowering Multimodal LLMs with Foresight Minds (), (📖), (📎), (📙), (🏠), (✳️)
12.3 US artificial intelligence leader OpenAI applies for GPT-6, GPT-7 trademarks in China (news)
12.2 Medical AI Tools Can Make Dangerous Mistakes. Can the Government Help Prevent Them? (WSJ news) - (archive)
12.2 Segment and Caption Anything (), (📖), (📎), (📙), (🏠), (✳️)
12.2 SeaLLMs -- Large Language Models for Southeast Asia (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.2 VideoBooth: Diffusion-based Video Generation with Image Prompts (), (📖), (📎), (📙), (🏠), (✳️)
12.2 Mamba: Linear-Time Sequence Modeling with Selective State Spaces (), (📖), (📎), (📙), (🏠), (✳️)
12.2 Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses (), (📖), (📎), (📙), (🏠), (✳️)
12.1 Grounding Everything: Emerging Localization Properties in Vision-Language Transformers (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.1 The Efficiency Spectrum of Large Language Models: An Algorithmic Survey (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.1 An open letter to ChatGPT on its first birthday (CNN news)
12.1 Explanatory Argument Extraction of Correct Answers in Resident Medical Exams (), (📖), (📎), (📙), (🏠), (✳️)
12.1 GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs (), (📖), (📎), (📙), (🏠), (✳️)
12.1 StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
12.1 Dolphins: Multimodal Language Model for Driving (), (📖), (📎), (📙), (🏠), (✳️)
12.1 DREAM: Diffusion Rectification and Estimation-Adaptive Models (), (📖), (📎), (📙), (🏠), (✳️)
12.1 Instruction-tuning Aligns LLMs to the Human Brain (), (📖), (📎), (📙), (🏠), (✳️)
12.1 Text-Guided 3D Face Synthesis -- From Generation to Editing (), (📖), (📎), (📙), (🏠), (✳️)
12.1 FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting (), (📖), (📎), (📙), (🏠), (✳️)
12.1 Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering (), (📖), (📎), (📙), (🏠), (✳️)
12.1 PyNeRF: Pyramidal Neural Radiance Fields (), (📖), (📎), (📙), (🏠), (✳️)
11.30 Tech predictions for 2024 and beyond (blog)
11.30 Meta - Audiobox: Generating audio from voice and natural language prompts (blog)
11.30 Will Generative Artificial Intelligence Deliver on Its Promise in Health Care? (JAMA doi:10.1001/jama.2023.25054)
11.30 Generative AI could revolutionize health care — but not if control is ceded to big tech (Nature doi: https://doi.org/10.1038/d41586-023-03803-y)
11.30 ChatGPT one year on: who is using it, how and why? (Nature https://doi.org/10.1038/d41586-023-03798-6)
11.30 RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.30 X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.30 MoMask: Generative Masked Modeling of 3D Human Motions (), (📖), (📎), (📙), (🏠), (✳️)
11.30 HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models (), (📖), (📎), (📙), (🏠), (✳️)
11.30 Six ways large language models are changing healthcare (Nature medicine https://doi.org/10.1038/s41591-023-02700-1)
11.30 Towards Accurate Differential Diagnosis with Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.30 Generative AI could revolutionize health care — but not if control is ceded to big tech (Nature doi: https://doi.org/10.1038/d41586-023-03803-y)
11.30 Discover, download, and run local LLMs - (LM Studio)
11.30 A timeline of Sam Altman’s firing from OpenAI — and the fallout (news)
11.30 Synthetic data: Anthropic’s CAI, from fine-tuning to pretraining, OpenAI’s Superalignment, tips, types, and open examples (blog)
11.29 Deepfakes, Misinformation, and Disinformation in the Era of Frontier AI, Generative AI, and Large AI Models (), (📖), (📎), (📙), (🏠), (✳️)
11.29 Are we going MAD? Benchmarking Multi-Agent Debate between Language Models for Medical Q&A (), (📖), (📎), (📙), (🏠), (✳️)
11.29 Welcome to a new world of work with Amazon Q - (tweet), (blog)
11.29 Scaling deep learning for materials discovery (Nature https://doi.org/10.1038/s41586-023-06735-9)
11.29 Millions of new materials discovered with deep learning (Google DeepMind blog)
11.29 OpenAI Cookbook
11.29 Announcing ElevenLabs Grants! (tweet), (site)
11.29 SDXL Turbo: A real-time text-to-image generation model (tweet), (news)
11.28 AMA issues new principles for AI development, deployment & use (press), (PDF)
11.28 Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation (), (📖), (📎), (📙), (🏠), (✳️)
11.28 Power Hungry Processing: Watts Driving the Cost of AI Deployment? (), (📖), (📎), (📙), (🏠), (✳️)
11.28 Graph Prompt Learning: A Comprehensive Survey and Beyond (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.28 ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up? (), (📖), (📎), (📙), (🏠), (✳️)
11.28 SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models (project), (), (📖), (📎), (📙), (🏠), (✳️)
11.28 Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models (project), (), (📖), (📎), (📙), (🏠), (✳️)
11.28 Introducing Pika 1.0, the idea-to-video platform that brings your creativity to life (tweet), (site)
11.28 Adversarial Diffusion Distillation (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.28 MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI (project), (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars), (Dataset)
11.28 LEDITS++: Limitless Image Editing using Text-to-Image Models (), (📖), (📎), (📙), (🏠), (✳️)
11.28 MEDITRON-70B: Scaling Medical Pretraining for Large Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.28 Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine (), (📖), (📎), (📙), (🏠), (✳️)
11.28 Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥 (:octocat:GitHub Repo stars)
11.28 The Power of Prompting (blog)
11.27 RO-LLaMA: Generalist LLM for Radiation Oncology via Noise Augmentation and Consistency Regularization (), (📖), (📎), (📙), (🏠), (✳️)
11.27 Applications of Large Scale Foundation Models for Autonomous Driving (), (📖), (📎), (📙), (🏠), (✳️)
11.27 Building the Future of Responsible AI: A Reference Architecture for Designing Large Language Model based Agents (), (📖), (📎), (📙), (🏠), (✳️)
11.27 ChatGPT’s One-Year Anniversary: Generative AI’s Breakout Year (blog)
11.27 GPT-4’s potential in shaping the future of radiology (tweet), (blog)
11.27 Automatic Hallucination detection with SelfCheckGPT NLI (blog)
11.25 Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains (), (📖), (📎), (📙), (🏠), (✳️)
11.25 LLM-Assisted Code Cleaning For Training Accurate Code Generators (), (📖), (📎), (📙), (🏠), (✳️)
11.23 MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model (project), (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars), (demo)
11.23 Challenges of Large Language Models for Mental Health Counseling (), (📖), (📎), (📙), (🏠), (✳️)
11.23 MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.23 ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs (), (📖), (📎), (📙), (🏠), (✳️)
11.23 Visual In-Context Prompting (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.22 Enhancing Summarization Performance through Transformer-Based Prompt Engineering in Automated Medical Reporting (), (📖), (📎), (📙), (🏠), (✳️)
11.22 OpenAI chaos: A timeline of firings, interim CEOs, re-hirings and other twists (blog)
11.22 Here's a timeline of the OpenAI saga with CEO Sam Altman (mashable news)
11.22 A timeline of Sam Altman's firing and dramatic return to OpenAI (Reuters news)
11.22 Sam Altman to return as CEO of OpenAI (news)
11.22 DiffusionMat: Alpha Matting as Sequential Refinement Learning (project), (), (📖), (📎), (📙), (🏠), (✳️)
11.22 GAIA: a benchmark for General AI Assistants (), (📖), (📎), (📙), (🏠), (✳️)
11.22 FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.22 LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes (), (📖), (📎), (📙), (🏠), (✳️)
11.22 Diffusion Model Alignment Using Direct Preference Optimization (), (📖), (📎), (📙), (🏠), (✳️)
11.22 Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.22 PG-Video-LLaVA: Pixel Grounding Large Video-Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.22 Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.22 SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering (), (📖), (📎), (📙), (🏠), (✳️)
11.22 ChatGPT generates fake data set to support scientific hypothesis (Nature doi: https://doi.org/10.1038/d41586-023-03635-w), (PDF)
11.21 It’s Time For ‘Nutrition Labels’ In Artificial Intelligence (Forbes news)
11.21 From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.21 ALPHA: AnomaLous Physiological Health Assessment Using Large Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.21 HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.21 PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics (), (📖), (📎), (📙), (🏠), (✳️)
11.21 NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.21 Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.21 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction (), (📖), (📎), (📙), (🏠), (✳️)
11.21 GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning (), (📖), (📎), (📙), (🏠), (✳️)
11.21 Accuracy of ChatGPT, Google Bard, and Microsoft Bing for Simplifying Radiology Reports (RSNA https://doi.org/10.1148/radiol.232561), (PDF)
11.21 System 2 Attention (is something you might need too) (), (📖), (📎), (📙), (🏠), (✳️)
11.21 GPQA: A Graduate-Level Google-Proof Q&A Benchmark (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.21 GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration (), (📖), (📎), (📙), (🏠), (✳️)
11.20 Assessing Prompt Injection Risks in 200+ Custom GPTs (), (📖), (📎), (📙), (🏠), (✳️)
11.20 Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.20 Sam Altman to Join Microsoft Following OpenAI Ouster (WSJ news)
11.20 MultiLoRA: Democratizing LoRA for Better Multi-Task Learning (), (📖), (📎), (📙), (🏠), (✳️)
11.19 Meta Prompting for AGI Systems (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.19 M^{2}UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.19 LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.19 AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort (), (📖), (📎), (📙), (🏠), (✳️)
11.19 TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems (), (📖), (📎), (📙), (🏠), (✳️)
11.18 Designing Interpretable ML System to Enhance Trustworthy AI in Healthcare: A Systematic Review of the Last Decade to A Proposed Robust Framework (), (📖), (📎), (📙), (🏠), (✳️)
11.18 MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.18 Make Pixels Dance: High-Dynamic Video Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.18 Orca 2: Teaching Small Language Models How to Reason (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.18 Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers (), (📖), (📎), (📙), (🏠), (✳️)
11.18 Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning (), (📖), (📎), (📙), (🏠), (✳️)
11.18 Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 (), (📖), (📎), (📙), (🏠), (✳️)
11.18 SelfEval: Leveraging the discriminative nature of generative models for evaluation (), (📖), (📎), (📙), (🏠), (✳️)
11.18 Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.18 Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections (), (📖), (📎), (📙), (🏠), (✳️)
11.17 PEFT-MedAware: Large Language Model for Medical Awareness (), (📖), (📎), (📙), (🏠), (✳️)
11.17 OpenAI’s Sam Altman exits as CEO because ‘board no longer has confidence’ in his ability to lead (CNBC news)
11.17 Testing Language Model Agents Safely in the Wild (), (📖), (📎), (📙), (🏠), (✳️)
11.17 Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression (), (📖), (📎), (📙), (🏠), (✳️)
11.17 The Chosen One: Consistent Characters in Text-to-Image Diffusion Models (), (📖), (📎), (📙), (🏠), (✳️)
11.17 Adaptive Shells for Efficient Neural Radiance Field Rendering (), (📖), (📎), (📙), (🏠), (✳️)
11.17 JaxMARL: Multi-Agent RL Environments in JAX (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:![GitHub Repo stars](https://img.shields.io/github/stars/flairox/jaxmarl ?style=social))
11.16 By the Numbers: Tracking The AI Executive Order (HAI news)
11.16 Change to policy on the use of generative AI and large language models (SCience blog)
11.16 ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks (), (📖), (📎), (📙), (🏠), (✳️)
11.16 MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.16 HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.16 Do Physicians Know How to Prompt? The Need for Automatic Prompt Optimization Help in Clinical Note Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.16 AI: The Coming Revolution (report), (presentation)
11.16 VideoCon: Robust Video-Language Alignment via Contrast Captions (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.16 UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.16 Exponentially Faster Language Modelling (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.16 Memory Augmented Language Models through Mixture of Word Experts (), (📖), (📎), (📙), (🏠), (✳️)
11.16 ToolTalk: Evaluating Tool-Usage in a Conversational Setting (), (📖), (📎), (📙), (🏠), (✳️)
11.16 Video-LLaVA: Learning United Visual Representation by Alignment Before Projection (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.16 MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture (), (📖), (📎), (📙), (🏠), (✳️)
11.16 I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization (), (📖), (📎), (📙), (🏠), (✳️)
11.16 Contrastive Chain-of-Thought Prompting (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.16 Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying (), (📖), (📎), (📙), (🏠), (✳️)
11.16 DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model (), (📖), (📎), (📙), (🏠), (✳️)
11.16 Single-Image 3D Human Digitization with Shape-Guided Diffusion (), (📖), (📎), (📙), (🏠), (✳️)
11.16 GRIM: GRaph-based Interactive narrative visualization for gaMes (), (📖), (📎), (📙), (🏠), (✳️)
11.16 PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers (), (📖), (📎), (📙), (🏠), (✳️)
11.16 SiRA: Sparse Mixture of Low Rank Adaptation (), (📖), (📎), (📙), (🏠), (✳️)
11.16 Fusion-Eval: Integrating Evaluators with LLMs (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Towards Publicly Accountable Frontier LLMs: Building an External Scrutiny Ecosystem under the ASPIRE Framework (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Can AI solve medical mysteries? It’s worth finding out (WP news), (archive)
11.15 UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Drivable 3D Gaussian Avatars (), (📖), (📎), (📙), (🏠), (✳️)
11.15 EDMSound: Spectrogram Based Diffusion Models for Efficient and High-Quality Audio Synthesis (), (📖), (📎), (📙), (🏠), (✳️)
11.15 UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations (), (📖), (📎), (📙), (🏠), (✳️)
11.15 UT5: Pretraining Non autoregressive T5 with unrolled denoising (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Thread of Thought Unraveling Chaotic Contexts (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Instant3D: Instant Text-to-3D Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Fine-tuning Language Models for Factuality (), (📖), (📎), (📙), (🏠), (✳️)
11.15 Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster (), (📖), (📎), (📙), (🏠), (✳️)
11.14 Extrinsically-Focused Evaluation of Omissions in Medical Summarization (), (📖), (📎), (📙), (🏠), (✳️)
11.14 Artificial General Intelligence, Existential Risk, and Human Risk Perception (), (📖), (📎), (📙), (🏠), (✳️)
11.14 Story-to-Motion: Synthesizing Infinite and Controllable Character Animation from Long Text (), (📖), (📎), (📙), (🏠), (✳️)
11.14 One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion (), (📖), (📎), (📙), (🏠), (✳️)
11.14 A Survey on Language Models for Code (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.14 DiLoCo: Distributed Low-Communication Training of Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.14 Instruction-Following Evaluation for Large Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.14 The ART of LLM Refinement: Ask, Refine, and Trust (), (📖), (📎), (📙), (🏠), (✳️)
11.14 MART: Improving LLM Safety with Multi-round Automatic Red-Teaming (), (📖), (📎), (📙), (🏠), (✳️)
11.14 Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.14 GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.14 SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.14 MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks (), (📖), (📎), (📙), (🏠), (✳️)
11.13 Applying Large Language Models for Causal Structure Learning in Non Small Cell Lung Cancer (), (📖), (📎), (📙), (🏠), (✳️)
11.13 The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4 (), (📖), (📎), (📙), (🏠), (✳️)
11.13 To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.13 Music ControlNet: Multiple Time-varying Controls for Music Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.13 Prompt Engineering a Prompt Engineer (), (📖), (📎), (📙), (🏠), (✳️)
11.12 Trusted Source Alignment in Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.12 ChatAnything: Facetime Chat with LLM-Enhanced Personas (), (📖), (📎), (📙), (🏠), (✳️)
11.12 Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.12 Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data (), (📖), (📎), (📙), (🏠), (✳️)
11.12 Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer (), (📖), (📎), (📙), (🏠), (✳️)
11.11 LayoutPrompter: Awaken the Design Ability of Large Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.11 GOAT: GO to Any Thing (), (📖), (📎), (📙), (🏠), (✳️)
11.11 Language Models can be Logical Solvers (), (📖), (📎), (📙), (🏠), (✳️)
11.11 Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model (), (📖), (📎), (📙), (🏠), (✳️)
11.11 Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization (), (📖), (📎), (📙), (🏠), (✳️)
11.11 A Strengths, Weaknesses, Opportunities, and Threats (SWOT) Analysis of ChatGPT Integration in Nursing Education: A Narrative Review (Cureus DOI: 10.7759/cureus.48643)
11.11 The Impact of Chat Generative Pre-trained Transformer (ChatGPT) on Oncology: Application, Expectations, and Future Prospects (Cureus DOI: 10.7759/cureus.48670)
11.10 How to Bridge the Gap between Modalities: A Comprehensive Survey on Multimodal Large Language Model (), (📖), (📎), (📙), (🏠), (✳️)
11.10 ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.10 JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.10 China proposes new regulations for generative AI focusing on data security, evaluation (news) - (生成式人工智能服务安全基本要求)
11.10 New international consortium formed to create trustworthy and reliable generative AI models for science (news) - (Trillion Parameter Consortium)
11.10 AI robotics’ ‘GPT moment’ is near (TC news)
11.10 ♥️ ChatGPT in prostate cancer: myth or reality? (Prostate Cancer Prostatic Dis https://doi.org/10.1038/s41391-023-00750-7)
11.10 LCM-LoRA: A Universal Stable-Diffusion Acceleration Module (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.10 LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents (), (📖), (📎), (📙), (🏠), (✳️)
11.9 Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5? (), (📖), (📎), (📙), (🏠), (✳️)
11.9 Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.9 Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.9 ♥️ Accuracy of a Vision-Language Model on Challenging Medical Cases (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.9 The testing framework for ML models, from tabular to LLMs (:octocat:GitHub Repo stars)
11.9 A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions (), (📖), (📎), (📙), (🏠), (✳️)
11.9 ♥️ A Survey of Large Language Models in Medicine: Progress, Application, and Challenge (), (📖), (📎), (📙), (🏠), (SS), (✳️), (:octocat:GitHub Repo stars)
11.9 GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs (), (📖), (📎), (📙), (🏠), (✳️)
11.9 u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model (), (📖), (📎), (📙), (🏠), (✳️)
11.9 On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.9 Humane AI Pin: ChatGPT Wearable to Launch with $699 Price Tag (news)
11.9 Microsoft briefly restricted employee access to OpenAI’s ChatGPT, citing security concerns (news)
11.8 Unveiling Safety Vulnerabilities of Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.8 Video Instance Matting (), (📖), (📎), (📙), (🏠), (✳️)
11.8 I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models (), (📖), (📎), (📙), (🏠), (✳️)
11.8 OtterHD: A High-Resolution Multi-modality Model (), (📖), (📎), (📙), (🏠), (✳️)
11.8 TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
11.8 Holistic Evaluation of Text-To-Image Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.8 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features (), (📖), (📎), (📙), (🏠), (✳️)
11.8 NExT-Chat: An LMM for Chat, Detection and Segmentation (), (📖), (📎), (📙), (🏠), (✳️)
11.8 LRM: Large Reconstruction Model for Single Image to 3D (), (📖), (📎), (📙), (🏠), (✳️)
11.8 Prompt Cache: Modular Attention Reuse for Low-Latency Inference (), (📖), (📎), (📙), (🏠), (✳️)
11.8 Role play with large language models (Nature https://doi.org/10.1038/s41586-023-06647-8)
11.8 How Accurate was ChatGPT for Common Allergy Myths? Pretty Accurate (news)
11.8 Amazon is reportedly racing to build an AI model called Olympus to take on ChatGPT and Bard (news)
11.8 The AI boom is shaking up the tech industry and moving markets. But is it all a mirage? (news)
11.8 Samsung unveils ChatGPT alternative Samsung Gauss that can generate text, code and images (news)
11.7 Benefits and Harms of Large Language Models in Digital Mental Health (), (📖), (📎), (📙), (🏠), (✳️)
11.7 Evaluating Large Language Models in Ophthalmology (), (📖), (📎), (📙), (🏠), (✳️)
11.7 Evaluating multiple large language models in pediatric ophthalmology (), (📖), (📎), (📙), (🏠), (✳️)
11.7 Leveraging Large Language Models for Automated Proof Synthesis in Rust (), (📖), (📎), (📙), (🏠), (✳️)
11.7 SoundCam: A Dataset for Finding Humans Using Room Acoustics (), (📖), (📎), (📙), (🏠), (✳️)
11.7 Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning (), (📖), (📎), (📙), (🏠), (✳️)
11.7 Random Field Augmentations for Self-Supervised Representation Learning (), (📖), (📎), (📙), (🏠), (✳️)
11.7 Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.7 mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.7 GPT4All: An Ecosystem of Open Source Compressed Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.7 GLaMM: Pixel Grounding Large Multimodal Model (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.7 S-LoRA: Serving Thousands of Concurrent LoRA Adapters (), (📖), (📎), (📙), (🏠), (✳️)
11.7 Ziya2: Data-centric Learning is All LLMs Need (), (📖), (📎), (📙), (🏠), (✳️)
11.7 LDM3D-VR: Latent Diffusion Model for 3D VR (), (📖), (📎), (📙), (🏠), (✳️)
11.6 A Foundation Model for Music Informatics (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.6 Can LLMs Follow Simple Rules? (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.6 ‘ChatGPT detector’ catches AI-generated papers with unprecedented accuracy (Nature doi: https://doi.org/10.1038/d41586-023-03479-4)
11.6 CogVLM: Visual Expert for Pretrained Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.6 Introducing GPTs (blog)
11.6 New models and developer products announced at DevDay (blog)
11.6 OpenAI DevDay, Opening Keynote (Youtube), (tweet)
11.6 All the news from OpenAI’s first developer conference (news)
11.6 OpenAI Wants Everyone to Build Their Own Version of ChatGPT (Wired news)
11.6 ChatGPT subscribers may get a ‘GPT builder’ option soon (news)
11.5 Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.5 Levels of AGI: Operationalizing Progress on the Path to AGI (), (📖), (📎), (📙), (🏠), (✳️)
11.4 Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions (), (📖), (📎), (📙), (🏠), (✳️)
11.4 MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.4 Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.4 Ultra-Long Sequence Distributed Transformer (), (📖), (📎), (📙), (🏠), (✳️)
11.4 Meet Grok – Elon Musk’s Answer to ChatGPT (Tweet), (news)
11.4 EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision (), (📖), (📎), (📙), (🏠), (✳️)
11.3 LLM-driven Multimodal Target Volume Contouring in Radiation Oncology (), (📖), (📎), (📙), (🏠), (✳️)
11.3 FinGPT: Large Generative Models for a Small Language (), (📖), (📎), (📙), (🏠), (✳️)
11.3 ♥️ An Introduction to Natural Language Processing Techniques and Framework for Clinical Implementation in Radiation Oncology (), (📖), (📎), (📙), (🏠), (SS), (✳️)
11.3 Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.3 FLAP: Fast Language-Audio Pre-training (), (📖), (📎), (📙), (🏠), (✳️)
11.3 PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion (), (📖), (📎), (📙), (🏠), (✳️)
11.3 The world’s week on AI safety: powerful computing efforts launched to boost research (nature doi: https://doi.org/10.1038/d41586-023-03472-x)
11.3 Forget ChatGPT, why Llama and open source AI win 2023 (news)
11.3 RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation (), (📖), (📎), (📙), (🏠), (✳️)
11.3 Idempotent Generative Network (), (📖), (📎), (📙), (🏠), (✳️)
11.2 ProAgent: From Robotic Process Automation to Agentic Process Automation (), (📖), (📎), (📙), (🏠), (✳️)
11.2 A Survey of Large Language Models for Autonomous Driving (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.2 TopicGPT: A Prompt-based Topic Modeling Framework (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.2 GOV.UK - Introducing the AI Safety Institute (news)
11.2 US to launch its own AI safety institute (news)
11.2 U.S. ARTIFICIAL INTELLIGENCE SAFETY INSTITUTE (FAQ)
11.2 NIST Seeks Collaborators for Consortium Supporting Artificial Intelligence Safety (news)
11.2 RoboVQA: Multimodal Long-Horizon Reasoning for Robotics (), (📖), (📎), (📙), (🏠), (✳️)
11.2 E3 TTS: Easy End-to-End Diffusion-based Text to Speech (), (📖), (📎), (📙), (🏠), (✳️)
11.2 In-Context Prompt Editing For Conditional Audio Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.2 FlashDecoding++: Faster Large Language Model Inference on GPUs (), (📖), (📎), (📙), (🏠), (✳️)
11.2 The AI Engineer Foundation: Open Source for the Future of AI (news), (:octocat:GitHub Repo stars)
11.2 LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing (), (📖), (📎), (📙), (🏠), (✳️)
11.2 De-Diffusion Makes Text a Strong Cross-Modal Interface (), (📖), (📎), (📙), (🏠), (✳️)
11.2 Controllable Music Production with Diffusion Models and Guidance Gradients (), (📖), (📎), (📙), (🏠), (✳️)
11.1 Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.1 AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning (), (📖), (📎), (📙), (🏠), (✳️)
11.1 ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation (), (📖), (📎), (📙), (🏠), (✳️)
11.1 Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans? (), (📖), (📎), (📙), (🏠), (✳️)
11.1 ChipNeMo: Domain-Adapted LLMs for Chip Design (), (📖), (📎), (📙), (🏠), (✳️)
11.1 Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.1 The Generative AI Paradox: "What It Can Create, It May Not Understand" (), (📖), (📎), (📙), (🏠), (✳️)
11.1 Learning From Mistakes Makes LLM Better Reasoner (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
11.1 Exclusive: Stability AI brings advanced 3D and image fine-tuning to Stable Diffusion (VentureBeatnews)
11.1 An Early Look at Stability AI's New Text to 3D Model (news)
11.1 Microsoft 365 Copilot is available for purchase starting today. Here's what to know (ZDnet news)
11.1 GOV.UK - The Bletchley Declaration by Countries Attending the AI Safety Summit, 1-2 November 2023 (Policy paper)
11.1 Generative AI for Beginners - A Course (:octocat:GitHub Repo stars)
11.1 GOV.UK - Countries agree to safe and responsible development of frontier AI in landmark Bletchley Declaration (press)
11.1 JADE: A Linguistics-based Safety Evaluation Platform for LLM (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.31 Taking control: Policies to address extinction risks from advanced AI (), (📖), (📎), (📙), (🏠), (✳️)
10.31 Does GPT-4 Pass the Turing Test? (), (📖), (📎), (📙), (🏠), (✳️), (test)
10.31 ♥️ A Comprehensive Study of GPT-4V's Multimodal Capabilities in Medical Imaging (), (📖), (📎), (📙), (🏠), (✳️)
10.31 Artificial intelligence - UK Regulatory Outlook October 2023 (news)
10.31 MM-VID: Advancing Video Understanding with GPT-4V(ision) (), (📖), (📎), (📙), (🏠), (✳️)
10.30 EHRTutor: Enhancing Patient Understanding of Discharge Instructions (), (📖), (📎), (📙), (🏠), (✳️)
10.30 Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities (), (📖), (📎), (📙), (🏠), (✳️)
10.30 Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.30 RedPajama-Data-v2: an Open Dataset with 30 Trillion Tokens for Training Large Language Models (blog), (:octocat:GitHub Repo stars)
10.30 Awesome LLMs Evaluation Papers (:octocat:GitHub Repo stars)
10.30 Evaluating Large Language Models: A Comprehensive Survey (), (📖), (📎), (📙), (🏠), (✳️), (SS), (:octocat:GitHub Repo stars)
10.30 Phishing emails increase over 1,200 percent since ChatGPT launch (news)
10.30 G7 Leaders’ Statement on the Hiroshima AI Process (statement), (download), (white house)
10.30 Commission welcomes G7 leaders' agreement on Guiding Principles and a Code of Conduct on Artificial Intelligence (news)
10.30 Hiroshima Process International Guiding Principles for Advanced AI system (news), (download)
10.30 Hiroshima Process International Code of Conduct for Advanced AI Systems (news), (download)
10.30 FACT SHEET: President Biden Issues Executive Order on Safe, Secure, and Trustworthy Artificial Intelligence (news)
10.30 Joe Biden’s Sweeping New Executive Order Aims to Drag the US Government Into the Age of ChatGPT (Wired news)
10.30 ♥️ Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.30 Atom: Low-bit Quantization for Efficient and Accurate LLM Serving (), (📎), (📙), (🏠), (✳️)
10.30 Skywork: A More Open Bilingual Foundation Model (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.30 VideoCrafter1: Open Diffusion Models for High-Quality Video Generation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.30 Text-to-3D with classifier score distillation (), (📎), (📙), (🏠), (✳️)
10.29 TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise (), (📎), (📙), (🏠), (✳️)
10.28 Overview of Current Applications of Large Language Models in Various Medical Specialities (), (📖), (📎), (📙), (🏠), (✳️)
10.28 Punica: Multi-Tenant LoRA Serving (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.28 Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation (), (📎), (📙), (🏠), (✳️)
10.27 ♥️ Qilin-Med-VL: Towards Chinese Large Vision-Language Model for General Healthcare (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.27 JudgeLM: Fine-tuned Large Language Models are Scalable Judges (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.27 A Framework for Automated Measurement of Responsible AI Harms in Generative AI Applications (), (📎), (📙), (🏠), (✳️)
10.27 ControlLLM: Augment Language Models with Tools by Searching on Graphs (), (📎), (📙), (🏠), (✳️)
10.27 FP8-LM: Training FP8 Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.27 United Nations creates advisory body to address AI governance (Reuters news, UN AI Advisory Body)
10.27 Guarding the AI frontier: A proposal for federal regulation (news)
10.27 GOV.UK - Emerging processes for frontier AI safety (white paper - HTML, PDF)
10.27 GOV.UK - Leading frontier AI companies publish safety policies (news)
10.26 Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting (), (📖), (📎), (📙), (🏠), (✳️)
10.26 UK Prime Minister announces world’s first AI Safety Institute (news)
10.26 Using fine-tuned large language models to parse clinical notes in musculoskeletal pain disorders (Lancet https://doi.org/10.1016/S2589-7500(23)00202-9)
10.26 Large Language Models as Generalizable Policies for Embodied Tasks (project), (), (📖), (📎), (📙), (🏠), (✳️)
10.26 How the Foundation Model Transparency Index Distorts Transparency (blog)
10.26 Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.26 Controlled Decoding from Language Models (), (📎), (📙), (🏠), (✳️)
10.26 HyperFields: Towards Zero-Shot Generation of NeRFs from Text (), (📎), (📙), (🏠), (✳️)
10.26 CodeFusion: A Pre-trained Diffusion Model for Code Generation (), (📎), (📙), (🏠), (✳️)
10.26 BostonDynamics - a robot tour guide using Spot integrated with Chat GPT and other AI models as a proof of concept for the robotics applications of foundational models (Youtube)
10.26 A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation (), (📎), (📙), (🏠), (✳️)
10.26 DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior (), (📎), (📙), (🏠), (✳️)
10.25 Qualcomm Raises Bar for On-Device Generative AI at Snapdragon Summit (news) - (Keynote)
10.25 Artificial Intelligence in Health Care: Peter Lee on Empathy, Empowerment, and Equity (blog)
10.25 ♥️ An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.25 OpenAI - Frontier risk and preparedness (Blog)
10.25 ogether with Anthropic, Google, and Microsoft, we’re announcing the new Executive Director of the Frontier Model Forum and a new $10 million AI Safety Fund (blog)
10.25 An Early Evaluation of GPT-4V(ision) (), (📖), (📎), (📙), (🏠), (✳️)
10.25 In-Context Learning Creates Task Vectors (), (📎), (📙), (🏠), (✳️)
10.25 Woodpecker: Hallucination Correction for Multimodal Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.25 Dissecting In-Context Learning of Translations in GPTs (), (📎), (📙), (🏠), (✳️)
10.24 BLESS: Benchmarking Large Language Models on Sentence Simplification (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.24 NoteChat: A Dataset of Synthetic Doctor-Patient Conversations Conditioned on Clinical Notes (), (📖), (📎), (📙), (🏠), (✳️)
10.24 Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature (), (📖), (📎), (📙), (🏠), (✳️)
10.24 SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding (), (📎), (📙), (🏠), (✳️)
10.24 Wonder3D: Single Image to 3D using Cross-Domain Diffusion (), (📎), (📙), (🏠), (✳️)
10.24 Matryoshka Diffusion Models (), (📎), (📙), (🏠), (✳️)
10.24 DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design (), (📎), (📙), (🏠), (✳️)
10.24 FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.24 Branch-Solve-Merge Improves Large Language Model Evaluation and Generation (), (📎), (📙), (🏠), (✳️)
10.23 Systematic AI Approach for AGI: Addressing Alignment, Energy, and AGI Grand Challenges (), (📖), (📎), (📙), (🏠), (✳️)
10.23 Evaluating Large Language Models on Controlled Generation Tasks (), (📖), (📎), (📙), (🏠), (✳️)
10.23 AlpaCare:Instruction-tuned Large Language Models for Medical Application (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.23 Large Search Model: Redefining Search Stack in the Era of LLMs (), (📖), (📎), (📙), (🏠), (✳️)
10.23 InstructExcel: A Benchmark for Natural Language Instruction in Excel (), (📎), (📙), (🏠), (✳️)
10.23 HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.23 Moral Foundations of Large Language Models (), (📎), (📙), (🏠), (✳️)
10.23 Exploring the Boundaries of GPT-4 in Radiology (), (📎), (📙), (🏠), (✳️)
10.22 An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI (), (📖), (📎), (📙), (🏠), (✳️)
10.22 Assessing the Utilization of Large Language Models in Medical Education: Insights From Undergraduate Medical Students (Cureus DOI: 10.7759/cureus.47468)
10.21 TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models (), (📎), (📙), (🏠), (✳️)
10.21 Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs (), (📎), (📙), (🏠), (✳️)
10.21 Specific versus General Principles for Constitutional AI (), (📎), (📙), (🏠), (✳️)
10.21 Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models (), (📎), (📙), (🏠), (✳️)
10.21 Contrastive Preference Learning: Learning from Human Feedback without RL (), (📎), (📙), (🏠), (✳️)
10.20 Democratizing Reasoning Ability: Tailored Learning from Large Language Model (), (📎), (📙), (🏠), (✳️)
10.20 DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.20 Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models (), (📎), (📙), (🏠), (✳️)
10.20 Localizing and Editing Knowledge in Text-to-Image Generative Models (), (📎), (📙), (🏠), (✳️)
10.20 Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots (), (📎), (📙), (🏠), (✳️)
10.20 SALMONN: Towards Generic Hearing Abilities for Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.20 Teaching Language Models to Self-Improve through Interactive Demonstrations (), (📎), (📙), (🏠), (✳️)
10.20 DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation (), (📎), (📙), (🏠), (✳️)
10.20 Creative Robot Tool Use with Large Language Models (), (📎), (📙), (🏠), (✳️)
10.20 Tuna: Instruction Tuning using Feedback from Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.20 ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search (), (📎), (📙), (🏠), (✳️)
10.20 SILC: Improving Vision Language Pretraining with Self-Distillation (), (📎), (📙), (🏠), (✳️)
10.20 Towards Understanding Sycophancy in Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.20 ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection (), (📎), (📙), (🏠), (✳️)
10.20 3D-GPT: Procedural 3D Modeling with Large Language Models (), (📎), (📙), (🏠), (✳️)
10.20 Eureka: Human-Level Reward Design via Coding Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.20 AgentTuning: Enabling Generalized Agent Abilities for LLMs (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.20 Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning (), (📎), (📙), (🏠), (✳️)
10.20 AutoMix: Automatically Mixing Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.20 An Emulator for Fine-Tuning Large Language Models using Small Language Models (), (📎), (📙), (🏠), (✳️)
10.20 ChatGPT parent OpenAI seeks $86bn valuation (FT (news)
10.19 The Foundation Model Transparency Index (), (📖), (📎), (📙), (🏠), (✳️), (SS), (:octocat:GitHub Repo stars)
10.19 An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Concept Prompt Learning (), (📎), (📙), (🏠), (✳️)
10.19 Loop Copilot: Conducting AI Ensembles for Music Generation and Iterative Editing (), (📎), (📙), (🏠), (✳️)
10.19 Safe RLHF: Safe Reinforcement Learning from Human Feedback (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.19 Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.18 DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.18 Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.18 MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.18 Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts (), (📎), (📙), (🏠), (✳️)
10.18 BitNet: Scaling 1-bit Transformers for Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.18 4K4D: Real-Time 4D View Synthesis at 4K Resolution (), (📎), (📙), (🏠), (✳️)
10.18 VeRA: Vector-based Random Matrix Adaptation (), (📎), (📙), (🏠), (✳️)
10.18 Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V (), (📖), (📎), (📙), (🏠), (✳️)
10.18 EvalCrafter: Benchmarking and Evaluating Large Video Generation Models (), (📎), (📙), (🏠), (✳️)
10.17 Integrating LLM, EEG, and Eye-Tracking Biomarker Analysis for Word-Level Neural State Classification in Semantic Inference Reading Comprehension (), (📖), (📎), (📙), (🏠), (✳️)
10.17 Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
10.17 TEQ: Trainable Equivalent Transformation for Quantization of LLMs (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.17 LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.17 CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (), (📎), (📙), (🏠), (✳️)
10.17 Context-Aware Meta-Learning (), (📎), (📙), (🏠), (✳️)
10.17 H2O Open Ecosystem for State-of-the-art Large Language Models (), (📎), (📙), (🏠), (✳️)
10.17 In-Context Pretraining: Language Modeling Beyond Document Boundaries (), (📎), (📙), (🏠), (✳️)
10.17 Interactive Task Planning with Language Models (), (📎), (📙), (🏠), (✳️)
10.17 Video Language Planning (), (📎), (📙), (🏠), (✳️)
10.16 OpenAgents: An Open Platform for Language Agents in the Wild (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.16 How ChatGPT is transforming the postdoc experience (Nature 622, 655-657 (2023) (doi: https://doi.org/10.1038/d41586-023-03235-8)
10.16 Llemma: An Open Language Model For Mathematics (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.15 AutoAgents: A Framework for Automatic Agent Generation (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.15 Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical Diagnosis (), (📖), (📎), (📙), (🏠), (✳️)
10.14 MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.14 Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.14 Table-GPT: Table-tuned GPT for Diverse Table Tasks (), (📎), (📙), (🏠), (✳️)
10.14 PaLI-3 Vision Language Models: Smaller, Faster, Stronger (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.13 Multinational AGI Consortium (MAGIC): A Proposal for International Coordination on AI (), (📖), (📎), (📙), (🏠), (✳️)
10.13 A Zero-Shot Language Agent for Computer Control with Structured Reflection (), (📎), (📙), (🏠), (✳️)
10.13 The Consensus Game: Language Model Generation via Equilibrium Search (), (📎), (📙), (🏠), (✳️)
10.13 LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models (), (📎), (📙), (🏠), (✳️)
10.13 CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules (), (📎), (📙), (🏠), (✳️)
10.13 Toward Joint Language Modeling for Speech Units and Text (), (📎), (📙), (🏠), (✳️)
10.13 Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation (), (📎), (📙), (🏠), (✳️)
10.13 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion (), (📎), (📙), (🏠), (✳️)
10.13 GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors (), (📎), (📙), (🏠), (✳️)
10.13 MotionDirector: Motion Customization of Text-to-Video Diffusion Models (), (📎), (📙), (🏠), (✳️)
10.12 Organizational preparedness for the use of large language models in pathology informatics (Journal of Pathology Informatics, https://doi.org/10.1016/j.jpi.2023.100338)
10.12 ♥️ FDA creates new advisory committee for digital health and AI (news)
10.12 EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation (), (📎), (📙), (🏠), (✳️)
10.12 LangNav: Language as a Perceptual Representation for Navigation (), (📎), (📙), (🏠), (✳️)
10.12 Octopus: Embodied Vision-Language Programmer from Environmental Feedback (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.12 Prometheus: Inducing Fine-grained Evaluation Capability in Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.12 Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams (), (📎), (📙), (🏠), (✳️)
10.11 Exploring the Landscape of Large Language Models In Medical Question Answering: Observations and Open Questions (), (📖), (📎), (📙), (🏠), (✳️)
10.11 Lemur: Harmonizing Natural Language and Code for Language Agents (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.10 Teaching Language Models to Hallucinate Less with Synthetic Tasks (), (📖), (📎), (📙), (🏠), (✳️)
10.10 Towards Mitigating Hallucination in Large Language Models via Self-Reflection (), (📖), (📎), (📙), (🏠), (✳️)
10.10 Multilingual Jailbreak Challenges in Large Language Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.10 Feasibility of Using the Privacy-preserving Large Language Model Vicuna for Labeling Radiology Reports (RSNA Radiology https://doi.org/10.1148/radiol.231147)
10.10 Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.10 How ChatGPT and other AI tools could disrupt scientific publishing (Nature 622, 234-236 (2023) doi: https://doi.org/10.1038/d41586-023-03144-w)
10.9 GraphLLM: Boosting Graph Reasoning Ability of Large Language Model (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:![GitHub Repo stars](https://img.shields.io/github/stars/ mistyreed63849/graph-llm?style=social))
10.9 HyperAttention: Long-context Attention in Near-Linear Time (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.9 ♥️ A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.8 ♥️ ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data (), (📖), (📎), (📙), (🏠), (✳️)
10.7 Data-Centric Financial Large Language Models (), (📎), (📙), (🏠), (✳️)
10.6 Segmented Harmonic Loss: Handling Class-Imbalanced Multi-Label Clinical Data for Medical Coding with Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
10.6 Governments race to regulate AI tools (Reuters news)
10.6 MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.6 Improved Baselines with Visual Instruction Tuning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.6 Aligning Text-to-Image Diffusion Models with Reward Backpropagation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.6 DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.6 Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency (), (📎), (📙), (🏠), (✳️)
10.6 A Long Way to Go: Investigating Length Correlations in RLHF (), (📎), (📙), (🏠), (✳️)
10.6 Drag View: Generalizable Novel View Synthesis with Unposed Imagery (), (📎), (📙), (🏠), (✳️)
10.6 HeaP: Hierarchical Policies for Web Actions using LLMs (), (📎), (📙), (🏠), (✳️)
10.5 Redefining Digital Health Interfaces with Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
10.5 Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report (), (📖), (📎), (📙), (🏠), (✳️)
10.5 Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT (Nat Comput Sci 3, 833–838 (2023). https://doi.org/10.1038/s43588-023-00527-x)
10.5 Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning (), (📎), (📙), (🏠), (✳️)
10.5 FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.5 Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.4 EcoAssistant: Using LLM Assistant More Affordably and Accurately (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.4 How FaR Are Large Language Models From Agents with Theory-of-Mind? (), (📎), (📙), (🏠), (✳️)
10.3 Low-Resource Languages Jailbreak GPT-4 (), (📖), (📎), (📙), (🏠), (✳️)
10.3 Can large language models provide useful feedback on research papers? A large-scale empirical analysis (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.3 ♥️ Conversational Health Agents: A Personalized LLM-Powered Agent Framework (), (📎), (📙), (🏠), (✳️)
10.3 Large Language Models Cannot Self-Correct Reasoning Yet (), (📎), (📙), (🏠), (✳️)
10.3 ImagenHub: Standardizing the evaluation of conditional image generation models (), (📎), (📙), (🏠), (✳️)
10.3 Large Language Models as Analogical Reasoners (), (📎), (📙), (🏠), (✳️)
10.3 SmartPlay : A Benchmark for LLMs as Intelligent Agents (), (📎), (📙), (🏠), (✳️)
10.3 Conditional Diffusion Distillation (), (📎), (📙), (🏠), (✳️)
10.2 Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.2 Investigating the Efficacy of Large Language Models in Reflective Assessment Methods through Chain of Thoughts Prompting (), (📖), (📎), (📙), (🏠), (✳️)
10.2 Mirror Diffusion Models for Constrained and Watermarked Generation (), (📎), (📙), (🏠), (✳️)
10.2 UniAudio: An Audio Foundation Model Toward Universal Audio Generation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
10.2 Enable Language Models to Implicitly Learn Self-Improvement From Data (), (📎), (📙), (🏠), (✳️)
10.1 PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.30 Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives (), (📎), (📙), (🏠), (✳️)
9.29 An evaluation of GPT models for phenotype concept recognition (), (📖), (📎), (📙), (🏠), (✳️)
9.29 Vision Transformers Need Registers (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.29 The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision) (), (📖), (📎), (📙), (🏠), (✳️)
9.29 DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.29 Text-to-3D using Gaussian Splatting (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.29 Qwen Technical Report (), (📎), (📙), (🏠), (✳️), (:octocat:![GitHub Repo stars](https://img.shields.io/github/stars/qwenlm/qwen ?style=social))
9.29 Deep Geometrized Cartoon Line Inbetweening (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.29 Demystifying CLIP Data (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.29 MotionLM: Multi-Agent Motion Forecasting as Language Modeling (), (📎), (📙), (🏠), (✳️)
9.29 GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.29 RealFill: Reference-Driven Generation for Authentic Image Completion (), (📎), (📙), (🏠), (✳️)
9.29 CCEdit: Creative and Controllable Video Editing via Diffusion Models (), (📎), (📙), (🏠), (✳️)
9.29 ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning (), (📎), (📙), (🏠), (✳️)
9.28 Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack (), (📎), (📙), (🏠), (✳️)
9.28 Language models in molecular discovery (), (📎), (📙), (🏠), (✳️)
9.28 Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.28 AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models (), (📎), (📙), (🏠), (✳️)
9.28 AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model (), (📎), (📙), (🏠), (✳️)
9.28 Effective Long-Context Scaling of Foundation Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.28 Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.27 NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.27 Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition (), (📎), (📙), (🏠), (✳️)
9.27 Jointly Training Large Autoregressive Multimodal Models (), (📎), (📙), (🏠), (✳️)
9.27 DECO: Dense Estimation of 3D Human-Scene Contact In The Wild (), (📎), (📙), (🏠), (✳️)
9.27 Finite Scalar Quantization: VQ-VAE Made Simple (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.27 VPA: Fully Test-Time Visual Prompt Adaptation (), (📎), (📙), (🏠), (✳️)
9.27 LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models (), (📎), (📙), (🏠), (✳️)
9.27 VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning (), (📎), (📙), (🏠), (✳️)
9.27 Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models (), (📎), (📙), (🏠), (✳️)
9.26 ♥️ Creating Trustworthy LLMs: Dealing with Hallucinations in Healthcare AI (), (📖), (📎), (📙), (SS), (🏠), (✳️)
9.26 QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models (), (📎), (📙), (🏠), (✳️)
9.26 Aligning Large Multimodal Models with Factually Augmented RLHF (), (📎), (📙), (🏠), (✳️)
9.26 DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models (), (📎), (📙), (🏠), (✳️)
9.26 Efficient Post-training Quantization with FP8 Formats (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.26 DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.26 Small-scale proxies for large-scale Transformer training instabilities (), (📎), (📙), (🏠), (✳️)
9.25 VidChapters-7M: Video Chapters at Scale (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.25 Evaluating Cognitive Maps and Planning in Large Language Models with CogEval (), (📎), (📙), (🏠), (✳️)
9.23 MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.23 Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model (), (📎), (📙), (🏠), (✳️)
9.23 Robotic Offline RL from Internet Videos via Value-Function Pre-Training (), (📎), (📙), (🏠), (✳️)
9.23 Exploring Large Language Models' Cognitive Moral Development through Defining Issues Test (), (📎), (📙), (🏠), (✳️)
9.23 Calibrating LLM-Based Evaluator (), (📎), (📙), (🏠), (✳️)
9.22 Affect Recognition in Conversations Using Large Language Models (), (📖), (📎), (📙), (🏠), (✳️)
9.22 DRG-LLaMA : Tuning LLaMA Model to Predict Diagnosis-related Group for Hospitalized Patients (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.22 CodePlan: Repository-level Coding using LLMs and Planning (), (📎), (📙), (🏠), (✳️)
9.22 DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion (), (📎), (📙), (🏠), (✳️)
9.22 LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.22 LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent (), (📎), (📙), (🏠), (✳️)
9.22 MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models (), (📎), (📙), (🏠), (✳️)
9.22 Game of Thrones author sues ChatGPT owner OpenAI (BBC news)
9.21 Foundation Metrics: Quantifying Effectiveness of Healthcare Conversations powered by Generative AI (), (📖), (📎), (📙), (🏠), (✳️)
9.21 How Robust is Google's Bard to Adversarial Image Attacks? (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.21 SCREWS: A Modular Framework for Reasoning with Revisions (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.21 OpenAI release preview of Dall-E 3 (tweet), (DALL·E 3)
9.21 LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset (), (📎), (📙), (🏠), (✳️)
9.21 BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model (), (📎), (📙), (🏠), (✳️)
9.21 A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.21 DreamLLM: Synergistic Multimodal Comprehension and Creation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.21 FreeU: Free Lunch in Diffusion U-Net (), (📎), (📙), (🏠), (✳️)
9.21 Kosmos-2.5: A Multimodal Literate Model (), (📎), (📙), (🏠), (✳️)
9.21 Chain-of-Verification Reduces Hallucination in Large Language Models (), (📎), (📙), (🏠), (✳️)
9.20 OpenChat: Advancing Open-source Language Models with Mixed-Quality Data (), (📖),, (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.20 A Large-scale Dataset for Audio-Language Representation Learning (), (📎), (📙), (🏠), (✳️)
9.20 LMDX: Language Model-based Document Information Extraction and Localization (), (📎), (📙), (🏠), (✳️)
9.20 The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute (), (📎), (📙), (🏠), (✳️)
9.20 OpenAI’s Dall-E 3 Is an Art Generator Powered by ChatGPT (Wired news)
9.20 OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.19 Enhancing Health Data Interoperability with Large Language Models: A FHIR Study (), (📖), (📎), (📙), (🏠), (✳️)
9.19 OpenCog Hyperon: A Framework for AGI at the Human Level and Beyond (), (📎), (📙), (🏠), (✳️)
9.19 SlimPajama-DC: Understanding Data Combinations for LLM Training (), (📎), (📙), (🏠), (✳️)
9.19 Baichuan 2: Open Large-scale Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.19 Stabilizing RLHF through Advantage Model and Selective Rehearsal (), (📎), (📙), (🏠), (✳️)
9.19 360^circ Reconstruction From a Single Image Using Space Carved Outpainting (), (📎), (📙), (🏠), (✳️)
9.19 Language Modeling Is Compression (), (📎), (📙), (🏠), (✳️)
9.19 Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions (), (📎), (📙), (🏠), (✳️)
9.18. Data Formulator: AI-powered Concept-driven Visualization Authoring (), (📎), (📙), (🏠), (✳️)
9.18 MindAgent: Emergent Gaming Interaction (), (📎), (📙), (🏠), (✳️)
9.18 An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.18 LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.18 Multimodal Foundation Models: From Specialists to General-Purpose Assistants (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.18 Adapting Large Language Models via Reading Comprehension (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.17 OWL: A Large Language Model for IT Operations (), (📎), (📙), (🏠), (✳️)
9.17 CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages (), (📎), (📙), (🏠), (✳️)
9.17 Contrastive Decoding Improves Reasoning in Large Language Models (), (📎), (📙), (🏠), (✳️)
9.16 PDFTriage: Question Answering over Long, Structured Documents (), (📎), (📙), (🏠), (✳️)
9.16 Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT) (), (📎), (📙), (🏠), (✳️)
9.16 Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data? (), (📎), (📙), (🏠), (✳️)
9.15 Compositional Foundation Models for Hierarchical Planning (), (📎), (📙), (🏠), (✳️)
9.15 Scaling Laws for Sparsely-Connected Foundation Models (), (📎), (📙), (🏠), (✳️)
9.15 Investigating Answerability of LLMs for Long-Form Question Answering (), (📎), (📙), (🏠), (✳️)
9.15 Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers (), (📎), (📙), (🏠), (✳️)
9.15 TextBind: Multi-turn Interleaved Multimodal Instruction-following (), (📎), (📙), (🏠), (✳️)
9.15 LASER: LLM Agent with State-Space Exploration for Web Navigation (), (📎), (📙), (🏠), (✳️)
9.14 Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges (), (📖), (📎), (📙), (🏠), (✳️)
9.14 The Rise and Potential of Large Language Model Based Agents: A Survey (), (📎), (📙), (🏠), (✳️) ,(:octocat:GitHub Repo stars)
9.14 Agents: An Open-source Framework for Autonomous Language Agents (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.14 Generative Image Dynamics (), (📎), (📙), (🏠), (✳️)
9.14 Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.14 AudioSR: Versatile Audio Super-resolution at Scale (), (📎), (📙), (🏠), (✳️)
9.14 Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? (), (📎), (📙), (🏠), (✳️)
9.14 Ambiguity-Aware In-Context Learning with Large Language Models (), (📎), (📙), (🏠), (✳️)
9.13 RAIN: Your Language Models Can Align Themselves without Finetuning (), (📎), (📙), (🏠), (✳️)
9.13 Text-Guided Generation and Editing of Compositional 3D Avatars (), (📎), (📙), (🏠), (✳️)
9.13 MagiCapture: High-Resolution Multi-Concept Portrait Customization (), (📎), (📙), (🏠), (✳️)
9.13 DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models (), (📎), (📙), (🏠), (✳️)
9.12 Re-Reading Improves Reasoning in Language Models (), (📎), (📙), (🏠), (✳️)
9.12 A Survey of Hallucination in Large Foundation Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.12 Learning Disentangled Avatars with Hybrid 3D Representations (), (📎), (📙), (🏠), (✳️)
9.12 InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.12 Efficient Memory Management for Large Language Model Serving with PagedAttention (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.12 Large Language Model for Science: A Study on P vs. NP (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.12 AstroLLaMA: Towards Specialized Foundation Models in Astronomy (), (📎), (📙), (🏠), (✳️)
9.12 Uncovering mesa-optimization algorithms in Transformers (), (📎), (📙), (🏠), (✳️)
9.11 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning (), (📎), (📙), (🏠), (✳️)
9.11 PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models (), (📎), (📙), (🏠), (✳️)
9.11 Large Language Models for Compiler Optimization (), (📎), (📙), (🏠), (✳️)
9.11 NExT-GPT: Any-to-Any Multimodal LLM (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.11 Textbooks Are All You Need II: phi-1.5 technical report (), (📎), (📙), (🏠), (✳️)
9.11 Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.10 Neurons in Large Language Models: Dead, N-gram, Positional (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.9 MADLAD-400: A Multilingual And Document-Level Large Audited Dataset (), (📎), (📙), (🏠), (✳️)
9.9 When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale (), (📎), (📙), (🏠), (✳️)
9.9 FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning (), (📎), (📙), (🏠), (✳️)
9.8 From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting (), (📎), (📙), (🏠), (✳️)
9.8 Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts (), (📎), (📙), (🏠), (✳️)
9.7 GOV.UK - Frontier AI Taskforce: first progress report (report)
9.7 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks (), (📎), (📙), (🏠), (✳️)
9.7 ImageBind-LLM: Multi-modality Instruction Tuning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.7 ProPainter: Improving Propagation and Transformer for Video Inpainting (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.7 Tracking Anything with Decoupled Video Segmentation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.7 FLM-101B: An Open LLM and How to Train It with $100K Budget (), (📎), (📙), (🏠), (✳️)
9.7 Large-Scale Automatic Audiobook Creation (), (📎), (📙), (🏠), (✳️)
9.7 Large Language Models as Optimizers (), (📎), (📙), (🏠), (✳️)
9.7 SyncDreamer: Generating Multiview-consistent Images from a Single-view Image (), (📎), (📙), (🏠), (✳️)
9.7 Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model (), (📎), (📙), (🏠), (✳️)
9.7 DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.7 XGen-7B Technical Report (), (📎), (📙), (🏠), (✳️)
9.7 Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation (), (📎), (📙), (🏠), (✳️)
9.7 SLiMe: Segment Like Me (), (📎), (📙), (🏠), (✳️)
9.6 GPT Can Solve Mathematical Problems Without a Calculator (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.6 Physically Grounded Vision-Language Models for Robotic Manipulation (), (📎), (📙), (🏠), (✳️)
9.6 Doppelgangers: Learning to Disambiguate Images of Similar Structures (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.5 Artificial General Intelligence for Radiation Oncology (), (📖), (📎), (📙), (🏠), (✳️)
9.5 Cognitive Architectures for Language Agents (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.5 Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning (), (📎), (📙), (🏠), (✳️)
9.5 One Wide Feedforward is All You Need (), (📎), (📙), (🏠), (✳️)
9.5 AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections (), (📎), (📙), (🏠), (✳️)
9.5 PromptTTS 2: Describing and Generating Voices with Text Prompt (), (📎), (📙), (🏠), (✳️)
9.5 Hierarchical Masked 3D Diffusion Model for Video Outpainting (), (📎), (📙), (🏠), (✳️)
9.4 Concepts is All You Need: A More Direct Path to AGI (), (📖), (📎), (📙), (🏠), (✳️)
9.4 StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation (), (📎), (📙), (🏠), (✳️)
9.4 ControlMat: A Controlled Generative Approach to Material Capture (), (📎), (📙), (🏠), (✳️)
9.3 ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.2 Bias and Fairness in Large Language Models: A Survey (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.2 Contrastive Feature Masking Open-Vocabulary Vision Transformer (), (📎), (📙), (🏠), (✳️)
9.2 Efficient RLHF: Reducing the Memory Usage of PPO (), (📎), (📙), (🏠), (✳️)
9.2 MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation (), (📎), (📙), (🏠), (✳️)
9.2 Google's search for an AI future as it turns 25 (BBC news)
9.2 ChatGPT Glossary: 41 AI Terms that Everyone Should Know (blog)
9.2 CityDreamer: Compositional Generative Model of Unbounded 3D Cities (), (📎), (📙), (🏠), (✳️)
9.2 Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.1 RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback (), (📎), (📙), (🏠), (✳️)
9.1 YaRN: Efficient Context Window Extension of Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
9.1 VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation (), (📎), (📙), (🏠), (✳️)
9.1 Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior (), (📎), (📙), (🏠), (✳️)
9.1 FACET: Fairness in Computer Vision Evaluation Benchmark (), (📎), (📙), (🏠), (✳️)
9.1 UT Researchers Use AI to Translate Thoughts Into Text (blog)
9.1 Baidu launches Ernie chatbot after Chinese government approval (news)
9.1 The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants (), (📎), (📙), (🏠), (✳️)
8.31 AI Agents – Build and Host LLM Apps At Scale (blog)
8.31 UAE launches Arabic large language model in Gulf push into generative AI (blog)
8.31 UK MPs Propose Allies Form AI Union to Guard Against Adversaries (news)
8.31 OpenAI released a new Teaching with AI (blog)
8.31 BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.31 Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images (), (📎), (📙), (🏠), (✳️)
8.31 LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models (), (📎), (📙), (🏠), (✳️)
8.31 MVDream: Multi-view Diffusion for 3D Generation (), (📎), (📙), (🏠), (✳️)
8.31 Emergence of Segmentation with Minimalistic White-Box Transformers (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.31 Learning Vision-based Pursuit-Evasion Robot Policies (project), (), (📎), (📙), (🏠), (✳️)
8.30 SAM-Med2D (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.30 Large language models aren’t people. Let’s stop testing them as if they were (MIT TR blog)
8.30 Sobering Reports on AI for CPR, Cancer Treatment Advice (blog)
8.30 Why Generative AI Needs Another Breakthrough Moment (blog)
8.30 Chinese ChatGPT alternatives just got approved for the general public (MIT TR news)
8.30 OpenAI Nears $1 Billion of Annual Sales as ChatGPT Takes Off (news), (archive.today)
8.30 International Governance of Civilian AI: A Jurisdictional Certification Approach (), (📎), (📙), (🏠), (✳️)
8.30 AnomalyGPT: Detecting Industrial Anomalies using Large Vision-Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.30 LLaSM: Large Language and Speech Model (proejct), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.30 RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation (project), (), (📎), (📙), (🏠), (✳️)
8.29 Vector Search with OpenAI Embeddings: Lucene Is All You Need (), (📎), (📙), (🏠), (✳️)
8.29 Radiology-Llama2: Best-in-Class Large Language Model for Radiology (), (📎), (📙), (🏠), (✳️)
8.29 Inside Google's Plans To Fix Healthcare With Generative AI (Forbes news)
8.29 Google’s new Vertex AI features to unlock advanced LLM capabilities (blog)
8.29 The company landscape for artificial intelligence in large-molecule drug discovery (nature reviews drug discovery doi: https://doi.org/10.1038/d41573-023-00139-0)
8.29 Full Code Medical Launches Full Code AI, the First Integration of ChatGPT in Software-Based Medical Simulation (blog)
8.29 ChatGPT in Medical Education and Research: A Boon or a Bane? (DOI: 10.7759/cureus.44316)
8.29 OpenAI Unveils ChatGPT for Businesses, Stepping Up Revenue Push (news), (archive.today)
8.28 Graph Meets LLMs: Towards Large Graph Models (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.28 AI Deception: A Survey of Examples, Risks, and Potential Solutions (), (📎), (📙), (🏠), (✳️)
8.28 Is the AI boom already over? (blog)
8.28 Most Americans haven’t used ChatGPT; few think it will have a major impact on their job ([Pew Research Center news)
8.28 OpenAI - Introducing ChatGPT Enterprise (blog)
8.28 PointHPS: Cascaded 3D Human Pose and Shape Estimation from Point Clouds (), (📎), (📙), (🏠), (✳️)
8.27 MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records (), (📎), (📙), (🏠), (✳️)
8.26 ORES: Open-vocabulary Responsible Visual Synthesis (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.25 The One Generative AI Risk That No One Is Talking About (blog)
8.25 Korea’s Naver joins generative AI race with HyperCLOVA X large language model (blog)
8.25 Can ChatGPT Transform Healthcare? (blog)
8.25 OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.25 Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.25 Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.25 SoTaNa: The Open-Source Software Development Assistant (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.24 Code Llama: Open Foundation Models for Code (), (📖), (📎), (📙), (🏠), (✳️), (SS), (:octocat:GitHub Repo stars)
8.24 Evaluating large language models on medical evidence summarization (npj Digital Medicine volume 6, https://doi.org/10.1038/s41746-023-00896-7)
8.24 Harnessing AI for Psychiatric Use Requires More Nuanced Discussion (blog)
8.24 Use of Artificial Intelligence Chatbots for Cancer Treatment Information (JAMA Oncol. Published online August 24, 2023. doi:10.1001/jamaoncol.2023.2954)
8.24 Prompt2Model: Generating Deployable Models from Natural Language Instructions (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.24 American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.24 Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities (), (📎), (📙), (🏠), (✳️)
8.24 Problems in using LLMs in commercial products (blog)
8.24 Code LLaMA is now on Perplexity’s LLaMa Chat! (tweet), (labs)
8.24 Meta AI released Code Llama, a large language model built on top of Llama 2, fine-tuned for coding & state-of-the-art (tweet), (blog), (paper), (:octocat:GitHub Repo stars), (Model)
8.23 Efficient Benchmarking (of Language Models) (), (📎), (📙), (🏠), (✳️)
8.23 New Study Gives ChatGPT High Marks as a CDS Tool (news)
8.23 OpenAI launched fine-tuning for GPT-3.5 Turbo! Fine-tuning (tweet), (blog)
8.23 Seamless4MT: Massive Multilingual Multimodal Machine Translation (paper), (code), (blog), (demo), (tweet)
8.22 A Survey on Large Language Model based Autonomous Agents (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.22 Comparison of Ophthalmologist and Large Language Model Chatbot Responses to Online Patient Eye Care Questions (JAMA Netw Open. 2023;6(8):e2330320. doi:10.1001/jamanetworkopen.2023.30320)
8.22 Assessing the Utility of ChatGPT Throughout the Entire Clinical Workflow: Development and Usability Study (J Med Internet Res 2023;25:e48659 doi: 10.2196/48659)
8.22 Giraffe - 32K Long Context Open-Source LLMs (tweet), (blog), (Model)
8.22 Language to rewards for robotic skill synthesis (Google blog), (tweet)
8.22 Stabilizing Unsupervised Environment Design with a Learned Adversary (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.21 Giraffe: Adventures in Expanding Context Lengths in LLMs (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.21 TADA! Text to Animatable Digital Avatars (project), (), (📎), (📙), (🏠), (✳️), (tweet)
8.21 Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts (), (📎), (📙), (🏠), (✳️)
8.21 Instruction Tuning for Large Language Models: A Survey (), (📎), (📙), (🏠), (✳️)
8.21 Large Language Models in Hematology Case Solving: A Comparative Study of ChatGPT-3.5, Google Bard, and Microsoft Bing (DOI: 10.7759/cureus.43861 )
8.20 LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models (project), (), (📎), (📙), (🏠), (✳️)
8.19 HumanLiff: Layer-wise 3D Human Generation with Diffusion Model (), (📎), (📙), (🏠), (✳️)
8.19 Meet FraudGPT: The Dark Side Twin of ChatGPT (news)
8.19 AI2 Dolma: 3 Trillion Token Open Corpus for Language Model Pretraining (blog)
8.19 AI2 drops biggest open dataset yet for training language models (TechCrunch news)
8.18 Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies (), (📎), (📙), (🏠), (✳️)
8.18 Graph of Thoughts: Solving Elaborate Problems with Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.18 NYU Langone Health Holds First Generative AI "Prompt-a-Thon" (tweet), (news), (Nature paper)
8.18 Autonomous visual information seeking with large language models (Google blog)
8.18 Mind + Machine: ChatGPT as a Basic Clinical Decisions Support Tool (DOI: 10.7759/cureus.43690)
8.17 Reinforced Self-Training for Language Modeling (), (📎), (📙), (🏠), (✳️)
8.17 Consciousness in Artificial Intelligence: Insights from the Science of Consciousness (), (📎), (📙), (🏠), (✳️)
8.17 OpenAI acquires start-up Global Illumination to work on core products, ChatGPT (Reuters news)
8.16 RAVEN: In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models (), (📎), (📙), (🏠), (✳️)
8.16 Atom-by-atom protein generation and beyond with language models (), (📎), (📙), (🏠), (✳️)
8.16 TeCH: Text-guided Reconstruction of Lifelike Clothed Humans (), (📎), (📙), (🏠), (✳️)
8.16 Open challenges in LLM research (blog)
8.16 Microsoft Introduces Azure ChatGPT: A Private Version of ChatGPT Tailored for the Enterprise (news)
8.15 GOV.UK - Artificial Intelligence for Decarbonisation innovation programme: Stream 3 (announcement)
8.15 Introducing DeciCoder: The New Gold Standard in Efficient and Accurate Code Generation (blog), (project)
8.15 CALYPSO: LLMs as Dungeon Masters' Assistants (), (📎), (📙), (🏠), (✳️)
8.15 CoDeF: Content Deformation Fields for Temporally Consistent Video Processing (project), (Hires Demo), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.15 Link-Context Learning for Multimodal LLMs (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.15 Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification (), (📎), (📙), (🏠), (✳️)
8.15 Teach LLMs to Personalize -- An Approach inspired by Writing Education (), (📎), (📙), (🏠), (✳️)
8.14 LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked (), (📎), (📙), (🏠), (✳️)
8.14 GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text (), (📎), (📙), (🏠), (✳️)
8.14 Chatbots in Drug Discovery: A Case Study on Anti-Cocaine Addiction Drug Development with ChatGPT (), (📎), (📙), (🏠), (✳️)
8.14 Large Language Models for Information Retrieval: A Survey (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.14 Bayesian Flow Networks (), (📎), (📙), (🏠), (✳️)
8.14 Mind your Language (Model): Fact-Checking LLMs and their Role in NLP Research and Practice (), (📎), (📙), (🏠), (✳️)
8.14 The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation (), (📎), (📙), (🏠), (✳️)
8.14 CausalLM is not optimal for in-context learning (), (📎), (📙), (🏠), (✳️)
8.14 OctoPack: Instruction Tuning Code Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.14 SpeechX: Neural Codec Language Model as a Versatile Speech Transformer (), (📎), (📙), (🏠), (✳️)
8.13 What if Generative AI turned out to be a Dud? (blog)
8.13 The most powerful open source instructions dataset: Flan (378 Million samples) (tweet), (HF)
8.13 VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use (), (📎), (📙), (🏠), (✳️)
8.13 IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models (), (📎), (📙), (🏠), (✳️)
8.12 A new solution and concrete implementation steps for Artificial General Intelligence (), (📖), (📎), (📙), (🏠), (✳️)
8.12 AI Town - a virtual town where AI characters live, chat and socialize 🏠💻💌 (:octocat:GitHub Repo stars), (Live Demo)
8.12 GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher (project), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.12 Release the Platypus family of finetuned LLMs (tweet), (project), (paper), (:octocat:GitHub Repo stars)
8.12 Self-Alignment with Instruction Backtranslation (), (📎), (📙), (🏠), (✳️)
8.11 BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents (), (📎), (📙), (🏠), (✳️)
8.11 AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining (), (📎), (📙), (🏠), (✳️)
8.11 Follow Anything: Open-set detection, tracking, and following in real-time (), (📎), (📙), (🏠), (✳️)
8.11 ChatGPT expands its ‘custom instructions’ feature to free users (TechCrunch news)
8.10 The Multi-modality Cell Segmentation Challenge: Towards Universal Solutions (), (📎), (📙), (🏠), (✳️)
8.10 DOD Announces Establishment of Generative AI Task Force (U.S. Department of Defense, Release)
8.10 Metacognitive Prompting Improves Understanding in Large Language Models (), (📎), (📙), (🏠), (✳️)
8.10 Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems (), (📎), (📙), (🏠), (✳️)
8.10 Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment (), (📎), (📙), (🏠), (✳️)
8.10 OpenProteinSet: Training data for structural biology at scale (), (📎), (📙), (🏠), (✳️)
8.9 Inst-Inpaint: Instructing to Remove Objects with Diffusion Models (project), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars), (demo)
8.9 A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology (), (📎), (📙), (🏠), (✳️)
8.9 LLMeBench: A Flexible Framework for Accelerating LLMs Benchmarking (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.9 Extrapolating Large Language Models to Non-English by Aligning Languages (), (📎), (📙), (🏠), (✳️)
8.9 ChatGPT answers more than half of software engineering questions incorrectly (ZDnet (news)
8.9 Releasing Claude Instant 1.2 (Blog)
8.9 Shepherd: A Critic for Language Model Generation (), (📎), (📙), (🏠), (✳️)
8.9 JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models (), (📎), (📙), (🏠), (✳️)
8.9 Accelerating LLM Inference with Staged Speculative Decoding (), (📎), (📙), (🏠), (✳️)
8.9 Could a Large Language Model Be Conscious? (news)
8.9 🚀Exciting news! Stability AI has launched StableCode, the revolutionary generative AI LLM for coding! (tweet), (blog)
8.9 New research visualizes the political bias of all major AI language models (tweet)
8.8 Gentopia: A Collaborative Platform for Tool-Augmented LLMs (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.8 AgentSims: An Open-Source Sandbox for Large Language Model Evaluation (), (📎), (📙), (🏠), (✳️)
8.8 Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.8 MedMine: Examining Pre-trained Language Models on Medication Mining (), (📎), (📙), (🏠), (✳️)
8.8 Separate Anything You Describe (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.8 AI regulation is taking shape, but startups are being left out (Verge news)
8.8 Accelerating LLM Inference with Staged Speculative Decoding (), (📎), (📙), (🏠), (✳️)
8.8 3D Gaussian Splatting for Real-Time Radiance Field Rendering (), (📎), (📙), (🏠), (✳️)
8.8 OpenAI launches webcrawler GPTBot, and instructions on how to block it (mashable news)
8.8 FLIRT: Feedback Loop In-context Red Teaming (), (📎), (📙), (🏠), (✳️)
8.8 SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore (), (📎), (📙), (🏠), (✳️)
8.8 Study Tests Large Language Models’ Ability to Answer Clinical Questions (JAMA. 2023;330(6):496. doi:10.1001/jama.2023.12553)
8.8 Why Are So Many Organizations Banning ChatGPT? (BlackBerry Blog)
8.7 Coupling Symbolic Reasoning with Language Modeling for Efficient Longitudinal Understanding of Unstructured Electronic Medical Records (), (📎), (📙), (🏠), (✳️)
8.7 Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.7 Extracting detailed oncologic history and treatment plan from medical oncology notes with large language models (), (📎), (📙), (🏠), (✳️)
8.7 UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition (project), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.7 Studying Large Language Model Generalization with Influence Functions (), (📎), (📙), (🏠), (✳️)
8.7 "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.7 RecycleGPT: An Autoregressive Language Model with Recyclable Module (), (📎), (📙), (🏠), (✳️)
8.7 AgentBench: Evaluating LLMs as Agents (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.7 Simple synthetic data reduces sycophancy in large language models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.7 Creation and Adoption of Large Language Models in Medicine (Jama doi:10.1001/jama.2023.14217)
8.7 Doctors Vs. ChatGPT: Which Is More Empathetic? (Forbes news)
8.7 Criminals Have Created Their Own ChatGPT Clones (Wired news)
8.7 Large Language Models Answer Medical Questions Accurately, but Can’t Match Clinicians’ Knowledge (Jama doi:10.1001/jama.2023.14311)
8.6 Scaling Clinical Trial Matching Using Large Language Models: A Case Study in Oncology (), (📎), (📙), (🏠), (✳️)
8.6 Pre-Trained Large Language Models for Industrial Control (), (📎), (📙), (🏠), (✳️)
8.6 Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.6 A Simple AI Governance Framework In The Age Of ChatGPT (Forbes news)
8.5 ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation (), (📎), (📙), (🏠), (✳️)
8.4 Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization (), (📎), (📙), (🏠), (✳️)
8.4 Who Answers It Better? An In-Depth Analysis of ChatGPT and Stack Overflow Answers to Software Engineering Questions (), (📎), (📙), (🏠), (✳️)
8.4 Towards Generalist Foundation Model for Radiology (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.3 Emergent Analogical Reasoning in Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.3 The Capability of Large Language Models to Measure Psychiatric Functioning (), (📎), (📙), (🏠), (✳️)
8.3 Local Large Language Models for Complex Structured Medical Tasks (), (📎), (📙), (🏠), (✳️)
8.3 Huge set of ChatGPT updates (tweet)
8.3 Accuracy of Vitreoretinal Disease Information From an Artificial Intelligence Chatbot (JAMA Ophthalmology doi: 10.1001/jamaophthalmol.2023.3314)
8.2 XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models (), (📎), (📙), (🏠), (✳️)
8.2 4 Charts That Show Why AI Progress Is Unlikely to Slow Down (Time news)
8.2 Do Multilingual Language Models Think Better in English? (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.2 Exploring the psychology of GPT-4's Moral and Legal Reasoning (), (📎), (📙), (🏠), (✳️)
8.2 DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales (), (📎), (📙), (🏠), (✳️)
8.2 Flows: Building Blocks of Reasoning and Collaborating AI (), (📎), (📙), (🏠), (✳️)
8.1 MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework (), (📖), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.1 Retrieval Augmented Generation and Representative Vector Summarization for large unstructured textual data in Medical Education (), (📎), (📙), (🏠), (✳️)
8.1 MetaGPT: Meta Programming for Multi-Agent Collaborative Framework (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.1 Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models (), (📎), (📙), (🏠), (✳️)
8.1 Upstage LLM #1 in Open LLM Leaderboard (Leaderboard)
8.1 ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
8.1 ChatGPT app for Android is now available in all countries and regions (tweet), (blog)
7.31 LLMs4OL: Large Language Models for Ontology Learning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.31 Plotting Progress in AI (blog)
7.31 Getting from Generative AI to Trustworthy AI: What LLMs might learn from Cyc (), (📎), (📙), (🏠), (✳️)
7.31 Learning to Model the World with Language (), (📎), (📙), (🏠), (✳️)
7.30 Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.30 Unified Model for Image, Video, Audio and Language Tasks (), (📎), (📙), (🏠), (✳️)
7.29 The shaky foundations of large language models and foundation models for electronic health records (npj digital medicine, https://doi.org/10.1038/s41746-023-00879-8), (PDF)
7.29 Uncertainty in Natural Language Generation: From Theory to Applications (), (📎), (📙), (🏠), (✳️)
7.28 Exploring Format Consistency for Instruction Tuning (), (📎), (📙), (🏠), (✳️)
7.28 ⭐ Med-HALT: Medical Domain Hallucination Test for Large Language Models (), (📎), (📙), (🏠), (✳️)
7.28 Med-Flamingo: a Multimodal Medical Few-shot Learner (), (📎), (📙), (🏠), (✳️)
7.28 Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding (), (📎), (📙), (🏠), (✳️)
7.28 How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges (), (📎), (📙), (🏠), (✳️)
7.27 Generative AI for Medical Imaging: extending the MONAI Framework (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.27 Guidance for Authors, Peer Reviewers, and Editors on Use of AI, Language Models, and Chatbots (Jama doi:10.1001/jama.2023.12500)
7.27 Chatbots, Artificial Intelligence, and the Future of Scientific Reporting (JAMA Ophthalmology doi: 10.1001/jamaophthalmol.2023.3344)
7.27 Matching Patients to Clinical Trials with Large Language Models (), (📎), (📙), (🏠), (✳️)
7.27 Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback (), (📎), (📙), (🏠), (✳️)
7.27 Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition (), (📎), (📙), (🏠), (✳️)
7.27 NeurIPS 2023 Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day (site)
7.27 Google DeepMind RT-2: Vision-Language-Action Models (tweet), (blog), (project), (PDF)
7.27 Multilingual Code Co-Evolution Using Large Language Models (), (📎), (📙), (🏠), (✳️)
7.27 The Guardian's updated editorial code guidance now includes a section on generative AI (PDF)
7.27 Training Data Extraction From Pre-trained Language Models: A Survey (report), (PDF)
7.27 ⭐ Universal and Transferable Adversarial Attacks on Aligned Language Models (project), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars), (SS)
7.27 NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection (), (📎), (📙), (🏠), (✳️)
7.27 PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback (), (📎), (📙), (🏠), (✳️)
7.27 WavJourney: Compositional Audio Creation with Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.26 Supporting Open Source and Open Science in the EU AI Act (Blog), (PDF)
7.26 Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation (), (📎), (📙), (🏠), (✳️)
7.26 Tracking Anything in High Quality (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.26 Stability AI Announces Stable Diffusion XL 1.0, Featured on Amazon Bedrock (blog), (SD-XL 1.0-base Model Card), (SD-XL 1.0-refiner Model Card), (:octocat:GitHub Repo stars)
7.26 Towards Generalist Biomedical AI (), (📎), (📙), (🏠), (✳️)
7.26 ⭐ Microsoft, Anthropic, Google, and OpenAI launch Frontier Model Forum (Microsoft), Google, OpenAI, anthropic)
7.26 Evaluating the Moral Beliefs Encoded in LLMs (), (📎), (📙), (🏠), (✳️)
7.26 WebArena: A Realistic Web Environment for Building Autonomous Agents (project), (📎), (:octocat:GitHub Repo stars
7.26 ARB: Advanced Reasoning Benchmark for Large Language Models (), (📎), (📙), (🏠), (✳️)
7.26 OpenAI scuttles AI-written text detector over ‘low rate of accuracy’ (news)
7.25 Foundational Models Defining a New Era in Vision: A Survey and Outlook (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.25 Evaluating Large Language Models for Radiology Natural Language Processing (), (📎), (📙), (🏠), (✳️)
7.25 LLM-Rec: Personalized Recommendation via Prompting Large Language Models (), (📎), (📙), (🏠), (✳️)
7.25 How Can Large Language Models Help Humans in Design and Manufacturing? (), (📎), (📙), (🏠), (✳️)
7.25 UK House of Lords Announces Inquiry into Large Language Models (news)
7.25 FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.25 LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition (), (📎), (📙), (🏠), (✳️)
7.25 ChatGPT is a black box: how AI research can break it open (Nature doi: https://doi.org/10.1038/d41586-023-02366-2)
7.25 ChatGPT broke the Turing test — the race is on for new ways to assess AI (Nature doi: https://doi.org/10.1038/d41586-023-02361-7), (PDF)
7.25 Evaluating the Ripple Effects of Knowledge Editing in Language Models (), (📎), (📙), (🏠), (✳️)
7.25 3D-LLM: Injecting the 3D World into Large Language Models (), (📎), (📙), (🏠), (✳️)
7.25 RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment (), (📎), (📙), (🏠), (✳️)
7.24 A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.24 ⭐ Aligning Large Language Models with Human: A Survey (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.24 A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis (), (📎), (📙), (🏠), (✳️)
7.24 LLMs get a medical education (Nature DOI: 10.1038/d41591-023-00064-0)
7.24 MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised Learning of Motion and Content Features (), (📎), (📙), (🏠), (✳️)
7.24 Interpolating between Images with Diffusion Models (), (📎), (📙), (🏠), (✳️)
7.24 PUMA: Secure Inference of LLaMA-7B in Five Minutes (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.24 A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis (), (📎), (📙), (🏠), (✳️)
7.23 GitHub repo for Generative Agents: Interactive Simulacra of Human Behavior (:octocat:GitHub Repo stars)
7.23 Optimized Network Architectures for Large Language Model Training with Billions of Parameters (), (📎), (📙), (🏠), (✳️)
7.22 A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks (), (📎), (📙), (🏠), (✳️)
7.22 Introducing FreeWilly1 and FreeWilly2 - The latest groundbreaking LLMs from Stability AI's and @carperai lab! ⭐ (tweet)
7.22 llama2-webui: Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac) (:octocat:GitHub Repo stars)
7.22 ChatGPT for Android launches next week (news)
7.22 Expedia launches ChatGPT travel planning tool (news)
7.21 CohortGPT: An Enhanced GPT for Participant Recruitment in Clinical Study (), (📎), (📙), (🏠), (✳️)
7.21 FACT SHEET: Biden-⁠Harris Administration Secures Voluntary Commitments from Leading Artificial Intelligence Companies to Manage the Risks Posed by AI (White House news)
7.21 Prompting Large Language Models with Speech Recognition Abilities (), (📎), (📙), (🏠), (✳️)
7.21 L-Eval: Instituting Standardized Evaluation for Long Context Language Models (), (📎), (📙), (🏠), (✳️)
7.21 CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields (), (📎), (📙), (🏠), (✳️)
7.21 FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields (), (📎), (📙), (🏠), (✳️)
7.21 Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.21 Meet FreeWilly, Our Large And Mighty Instruction Fine-Tuned Models (stability.ai announcement)
7.21 WormGPT: ChatGPT For Cybercriminals (news)
7.21 Brain2Music: Reconstructing Music from Human Brain Activity (), (📎), (📙), (🏠), (✳️)
7.21 OpenAI launches customized instructions for ChatGPT (news)
7.20 L-Eval: Instituting Standardized Evaluation for Long Context Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.20 DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.20 LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs (), (📎), (📙), (🏠), (✳️)
7.20 DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.20 FABRIC: Personalizing Diffusion Models with Iterative Feedback (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.20 ⭐ FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets (), (📎), (📙), (🏠), (✳️)
7.20 Instruction-following Evaluation through Verbalizer Manipulation (), (📎), (📙), (🏠), (✳️)
7.20 PASTA: Pretrained Action-State Transformer Agents (), (📎), (📙), (🏠), (✳️)
7.20 TokenFlow: Consistent Diffusion Features for Consistent Video Editing (), (📎), (📙), (🏠), (✳️)
7.20 ⭐ SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.20 Artificial intelligence is making the union movement’s case–and even ChatGPT knows it (news)
7.20 Apple is testing a ChatGPT-like AI chatbot (news)
7.20 Someone Used ChatGPT to Finish the Game of Thrones Book Series (news)
7.20 ⭐ Meta-Transformer: A Unified Framework for Multimodal Learning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.19 PharmacyGPT: The AI Pharmacist (), (📎), (📙), (🏠), (✳️)
7.19 IvyGPT: InteractiVe Chinese pathwaY language model in medical domain (), (📎), (📙), (🏠), (✳️)
7.19 Study Tests Large Language Models’ Ability to Answer Clinical Questions (Jama doi: 10.1001/jama.2023.12553)
7.19 (Ab)using Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.19 Text2Layer: Layered Image Generation using Latent Diffusion Model (), (📎), (📙), (🏠), (✳️)
7.19 Towards A Unified Agent with Foundation Models (), (📎), (📙), (🏠), (✳️)
7.19 ⭐ Challenges and Applications of Large Language Models (), (📎), (📙), (🏠), (✳️)
7.19 On the Origin of LLMs: An Evolutionary Tree and Graph for 15,821 Large Language Models (), (📎), (📙), (🏠), (✳️), (Constellation)
7.18 Augmenting CLIP with Improved Visio-Linguistic Reasoning (), (📎), (📙), (🏠), (✳️)
7.18 How generative AI will reshape the enterprise (report)
7.18 How is ChatGPT's behavior changing over time? (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.18 NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and Repulsive UDF (), (📎), (📙), (🏠), (✳️)
7.18 Measuring Faithfulness in Chain-of-Thought Reasoning (PDF)
7.18 🦙 Llama 2 and Claude 2 are now live on Chatbot Arena! (arena)
7.18 Statement of Support for Meta’s Open Approach to Today’s AI (blog)
7.18 Llama 2: Open Foundation and Fine-Tuned Chat Models (paper), (PDF), (:octocat:GitHub Repo stars)
7.18 Meta and Microsoft Introduce the Next Generation of Llama (tweet), (news), (Llama2), (download)
7.18 Retentive Network: A Successor to Transformer for Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.18 Diffusion Models Beat GANs on Image Classification (), (📎), (📙), (🏠), (✳️)
7.18 BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs (), (📎), (📙), (🏠), (✳️)
7.18 TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT (), (📎), (📙), (🏠), (✳️)
7.17 Abductive Reasoning with the GPT-4 Language Model: Case studies from criminal investigation, medical practice, scientific research (), (📎), (📙), (🏠), (✳️)
7.17 Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination (Jama doi: 10.1001/jamapediatrics.2023.2373)
7.17 Large language models in medicine (nature medicine https://doi.org/10.1038/s41591-023-02448-8)
7.17 Chatbot vs Medical Student Performance on Free-Response Clinical Reasoning Examinations (JAMA, doi:10.1001/jamainternmed.2023.2909)
7.17 AlpaGasus: Training A Better Alpaca with Fewer Data (), (📎), (📙), (🏠), (✳️)
7.16 Communicative Agents for Software Development (), (📎), (📙), (🏠), (✳️)
7.16 Planting a SEED of Vision in Large Language Model (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.15 DreamTeacher: Pretraining Image Backbones with Deep Generative Models (), (📎), (📙), (🏠), (✳️)
7.15 INVE: Interactive Neural Video Editing (), (📎), (📙), (🏠), (✳️)
7.14 Are Large Language Models a Threat to Digital Public Goods? Evidence from Activity on Stack Overflow (), (📎), (📙), (🏠), (✳️)
7.14 China takes major step in regulating generative AI services like ChatGPT (news), (生成式人工智能服务管理暂行办法)
7.14 What Happens When You Ask a Chinese Chatbot About Taiwan? (news)
7.14 In-context Autoencoder for Context Compression in a Large Language Model (), (📎), (📙), (🏠), (✳️)
7.14 Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts (), (📎), (📙), (🏠), (✳️)
7.14 Learning to Retrieve In-Context Examples for Large Language Models (), (📎), (📙), (🏠), (✳️)
7.14 Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation (), (📎), (📙), (🏠), (✳️)
7.14 CoTracker: It is Better to Track Together (), (📎), (📙), (🏠), (✳️)
7.14 The Practical Guides for Large Language Models (:octocat:GitHub Repo stars)
7.14 Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning (paper), (PDF)
7.14 Introducing CM3leon, a more efficient, state-of-the-art generative model for text and images (blog)
7.14 Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.14 Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models (), (📎), (📙), (🏠), (✳️)
7.14 Generating Benchmarks for Factuality Evaluation of Language Models (), (📎), (📙), (🏠), (✳️)
7.13 F.T.C. Opens Investigation Into ChatGPT Maker Over Technology’s Potential Harms (news)
7.13 Instruction Mining: High-Quality Instruction Data Selection for Large Language Models (), (📎), (📙), (🏠), (✳️)
7.13 Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution (), (📎), (📙), (🏠), (✳️)
7.13 T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.13 Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events (), (📎), (📙), (🏠), (✳️)
7.13 DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations (), (📎), (📙), (🏠), (✳️)
7.13 Copy Is All You Need (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.13 AniFaceDrawing: Anime Portrait Exploration during Your Sketching (), (📎), (📙), (🏠), (✳️)
7.13 HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models (project), (), (📎), (📙), (🏠), (✳️)
7.13 Stability AI releases Stable Doodle, a sketch-to-image tool (news), (announcement)
7.12 Efficient 3D Articulated Human Generation with Layered Surface Volumes (), (📎), (📙), (🏠), (✳️)
7.12 SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Task Planning (), (📎), (📙), (🏠), (✳️)
7.12 Stack More Layers Differently: High-Rank Training Through Low-Rank Updates (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.12 PolyLM: An Open Source Polyglot Large Language Model (), (📎), (📙), (🏠), (✳️)
7.12 Today we announce the formation of xAI (announcement)
7.12 Large language models encode clinical knowledge (Nature, https://doi.org/10.1038/s41586-023-06291-2), (PDF)
7.12 Google's NotebookLM (waitlist)
7.12 27% of jobs at high risk from AI revolution, says OECD (news)
7.12 Objaverse-XL: A Universe of 10M+ 3D Objects (PDF)
7.12 EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone (), (📎), (📙), (🏠), (✳️)
7.12 Differentiable Blocks World: Qualitative 3D Decomposition by Rendering Primitives (), (📎), (📙), (🏠), (✳️)
7.11 AmadeusGPT: a natural language interface for interactive animal behavioral analysis (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.11 3 principles for regulatory-grade large language model application (CIO news)
7.11 Generative Pretraining in Multimodality (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.11 DNAGPT: A Generalized Pretrained Tool for Multiple DNA Sequence Analysis Tasks (), (📎), (📙), (🏠), (✳️)
7.11 VampNet: Music Generation via Masked Acoustic Token Modeling (), (📎), (📙), (🏠), (✳️)
7.11 Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement (), (📎), (📙), (🏠), (✳️)
7.11 International Institutions for Advanced AI (), (📎), (📙), (🏠), (✳️)
7.11 Semantic-SAM: Segment and Recognize Anything at Any Granularity (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.11 AI tools are designing entirely new proteins that could transform medicine (Nature, doi: https://doi.org/10.1038/d41586-023-02227-y), (PDF)
7.11 Shutterstock expands deal with OpenAI to build generative AI tools (news)
7.11 Generative Pretraining in Multimodality (), (📎), (📙), (🏠), (✳️)
7.11 Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration (), (📎), (📙), (🏠), (✳️)
7.11 Secrets of RLHF in Large Language Models Part I: PPO (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.11 Anthropic's Claude-2 was just released (blog), (claude)
7.11 Large Language Models as General Pattern Machines (), (📎), (📙), (🏠), (✳️)
7.10 Self-Diagnosis and Large Language Models: A New Front for Medical Misinformation (), (📎), (📙), (🏠), (✳️)
7.10 Google is testing its medical AI chatbot at the Mayo Clinic (news)
7.10 RLTF: Reinforcement Learning from Unit Test Feedback (), (📎), (📙), (🏠), (✳️)
7.10 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.10 GPT Researcher - GPT based autonomous agent that does online comprehensive research on any given topic (:octocat:GitHub Repo stars)
7.9 Chapyter: ChatGPT Code Interpreter in Jupyter Notebooks (:octocat:GitHub Repo stars)
7.9 DragGAN - Drag Your GAN - Face Inversion: Interactive Point-based Manipulation on the Generative Image Manifold (tweet), (HF demo)
7.8 Opening up ChatGPT: Tracking openness, transparency, and accountability in instruction-tuned text generators (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.8 Google’s medical AI chatbot is already being tested in hospitals (news)
7.8 Large Language Models for Supply Chain Optimization (), (📎), (📙), (🏠), (✳️)
7.8 Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation (), (📎), (📙), (🏠), (✳️)
7.8 AutoDecoding Latent 3D Diffusion Models (), (📎), (📙), (🏠), (✳️)
7.8 Awesome Generative AI Techniques: a curated list of Generative AI Techniques (:octocat:GitHub Repo stars)
7.8 Robots say they won't steal jobs, rebel against humans (news)
7.7 CheXmask: a large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.7 Teaching Arithmetic to Small Transformers (), (📎), (📙), (🏠), (✳️)
7.7 GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest (), (📎), (📙), (🏠), (✳️)
7.7 Lost in the Middle: How Language Models Use Long Contexts (), (📎), (📙), (🏠), (✳️)
7.6 What Should Data Science Education Do with Large Language Models? (), (📎), (📙), (🏠), (✳️)
7.6 A.I. Will Change Medicine but Not What It Means to Be a Doctor (NYT, news)
7.6 Frontier AI Regulation: Managing Emerging Risks to Public Safety (), (📎), (📙), (🏠), (✳️)
7.6 The imperative for regulatory oversight of large language models (or generative AI) in healthcare (npj Digital Medicine, https://doi.org/10.1038/s41746-023-00873-0), (PDF)
7.6 OpenAI launches ChatGTP code interpreter for better coding using only natural language (tweet), (blog), (news)
7.6 Jailbroken: How Does LLM Safety Training Fail? (), (📎), (📙), (🏠), (✳️)
7.6 Building Cooperative Embodied Agents Modularly with Large Language Models (), (📎), (📙), (🏠), (✳️)
7.6 What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? (), (📎), (📙), (🏠), (✳️)
7.6 DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.6 Elastic Decision Transformer (), (📎), (📙), (🏠), (✳️)
7.6 Releasing 🚀 CodeGen2.5 🚀, a small but mighty LLM for code (tweet), (blog), (:octocat:GitHub Repo stars)
7.6 Training Models to Generate, Recognize, and Reframe Unhelpful Thoughts (), (📎), (📙), (🏠), (✳️)
7.6 Lost in the Middle: How Language Models Use Long Contexts (), (📎), (📙), (🏠), (✳️)
7.6 Artificial Intelligence in Clinical Diagnosis Opportunities, Challenges, and Hype (JAMA, doi:10.1001/jama.2023.11440)
7.6 AI Chatbots, Health Privacy, and Challenges to HIPAA Compliance (JAMA, doi:10.1001/jama.2023.9458)
7.6 Health Care Privacy Risks of AI Chatbots (JAMA, doi:10.1001/jama.2023.9618)
7.6 Generative AI in Health Care and Liability Risks for Physicians and Safety Concerns for Patients (JAMA, doi:10.1001/jama.2023.9630)
7.6 The Challenges for Regulating Medical Use of ChatGPT and Other Large Language Models (JAMA, doi:10.1001/jama.2023.9651)
7.6 A Survey on Evaluation of Large Language Models (), (📎), (📙), (🏠), (✳️), (SS)
7.5 Collaborative Score Distillation for Consistent Visual Synthesis (), (📎), (📙), (🏠), (✳️)
7.5 Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning (), (📎), (📙), (🏠), (✳️)
7.5 OpenAI - Introducing Superalignment (blog)
7.5 Open-Source Large Language Models Outperform Crowd Workers and Approach ChatGPT in Text-Annotation Tasks (), (📎), (📙), (🏠), (✳️)
7.5 Embodied Task Planning with Large Language Models (), (📎), (📙), (🏠), (✳️)
7.5 Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.5 Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners (), (📎), (📙), (🏠), (✳️)
7.5 Physics-based Motion Retargeting from Sparse Inputs (), (📎), (📙), (🏠), (✳️)
7.5 MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers (), (📎), (📙), (🏠), (✳️)
7.5 Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks (), (📎), (📙), (🏠), (✳️)
7.5 All about the generative tasks in the Generative Medical AI (blog)
7.5 LongNet: Scaling Transformers to 1,000,000,000 Tokens (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.4 PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic Dialogue Convert Patient Dialogues to Medical Records (), (📎), (📙), (🏠), (✳️)
7.4 A ChatGPT Aided Explainable Framework for Zero-Shot Medical Image Diagnosis (), (📎), (📙), (🏠), (✳️)
7.4 Segment Anything Meets Point Tracking (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.4 Career Essentials in Generative AI by Microsoft and LinkedIn (learning)
7.4 SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis (PDF), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.4 Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning (), (📎), (📙), (🏠), (✳️)
7.3 Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.3 EmoGen: Eliminating Subjective Bias in Emotional Music Generation (), (📎), (📙), (🏠), (✳️)
7.3 SketchMetaFace: A Learning-based Sketching Interface for High-fidelity 3D Character Face Modeling (), (📎), (📙), (🏠), (✳️)
7.2 LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance (), (📎), (📙), (🏠), (✳️), (demo)
7.1 Global Mental Health Services and the Impact of Artificial Intelligence–Powered Large Language Models (Jama doi:10.1001/jamapediatrics.2023.2373)
7.1 Personality Traits in Large Language Models (), (📎), (📙), (🏠), (✳️)
7.1 DisCo: Disentangled Control for Referring Human Dance Generation in Real World (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
7.1 BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer (), (📎), (📙), (🏠), (✳️)
7.1 Improve ChatGPT with Knowledge Graphs (blog)
7.1 The Rise of the AI Engineer (Blog)
6.30 DrugGPT: A GPT-based Strategy for Designing Potential Ligands Targeting Specific Proteins (), (📎)
6.30 Doctor Chatbot: The EUʼs Regulatory Prescription for Generative Medical AI (Oslo Law Review, https://doi.org/10.18261/olr.10.1.1), (PDF)
6.30 Preference Ranking Optimization for Human Alignment (), (📎), (📙), (🏠), (✳️)
6.30 Reliability of Medical Information Provided by ChatGPT: Assessment Against Clinical Guidelines and Patient Information Quality Instrument (JMIR, doi: 10.2196/47479), (PDF)
6.30 Large language model AI chatbots require approval as medical devices (Nature Medicine, https://doi.org/10.1038/s41591-023-02412-6)
6.30 LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding (), (📎), (📙), (🏠), (✳️)
6.30 Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors (), (📎), (📙), (🏠), (✳️)
6.30 Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation (), (📎), (📙), (🏠), (✳️)
6.30 Generate Anything Anywhere in Any Scene (), (📎), (📙), (🏠), (✳️)
6.30 Benchmarking Large Language Model Capabilities for Conditional Generation (), (📎), (📙), (🏠), (✳️)
6.29 End-to-end Autonomous Driving: Challenges and Frontiers (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.29 UMASS_BioNLP at MEDIQA-Chat 2023: Can LLMs generate high-quality synthetic note-oriented doctor-patient conversations? (), (📎), (📙), (🏠), (✳️)
6.29 ⭐ A Survey of Large Language Models - version 11 (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars), (SS)
6.29 June 2023, A Stage Review of Instruction Tuning (notion)
6.29 Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.29 Towards Measuring the Representation of Subjective Global Opinions in Language Models (), (📎), (📙), (🏠), (✳️)
6.29 REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction (), (📎), (📙), (🏠), (✳️)
6.29 One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization (), (📎), (📙), (🏠), (✳️)
6.29 DreamDiffusion: Generating High-Quality Images from Brain EEG Signals (), (📎), (📙), (🏠), (✳️)
6.28 Regulations to govern use of AI in health records could come later this year (news)
6.28 On the Exploitability of Instruction Tuning (), (📎), (📙), (🏠), (✳️)
6.28 ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.28 RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model (), (📎), (📙), (🏠), (✳️), (demo)
6.28 Extending Context Window of Large Language Models via Positional Interpolation (), (📎), (📙), (🏠), (✳️)
6.28 CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \10,000 Budget; An Extra 4,000 Unlocks 81.8% Accuracy (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.28 Automatic Calibration and Error Correction for Large Language Models via Pareto Optimal Self-Supervision (), (📎), (📙), (🏠), (✳️
6.28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment (project), (📎), (:octocat:GitHub Repo stars)
6.28 BrainGPT - A Large Language Model tool to assist neuroscientific research (home)
6.28 Toward Actionable Generative AI - LAMs: From Large Language Models to Large Action Models (blog)
6.28 The official #DragGAN app and code (tweet), (application), (:octocat:GitHub Repo stars)
6.27 Introducing ERNIE 3.5: Baidu’s Knowledge-Enhanced Foundation Model Takes a Giant Leap Forward (blog)
6.27 Beyond the Hype: Assessing the Performance, Trustworthiness, and Clinical Suitability of GPT3.5 (), (📎), (📙), (🏠), (✳️)
6.27 Vision Augmented Language Models: Computer vision through the LENS of natural language (blog), (demo), (:octocat:GitHub Repo stars)
6.27 Restart Sampling for Improving Generative Processes (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.27 3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.27 MIMIC: Masked Image Modeling with Image Correspondences (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.27 LeanDojo: Theorem Proving with Retrieval-Augmented Language Models (project), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.27 Any Image to 3D (blog)
6.27 ⭐️LangChain Integrations⭐️ Hub (link)
6.27 MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion (project), (demo)
6.27 Extending Context Window of Large Language Models via Positional Interpolation (), (📎), (📙), (🏠), (✳️)
6.27 Salesforce open-source LLMs with 8k sequence length - Xgen 7B (tweet), (blog), (:octocat:GitHub Repo stars)
6.27 Embracing change and resetting expectations (blog)
6.27 Baby steps in evaluating the capacities of large language models (Nature Reviews Psychology, https://doi.org/10.1038/s44159-023-00211-x), (preview)
6.26 MedLSAM: Localize and Segment Anything Model for 3D Medical Images (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.26 MotionGPT: Human Motion as a Foreign Language (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.26 Faster Segment Anything: Towards Lightweight SAM for Mobile Applications (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.26 Aligning Large Multi-Modal Model with Robust Instruction Tuning (), (📎), (📙), (🏠), (✳️)
6.26 InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback (project), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.26 LongCoder: A Long-Range Pre-trained Language Model for Code Completion (), (📎), (📙), (🏠), (✳️)
6.26 Kosmos-2: Grounding Multimodal Large Language Models to the World (), (📎), (📙), (🏠), (✳️)
6.26 ViNT: A Foundation Model for Visual Navigation (project), (), (📎), (📙), (🏠), (✳️), (video)
6.26 DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing (), (📎), (📙), (🏠), (✳️)
6.25 Generative AI — LLMOps Architecture Patterns (blog)
6.25 DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data (), (📎), (📙), (🏠), (✳️)
6.25 H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models (), (📎), (📙), (🏠), (✳️)
6.25 Thinking Like an Annotator: Generation of Dataset Labeling Instructions (), (📎), (📙), (🏠), (✳️)
6.25 Language models are weak learners (), (📎), (📙), (🏠), (✳️)
6.25 Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning (), (📎), (📙), (🏠), (✳️)
6.25 Chat with Hacker News in real-time using natural language (demo)
6.24 Zero-shot spatial layout conditioning for text-to-image diffusion models (), (📎), (📙), (🏠), (✳️)
6.24 Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data (), (📎), (📙), (🏠), (✳️)
6.24 On the paper “Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models” (MIT)
6.24 A critical analysis of “Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models” (blog)
6.24 System-Level Natural Language Feedback (), (📎), (📙), (🏠), (✳️)
6.24 OpenMask3D: Open-Vocabulary 3D Instance Segmentation (), (📎), (📙), (🏠), (✳️)
6.24 Scaling MLPs: A Tale of Inductive Bias (), (📎), (📙), (🏠), (✳️)
6.23 MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.23 What's going on with the Open LLM Leaderboard? (blog)
6.23 A Survey on Multimodal Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.23 LLM Powered Autonomous Agents (blog)
6.23 DreamEditor: Text-Driven 3D Scene Editing with Neural Fields (), (📎), (📙), (🏠), (✳️)
6.23 Long-range Language Modeling with Self-retrieval (), (📎), (📙), (🏠), (✳️)
6.23 Bring Your Own Data! Self-Supervised Evaluation for Large Language Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.22 reliableGPT: Stop OpenAI Errors in Production (:octocat:GitHub Repo stars)
6.22 Lit-GPT : Implementation of Falcon, StableLM, Pythia, INCITE language models based on nanoGPT (:octocat:GitHub Repo stars)
6.22 Perspective Fields for Single Image Camera Calibration (project page), (video), (demo), (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars), (CVPR 2023)
6.22 Event Stream GPT (ESGPT), for "event stream" datasets, particularly Electronic Health Record (EHR) datasets (tweet), (:octocat:GitHub Repo stars)
6.22 MPT-30B is here (tweet), (blog), (HF), (MosaicML MPT-30B-Chat)
6.22 How continuous batching enables 23x throughput in LLM inference while reducing p50 latency (blog)
6.22 DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation (), (📎), (📙), (🏠), (✳️)
6.22 Stability AI launches SDXL 0.9: A Leap Forward in AI Image Generation (news)
6.21 ChatGPT Poses New Regulatory Questions for FDA, Medical Industry (Bloomber news), Youtube)
6.21 Understanding Social Reasoning in Language Models with Language Models (), (📎), (📙), (🏠), (✳️)
6.21 Opportunities and Risks of LLMs for Scalable Deliberation with Polis (), (📎), (📙), (🏠), (✳️)
6.21 Training Transformers with 4-bit Integers (), (📎), (📙), (🏠), (✳️)
6.21 Fast Segment Anything (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.21 DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models (), (📎), (📙), (🏠), (✳️)
6.20 Visual Foundation Models for Medical Image Analysis (blog)
6.20 Learning to Generate Better Than Your LLM (), (📎), (📙), (🏠), (✳️)
6.20 Sound reconstruction from human brain activity via a generative model with brain-like auditory features (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.20 A Simple and Effective Pruning Approach for Large Language Models (), (📎), (📙), (🏠), (:eight_spoked_asterisk:, (:octocat:GitHub Repo stars)
6.20 Radiology Report Expert Evaluation (ReXVal) Dataset (PhysioNet https://doi.org/10.13026/2fp8-qr71)
6.20 RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.20 Segment Anything Model (SAM) for Radiation Oncology (), (📎), (📙), (🏠), (✳️)
6.20 RepoFusion: Training Code Models to Understand Your Repository (), (📎), (📙), (🏠), (✳️)
6.20 Textbooks Are All You Need (), (📎), (📙), (🏠), (✳️)
6.19 Path to Medical AGI: Unify Domain-specific Medical LLMs with the Lowest Cost (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.19 CounselGPT - Korean psychological counseling dataset (:octocat:GitHub Repo stars)
6.19 MotionGPT: Finetuned LLMs are General-Purpose Motion Generators (), (📎), (📙), (🏠), (✳️)
6.18 Point-Cloud Completion with Pretrained Text-to-image Diffusion Models (), (📎), (📙), (🏠), (✳️)
6.18 Mercedes-Benz Installs ChatGPT Artificial Intelligence in 900,000 Cars (Newsweek), (Mercedes Benz)
6.18 OpenLLaMA-13B released (tweet), (:octocat:GitHub Repo stars)
6.17 Generation of Radiology Findings in Chest X-Ray by Leveraging Collaborative Knowledge (), (📎), (📙), (🏠), (✳️)
6.17 Beware of Unreliable Data in Model Evaluation: A LLM Prompt Selection case study with Flan-T5 (blog)
6.17 GPT Engineer - specify what you want it to build, the AI asks for clarification, and then builds it (:octocat:GitHub Repo stars)
6.17 Demystifying GPT Self-Repair for Code Generation (), (📎), (📙), (🏠), (✳️)
6.17 Introducing GAIA-1: A Cutting-Edge Generative AI Model for Autonomy (blog)
6.17 Understanding Encoder And Decoder LLMs (blog)
6.16 Evaluating Superhuman Models with Consistency Checks (), (📎), (📙), (🏠), (✳️)
6.16 AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation (project), (), (📎), (📙), (🏠), (✳️)
6.16 Gradient is All You Need? (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.16 LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.16 AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology (), (📎), (📙), (🏠), (✳️)
6.16 Meta - Introducing Voicebox: The Most Versatile AI for Speech Generation (news)
6.16 Explore, Establish, Exploit: Red Teaming Language Models from Scratch (), (📎), (📙), (🏠), (✳️)
6.16 Full Parameter Fine-tuning for Large Language Models with Limited Resources (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.16 ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data and Comprehensive Evaluation (), (📎), (📙), (🏠), (✳️)
6.16 CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models (), (📎), (📙), (🏠), (✳️)
6.16 Language-Guided Music Recommendation for Video via Prompt Analogies (), (📎), (📙), (🏠), (✳️)
6.16 QR Code AI Art Generator (tweet), (Hugging face), (SD art)
6.16 Standford CRFM - Transparency Index for Foundation Model Provider's Compliance measurement with the Draft EU AI Act (tweet), (:octocat:GitHub Repo stars)
6.16 The economic potential of generative AI: The next productivity frontier (McKinsey & Company. report)
6.15 Med-MMHL: A Multi-Modal Dataset for Detecting Human- and LLM-Generated Misinformation in the Medical Domain (), (📎), (📙), (🏠), (✳️)
6.15 Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health (), (📎), (📙), (🏠), (✳️), (SS)
6.15 Introducing the ElevenLabs AI Speech Classifier: Elevating Safety Standards for AI-generated Audio Content (news)
6.15 ChatGPT AI Shines in Challenging Medical Cases (news)
6.15 Accuracy of a Generative Artificial Intelligence Model in a Complex Diagnostic Challenge (JAMA doi:10.1001/jama.2023.8288)
6.15 LOVM: Language-Only Vision Model Selection (), (📎), (📙), (🏠), (✳️)
6.15 WizardCoder: Empowering Code Large Language Models with Evol-Instruct (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.15 Segment Any Point Cloud Sequences by Distilling Vision Foundation Models (), (📎), (📙), (🏠), (✳️), (:octocat:GitHub Repo stars)
6.15 Seeing the World through Your Eyes (), (📎), (📙), (🏠), (✳️)
6.15 Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models (), (📎), (📙), (🏠), (✳️)
6.15 Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students via Theory of Mind (), (📎), (📙), ([:house:](https://huggingfac

genai_llm_timeline's People

Contributors

hollobit avatar bcho avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.