quester-one / smartplay Goto Github PK
View Code? Open in Web Editor NEWThis project forked from microsoft/smartplay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.
License: Creative Commons Attribution 4.0 International