Coder Social home page Coder Social logo

Comments (5)

mali-nuist avatar mali-nuist commented on May 20, 2024 3

放出来厂商就可以作弊了 lol

from superclue.

stenlylee avatar stenlylee commented on May 20, 2024 3

看到人类得分那么高,就知道这个项目不靠谱。

from superclue.

littlepan0413 avatar littlepan0413 commented on May 20, 2024

同文+1,具体的题目数量有多少呢

from superclue.

qiangmzsx avatar qiangmzsx commented on May 20, 2024

期待把每一期的题目公布出来,大家一起共创。

from superclue.

brightmart avatar brightmart commented on May 20, 2024

1)我看到基础能力评测中人类各项分数都接近100分,是不是题目出的太少太简单? 2) 项目上说一共三个人用投票机制,作为人类的分数,请问是什么水平的人类?另外三个人是否太少~ 3)尤其是代码能力方面 以我自己使用的体验 gpt-4 写代码能力很强 而且属于全栈 ,各种语言都会一些,这个应该没人能达到吧。但是这个评测中人类、gpt-4、gpt-3.5-turbo分数一样,是否题目的区分度还不够

1)当前报告的分数是采用开卷形式做题目的分数,所以结果比较高。我们也计划报告一下闭卷形式的分数。
2)人类的水平是本科生、研究生的水平
3)代码生成方面gpt-4还是很强的。只是我们的题目是客观题,而不是纯生成题,所以gpt-4强大的生成能力,可能没有那么明显。

from superclue.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.