lonestar-lemmy
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
monica_b1998@lemmy.world to Artificial Intelligence @lemmy.sdf.orgEnglish · 4 days ago

Measuring AI Ability to Complete Long Tasks

metr.org

external-link
message-square
2
link
fedilink
1
external-link

Measuring AI Ability to Complete Long Tasks

metr.org

monica_b1998@lemmy.world to Artificial Intelligence @lemmy.sdf.orgEnglish · 4 days ago
message-square
2
link
fedilink
We propose measuring AI performance in terms of the *length* of tasks AI agents can complete. We show that this metric has been consistently exponentially increasing over the past 6 years, with a doubling time of around 7 months. Extrapolating this trend predicts that, in under a decade, we will see AI agents that can independently complete a large fraction of software tasks that currently take humans days or weeks.
alert-triangle
You must log in or # to comment.
  • Zachariah@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 days ago

    …successfully and accurately?

    • ImgurRefugee114@reddthat.com
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 days ago

      No <3

Artificial Intelligence @lemmy.sdf.org

artificialintelligence@lemmy.sdf.org

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !artificialintelligence@lemmy.sdf.org

Chat about and share AI stuff

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 4 users / week
  • 5 users / month
  • 13 users / 6 months
  • 0 local subscribers
  • 261 subscribers
  • 61 Posts
  • 9 Comments
  • Modlog
  • mods:
  • Pokey@lemmy.sdf.org
  • BE: 0.19.14
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org