ATProto Browser

Experimental browser for the Atmosphere

Post

At a high level, our method is simple: 1. We ask both skilled humans and AI systems to attempt tasks in similar conditions. 2. We measure how long the humans take. 3. We then measure how AI success rates vary depending on how long the humans took to do those tasks.

Mar 19, 2025, 5:43 PM

Loading post...

Record data

{
  "uri": "at://did:plc:oky5czdrnfjpqslsw2a5iclo/app.bsky.feed.like/3lohaigcimg2t",
  "cid": "bafyreihm6a5wjdlgqwnuofy33ygx5u3hrbscfzw54avespzrba75gpof2m",
  "value": {
    "$type": "app.bsky.feed.like",
    "subject": {
      "cid": "bafyreidt7e6rmonj4g7677x2bitmkknnukwzs3muwwkovbblza66tchisu",
      "uri": "at://did:plc:dll3hepzq76nymel5c3yt6nk/app.bsky.feed.post/3lkqqb5sk2s2m"
    },
    "createdAt": "2025-05-05T20:47:46.387Z"
  }
}