Experimental browser for the Atmosphere
Loading post...
{ "uri": "at://did:plc:oky5czdrnfjpqslsw2a5iclo/app.bsky.feed.like/3lohaigcimg2t", "cid": "bafyreihm6a5wjdlgqwnuofy33ygx5u3hrbscfzw54avespzrba75gpof2m", "value": { "$type": "app.bsky.feed.like", "subject": { "cid": "bafyreidt7e6rmonj4g7677x2bitmkknnukwzs3muwwkovbblza66tchisu", "uri": "at://did:plc:dll3hepzq76nymel5c3yt6nk/app.bsky.feed.post/3lkqqb5sk2s2m" }, "createdAt": "2025-05-05T20:47:46.387Z" } }
At a high level, our method is simple: 1. We ask both skilled humans and AI systems to attempt tasks in similar conditions. 2. We measure how long the humans take. 3. We then measure how AI success rates vary depending on how long the humans took to do those tasks.
Mar 19, 2025, 5:43 PM