Experimental browser for the Atmosphere
Loading post...
{ "uri": "at://did:plc:johrx4zvjlf3mpovalc733zg/app.bsky.feed.like/3loj4uvevti2h", "cid": "bafyreig3r3rkfuo77p3lwo4tr4zkqsyurv32nuip2e2dr7dop6aq7vghtm", "value": { "$type": "app.bsky.feed.like", "subject": { "cid": "bafyreicqejkjea7bwswh36wm6nu7lqpsca3qmk33mvpvryiamurrk54n6i", "uri": "at://did:plc:evvussoazdkvsld475dfbuci/app.bsky.feed.post/3loizm7whl22b" }, "createdAt": "2025-05-06T14:48:31.679Z" } }
Delighted to see BigGen Bench paper receive the πbest paper award πat NAACL 2025! BigGen Bench introduces fine-grained, scalable, & human-aligned evaluations: π 77 hard, diverse tasks π οΈ 765 exs w/ ex-specific rubrics π More human-aligned than previous rubrics π 10 languages, by native speakers 1/
May 6, 2025, 1:49 PM