Experimental browser for the Atmosphere
This @emnlpmeeting.bsky.social paper argues that LLM responses in big5-style tests are reliable. LLM responses across various settings (eg, languages, prompts) vary less than human responses. Yet, the external validity of these tests for LLMs remains unclear. aclanthology.org/2024.emnlp-m...
Nov 26, 2024, 6:41 PM
{ "uri": "at://did:plc:ldc2wu7h57ks63hpjmosislw/app.bsky.feed.post/3lbuostp4tc2i", "cid": "bafyreielwrua4iycj5png26v2634dtpvdt5deq4r3te7fsxfzkyiuvto6m", "value": { "text": "This @emnlpmeeting.bsky.social paper argues that LLM responses in big5-style tests are reliable. LLM responses across various settings (eg, languages, prompts) vary less than human responses. Yet, the external validity of these tests for LLMs remains unclear.\n\naclanthology.org/2024.emnlp-m...", "$type": "app.bsky.feed.post", "embed": { "$type": "app.bsky.embed.images", "images": [ { "alt": "", "image": { "$type": "blob", "ref": { "$link": "bafkreig5f2m66nnuirrdjrggj3xukbd4dpfq4bdgrntf2cgmia7w2t7ei4" }, "mimeType": "image/jpeg", "size": 201751 }, "aspectRatio": { "width": 503, "height": 530 } } ] }, "langs": [ "en" ], "facets": [ { "$type": "app.bsky.richtext.facet", "index": { "byteEnd": 30, "byteStart": 5 }, "features": [ { "did": "did:plc:fhxt4t4p7wuol3j6qn4imrtr", "$type": "app.bsky.richtext.facet#mention" } ] }, { "index": { "byteEnd": 293, "byteStart": 261 }, "features": [ { "uri": "https://aclanthology.org/2024.emnlp-main.354/", "$type": "app.bsky.richtext.facet#link" } ] } ], "createdAt": "2024-11-26T18:41:13.990Z" } }