Experimental browser for the Atmosphere
Ironically, two days before acceptance of our paper at EMNLP, OpenAI removed the ability to access token logprobs from gpt-3.5-turbo-instruct. This is a timely issue. We need to establish best practices for LLM evaluation based on scientific merit, not just convenience. 5/5
Oct 24, 2023, 3:05 PM
{ "uri": "at://did:plc:t7jbj4w3uo2sus4bc3znsspa/app.bsky.feed.post/3kciypxetdk2h", "cid": "bafyreiel5xhbelzvr5wsavgmg3q6fhitahiqajtriodygtwxnwuqjatjse", "value": { "text": "Ironically, two days before acceptance of our paper at EMNLP, OpenAI removed the ability to access token logprobs from gpt-3.5-turbo-instruct. \n\nThis is a timely issue. We need to establish best practices for LLM evaluation based on scientific merit, not just convenience. 5/5", "$type": "app.bsky.feed.post", "langs": [ "en" ], "reply": { "root": { "cid": "bafyreiclw56tvjjsi4r5tv53erbkmmzwjf4ohc2rttnowg6tivm4k7cdfi", "uri": "at://did:plc:t7jbj4w3uo2sus4bc3znsspa/app.bsky.feed.post/3kciylu4bpi2w" }, "parent": { "cid": "bafyreiaffb7qs7ejsfoccg4jwdyhtplzvg4b523x6ljgaosrnjh4tmszda", "uri": "at://did:plc:t7jbj4w3uo2sus4bc3znsspa/app.bsky.feed.post/3kciyph6w4c2n" } }, "createdAt": "2023-10-24T15:05:37.100Z" } }