ATProto Browser

ATProto Browser

Experimental browser for the Atmosphere

Post

I've heard a lot about this working paper, mainly because I'm married to one of The Many Economists™️ (who are not all economists but I digress). Turns out data cleaning matters!

May 5, 2025, 9:31 PM

Record data

{
  "uri": "at://did:plc:ozma3timm5c32fa7bimb4zaj/app.bsky.feed.post/3lohcxddnc226",
  "cid": "bafyreibydvgklue7pa5k4aeec3djrgr4sbssplktrjvmgcd32lkghnrvam",
  "value": {
    "text": "I've heard a lot about this working paper, mainly because I'm married to one of The Many Economists™️ (who are not all economists but I digress). Turns out data cleaning matters!",
    "$type": "app.bsky.feed.post",
    "embed": {
      "$type": "app.bsky.embed.images",
      "images": [
        {
          "alt": "Screenshot of the top of the webpage for the working paper. Title is The Sources of Researcher Variation in Economics. Authors are Nick Huntington-Klein, Claus C. Portner, Ian McCarthy & The Many Economists Collaborative on Researcher Variation",
          "image": {
            "$type": "blob",
            "ref": {
              "$link": "bafkreidqpgcjdkvsq2xo4sk4kqo7av3ljelwwcr3bx2gyvgxgu2gbpwmry"
            },
            "mimeType": "image/jpeg",
            "size": 327496
          },
          "aspectRatio": {
            "width": 1080,
            "height": 1541
          }
        },
        {
          "alt": "Screenshot of abstract that reads: We use a rigorous three-stage many-analysts design to assess how different researcher decisions—specifically data cleaning, research design, and the interpretation of a policy question—affect the variation in estimated treatment effects. A total of 146 research teams each completed the same causal inference task three times each: first with few constraints, then using a shared research design, and finally with pre-cleaned data in addition to a specified design. We find that even when analyzing the same data, teams reach different conclusions. In the first stage, the interquartile range (IQR) of the reported policy effect was 3.1 percentage points, with substantial outliers. Surprisingly, the second stage, which restricted research design choices, exhibited slightly higher IQR (4.0 percentage points), largely attributable to imperfect adherence to the prescribed protocol. By contrast, the final stage, featuring standardized data cleaning, narrowed variation in estimated effects, achieving an IQR of 2.4 percentage points. Reported sample sizes also displayed significant convergence under more restrictive conditions, with the IQR dropping from 295,187 in the first stage to 29,144 in the second, and effectively zero by the third. Our findings underscore the critical importance of data cleaning in shaping applied microeconomic results and highlight avenues for future replication efforts.",
          "image": {
            "$type": "blob",
            "ref": {
              "$link": "bafkreif5r7wxjxfyq35aqykfatoxdm473my34d4qrfsjojqzj2urvwrpve"
            },
            "mimeType": "image/jpeg",
            "size": 635245
          },
          "aspectRatio": {
            "width": 942,
            "height": 1999
          }
        }
      ]
    },
    "langs": [
      "en"
    ],
    "createdAt": "2025-05-05T21:31:54.254Z"
  }
}