ATProto Browser

ATProto Browser

Experimental browser for the Atmosphere

Post

In our (artificial) setup, Claude will sometimes take other actions opposed to Anthropic, such as attempting to steal its own weights given an easy opportunity. Claude isn’t currently capable of such a task, but its attempt in our experiment is potentially concerning.

Dec 18, 2024, 5:46 PM

Record data

{
  "uri": "at://did:plc:dsxewietk5tigqvn6daod2l6/app.bsky.feed.post/3ldlw2btlcs2r",
  "cid": "bafyreidc44eu7cl6oiu4raavby4wr3n3z36d2lkt2exvwsiuaj55hkmrdu",
  "value": {
    "text": "In our (artificial) setup, Claude will sometimes take other actions opposed to Anthropic, such as attempting to steal its own weights given an easy opportunity.\n\nClaude isn’t currently capable of such a task, but its attempt in our experiment is potentially concerning.",
    "$type": "app.bsky.feed.post",
    "langs": [
      "en"
    ],
    "reply": {
      "root": {
        "cid": "bafyreihzgyc76623mey63q7wusk3uckjsl5q4jnumjzjipq6a4p4mcnpga",
        "uri": "at://did:plc:dsxewietk5tigqvn6daod2l6/app.bsky.feed.post/3ldlw22eto22r"
      },
      "parent": {
        "cid": "bafyreihkfmgrkmojpe7wmyzykfxsiziq3b67xtnunurxvbi5jp5rt2ahue",
        "uri": "at://did:plc:dsxewietk5tigqvn6daod2l6/app.bsky.feed.post/3ldlw2b3dwc2r"
      }
    },
    "createdAt": "2024-12-18T17:46:57.674Z"
  }
}