ATProto Browser

ATProto Browser

Experimental browser for the Atmosphere

Post

Recently used arrow + duckdb to get some SQL practice in and blogged about it. Was blown away that this doc rendered even though the dataset was originally 10GB in size. On a side note: does anyone know if you can use arrow::open_dataset() on a pins parquet or arrow object? #rstats #quartopub

Aug 27, 2023, 12:26 AM

Record data

{
  "uri": "at://did:plc:gffmqwjiagcs7hug7npomosg/app.bsky.feed.post/3k5vmkuy7bp25",
  "cid": "bafyreidltwujtg2l3sagst5xoam7ta5fvtggdcafa2gotxvpoavj3xezai",
  "value": {
    "text": "Recently used arrow + duckdb to get some SQL practice in and blogged about it. \n\nWas blown away that this doc rendered even though the dataset was originally 10GB in size. On a side note: does anyone know if you can use arrow::open_dataset() on a pins parquet or arrow object? #rstats #quartopub",
    "$type": "app.bsky.feed.post",
    "embed": {
      "$type": "app.bsky.embed.external",
      "external": {
        "uri": "https://www.mrworthington.com/articles/rstats/gnarly-data-arrow-sql-duckdb/",
        "thumb": {
          "$type": "blob",
          "ref": {
            "$link": "bafkreicybpsaz2thpb22bhpnxgt7seialcutyevzcdfbyti4ksf55ax4ji"
          },
          "mimeType": "image/jpeg",
          "size": 564805
        },
        "title": "Matt Worthington - Gnarly Data w/ Arrow, DuckDB, + SQL",
        "description": "Knowing SQL is a must for Data Scientists + other analytics professionals, but how can R users start practicing their SQL skills in a familiar environment? That’s this post!"
      }
    },
    "langs": [
      "en"
    ],
    "createdAt": "2023-08-27T00:26:40.527Z"
  }
}