Experimental browser for the Atmosphere
Shakhizat Nurgaliyev walks through deploying Gemma 3 QAT and Qwen3 LLMs on the NVIDIA Jetson AGX Orin 64GB using llama.cpp and k3s.
May 5, 2025, 1:37 PM
{ "uri": "at://did:plc:oh4ceda6mjjxtwevpahuw7bx/app.bsky.feed.post/3logigoak4223", "cid": "bafyreid6652em545vw3bbrwq5nlyalfscsqh5dzeygj254blbe2gpbhk5y", "value": { "text": "Shakhizat Nurgaliyev walks through deploying Gemma 3 QAT and Qwen3 LLMs on the NVIDIA Jetson AGX Orin 64GB using llama.cpp and k3s.", "$type": "app.bsky.feed.post", "embed": { "$type": "app.bsky.embed.external", "external": { "uri": "https://www.hackster.io/shahizat/deploying-large-language-models-with-llama-cpp-and-k3s-24a237", "thumb": { "$type": "blob", "ref": { "$link": "bafkreifvd5h3hdgnju5ixnkkplbecoz7kbqv5f4jctbzwqor5tovaphqwe" }, "mimeType": "image/jpeg", "size": 425888 }, "title": "Deploying Large Language Models with llama.cpp and k3s", "description": "In this guide, I'll walk through deploying Gemma 3 QAT and Qwen3 models, using llama. cpp and K3s Kubernetes Cluster. By Nurgaliyev Shakhizat." } }, "langs": [ "en" ], "createdAt": "2025-05-05T13:37:17.992Z" } }