NaN: the community's first month in numbers
117 billion tokens, 3.68 million requests, 21 countries, and 99.98% uptime. NaN is a community of builders with its own inference infrastructure and a private platform to deploy apps and agents.
7 articles
117 billion tokens, 3.68 million requests, 21 countries, and 99.98% uptime. NaN is a community of builders with its own inference infrastructure and a private platform to deploy apps and agents.
In this post we'll learn what parameters and quantization are, so we can figure out how much space AI models take up.
In this post I'll walk you through how the community's inference servers are set up: the hardware we use, the stack we run, and the models we serve.
I've spent several hours over several days documenting and optimizing my entire local environment so I can "mechanize" the work I do every day managing infrastructure for multiple startups.
This post isn't meant to be a guide on how to use Clawd, but rather a look at how we're rolling it out at Helmcode to have an AI Agent that helps us with our day-to-day work managing the Cloud infrastructure of multiple startups.
Kubernetes is one of the most widely used infrastructure tools among companies, and it has become the standard for running containerized applications at scale all over the world.
Before we start, a bit of context. The infrastructure is hosted on AWS and the architecture was based on Serverless services: