LLM Memory Usage Calculator
Model Name:
Parameters Quantity (Billions):
Quantization (bits):
4-bit
4-bit GAT (30% more efficient)
8-bit
16-bit
Context Length (tokens):
Number of Users (simultaneous):
CPU Cores:
RAM Capacity (GB):
Calculate