You can try something like this to get a rough estimate: https://huggingface.co/spaces/NyxKrage/LLM-Model-VRAM-Calcul...
But you really don't know the exact numbers until you try, a lot of it is runtime/environment context specific.
But you really don't know the exact numbers until you try, a lot of it is runtime/environment context specific.