I want to achieve decent performance/low delay while using DeepSeekCoder 6.7B with 8GB VRAM. The model mentions that using GPU offloading will lower RAM consumption. However, I'm using an AMD GPU. Does Tabby support GPU offloading?
RiQuY
Asked on Apr 11, 2024