I'm wondering if it's possible to apply a setup to optimize CPU usage when launching the Tabby server. Clement mentioned about exploiting M3 macOS CPU with LM Studio, and shared the command used to start Tabby server along with the output of /v1/health
. Meng Zhang suggested trying to use 1.3B deepseekcoder for --model
.
clement-igonet
Asked on Feb 27, 2024
--model
parameter.Example command with optimized --model
parameter:
$ TABBY_DISABLE_USAGE_COLLECTION=1 tabby serve --device metal --model TabbyML/DeepseekCoder-1.3B --chat-model TabbyML/Mistral-7B --webserver