没有 GPU，还能跑大模型吗？vLLM vs llama.cpp 实测对比_开源_GPUStack