模型微服务化(Model Microserv […]
模型服务(Model Serving)是指将 […]
模型推理服务器是一种专门用于执行人工智能模型 […]
数据中心LLM(Data Center LL […]
边缘LLM(Edge Large Langu […]
FP16量化(Half Precision […]
非结构化剪枝(Unstructured Pr […]
边缘部署(Edge Deployment)是 […]
容器化(Containerization)是 […]
知识蒸馏(Knowledge Distill […]