PC Guff - 搜索

约 6,710,000 个结果

时间不限

在新选项卡中打开链接

CSDN博客
https://blog.csdn.net › article › details
第二十一篇-LM-Studio安装部署gguf模型到windows电脑_lm …
2024年2月29日 · (1) 使用LMStudio，LM Studio 是一个桌面应用程序，允许本地运行大型语言模型（LLMs，LM Studio支持的是guff格式的模型，这类模型经过重新编译打包，整体尺寸更小，文 …
chatstream.org
https://www.chatstream.org › cn › download
下载 DeepSeek V3 和 R1 模型 - 完整版、量化版和蒸馏版
首先，从官方网站下载并安装 Ollama。然后，您可以运行以下任何命令来使用不同版本的 DeepSeek-R1： macOS Linux Windows # Base Model (67.1B) ollama run deepseek-r1:671b …
fluxai.cn
https://www.fluxai.cn › detail
如何使用 ComfyUI 运行 FLUX GGUF 文件
2024年9月18日 · FLUX.1 dev和schnell开源文本到图像生成模型提供了卓越的质量，但运行FLUX.1 dev需要16-24GB的VRAM。量化模型通过压缩全尺寸模型，降低了VRAM需求，但可能牺 …
阿里云开发者社区
https://developer.aliyun.com › article
魔搭社区GGUF模型怎么玩！看这篇就够了 - 阿里云开发 ...
2024年6月20日 · 本文内容由阿里云实名注册用户自发贡献，版权归原作者所有，阿里云开发者社区不拥有其著作权，亦不承担相应法律责任。
Github
https://github.com › LostRuins › koboldcpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
Overview
Windows Usage
Run on Colab
OSX and Linux
OSX and Linux Manual Compiling
Compiling on Windows
Compiling on Android (Termux Installation)
AMD
Docker
Questions and Help
KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models. It's a single self contained distributable from Concedo, that builds off llama.cpp, and adds a versatile Kobold API endpoint, additional format support, backward compatibility, as well as a fancy UI with persistent stories, editing tools, save formats, memory, world i...
在github.com上查看更多信息
Github
https://github.com › antirez › gguf-tools
GitHub - antirez/gguf-tools: GGUF implementation in C as a library …
This is a work in progress library to manipulate GGUF files. While the library aims to be useful, one of the main goals is to provide an accessible code base that as a side effect documents …
Hugging Face
https://huggingface.co › bartowski
bartowski/DeepSeek-V2.5-GGUF - Hugging Face
Filename Quant type File Size Split Description; DeepSeek-V2.5-Q8_0.gguf: Q8_0: 250.62GB: true: Extremely high quality, generally unneeded but max available quant.
Stack Overflow
https://stackoverflow.com › questions › how...
How to run any quantized GGUF model on CPU for local inference?
2023年12月9日 · llama-cpp-python is my personal choice, because it is easy to use and it is usually one of the first to support quantized versions of new models. To install it for CPU, just …
hugging-face.cn
https://hugging-face.cn › docs › hub › gguf
GGUF - Hugging Face 机器学习平台
GGUF. Hugging Face Hub 支持所有文件格式，但内置了对 GGUF 格式的支持，该格式是二进制格式，针对模型的快速加载和保存进行了优化，使其在推理方面非常高效。 GGUF 被设计用 …
DevPress官方社区
https://devpress.csdn.net › article › detail
如何用hugging face hub一键下载模型并将模型转换为gguf格式（ …
量化就是减少模型所占内存的大小，量化位数越少，模型占有内存越小，但相对可能模型表现会更差，所以建议在模型过大的时候可以适当采用。进入本地【E:\llamaapp\llama.cpp】文件 …
某些结果已被删除
分页
- 1
- 2
- 3
- 4
- 下一页