GPT Neo - 搜索

约 3,750,000 个结果

在新选项卡中打开链接

时间不限

huggingface.co
https://huggingface.co › docs › transformers › model_doc › gpt_neo
GPT Neo - Hugging Face
GPT Neo Overview. The GPTNeo model was released in the EleutherAI/gpt-neo repository by Sid Black, Stella Biderman, Leo Gao, Phil Wang and Connor Leahy. It is a GPT2 like causal language model trained on the Pile dataset. The architecture is similar to GPT2 except that GPT Neo uses local attention in every other layer with a window size of 256 ...
github.com
https://github.com › EleutherAI › gpt-neo
GitHub - EleutherAI/gpt-neo: An implementation of model parallel GPT …
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library. - EleutherAI/gpt-neo
csdn.net
https://blog.csdn.net › xiexiecn › article › details
EleutherAI GPT-Neo: 穷人的希望 - CSDN博客
2022年11月27日 · 近日，Eleuther AI在GitHub上推出了一个名为GPT-Neo的开源项目：GPT-3同等复现预训练模型(1.3B & 2.7B)，并可在Colab上完成微调。虽然GPT-Neo与真正的GPT-3比，参数量仍然很小（较大版本也只与GPT-3最小模型的参数量），但...
huggingface.co
https://huggingface.co › EleutherAI
EleutherAI/gpt-neo-2.7B - Hugging Face
GPT-Neo 2.7B is a transformer model designed using EleutherAI's replication of the GPT-3 architecture. GPT-Neo refers to the class of models, while 2.7B represents the number of parameters of this particular pre-trained model.
eleuther.ai
https://www.eleuther.ai › artifacts › gpt-neo
GPT-Neo - EleutherAI
2021年3月21日 · A series of large language models trained on the Pile. It was our first attempt to produce GPT-3-like language models and comes in 125M, 1.3B, and 2.7B parameter variants.
csdn.net
https://blog.csdn.net › article › details
GPT-Neo 2.7B 模型安装与使用教程 - CSDN博客
2024年12月12日 · 本文将深入探讨“gpt-neo_dungeon”这一项目，了解如何在Colab环境下使用微调后的GPT-Neo-2.7B模型进行文本生成和交互。首先，我们要理解什么是GPT-Neo。GPT-Neo是一个Transformer架构的大型语言模型，其参数量...
datalearner.com
https://www.datalearner.com › ai-models › foundation-models › GPT-Neo
GPT-Neo(X)基础大模型详细信息 | 数据学习(DataLearner)
2021年3月22日 · GPT-Neo 模型旨在成为开放和透明的 GPT 模型替代品，后者是由 OpenAI 开发的专有模型。 GPT-Neo 系列包括多个不同大小的模型，从具有 1.25亿参数的 GPT-Neo 125M 到具有 13 亿参数的较大型 GPT-Neo 1.3B。
dongaigc.com
https://www.dongaigc.com › gpt-neo-learning-resources
GPT-Neo学习资料汇总 - 开源GPT-3级别大型语言模型 - 懂AI
GPT-Neo为研究人员和开发者提供了一个强大的NLP工具,可用于文本生成、语言理解等多种任务。要快速开始使用GPT-Neo,可以通过Hugging Face Transformers库加载预训练模型: 这将使用2.7B参数的GPT-Neo模型生成文本。 EleutherAI提供了以下预训练的GPT-Neo模型供下载: 这些模型在 The Pile 数据集上训练而成,可以直接用于推理或在特定任务上微调。 GPT-Neo为NLP研究和应用提供了一个强大的开源工具。通过本文提供的资源,读者可以快速上手使用GPT-Neo,并探 …
csdn.net
https://blog.csdn.net › article › details
探秘GPT-Neo：开源版的预训练语言模型 - CSDN博客
2024年3月20日 · GPT-Neo基于Transformer架构，这是一种在深度学习领域用于处理序列任务的革命性模型。其主要特点是自注意力（self-attention）机制，使得模型能够理解输入序列的全局依赖关系，这对于理解和生成复杂的自然语言至关重要。 GPT-Neo有不同的大小版本，包括1.3B和2.7B参数量，分别对应不同的计算和性能需求。训练数据集方面，GPT-Neo利用了公共的互联网文本，这使其具备广泛的语料库知识和多样化的表达能力。此外，项目的训练代码是开放源 …
github.com
https://github.com › ... › source › en › model_doc › gpt_neo.md
transformers/docs/source/en/model_doc/gpt_neo.md at main
The GPTNeo model was released in the EleutherAI/gpt-neo repository by Sid Black, Stella Biderman, Leo Gao, Phil Wang and Connor Leahy. It is a GPT2 like causal language model trained on the Pile dataset. The architecture is similar to GPT2 except that GPT Neo uses local attention in every other layer with a window size of 256 tokens.
分页
- 1
- 2
- 3
- 4
- 下一页

GPT Neo - Hugging Face

GitHub - EleutherAI/gpt-neo: An implementation of model parallel GPT …

EleutherAI GPT-Neo: 穷人的希望 - CSDN博客

EleutherAI/gpt-neo-2.7B - Hugging Face

GPT-Neo - EleutherAI

GPT-Neo 2.7B 模型安装与使用教程 - CSDN博客

GPT-Neo(X)基础大模型详细信息 | 数据学习(DataLearner)

GPT-Neo学习资料汇总 - 开源GPT-3级别大型语言模型 - 懂AI

探秘GPT-Neo：开源版的预训练语言模型 - CSDN博客

transformers/docs/source/en/model_doc/gpt_neo.md at main