GPT J 6B - 搜索

约 185,000 个结果

在新选项卡中打开链接

时间不限

huggingface.co
https://huggingface.co › EleutherAI
EleutherAI/gpt-j-6b - Hugging Face
GPT-J 6B is a transformer model trained using Ben Wang's Mesh Transformer JAX. "GPT-J" refers to the class of model, while "6B" represents the number of trainable parameters. * Each layer consists of one feedforward block and one self attention block.
huggingface.co
https://huggingface.co › docs › transformers › model_doc › gptj
GPT-J - Hugging Face
To get proper results, you should use EleutherAI/gpt-j-6B instead of hf-internal-testing/tiny-random-gptj. If you get out-of-memory when loading that checkpoint, you can try adding device_map="auto" in the from_pretrained call.
wikipedia.org
https://en.wikipedia.org › wiki › GPT-J
GPT-J - Wikipedia
GPT-J or GPT-J-6B is an open-source large language model (LLM) developed by EleutherAI in 2021. [1] As the name suggests, it is a generative pre-trained transformer model designed to produce human-like text that continues from a prompt.
eleuther.ai
https://6b.eleuther.ai
EleutherAI - text generation testing UI
MODEL: GPT-J-6B. Model on Github. Prompt List. Try a classic prompt evaluated on other models . TOP-P. 0.9. Temperature. 0.8. Run the model! Powered by Mystic.ai.
github.com
https://github.com › graphcore › gpt-j
graphcore/gpt-j: Notebook for running GPT-J/GPT-J-6B - GitHub
GPT-J is an open-source alternative from EleutherAI to OpenAI's GPT-3. Available for anyone to download, GPT-J can be successfully fine-tuned to perform just as well as large models on a range of NLP tasks including question answering, sentiment analysis, and …
wordpress.com
https://arankomatsuzaki.wordpress.com › gpt-j
GPT-J-6B: 6B JAX-Based Transformer – Aran Komatsuzaki
2021年6月4日 · We have released GPT-J-6B, 6B JAX-based (Mesh) Transformer LM . GPT-J-6B performs nearly on par with 6.7B GPT-3 (or Curie) on various zero-shot down-streaming tasks. You can try out this Colab notebook or free web demo. This library also serves as an example of model parallelism with xmap on JAX.
dataloop.ai
https://dataloop.ai › library › model
Gpt J 6b · Models · Dataloop
GPT-J 6B is a powerful transformer-based language model that generates text from prompts with remarkable efficiency. With 6 billion trainable parameters, it's designed to learn an inner representation of the English language, enabling it to extract …
medium.com
https://forefrontai.medium.com
GPT-J-6B: An Introduction to the Largest Open Source GPT …
2021年10月14日 · ‍GPT-J-6B is an open source, autoregressive language model created by a group of researchers called EleutherAI. It’s one of the most advanced alternatives to OpenAI’s GPT-3 and performs well...
google.com
https://colab.research.google.com › ... › blob › master
Inference with GPT-J-6B - Google Colab
In this notebook, we are going to perform inference (i.e. generate new text) with EleutherAI's GPT-J-6B model, which is a 6 billion parameter GPT model trained on The Pile, a huge publicly...
aimodels.fyi
https://www.aimodels.fyi › models › huggingFace
gpt-j-6b | AI Model Details
2024年5月28日 · The gpt-j-6b is a large language model trained by EleutherAI, a research group dedicated to developing open-source AI systems. The model has 6 billion trainable parameters and uses the same tokenizer as GPT-2 and GPT-3, with a vocabulary size of 50,257. It utilizes Rotary Position Embedding (RoPE) for positional encoding.
分页
- 1
- 2
- 3
- 4
- 下一页

EleutherAI/gpt-j-6b - Hugging Face

GPT-J - Hugging Face

GPT-J - Wikipedia

EleutherAI - text generation testing UI

graphcore/gpt-j: Notebook for running GPT-J/GPT-J-6B - GitHub

GPT-J-6B: 6B JAX-Based Transformer – Aran Komatsuzaki

Gpt J 6b · Models · Dataloop

GPT-J-6B: An Introduction to the Largest Open Source GPT …

Inference with GPT-J-6B - Google Colab

gpt-j-6b | AI Model Details