ACL Fine - 搜索

约 3,680,000 个结果

在新选项卡中打开链接

时间不限

zhihu.com
https://zhuanlan.zhihu.com
论文笔记：ReFT Reasoning with Reinforced Fine-Tuning - 知乎
2024年12月21日 · ReFT使用强化学习，不是为了像RLHF那样将模型输出与人类偏好相对齐，而是为了微调增强模型的推理能力。原CoT数据集每一条样本分成三个部分（question，CoT，answer），分别用（x，e，y）表示。其中at可取词表中的任一token，st是当前的状态，即问题+所有上文tokens。其中L是CoT总长度（总token数）。使用“ (question, CoT)” tuples: (x, e)。这一步使用SFT训几个epoch，让LLM先看几遍所有的推理路径，获得一定的生 …
aclanthology.org
https://aclanthology.org
FineSurE: Fine-grained Summarization Evaluation using LLMs - ACL …
2025年3月15日 · To remedy those limitations, we propose FineSurE, a fine-grained evaluator specifically tailored for the summarization task using large language models (LLMs). It also employs completeness and conciseness criteria, in addition to faithfulness, enabling multi-dimensional assessment.
aclanthology.org
https://aclanthology.org
Towards Faithful Multi-step Reasoning through Fine ... - ACL …
2 天之前 · While CoT facilitates multi-step reasoning, the dependencies between reasoning steps are not always clearly discernible, which may lead to inconsistent reasoning. In this paper, we introduce fine-grained attribution reasoning distillation (FARD), which incorporates grounded citations to consolidate the relationships between reasoning steps.
aclanthology.org
https://aclanthology.org
ACL Anthology
2 天之前 · The ACL Anthology currently hosts 105850 papers on the study of computational linguistics and natural language processing.
arxiv.org
https://arxiv.org › abs
[2401.08967] ReFT: Reasoning with Reinforced Fine-Tuning
2024年1月17日 · To address this issue, we propose a simple yet effective approach called Reinforced Fine-Tuning (ReFT) to enhance the generalizability of learning LLMs for reasoning, with math problem-solving as an example.
arxiv.org
https://arxiv.org › abs
[2403.13372] LlamaFactory: Unified Efficient Fine-Tuning of 100 ...
2024年3月20日 · We present LlamaFactory, a unified framework that integrates a suite of cutting-edge efficient training methods. It provides a solution for flexibly customizing the fine-tuning of 100+ LLMs without the need for coding through the built-in web UI LlamaBoard.
aclanthology.org
https://aclanthology.org
ReFT: Reasoning with Reinforced Fine-Tuning - ACL Anthology
2025年3月17日 · To address this issue, we propose a simple yet effective approach called Reinforced Fine-Tuning (ReFT) to enhance the generalizability of learning LLMs for reasoning, with math problem-solving as an example.
zhihu.com
https://zhuanlan.zhihu.com
一文速览 | ACL 2021 最全论文分类（主会+Findings） - 知乎
ACL-IJCNLP 2021 是CCF A类会议，是人工智能领域自然语言处理（ Natural Language Processing，NLP）方向最权威的国际会议。计算语言学协会第59届年会暨第11届自然语言处理国际联席会议（The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL-IJCNLP 2021）计划于今年8月1日-8月6日以线上会议形式召开。
basicutils.com
https://basicutils.com › ... › spring-security-access-control-list-acl
Implementing Access Control Lists (ACL) in Spring Security
Learn how to implement Access Control Lists (ACL) in Spring Security for fine-grained control over resource access. This guide covers the components, setup, best practices, and examples of ACL in Spring Boot applications.
google.com
https://cloud.google.com › storage › docs › access-control
Overview of access control | Cloud Storage | Google Cloud
2025年3月5日 · Fine-grained: The fine-grained option enables you to use IAM and Access Control Lists (ACLs) together to manage permissions. ACLs are a legacy access control system for Cloud Storage designed...
某些结果已被删除
分页
- 1
- 2
- 3
- 4
- 下一页

论文笔记：ReFT Reasoning with Reinforced Fine-Tuning - 知乎

FineSurE: Fine-grained Summarization Evaluation using LLMs - ACL …

Towards Faithful Multi-step Reasoning through Fine ... - ACL …

ACL Anthology

[2401.08967] ReFT: Reasoning with Reinforced Fine-Tuning

[2403.13372] LlamaFactory: Unified Efficient Fine-Tuning of 100 ...

ReFT: Reasoning with Reinforced Fine-Tuning - ACL Anthology

一文速览 | ACL 2021 最全论文分类（主会+Findings） - 知乎

Implementing Access Control Lists (ACL) in Spring Security

Overview of access control | Cloud Storage | Google Cloud