Tag: Human-AI Interaction

All the articles with the tag "Human-AI Interaction".

Diverse, not Short: A Length-Controlled Self-Learning Framework for Improving Response Diversity of Language Models

Published: 26 May, 2025 at 11:24 AM

87.09 🤔

本文提出Diverse-NS框架，通过长度控制的自学习和偏好优化显著提升了大型语言模型在创造性任务中的响应多样性，同时在大多数情况下保持了输出质量，并验证了小模型作为大模型多样性教师的可行性。
AI agents may be worth the hype but not the resources (yet): An initial exploration of machine translation quality and costs in three language pairs in the legal and news domains

Published: 8 May, 2025 at 12:22 AM

86.86 🤔

本文通过实证评估五种机器翻译范式，发现推理增强的大型语言模型（如o1-preview）在人工评估中表现出色，超越传统NMT，而多智能体系统虽具潜力，但因高计算成本和语言对表现不一致而受限。
Activated LoRA: Fine-tuned LLMs for Intrinsics

Published: 7 May, 2025 at 12:17 AM

86.84 🤔

本文提出 Activated LoRA (aLoRA)，一种改进的 LoRA 框架，通过仅对激活后 token 适配权重，复用基础模型 KV 缓存，实现高效动态适配，并在多个任务上保持与标准 LoRA 相当的性能，同时显著降低推理成本。
Hybrid Latent Reasoning via Reinforcement Learning

Published: 3 Jun, 2025 at 11:43 AM

86.71 🤔

本文提出HRPO，一种基于强化学习的混合潜在推理框架，通过门控机制结合离散token和连续隐状态，显著提升了大型语言模型在知识和推理任务上的性能，同时减少了对链式思维数据的依赖。
Self-Interpretability: LLMs Can Describe Complex Internal Processes that Drive Their Decisions, and Improve with Training

Published: 30 May, 2025 at 11:15 AM

86.28 🤔

本文通过微调GPT-4o和GPT-4o-mini，展示了大型语言模型能够量化报告其内部决策过程（如属性权重），并通过内省训练显著提升报告准确性，且这种能力可泛化至原生偏好，为AI可解释性和安全性提供了新路径。

Tag: Human-AI Interaction

Diverse, not Short: A Length-Controlled Self-Learning Framework for Improving Response Diversity of Language Models

AI agents may be worth the hype but not the resources (yet): An initial exploration of machine translation quality and costs in three language pairs in the legal and news domains

Activated LoRA: Fine-tuned LLMs for Intrinsics

Hybrid Latent Reasoning via Reinforcement Learning

Self-Interpretability: LLMs Can Describe Complex Internal Processes that Drive Their Decisions, and Improve with Training