Tag: Test Time

All the articles with the tag "Test Time".

Test-time Correlation Alignment

Published: 8 May, 2025 at 12:21 AM

93.31 🤔

本文提出测试时相关性对齐（TCA）范式，通过构建伪源域相关性并应用线性变换对齐测试数据特征，显著提升测试时适应（TTA）性能，同时保持高效性和源域知识。
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Published: 22 May, 2025 at 11:16 AM

92.95 🤔

本文提出 LATENTSEEK 框架，通过在潜在空间中基于策略梯度的测试时实例级适应（TTIA），显著提升大型语言模型的推理能力，同时探索测试时扩展的新方向。
Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging

Published: 23 May, 2025 at 11:15 AM

92.79 🤔

本文提出测试时模型合并（TTMM）方法，通过在训练时预训练大量专家模型并在测试时动态合并参数，以几乎无测试时开销的方式逼近测试时训练（TTT）的语言建模性能。
Reward Reasoning Model

Published: 24 May, 2025 at 11:08 AM

92.11 🤔

本文提出奖励推理模型（RRMs），通过链式推理过程在生成奖励前自适应利用测试时计算资源，在多个奖励建模基准和实际应用中显著提升性能，尤其在复杂推理任务上表现优异。
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning

Published: 24 May, 2025 at 11:12 AM

90.98 🤔

本文提出 PLAN-AND-BUDGET 框架，通过结构化推理和基于不确定性的自适应 token 预算分配，显著提升大型语言模型在推理任务中的计算效率，E3 指标最高提升 187.5%，同时保持准确率。

Tag: Test Time

Test-time Correlation Alignment

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging

Reward Reasoning Model

Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning