Tag: Chain-of-Thought Learning

All the articles with the tag "Chain-of-Thought Learning".

Pushing the boundary on Natural Language Inference

Published: 4 May, 2025 at 04:30 PM

56.51 🤔

本文提出使用Group Relative Policy Optimization结合Chain-of-Thought学习的方法提升自然语言推理任务的性能，无需标注推理路径，通过参数高效微调在对抗性基准上实现最先进结果。