Tag: Classification

All the articles with the tag "Classification".

When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator

Published: 8 May, 2025 at 06:13 PM

88.68 🤔

This paper demonstrates that a 1.5B parameter reasoning model (Distill-R1) outperforms larger non-reasoning LLMs as a discriminator in a text-to-SQL planning framework by leveraging a novel soft score extraction method from chain-of-thought outputs, though it struggles significantly as a generator.
Who Taught You That? Tracing Teachers in Model Distillation

Published: 22 May, 2025 at 11:11 AM

88.50 🤔

本文提出了一种基于句法模式（PoS 模板）的方法，通过学生模型输出的高阶语言特征识别其教师模型，并在多个任务和数据集上验证了其优于传统相似度和困惑度方法的性能，但准确率仍有待提升。
Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning

Published: 5 Jun, 2025 at 11:24 AM

87.16 🤔

本文通过将自然语言理解任务转化为强化学习问题，使用PPO算法微调中小规模LLMs，在GLUE和SuperGLUE基准上显著提升性能，超越监督微调和BERT-large，并展现出优于GPT-4o的零样本泛化能力。
Boltzmann Classifier: A Thermodynamic-Inspired Approach to Supervised Learning

Published: 14 May, 2025 at 11:08 AM

86.86 🤔

The Boltzmann Classifier introduces a thermodynamically inspired supervised learning approach that uses an energy-based model derived from the Boltzmann distribution to estimate class probabilities, achieving competitive accuracy on benchmark datasets while offering interpretability and computational efficiency.
Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning

Published: 31 May, 2025 at 11:19 AM

86.54 🤔

本文通过隐藏状态的几何特性（可分离性和对齐性）提出统一框架，揭示上下文学习（ICL）在分类任务中的两阶段机制——早期层通过PTH增强可分离性，后期层通过IH优化对齐性，并解释了任务向量的有效性。

Tag: Classification

When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator

Who Taught You That? Tracing Teachers in Model Distillation

Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning

Boltzmann Classifier: A Thermodynamic-Inspired Approach to Supervised Learning

Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning