Posts
All the articles I've posted.
-
A Comprehensive Analysis of Adversarial Attacks against Spam Filters
This paper conducts a comprehensive analysis of adversarial attacks on deep learning-based spam filters, revealing significant vulnerabilities across character, word, sentence, and AI-generated paragraph levels using novel scoring functions like spam weights, with distilBERT showing relative resilience at paragraph-level attacks.
-
Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning
Selftok introduces a non-spatial autoregressive visual tokenizer using diffusion timesteps, unifying vision-language models and enabling effective reinforcement learning for superior text-to-image generation, as demonstrated on GenEval and DPG-Bench benchmarks.
-
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
本文提出 Universal Reasoner (UniR),一种轻量级、可组合的推理模块,通过将预定义奖励转化为 token 级别指导信号,为冻结的大型语言模型提供高效的推理能力增强,并在数学推理与机器翻译任务上展现出优于部分基线的性能与跨模型迁移能力。
-
Warm Up Before You Train: Unlocking General Reasoning in Resource-Constrained Settings
本文提出了一种两阶段训练框架,通过领域无关的Knights & Knaves逻辑游戏预热激活通用推理能力,并结合少量目标领域数据的RLVR训练,在资源受限环境下显著提升大型语言模型的推理性能和跨领域泛化能力。
-
MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning
本文提出MoRE方法,通过将LoRA的不同秩视为专家并设计自适应秩选择器,显著提升了大型语言模型在多任务场景中的微调效率和性能,同时保持较低的参数量。