Posts
All the articles I've posted.
-
A Large-Scale Empirical Analysis of Custom GPTs' Vulnerabilities in the OpenAI Ecosystem
This paper conducts a large-scale empirical analysis of 14,904 custom GPTs in the OpenAI store, revealing over 95% lack adequate security against attacks like roleplay (96.51%) and phishing (91.22%), introduces a multi-metric popularity ranking system, and highlights the need for enhanced security in both custom and base models.
-
VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation
This paper introduces VideoUFO, a million-scale dataset of 1.09 million video clips across 1,291 user-focused topics for text-to-video generation, curated from YouTube with minimal overlap with existing datasets, demonstrating improved performance on worst-performing topics when training a simple model like MVDiT.
-
Contaminated Multivariate Time-Series Anomaly Detection with Spatio-Temporal Graph Conditional Diffusion Models
TSAD-C introduces a pioneering unsupervised framework for multivariate time-series anomaly detection on contaminated data, using a Decontaminator with S4-based diffusion, long-range dependency modeling via a time-then-graph approach, and anomaly scoring, achieving state-of-the-art performance across diverse datasets.
-
CB-cPIR: Code-Based Computational Private Information Retrieval
CB-cPIR introduces a code-based single-server computational private information retrieval scheme that enhances security against subquery attacks by using high-weight secret vectors and dual queries, achieving lower communication and computational costs compared to lattice-based schemes like XPIR and SimplePIR.
-
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
ζ¬ζιθΏ pass@k ζζ η³»η»θ―δΌ° RLVR ε¨ε€§εθ―θ¨ζ¨‘εζ¨ηθ½εθΎΉηδΈηζζοΌεη° RLVR δ» ζι«ιζ ·ζηθζͺεΌε ₯ζ°ζ¨η樑εΌοΌε Άθ½εειδΊεΊη‘樑εοΌεΌΊθ°ιζΉθΏ RL θεΌδ»₯ζΏεηζ£ηζ°ζ¨ηθ½εγ