The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Sycophantic Praise: Evaluating Excessive Praise in Language Models

Sycophancy in language models is typically studied as excessive agreement or validation, while explicit praise and flattery have received comparatively little attention. We argue that sycophantic praise is a distinct alignment problem that cannot be reliably measured using current methods. We introduce a parameterized framework that measures whether praise is excessive relative to contribution quality and expected user ability. We show that our framework substantially outperforms generic LLM judges in agreement with human annotations, and that sycophantic praise occurs far more frequently in ...

Daniel Vennemeyer·2 months ago

The Archive

Sycophantic Praise: Evaluating Excessive Praise in Language Models

Re-imagining ISO 26262 in the Age of Autonomous Vehicles: Enhancing Controllability through Transferability and Predictability

The Lipreading Gap: Do VSR Models Perceive Visual Speech Like Human Lipreaders?

Watch, Remember, Reason: Human-View Video Understanding with MLLMs

Discovering Multiscale Deep Formulas in Complex Systems via Neural-Guided Lambda Calculus

The Masked Advantage: Uncovering Local-Language Access to Cultural Knowledge in LLMs

Video-Based Prediction of In-Flight Particle Characteristics in Atmospheric Plasma Spraying

Sparsely gated tiny linear experts

Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills

A Comprehensive Anatomy of Human and DeepSeek-R1 LLM Mathematical Reasoning

Reversible Foundations: Training a 120B Sparse MoE through State-Preserving Scaling

The Proxy Benders Decomposition

M$^3$Exam: Benchmarking Multimodal Memory for Realistic User-Agent Interactions

Generative Modeling of Discrete Latent Structures via Dynamic Policy Gradients

Automatic, Debiased, and Invariant Counterfactual Generation under General Interventions

The Fitbit Air is a good wearable weighed down by a chatty AI "coach"

Online Pandora's Box for Contextual LLM Cascading

New York lawmakers pass one-year ban on new data centers

Making the Most of Limited Data: Score-Aware Training for Text-to-Music Generation

Unified Geometry-Guided ML-FTLE for Tracking Transient Chaos from Scalar Time Series

RhinoVLA Technical Report

Covariance Shrinkage via Stochastic Interpolation

Impact of Synthetic Lesional MR Images in Automated Focal Cortical Dysplasia Detection in Low-Data Scenarios

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Mitosis Detection in the Wild: Multi-Tumor and Context-Aware Generalization in the MIDOG 2025 Challenge

Self-evolving LLM agents with in-distribution Optimization

Dash2Sim: Closed-Loop Driving Simulation from in-the-wild Dashcam Videos

A robust PPG foundation model using multimodal physiological supervision

Breaking the Ice: Analyzing Cold Start Latency in vLLM

DirectAudioEdit: Inversion-Free Text-Guided Audio Editing via Diffusion Prediction Contrast