The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Study on German language model training trade-offs between data diversity and quality filtering, testing hierarchical filters on 500M documents.

Ansar Aynetdinov·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Unified Framework of Hyperbolic Graph Representation Learning Methods

Open-source framework for unified evaluation and comparison of hyperbolic graph representation learning methods across implementations.

Sofía Pérez Casulo·2 months ago

NVIDIA Dev Blog· INFRA

How to Build, Run, and Scale High-Quality Creator Workflows in ComfyUI

Creative and visualization teams today produce more assets, in more formats, with leaner teams. Generative AI can accelerate that work – compressing tasks... Creative and visualization teams today produce more assets, in more formats, with leaner teams. Generative AI can accelerate that work – compressing tasks that once took hours of manual effort into automated, repeatable pipelines. ComfyUI is an open-source, node-based creative tool that runs locally on NVIDIA RTX GPUs. It connects image generation, video synthesis, and language models into… Source

Joel Pennington·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Assessing the Role of Intersection Proximity in Pedestrian Crashes: Insights from Data Mining Approach

Data mining analysis of pedestrian crash patterns near intersections in Louisiana (2017-2021), using distance-to-intersection framework.

Ahmed Hossain·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Measuring research data reuse in scholarly publications using generative artificial intelligence: Open Science Indicator development and preliminary results

PLOS and DataSeer use LLMs to measure research data reuse in scholarly publications, finding 43% reuse rate via AI-based detection.

Lauren Cadwallader·2 months ago

TechCrunch AI· PRESS

Salesforce is crowdsourcing its AI roadmap — with customers

Salesforce lets its customers lead its product roadmap with the thinking that if one enterprise customer has a problem, the others likely do too.

Rebecca Szkutak·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses

RHyVE framework verifies and deploys LLM-generated reward hypotheses in RL, accounting for policy competence and training phase.

Feiyu Wu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking

PROMISE-AD predicts Alzheimer's disease progression from tabular clinical histories using survival estimation with leakage mitigation.

Qing Lyu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems

Scoping review identifies organizational and technical factors driving non-development and abandonment of AI systems pre-deployment.

Shreya Chappidi·2 months ago

The Verge AI· PRESS

Here’s how the new Microsoft and OpenAI deal breaks down

Microsoft's relationship with OpenAI has always been complicated, so I expected the close partnership-turned-situationship to end in tears. After all, executive disagreements, rearranged contracts, and frustrations over AI infrastructure have all regularly been part of the partnership, creating plenty of tension along the way. But against all odds, Microsoft and OpenAI divorced this week in a way that looks strangely amicable. Microsoft announced the updates to its long-standing OpenAI deal on Monday, with the most important change allowing OpenAI to make its products and services available a...

Tom Warren·2 months ago

The Verge AI· PRESS

Gemini is rolling out to cars with Google built-in

Here’s an early look at the new Gemini assistant on a vehicle infotainment system. | Image: Google Google is preparing to update vehicles that have Google built-in with its Gemini AI assistant. This will be an upgrade from the current Google Assistant according to Google's announcement, and promises to provide an improved experience for natural conversations, fetching vehicle-specific information, settings adjustments, and more. "When cars with Google built-in first hit the road in 2020, we made a commitment that your car will get better over time," Google senior product manager Alankar Agnih...

Jess Weatherbed·2 months ago·+ covered by others

MIT Tech Review· PRESS

This startup’s new mechanistic interpretability tool lets you debug LLMs

The San Francisco–based startup Goodfire just released a new tool, called Silico, that lets researchers and engineers peer inside an AI model and adjust its parameters—the settings that determine a model’s behavior—during training. This could give model makers more fine-grained control over how this technology is built than was once thought possible. Goodfire claims Silico…

Will Douglas Heaven·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agent-Agnostic Evaluation of SQL Accuracy in Production Text-to-SQL Systems

STEF enables schema-agnostic evaluation of text-to-SQL agents in production without ground-truth queries, addressing real-world deployment gaps.

Taslim Jamal Arif·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception

Study finds persona prompting in multimodal LLMs produces stable but limited behavioral variation in urban sentiment judgment tasks.

Neemias B da Silva·2 months ago

r/LocalLLaMA· COMMUNITY

New Stealth Model : Owl Alpha

Reddit speculation on Owl Alpha, an unidentified model with 1M context window and China-based safety patterns.

u/Kingwolf4·2 months ago·40 pts / 37 comm

r/LocalLLaMA· COMMUNITY

Are Qwen 3.6 27B and 35B making other ~30B models obsolete?

Qwen 3.6 27B/35B models outperform older ~30B alternatives (Qwen Coder, Gemma) on coding and agent tasks.

u/nikhilprasanth·2 months ago·41 pts / 79 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

CARE methodology systematizes LLM agent engineering in scientific domains via three-party collaboration between SMEs, developers, and helper agents.

Rahul Ramachandran·2 months ago

NVIDIA Dev Blog· INFRA

Automating GPU Kernel Translation with AI Agents: cuTile Python to cuTile.jl

NVIDIA CUDA Tile (cuTile) is a tile-based programming model that enables developers to write GPU kernels in terms of tile-level operations—loads, stores, and... NVIDIA CUDA Tile (cuTile) is a tile-based programming model that enables developers to write GPU kernels in terms of tile-level operations—loads, stores, and matrix multiply-accumulate—rather than manually coordinating threads, warps, and shared memory. cuTile.jl brings the same tile-based approach to the dynamic programming language Julia. Users can write custom GPU kernels without dropping… Source

Zhengyi Zhang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images

SpecVQA benchmark evaluates multimodal LLMs on spectral understanding with 620 expert-annotated scientific images across 7 spectrum types.

Jialu Shen·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Early Detection of Water Stress by Plant Electrophysiology: Machine Learning for Irrigation Management

ML framework detects water stress in tomato plants via electrophysiology signals for precision agriculture and irrigation optimization.

Eduard Buss·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Exponential families from a single KL identity

Theoretical paper derives unified KL identity for exponential families applicable to softmax, Gaussians, variational inference, and RLHF.

Marc Dymetman·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Ease of dependency distance minimization in star-like structures

Linguistic study on syntactic dependency distance minimization in star-like sentence structures; narrow theoretical interest.

Emília Garcia-Casademont·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Shuffling-Aware Optimization for Private Vector Mean Estimation

Differential privacy optimization for mean estimation in shuffle model; foundational theory without AI systems application.

Shun Takagi·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

DriftBench evaluates constraint adherence across 7 LLM models in iterative ideation; shows models lose fidelity under refinement pressure.

Garvin Kruthof·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MIFair: A Mutual-Information Framework for Intersectionality and Multiclass Fairness

MIFair framework for bias assessment via mutual information; addresses intersectionality and multiclass fairness in ML systems.

Jeanne Monnier·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Reliable Answers for Recurring Questions: Boosting Text-to-SQL Accuracy with Template Constrained Decoding

TeCoD system improves Text-to-SQL accuracy via template-constrained decoding from query pattern reuse in labeled workloads.

Smit Jivani·2 months ago

r/singularity· COMMUNITY

Claude Mythos supports Image outputs - Anthropic's first image gen model

https://preview.redd.it/u1ik0uejlcyg1.png?width=1080&format=png&auto=webp&s=d2ea7758fbfe5fdf2b65a3a79f2bb99711a07db8 As you can see in the outputs, Mythos can output images.

u/exordin26·2 months ago·107 pts / 22 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

FedHarmony: Harmonizing Heterogeneous Label Correlations in Federated Multi-Label Learning

FedHarmony addresses label correlation drift in federated multi-label learning across heterogeneous client datasets.

Zhiqiang Kou·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Universal statistical laws governing culinary design

Empirical study finds statistical laws in global recipe structures via NER; cultural/linguistic interest, not AI-relevant.

Ganesh Bagler·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Cost-Aware Learning

Cost-Aware SGD algorithm for finite-sum objectives with heterogeneous sampling costs; applied to RL with language models.

Clara Mohri·2 months ago

← Front Page30 stories

← Newer Older →

The Archive

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

A Unified Framework of Hyperbolic Graph Representation Learning Methods

How to Build, Run, and Scale High-Quality Creator Workflows in ComfyUI

Assessing the Role of Intersection Proximity in Pedestrian Crashes: Insights from Data Mining Approach

Measuring research data reuse in scholarly publications using generative artificial intelligence: Open Science Indicator development and preliminary results

Salesforce is crowdsourcing its AI roadmap — with customers

RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses

PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking

To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems

Here&#8217;s how the new Microsoft and OpenAI deal breaks down

Gemini is rolling out to cars with Google built-in

This startup’s new mechanistic interpretability tool lets you debug LLMs

Agent-Agnostic Evaluation of SQL Accuracy in Production Text-to-SQL Systems

Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception

New Stealth Model : Owl Alpha

Are Qwen 3.6 27B and 35B making other ~30B models obsolete?

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

Automating GPU Kernel Translation with AI Agents: cuTile Python to cuTile.jl

SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images

Early Detection of Water Stress by Plant Electrophysiology: Machine Learning for Irrigation Management

Exponential families from a single KL identity

Ease of dependency distance minimization in star-like structures

Shuffling-Aware Optimization for Private Vector Mean Estimation

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

MIFair: A Mutual-Information Framework for Intersectionality and Multiclass Fairness

Reliable Answers for Recurring Questions: Boosting Text-to-SQL Accuracy with Template Constrained Decoding

Claude Mythos supports Image outputs - Anthropic's first image gen model

FedHarmony: Harmonizing Heterogeneous Label Correlations in Federated Multi-Label Learning

Universal statistical laws governing culinary design

Cost-Aware Learning

Here’s how the new Microsoft and OpenAI deal breaks down