The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Concordance Comparison as a Means of Assembling Local Grammars

Concordance comparison method assembles local grammars for Portuguese named entity recognition via pairwise grammar analysis.

Juliana Pirovani·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

EvoNav: Evolutionary Reward Function Design for Robot Navigation with Large Language Models

EvoNav uses LLMs to evolve reward functions for robot navigation via reinforcement learning, automating design of navigation policies.

Zhikai Zhao·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Parameter Aggregation: Semantic Consensus for Federated Fine-Tuning of LLMs

Semantic consensus framework enables federated fine-tuning of LLMs through model behavior rather than parameter aggregation, supporting heterogeneous architectures.

Amr Abourayya·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs

UniVLR unifies text and vision in multimodal LLMs by rendering reasoning traces as shared visual workspace, improving latent reasoning efficiency.

Houcheng Jiang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Improving the Performance and Learning Stability of Parallelizable RNNs Designed for Ultra-Low Power Applications

Bistable Memory Recurrent Units (BMRU) improve gradient flow and stability for ultra-low power sequence models via hardware-software co-design.

Julien Brandoit·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models

Self-distilled trajectory-aware Boltzmann modeling closes training-inference gap in diffusion language models via multi-step denoising trajectories.

Kecheng Chen·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

GEAR: Granularity-Adaptive Advantage Reweighting for LLM Agents via Self-Distillation

GEAR enables fine-grained credit assignment in RL-trained LLM agents via adaptive-granularity advantage reweighting at token and segment levels.

Sijia Li·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Constrained Stochastic Spectral Preconditioning Converges for Nonconvex Objectives

Proximal spectral gradient methods with convergence guarantees for nonconvex constrained optimization under heavy-tailed noise.

Konstantinos Oikonomidis·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Fast and Energy-Efficient Latch-Based Memristive Analog Content-Addressable Memory

Memristor-based analog content-addressable memory architecture for edge AI inference with improved scalability.

Paul-Philipp Manea·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Martingale-Consistent Self-Supervised Learning

Self-supervised learning framework enforcing martingale consistency across coarse and refined data views.

Moritz Gögl·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Probabilistic Calibration Is a Trainable Capability in Language Models

Fine-tuning approach to improve language model calibration for user-specified output probability distributions.

Davide Baldelli·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Minimax Rates and Spectral Distillation for Tree Ensembles

Spectral analysis of tree ensembles deriving minimax convergence rates and compression schemes for random forests and gradient boosting.

Binh Duc Vu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Trade-offs in Decentralized Agentic AI Discovery Across the Compute Continuum

Comparative analysis of decentralized agent discovery mechanisms (Chord, Pastry, Kademlia) across edge and cloud compute.

Patrizio Dazzi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Gradient Clipping Beyond Vector Norms: A Spectral Approach for Matrix-Valued Parameters

Spectral gradient clipping method preserving matrix structure and controlling singular value decay during neural network training.

Alexander Yukhimchuk·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing

Lifelong normalization mechanism enabling stable sequential model editing in LLMs without catastrophic forgetting.

Xin Ma·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Multi-Timescale Conductance Spiking Networks: A Sparse, Gradient-Trainable Framework with Rich Firing Dynamics for Enhanced Temporal Processing

Spiking neural network framework with gradient-trainable multi-timescale conductance dynamics for sparse temporal processing.

Alex Fulleda-Garcia·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Bin Latent Transformer (BiLT): A shift-invariant autoencoder for calibration-free spectral unmixing of turbid media

Shift-invariant transformer autoencoder for calibration-robust spectral unmixing in pharmaceutical and biomedical analysis.

Martin Hohmann·1 month ago

r/ClaudeAI· COMMUNITY

Converted Karpathy's coding skill from Pro to free plan. Here's the full thing:

The Karpathy coding skill is locked behind Pro. It doesn't use any Pro-only features, so I rewrote it for free plan chat workflows. Same philosophy, tuned for no terminal, no subagents, and a shorter context window where mistakes are expensive. Paste the whole thing into a Project's custom instructions or use it as a system prompt. It auto-triggers on any coding request. --- name: karpathy-coding description: Apply Karpathy-inspired coding discipline to any programming task. Use this skill whenever the user asks you to write, fix, refactor, extend, or review code — even casually...

u/flarenz·1 month ago·22 pts / 6 comm

r/ClaudeAI· COMMUNITY

I may have uncovered the real reason they're sunsetting Sonnet 4.5. They could barely contain its true power

u/purloinedspork·1 month ago·32 pts / 5 comm

r/singularity· COMMUNITY

GPT-5.5 was used to flag fatal errors in FrontierMath problems

FrontierMath is supposed to be one of the hard benchmarks for frontier models, and now Epoch is saying an AI-assisted review found fatal errors in about a third of Tiers 1-4. Noam Brown says the initial flags came from GPT-5.5. Obviously we’ll have to wait for the corrected scores, but this is a pretty interesting moment: the model is already strong enough to sanity-check the benchmark.

u/Eyeswideshut_91·1 month ago·121 pts / 20 comm

r/ClaudeAI· COMMUNITY

I made an AI concierge for my wedding guests. The second most popular thing they did with it was try to jailbreak it.

Wedding guest deployed Claude-based concierge; users attempted jailbreak attempts as second most common interaction.

u/Thin_Sky·1 month ago·90 pts / 25 comm

r/ClaudeAI· COMMUNITY

Claude Code just shipped a "run until done" mode. Upgrade to v2.1.139 for /goal.

Morning Everyone! Big one today (**104 changes!**): Claude Code just went async. The new `/goal` command lets you set a completion condition ("all tests pass and the PR is ready"), then Claude keeps grinding across turns until it's hit. The new `claude agents` view shows every session you've got running: working, blocked on you, or done. Translation: kick off a goal -> let claude cook -> come back later. First proper fire-and-forget loop CC has shipped. Pretty huge unlock if you've been juggling multiple sessions and losing track of which one needs you. Full notes: [https://www.luk...

u/oh-keh·1 month ago·29 pts / 8 comm

r/ClaudeAI· COMMUNITY

Why Claude users are systematically missing from AI psychology research (and what that means)

Reddit post identifies systematic absence of Claude users from published AI psychology research, raising methodological concerns about chatbot adoption studies.

u/esuremu·1 month ago·20 pts / 19 comm

r/OpenAI· COMMUNITY

When you ask ChatGPT a question about VSCode but it pulls in VictoriaSecret for context 😂

Reddit user posts anecdote about ChatGPT confusing VSCode with Victoria's Secret in context retrieval.

u/DollarAkshay·1 month ago·105 pts / 11 comm

TechCrunch AI· PRESS

Thinking Machines wants to build an AI that actually listens while it talks

Right now, every AI model you've ever used works the same way. You talk, it listens. It responds, you listen. Thinking Machines is trying to change that by building a model that processes your input and generates a response at the same time, so it's more like a phone call than a text chain.

Connie Loizos·1 month ago

r/singularity· COMMUNITY

Unitree Launches World’s First Mass-Produced Manned Mecha GD01

Unitree announces GD01, a manned exoskeleton mecha; hardware milestone with unclear AI integration or technical specs.

u/givemeanappple·1 month ago·172 pts / 43 comm

r/OpenAI· COMMUNITY

ChatGPT seeing me write a whole sentence by myself

User observation about ChatGPT's real-time display behavior during text input; no technical substance.

u/imfrom_mars_·1 month ago·165 pts / 10 comm

Latent Space· ANALYST

[AINews] Thinking Machines' Native Interaction Models - TML-Interaction-Small 276B-A12B - advances SOTA Realtime Voice and kills standard VAD

well done, Team Thinky.

Latent Space·1 month ago

r/OpenAI· COMMUNITY

Ex OpenAI CTO Mira Murati is giving them a serious fight for the bucks. Her new “Interaction Model” makes “GPT-Realtime-2” look like caveman, current capabilities level wise

Reddit post claims Mira Murati's new project outperforms OpenAI's GPT-Realtime-2; lacks specifics on model name, capabilities, or verification.

u/py-net·1 month ago·72 pts / 14 comm

r/MachineLearning· COMMUNITY

ICML Author Removal [D]

PhD student. Need advice. After the ICML abstract deadline, industry coauthors asked to be removed, they missed their employer's internal approval window. They had contributed (discussions and written feedback) but I hadn't explicitly asked before adding them. January: wrote to PC chairs, got written confirmation from all coauthors, got explicit written approval. Chairs said they'd implement. Never happened. Paper accepted four months later with original author list. At camera-ready we followed up. Chairs reversed: blanket policy, no exceptions, keep the list or withdraw. What do you t...

u/Prize_Hospital6525·1 month ago·30 pts / 14 comm

← Front Page30 stories

← Newer Older →