The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

A Komi-Yazva--Russian Parallel Corpus and Evaluation Protocol for Zero- and Few-Shot LLM Translation

We present the first Komi-Yazva--Russian parallel corpus together with an explicit evaluation protocol for studying LLM translation in an endangered, extremely low-resource setting. The dataset contains 457 aligned sentence pairs from 74 narrative texts and is accompanied by documented provenance, sentence-level alignment, and story identifiers that enable leakage-aware evaluation. We use this setup to compare modern large language models on Komi-Yazva-to-Russian translation under severe parallel-data scarcity in zero-shot and retrieval-based few-shot regimes. The protocol includes story-leve...

Petr Parshakov·2 months ago

The Archive

A Komi-Yazva--Russian Parallel Corpus and Evaluation Protocol for Zero- and Few-Shot LLM Translation

Double Preconditioning (DoPr): Optimization for Test-Time Performance, not Validation Loss

Unsupervised Skill Discovery for Agentic Data Analysis

A Vision-language Framework for Comparative Reasoning in Radiology

CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments

The Post-GCN Decade Revisited: Curvature-Stratified Evaluation of Relational Learning

Risk Assessment of Autonomous Driving: Integrating Technical Failures, Ethical Dilemmas, and Policy Frameworks

Proper Scoring Rules for Right-Censored Survival Data

Conformal Risk Sharing: Certified Cost Allocation with Participation Guarantees

HomeWorld: A Unified Floorplan-to-Furnished Framework for Generating Controllable, Densely Interactive Whole-Home Scenes

Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

Learned Response-Field Inertia Operator for HEC-RAS 2D Water-Surface Elevation Prediction

Emergent Language as an Approach to Conscious AI

EasyLens: A Training-Free Plug-and-Play Subtle-Lesion Representation Amplifier for Medical Vision-Language Models

Rethinking Infrastructure Inspection as Image Difference Classification: A Traffic Sign Case Study

LatentWave: JEPA Pretraining for Wireless Foundation Models

Quoting Emanuel Maiberg, 404 Media

End-to-End Subgraph Detection with GraphDETR

Meta rolls out a new AI creator assistant on Facebook

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

An Infectious Disease Spread Simulation Based on Large Language Model Decision Making

F3-Tokenizer: Taming Audio Autoencoder Latents for Understanding and Generation

Where Should Knowledge Enter? A Layered Framework for Knowledge Infusion in Multimodal Iterative Generative Mo

Maximising the Set-Piece Return: Optimising Football Corner Tactics with Graph Reinforcement Learning

Function-Space Priors for Bayesian Neural ODEs with Application to Vessel Trajectory Prediction

EDIT: Evidence-Diagnosed Intervention Training for Rule-Faithful LLM Grading

"Chi nas dal soch el sent de legn" -- Auditing Text Corpora for Lombard

Performance Evaluation of GraphCast for Medium-Range Weather Forecasting over Brazil

Attack Detection using Time Series Foundation Models

Boosting Brain-to-Image Decoding with TRIBE v2 Data Augmentation