Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling
Study on German language model training trade-offs between data diversity and quality filtering, testing hierarchical filters on 500M documents.
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
Study on German language model training trade-offs between data diversity and quality filtering, testing hierarchical filters on 500M documents.
Open-source framework for unified evaluation and comparison of hyperbolic graph representation learning methods across implementations.
Creative and visualization teams today produce more assets, in more formats, with leaner teams. Generative AI can accelerate that work – compressing tasks... Creative and visualization teams today produce more assets, in more formats, with leaner teams. Generative AI can accelerate that work – compressing tasks that once took hours of manual effort into automated, repeatable pipelines. ComfyUI is an open-source, node-based creative tool that runs locally on NVIDIA RTX GPUs. It connects image generation, video synthesis, and language models into… Source
Data mining analysis of pedestrian crash patterns near intersections in Louisiana (2017-2021), using distance-to-intersection framework.
PLOS and DataSeer use LLMs to measure research data reuse in scholarly publications, finding 43% reuse rate via AI-based detection.
Salesforce lets its customers lead its product roadmap with the thinking that if one enterprise customer has a problem, the others likely do too.
RHyVE framework verifies and deploys LLM-generated reward hypotheses in RL, accounting for policy competence and training phase.
PROMISE-AD predicts Alzheimer's disease progression from tabular clinical histories using survival estimation with leakage mitigation.
Scoping review identifies organizational and technical factors driving non-development and abandonment of AI systems pre-deployment.
Microsoft's relationship with OpenAI has always been complicated, so I expected the close partnership-turned-situationship to end in tears. After all, executive disagreements, rearranged contracts, and frustrations over AI infrastructure have all regularly been part of the partnership, creating plenty of tension along the way. But against all odds, Microsoft and OpenAI divorced this week in a way that looks strangely amicable. Microsoft announced the updates to its long-standing OpenAI deal on Monday, with the most important change allowing OpenAI to make its products and services available a...
Here’s an early look at the new Gemini assistant on a vehicle infotainment system. | Image: Google Google is preparing to update vehicles that have Google built-in with its Gemini AI assistant. This will be an upgrade from the current Google Assistant according to Google's announcement, and promises to provide an improved experience for natural conversations, fetching vehicle-specific information, settings adjustments, and more. "When cars with Google built-in first hit the road in 2020, we made a commitment that your car will get better over time," Google senior product manager Alankar Agnih...
The San Francisco–based startup Goodfire just released a new tool, called Silico, that lets researchers and engineers peer inside an AI model and adjust its parameters—the settings that determine a model’s behavior—during training. This could give model makers more fine-grained control over how this technology is built than was once thought possible. Goodfire claims Silico…
STEF enables schema-agnostic evaluation of text-to-SQL agents in production without ground-truth queries, addressing real-world deployment gaps.
Study finds persona prompting in multimodal LLMs produces stable but limited behavioral variation in urban sentiment judgment tasks.
Reddit speculation on Owl Alpha, an unidentified model with 1M context window and China-based safety patterns.
Qwen 3.6 27B/35B models outperform older ~30B alternatives (Qwen Coder, Gemma) on coding and agent tasks.
CARE methodology systematizes LLM agent engineering in scientific domains via three-party collaboration between SMEs, developers, and helper agents.
NVIDIA CUDA Tile (cuTile) is a tile-based programming model that enables developers to write GPU kernels in terms of tile-level operations—loads, stores, and... NVIDIA CUDA Tile (cuTile) is a tile-based programming model that enables developers to write GPU kernels in terms of tile-level operations—loads, stores, and matrix multiply-accumulate—rather than manually coordinating threads, warps, and shared memory. cuTile.jl brings the same tile-based approach to the dynamic programming language Julia. Users can write custom GPU kernels without dropping… Source
SpecVQA benchmark evaluates multimodal LLMs on spectral understanding with 620 expert-annotated scientific images across 7 spectrum types.
ML framework detects water stress in tomato plants via electrophysiology signals for precision agriculture and irrigation optimization.
Theoretical paper derives unified KL identity for exponential families applicable to softmax, Gaussians, variational inference, and RLHF.
Linguistic study on syntactic dependency distance minimization in star-like sentence structures; narrow theoretical interest.
Differential privacy optimization for mean estimation in shuffle model; foundational theory without AI systems application.
DriftBench evaluates constraint adherence across 7 LLM models in iterative ideation; shows models lose fidelity under refinement pressure.
MIFair framework for bias assessment via mutual information; addresses intersectionality and multiclass fairness in ML systems.
TeCoD system improves Text-to-SQL accuracy via template-constrained decoding from query pattern reuse in labeled workloads.
https://preview.redd.it/u1ik0uejlcyg1.png?width=1080&format=png&auto=webp&s=d2ea7758fbfe5fdf2b65a3a79f2bb99711a07db8 As you can see in the outputs, Mythos can output images.
FedHarmony addresses label correlation drift in federated multi-label learning across heterogeneous client datasets.
Empirical study finds statistical laws in global recipe structures via NER; cultural/linguistic interest, not AI-relevant.
Cost-Aware SGD algorithm for finite-sum objectives with heterogeneous sampling costs; applied to RL with language models.