mdok-style at SemEval-2026 Task 10: Finetuning LLMs for Conspiracy Detection
mdok-style system finetuned Qwen3-32B using data augmentation for SemEval-2026 conspiracy detection task, ranking 8th of 52 submissions.
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
mdok-style system finetuned Qwen3-32B using data augmentation for SemEval-2026 conspiracy detection task, ranking 8th of 52 submissions.
Elon Musk made inflammatory public comments toward OpenAI leadership over corporate governance disputes.
Empirical study of 557 healthcare agent skills from ClawHub showing capability gaps and governance challenges for cross-setting AI agent deployment.
SAIL framework provides anatomy-aligned post-hoc explanations for OCT-based retinal disease detection using structure-aware interpretable learning.
Federated reinforcement learning approach for mobile crowdsensing platform optimization under dynamic task and resource constraints.
ProPACT is a proactive adaptive collaborative tutor using multimodal dyadic learner models to optimize pair programming collaboration.
PS-Clip-SGD gradient clipping method achieves optimal convergence rates for non-convex optimization under heavy-tailed noise with high-probability guarantees.
PIEGraph combines analytical physics with graph neural networks for data-efficient object dynamics learning in robotic manipulation of rigid and deformable bodies.
mdok-style system applied QLoRA finetuning on mid-size LLMs for SemEval-2026 multilingual polarization detection across detection, type, and manifestation subtasks.
OpenAI accuses Musk of trying to "coerce" a settlement days before trial started.
Fréchet-based random-effects algorithm for non-Euclidean data in metric spaces; extends mixed-effects models beyond Hilbert spaces.
ParaRNN: parallelizable RNN architecture combining autoregressive models with neural networks for interpretable time-series forecasting.
Bayesian optimization with dimensionality reduction for tuning Hyperledger Fabric blockchain configuration parameters via black-box benchmarking.
Elon Musk reportedly messaged Sam Brockman regarding settlement discussions in ongoing legal dispute, per court filing.
MSMixer: multi-scale MLP architecture with parallel branches at different temporal resolutions for long-term time-series forecasting.
Spectral Model eXplainer: domain-grounded explainability framework for chemometrics ML models operating on spectral data.
Zero-trust authorization framework for LLM agents with hybrid inspection and task-based access control to mitigate tool-use and resource-access risks.
Structural Causal Decision Models for modeling multi-agent economics with cognitive resource constraints and value discounting.
Online generalised predictive coding: extension of data assimilation for joint state inference, parameter learning, and uncertainty estimation.
ACII-DaiKon 2026 benchmark for dyadic conversation affect modeling: interpersonal influence, timing coordination, and rapport development challenges.
DILER Benchmark: drug-induced liver injury dataset with mechanistic hypotheses; reframes DILI prediction as explainable hypothesis generation.
Community demo comparing Talkie-1930 (13B retro LM) and Gemma 4 31B in side-by-side chat on Opper.ai platform.
The BOGO offer is live. For a limited time, buy one pass to TechCrunch Disrupt 2026 and get 50% off a second of the same ticket type. Offer ends this Friday, May 8. Save here.
Reddit user reports Qwen 3.6 27B found a bug that GPT 5.5 and Claude Opus 4.7 missed, attributing success to extended reasoning.
After \~3 weeks of experimentation in OpenAI's Parameter Golf competition, I wrote up why SSMs are structurally disadvantaged relative to transformers in a time- and size-constrained regime (10 min training, 16MB artifact, 25M parameters) on 8xH100s: [https://mradassaad.github.io/posts/why-ssms-struggle-in-parameter-golf/](https://mradassaad.github.io/posts/why-ssms-struggle-in-parameter-golf/) Main findings: 1. SSM in\_proj weights compress up to 3.26x worse than attention QKV under LZMA, directly taxing the compressed parameter budget 2. Architectural wins validated at SP4096 flipped sign...
Social media report of user exploiting Grok chatbot to extract funds; unverified claim lacking technical details.
Anthropic partners with Blackstone, Hellman & Friedman, and Goldman Sachs to launch enterprise AI services company.
LLMSearchIndex: open-source Python library for local, offline web search with 200M indexed pages, enabling RAG without paid APIs.
DoorDash on Monday added new AI-powered tools that let merchants speed up onboarding, edit photos to make dishes look better, and create new websites from existing content.
Reddit post speculating about a hypothetical AI-themed game; lacks substantive technical or industry content.