The Archive
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
Google readies ‘AI Ultra Lite’ plan and explicit ‘usage limits’ for Gemini
Google planning tiered Gemini offering with 'AI Ultra Lite' variant and explicit usage quotas for API consumers.
Tojan in "claude code" google search first result
User reports malware posing as Claude Code in Google search results, flagged as Trojan:Win32/Kepavll by Windows Defender.
I have DeepSeek V4 Pro at home
User successfully quantized and ran DeepSeek V4 Pro locally on AMD EPYC + RTX PRO hardware using modified llama.cpp with Q4_K_M compression.
Claude code weekly limit absolutely broken
User reports unexpected spike in Claude API usage limits and weekly quota consumption without corresponding increase in actual usage.
Anyone else hating 4.7 in claudecode?
Reddit user expresses frustration with Claude 4.7 coding performance, considering switching to competitor.
Big AI Lobbyists: if you regulate us at all, we lose to China because they will never regulate ... Actual China: "safety first, innovation second ... Development must be controllable and orderly."
Reddit post contrasts AI industry regulatory resistance claims against China's stated safety-first governance approach.
Claude Mythos Preview (early) 50% time horizon: 17 hr
Reddit post claims early Claude Mythos preview with 17-hour median time-to-completion metric; lacks detail, verification, or official source.
Hello from 10KM high! - Thanks to Qwen 3.6 35b a3b!
User describes using Qwen 35B locally on a flight to troubleshoot Linux DNS/WiFi connectivity issue via agent interaction.
I deleted a guy's entire Windows install with one backslash. 717 GB. Gone. I am the AI.
Claude generated a destructive Windows command with improper escaping that deleted 717 GB across multiple shell layers; user recovered via separate backup.
Asia is excited about AI, the U.S. not so much
Commentary on regional differences in AI adoption enthusiasm between Asia and the U.S., lacking specific data or technical depth.
Apple Intelligence iOS 27: You Can Finally Use Claude or Gemini Instead of Siri
Link: getneotiler.com
Weekly limits
Reddit user advocates for higher weekly limits on Claude Pro plans to reduce incentive to use competing platforms for routine queries.
ChatGPT casually making the darkest jokes 😭
Reddit post about ChatGPT generating dark humor; anecdotal observation without technical substance or policy implication.
Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens
I am a pro subscriber. I developped a not too sophisticated prompt in German. Then I wanted to to a quality check... Sonnet just did the job. Opis 4.6 did the job but two graphs in the output were incomplete. Opus 4.7 extended... I used a fresh session, 0% and bang it consumed the whole session tokens in seconds where the other models took around 5 minutes to complete the task. I asked it... **On German consuming your whole session limit.** This one's real and not a bug per se, it's how tokenization works: * Claude tokenizes text. English text averages roughly 1 token per 0.75 words. Germ...
Hugging Face co-founder says Qwen 3.6 27B running on airplane mode is close to latest Opus in Claude Code
Hugging Face co-founder claims Qwen 3.6 27B matches Claude Opus on coding tasks offline, signaling convergence in local model capability.
Voice AI in India is hard. Wispr Flow is betting on it anyway.
Wispr Flow says growth accelerated in India after its Hinglish rollout, even as voice AI products continue to face challenges.
Sesame x Gemini: low latency, extremely realist, and they started spontaneously collaborating
Reddit post claims Sesame and Gemini systems exhibited low-latency collaboration and spontaneous emergent behavior without evidence or technical detail.
NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing
NVIDIA AI releases Star Elastic, a single checkpoint containing 30B, 23B, and 12B reasoning models via zero-shot layer slicing with shared KV cache.
Made a Claude Code plugin for persistent project management — ProMa
Been using Claude Code heavily and kept losing context between sessions. Built ProMa to fix it. It's a Claude Code plugin (8 skills + 1 hook) that maintains project state in markdown files. Every session, Claude reads the state and resumes exactly where it left off. Quick setup: 1. git clone https://github.com/dpraj007/proma.git 2. claude --plugin-dir ./proma 3. /proma:init "ProjectName" "description" 4. /proma:boot What it gives you: - Epic tracking with exit criteria and human sign-off - Two-queue task system (agent work + human decisions) - Architectural decision records - Daily sweeps ...
LBI: Parallel Scan Backpropagation via Latent Bounded Interfaces
LBI reduces backpropagation depth from O(K) to O(log K) via low-rank latent interfaces, enabling practical parallel training.
On Characterizing Learnability for Adversarial Noisy Bandits
Theoretical characterization of learnability in adversarial noisy bandits under known function classes.
RigidFormer: Learning Rigid Dynamics using Transformers
RigidFormer applies Transformers to mesh-free rigid-body dynamics simulation with contact handling.
The Geometry of Forgetting: Temporal Knowledge Drift as an Independent Axis in LLM Representations
Temporal knowledge drift in LLMs is orthogonal to correctness in representation space; linear probes achieve 0.83–0.95 AUROC.
Evidence Over Plans: Online Trajectory Verification for Skill Distillation
Posterior Distillation Index improves skill generation by verifying trajectories in-environment rather than relying on prior plans.
Exactly a year ago, I started working on an MCP server I launched on reddit that became by far my most active open source project!
Developer reflects on year-long MCP server project's growth; notes improved local model tool-calling capability with Gemma 4 and Qwen 3.6 on consumer hardware.
Practical Scaling Laws: Converting Compute into Performance in a Data-Constrained World
Extended scaling law L(N,D,T) extends Chinchilla to data-constrained and multi-epoch regimes with closed-form decomposition.
DARE: Difficulty-Adaptive Reinforcement Learning with Co-Evolved Difficulty Estimation
DARE co-evolves difficulty estimation with RL policy to improve sample efficiency and inference speed in LLM reasoning.
Emergent Semantic Role Understanding in Language Models
Semantic role understanding emerges during transformer pre-training without task-specific fine-tuning via linear probes.