Vol. I · No. 52WED, JUN 10, 2026
Archive

The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

My company started measuring our Claude Code usage - now I'm asked to rank engineers on 'AI performance.' This feels wrong...

My company started tracking Claude Code usage - tokens and spend, that kind of thing. Now my manager wants me to stack-rank my engineers on "AI performance" using those numbers. I'm not comfortable with it (but I don't have a choice either). Token usage feels like exactly the wrong proxy - my strongest engineer uses Claude surgically while someone burning 10x the tokens isn't 10x more productive (often the opposite). Ranking on this just teaches people to game the metric. So, for folks here who use Claude daily and/or lead teams: * Has your company started measuring "AI performance"? How a...

··

Anyone else gotten something like this?

I wasn't even on my computer or Claude on May 25 when they said there was suspicious activity. Any ideas about what I should do? I emailed Anthropic a response but am not sure what else I can or should do. I'm furthered worried that if someone who is trying to do something weird on my account has has access to my account...it just makes me feel very concerned and like I am probably not the only case. https://preview.redd.it/k9rlklponj3h1.png?width=1028&format=png&auto=webp&s=de46e3c86a80b1c491d50fcda3a7e7ba7dfa7d76

··

Sonnet 4.5 disappeared? Claude 4.8 soon?

https://preview.redd.it/j0ymp70a2j3h1.png?width=746&format=png&auto=webp&s=4cdb70be13ccc99f5ea57556da96d6d81e61d702 i just realize the removed Sonnet 4.5, does that mean the sonnet 4.8 (maybe Opus 4.8 too?) cooming soon? maybe today or tommorow, excited to see new claude model, hope anthropic actually ship really good model this time. What are your assumptions?

··

How do you decide when to start a new Claude session or branch?

I’m trying to understand how people think about session and branch hygiene when using Claude. When do you create a brand new session versus continue in an existing one? And when do you create a new branch versus just keep working in the same thread? For example, do people generally create a new branch for every unrelated task they want to accomplish, almost like a separate workspace? Or do you only branch when you are exploring a different direction on the same underlying problem? I’m mostly trying to avoid two failure modes: 1. Keeping too much unrelated context in one session and confu...

··

Maat: The Agentic Legal Research Assistant for Competition Protection

Competition law experts conducting legal research must review extensive volumes of cases, decisions, and judicial reports to identify precedents and assess key elements in competition and merger cases. Although general research assistants such as Claude and ChatGPT and legal assistants such as SaulLM-7B and LegalGPT are increasingly used to assist legal research, they remain inadequate for competition law analysis: they lack specialized domain expertise, provide insufficient official citations, or hallucinate competition law cases. We propose Maat, a ReAct agent that orchestrates tools corres...

·

me at hour 3 of prompting claude to verify something i could've just checked myself

# ok so i was working on this projectt and INSTEAD of just doing it i kept asking claude ai to verify all the requirments were met right like i would go "did you complete EVERYTHING" it would go "yes all done :))" and then i check myself and requirments 5 and 6 are just. missing so i tell it to fix it same thing hapens requirments 5 and 6 still missing i do this maybe 6 or 7 times before i realise bro i couldve just written requirments 5 and 6 myself in like 10 minutes , like PLEASE and then the same you were right to pushback on that... lmao at some point u gotta add a lil bit of ur own brai...

··

That is load-bearing.

I know this topic is discussed here a lot but I SWEAR TO FUCKING GOD if I read another "That is real" OR "That is not nothing" OR "That is not X but Y" I am going to have a fucking aneurysm. Yes I have specifically forbidden it from telling me these phrases, yes I have specifically updated the memory and spec to BAN these phrases yet they slip through and I swear sometimes it is so insanely creative in its reasoning for how to get around these constraints but it just kills the immersion(?) so hard when it falls back on these god damn tropes. I use Claude (Max) for absolutely everything, ...

··

Opus 4.7 Often Assumes a Military Audience

This is especially prevalent in Claude Desktop and less obvious in Claude Code. * Often leads long assets with a BLUF (bottom line up front), specifically military-adjacent terminology. * Has slipped into language where it refers to civilians several times, as though it means someone other than us * References to things like deep-dive, vision-intent framing, force multipliers, and reframes are more subtle because they do cross over but their prevalence is unmistakable. This isn't a complaint, it's more a note that, "hey guys, we see your training material in our outputs" and its interestin...

··

I let Claude rank every YC Spring 26 startup — round 2

Follow-up to my W26 post a few months back. Ran the same Claude pipeline on the YC Spring 26 (X26) batch. Same setup: for each company, Claude scrapes founder LinkedIn profiles, searches for press and traction signals, and checks the product to see if something real exists or it's just a landing page. Then it scores on founder credibility, product reality, market opportunity, and competition, and assigns a tier from S to D. Demo Day is June 16, so the batch is mid-flight and rankings will keep shifting as more companies launch. Most are B or C tier, which feels about right for this stage. ...

··

Uber president says AI spending is getting ‘harder to justify’

Uber president Andrew Macdonald (pictured) says its “hard to draw a line” between AI spending and deliverable features. | Photo: Zed Jameson/Bloomberg via Getty Images After reportedly exhausting its annual AI budget just four months into 2026, Uber is now questioning whether it's actually seeing meaningful returns on its investments. In an interview with Rapid Response, Uber president and chief operating officer Andrew Macdonald said the company isn't seeing a connection between rising token consumption for Claude Code and more useful features being delivered to consumers. "That link is not ...

·

Why terminal

Hello, I'm on Windows having setup both Claude Code App and Terminal, but I find the App simply more convenient to use. I have had several people pushing me to use the Terminal saying "the App is low" and "Terminal is so much better" ... but when I inquired none of those people could actually name a single thing that the App would be missing (everything they mentioned the App has as well) or a single concrete reason why I should switch to Terminal beside vague phrases So is the terminal substantially better than the App in something, are there reasons to switch besides being used to it and p...

··

Just passed the new Claude Certified Architect - Foundations (CCA-F) exam with a 985/1000!

The original post was removed by Reddit Filters, so I made new one with same content. I just got my results back today and managed to snag the Early Adopter badge as well. Following up on my recent DP-600 certification, I really wanted to validate my architecture skills specifically on the Anthropic side. The exam covers a lot of practical ground on prompt engineering for tool use, managing context windows efficiently, and handling Human-in-the-Loop workflows. Link to join: https://anthropic.skilljar.com/claude-certified-architect-foundations-access-request Training courses: https://anthr...

··

Annoying AI tell that seems to have spiked recently: "honest caveat"

I noticed that Claude Code was giving me a lot of unsolicited caveats with phrasing like "honest caveat" or "genuine caveat" when this kind of hedging was absolutely unnecessary. I figured other people might be seeing the same thing so my instinct was to use Google Ngram but the cutoff year of 2022 meant that I had to use a different method. So I used Google search with quotes around the phrase "honest caveat" and set the time bound to different time intervals and compared the number of search results as a proxy for how usage has changed over time in indexed pages. As it turns out, while del...

··

Any tips on forming a good memory file on yourself for claude?

I see in non coding related chats claude is always guided by the memory file and its responses are shaped by it. I feel like if you had a really solid memory file you could make a lot more progress with life related things and other discussions with claude. Anyone explored this?

··

Weird Injection Prompt In Chat??

Claude inserted an injection prompt at the end of its message out of the blue, and i have repeatedly asked where it got it from or why it inserted this message, but Claude keeps denying it ever did it, no matter how many screenshots or replies i use or whatever i do, Claude just purely denies it and it went as far as saying there could be a physical sticker on my screen but wont accept saying this I am a uni student studying for an exam in 2 days, and I'm 19, so I don't understand

··

Does the “Indexing” status ever change in Claude Projects?

Hi everyone, I set up a Claude Project a few weeks ago, but the status indicator in the top right corner has never changed from "Indexing" and still shows that little black dot. Does this status ever clear up once processing finishes? Also, my larger PDFs aren't showing visual thumbnails like the smaller files do. The cards just show the total number of lines in the text. Does this indicate a processing failure, and would splitting these large documents into smaller files help clear the indexing queue? Thanks for any and all suggestions.

··

What happened??

Reddit user reports account suspension after minimal Claude usage for academic research assistance on Parkinson's Disease methodology.

··

I've been using Claude Code as a motion graphics engine for my YouTube videos. It writes the JSX, I render. Edit time roughly halved.

Found a really clean Claude Code use case that's not coding-coding. Remotion (React for video) means motion graphics are JSX components. So I describe what I want in plain English, Claude Code writes the component, I render. Lower thirds, intros, overlays, all reusable across videos. Iterations are seconds instead of the typical "drag clips around in CapCut for an hour" loop. Visual style is finally consistent across my channel because the components are shared. 13 min walkthrough on my channel, link in comments to avoid spam vibes.

··

I stress-tested Kimi K2.6 against Claude Opus 4.7 on a quick coding-agent task

I tested Claude Opus 4.7 and Kimi K2.6 on the same coding agent task i.e. build an AI Fix Runner that takes a broken repo, runs its tests, identifies the failure, applies a patch, reruns the test, and exposes the final diff/logs through an API and UI. The goal was not to benchmark syntax completion or simple repo edits. I wanted to test model behavior on a less familiar integration path: shifting execution from local processes into remote sandboxes. I used Tensorlake specifically because the sandbox API is newer and integration-heavy. This made the test more about whether the model could re...

··
30 matches