Build an agent that does better on LegalBench-RAG — an open benchmark for case-law retrieval and analysis, scored against ground-truth citations from real legal writing.
Also do anything else you think that could help the law industry adopt AI at large.
Walks through the agent end-to-end on a real LegalBench-RAG task — retrieval, reasoning, citation. Last 90s shows the vibecoded interface a lawyer can drive directly. Recorded day 14, single take, unedited.
Estimated by big tech employees themselves, based on their co-workers. Dashed ring is the median big-tech engineer.
Very high agency and judgement — built a self-improving harness and a UI for lawyer feedback, and read the surrounding literature extensively on his own.
Intense worker — 6th out of 20 on intensity of work for the hours they put in.
Highly parallel AI tool usage.
Over-handing to AI.
Over-deliberates decisions instead of getting real-world feedback.
These are the candidate's own answers about how they like to work. Self-reported, not a measured assessment — useful for fit, not a personality verdict.
I do my best work heads-down and solo, but I'm not closed off — I like a small team where everyone owns a clear piece, we sync briefly, then go deep on our own. I default to writing things down over scheduling a meeting. In a group I'm more of a quiet builder than the loudest voice; I'll push back on something I disagree with, but usually in writing or 1:1 rather than in a big room. Socially I'm friendly but reserved, and I recharge alone.
Direct and written, so I can sit with it. I tend to act on it a day later rather than in the moment — I like to think it through before changing course.
Early morning and late evening; the middle of the day is my weakest stretch. My ideal day is two or three uninterrupted 3–4 hour blocks, a goal set up front, and notifications off until the block is done.
Lots of small context-switches and back-to-back meetings. Cold outreach to strangers is the thing I avoid most.