Hacker News

Subscribe to Hacker News feed
Hacker News RSS
Updated: 45 min 40 sec ago

Show HN: Agent-estimate, how long a coding task takes, at agent speed

Thu, 05/21/2026 - 6:46pm

I have used Codex & Claude Code for coding for a while, but how long a coding task will actually take? When I ask Claude Code to estimate, the result is often from training data, which is based on human speed. That’s why I built this tool, to estimate effort in ai agent speed. I run it every morning before I dispatch coding tasks to my agents.

What's in it: task sizing: auto-classifies XS to XL from the description, then runs PERT on that tier human-equivalent comparison: a per-task-type multiplier so you see the speedup METR p80 thresholds: warns when an estimate exceeds a model's reliability horizon wave planning: schedules independent tasks in parallel across a multi-agent fleet

The estimation data is from my daily coding tasks from past few weeks: per-runtime calibration: Opus 4.7, GPT-5.5, different models have different reliability horizons and costs per-task-type priors: backend, frontend, app development, docs, and brainstorm PR review: I usually let Codex and Claude Code review each other’s code, and the tool takes that into consideration a calibration loop that keeps me honest: dispatch data is validated at end of day by my coordinator agent

Try it: pip install agent-estimate, read the code https://github.com/kiloloop/agent-estimate/ , or the writeup https://kiloloop.com/agent-estimate/

Comments URL: https://news.ycombinator.com/item?id=48229752

Points: 2

# Comments: 0

Categories: Hacker News

An LLM on a Sony PSP

Thu, 05/21/2026 - 6:10pm
Categories: Hacker News

Creating a Full PII Framework for Agents

Thu, 05/21/2026 - 5:59pm

Article URL: https://pii-firewall.com/

Comments URL: https://news.ycombinator.com/item?id=48229356

Points: 1

# Comments: 0

Categories: Hacker News

Pages