Feed aggregator
The Sunk Cost Fallacy and How It Influences Our Decisions
Article URL: https://almossawi.substack.com/p/the-sunk-cost-fallacy
Comments URL: https://news.ycombinator.com/item?id=48197879
Points: 1
# Comments: 0
Andrej Karpathy Joins Anthropic
Article URL: https://www.thevccorner.com/p/breaking-andrej-karpathy-joins-anthropic
Comments URL: https://news.ycombinator.com/item?id=48197859
Points: 2
# Comments: 0
Google Antigravity CLI
Article URL: https://antigravity.google/blog/introducing-google-antigravity-cli
Comments URL: https://news.ycombinator.com/item?id=48197843
Points: 2
# Comments: 0
Google introduces Gemini Spark, a 24/7 agentic assistant with Gmail integration
Article URL: https://techcrunch.com/2026/05/19/google-introduces-gemini-spark-a-24-7-agentic-assistant-with-gmail-integration/
Comments URL: https://news.ycombinator.com/item?id=48197827
Points: 1
# Comments: 0
Show HN: Logbox – let Claude monitor your dev logs
TL;DR: logbox is an open-source tool that pipes dev server logs to a local sqlite db with ` | logbox collect`. Give Claude Code access by running `claude mcp add logbox -- logbox serve`.
I used to copy & paste logs into Claude Code when manually testing my server in dev. I wanted to give it its own verification loop.
I initially tried having it boot the server itself and follow the logs. It was good at knowing if the server booted properly, but it capped out and missed details when the logs started flowing in.
I also tried piping the logs to a local file and telling Claude to read them from there. It worked, but became annoying once we had multiple services or wanted to reference past dev server sessions.
So I built logbox for ourselves at Struct and decided to open-source it. It’s a simple Rust CLI that pipes logs into a local SQLite db with an MCP server that gives coding agents the ability to search them.
Once it could reliably monitor the dev server logs totally autonomously after testing its changes, I stopped needing to fish for log snippets and keep nudging it to get a manual test working end-to-end.
Everything stays local. `logbox serve` is an stdio MCP server and it just works with the local SQLite db.
Comments URL: https://news.ycombinator.com/item?id=48197815
Points: 2
# Comments: 0
Likely AI-generated short story won a major prize
Article URL: https://twitter.com/nabeelqu/status/2056397504824963296
Comments URL: https://news.ycombinator.com/item?id=48197814
Points: 2
# Comments: 0
Show HN: Melogen – Generate MIDI melodies for free
Melogen generates short melodic phrases as .mid files, no rendered audio, just notes you drop into your DAW. Free to try, no signup required.
I built this because every "AI music" tool I'd tried produces finished audio, which is the opposite of what I want when I'm stuck on a hook. I wanted something that hands me a MIDI seed I can take into Ableton/FL/Logic and shape myself.
Under the hood: - Decoder-only transformer trained on MIDI - Outputs note sequences with timing and velocity - Tuned for short, usable phrases rather than full arrangements
It's still in beta and rough in places. I'd value blunt feedback on: - Whether the generations are actually musical, or just "notes that fit a key" - MIDI handling: timing, velocity, channel assignment - What's missing before it would slot into a real workflow
Happy to get into the model, dataset, or pipeline in the comments.
Comments URL: https://news.ycombinator.com/item?id=48197775
Points: 1
# Comments: 0
Show HN: FastBack end – schema-first back end runtime with OpenAPI output
Hi HN - author here. FastBackend reads a SQLAlchemy or Prisma schema, compiles it to a framework-agnostic IR, and a runtime adapter serves CRUD + relationships as REST with OpenAPI on the side. Try it in ~2 min: npm i -g @fastbackend/cli git clone https://github.com/darula-hpp/fastbackend cd fastbackend/examples/sqlalchemy-fastapi pip install -r requirements.txt fastbackend generate && fastbackend dev open http://localhost:8301/docs What's shipped: - CLI + core on npm (@fastbackend/cli 0.1.2) - FastAPI adapter on PyPI (fastbackend-fastapi) - Express + Prisma adapter on npm - Custom routes and overrides for non-CRUD logic Honest limits: - FastAPI adapter uses in-memory storage (MVP) - Express + Prisma is more production-shaped - No hosted platform — self-hosted codegen/runtime The IR is the interesting part: same schema could be served by different backend adapters. OpenAPI is the frontend handoff (Orval, openapi-typescript, etc.). Roadmap (not shipped): declarative wiring for storage/OAuth with secrets in .env. Happy to answer architecture questions or comparisons to Supabase, PostgREST, FastAPI-CRUD, etc.
Comments URL: https://news.ycombinator.com/item?id=48197768
Points: 1
# Comments: 0
The Gemini app becomes more agentic, delivering proactive, 24/7 help
Article URL: https://blog.google/innovation-and-ai/products/gemini-app/next-evolution-gemini-app/
Comments URL: https://news.ycombinator.com/item?id=48197720
Points: 2
# Comments: 0
Disney Erased FiveThirtyEight
Article URL: https://www.natesilver.net/p/disney-erased-fivethirtyeight
Comments URL: https://news.ycombinator.com/item?id=48197703
Points: 6
# Comments: 0
Gemini Voice Capabilities and Gemini Spark Coming to MacOS This Summer
Which campaigns actually drive your leads?
Article URL: https://www.digitalpilot.app/
Comments URL: https://news.ycombinator.com/item?id=48197673
Points: 1
# Comments: 0
Show HN: Coding agent where a second agent QAs every PR in a real browser
Hi HN. I've been building this for the last few months and it's at a state where outside eyes would help more than another week of solo iteration.
It's a kanban board where each ticket runs two agents back to back:
Build agent: runs in a sandboxed temp dir against a shallow clone of the user's repo, makes the change, pushes a branch, opens a PR. Uses the Claude Agent SDK.
QA agent: waits for the preview deploy to come up, then drives a real browser via Browserbase against the preview and verifies the change works against the ticket's acceptance criteria. Screenshots and an mp4 of the QA session get attached to the PR.
If QA fails, the build agent reruns with the QA report as context, up to 3 iterations. Before each retry, a classifier reads the failure and decides whether it was a real code bug or environmental (Clerk didn't load, preview never deployed, Browserbase session got 403'd, etc). Environmental failures break the loop instead of iterating on infra noise. This was the single biggest reliability win.
The other side is input. The platform exposes an MCP server, so from Claude Code or any MCP client you can say "make a ticket for X" and it lands in the backlog. The original reason I built any of this was that writing tickets was the bottleneck for me, not writing code.
A few implementation notes that might be interesting:
The build agent's system prompt forbids the Task / Agent (subagent) tool. Spawning subagents inside the SDK was hanging for 4+ minutes consistently. Staying in the main session with Read/Edit/Bash/Glob/Grep is dramatically more reliable.
Postgres schema is applied on startup from a single schema.sql, idempotent with IF NOT EXISTS everywhere. No migrations directory. Adding a column is "edit schema.sql, push, restart." This is the highest-leverage decision I've made on the backend.
QA has a fast mode (local Chromium for anonymous routes) and a deep mode (Browserbase + residential proxies + stealth, for anything behind auth). The mode is per-ticket because cheap-and-fast loses signal once you go past the login wall.
A background sweeper force-fails any job running over 60 min. The SDK can hang in ways asyncio.wait_for doesn't always clean up through the subprocess boundary, so the kill switch is a belt-and-suspenders guard.
Stack: FastAPI on Railway, Postgres, Claude Agent SDK, Browserbase, Vercel for previews, Clerk for auth, Resend for transactional email, MCP over HTTP. Frontend is one HTML file on Vercel, no build step, no framework, just vanilla JS and Clerk loaded from CDN.
What's not working well yet: deep-mode QA still occasionally gets stuck on CAPTCHAs in unfamiliar OAuth flows. The classifier's environmental-failure list is hand-curated keywords, which is fragile. Spend tracking is per-job but I haven't built per-workspace budget caps yet. PR previews on Vercel sometimes take 2-3 min to come up which the QA agent has to wait through.
It's in alpha with a waitlist. Free during alpha, paid plans later. The whole platform was built using Claude Code, so this has been dogfooding itself for the entire build.
Site: https://notesasm.com
Would love feedback, especially on: the dual-agent loop design, the classifier approach, what kinds of tickets would actually break this on your repo, and prior art I should be aware of (I know about Devin, OpenHands, SWE-agent; what else?).
Comments URL: https://news.ycombinator.com/item?id=48197661
Points: 1
# Comments: 0
The missing men of the American marriage market
Article URL: https://www.npr.org/sections/planet-money/2026/05/19/g-s1-122695/the-missing-men-of-the-american-marriage-market
Comments URL: https://news.ycombinator.com/item?id=48197651
Points: 2
# Comments: 0
Scientists worried about de-extinction ethics as biotech co. touts breakthrough
Google and Samsung's Warby Parker and Gentle Monster Glasses Are Coming This Fall
Show HN: I reviewed 271 event tech tools and turned the data into a visual story
I run YourEventKit, an independent directory of event technology tools.
Over the past few months I collected and enriched 271 tool profiles with structured data, and once that was done, I wanted to do something fun with it.
So event tech wrapped is the result of my fun curiosity while playing with the dataset. Its a scoll-driven data story where one SVG constellation of 271 dots morph through 10 different states as you scroll.
Each dot is a tool. Each state is a different lens shining upon the dataset.
Built with Astro and D3. The 10 layouts are precomputed with D3 force simulations at build time, stored as JSON. Scroll position interpolates between states via requestAnimationFrame -- no runtime simulation.
Some findings that surprised me:
Only 54% of tools publish a real starting price. The rest require a sales call. 150 out of 271 tools say they are not built for small or one-off events.
Some of you might be able to see the influence of Shirley wu, Emil Kowalski and FiveThirtyEight
All observations and questions are welcome. The experience is enjoyed over desktop as I could not figure out a good experience for mobile yet.
Comments URL: https://news.ycombinator.com/item?id=48197138
Points: 1
# Comments: 0
Microsoft's Durabletask Package on PyPI Compromised. Mini Shai Hulud
Article URL: https://www.aikido.dev/blog/durabletask-package-compromised-mini-shai-hulud
Comments URL: https://news.ycombinator.com/item?id=48197131
Points: 1
# Comments: 0
