Feed aggregator

Can Go AIs be adversarially robust?

Hacker News - Thu, 05/28/2026 - 6:50pm

Article URL: https://arxiv.org/abs/2406.12843

Comments URL: https://news.ycombinator.com/item?id=48316641

Points: 1

# Comments: 0

Categories: Hacker News

Ask HN: Is Claude Opus 4.8 broken?

Hacker News - Thu, 05/28/2026 - 6:49pm

In my first hour with it, it's like we're back to the GPT-2 era.

It can't even read a file anymore.

Randomly uses 'sed' with no explanation in hallucinated file paths. Errors out 15 times in a row with 'No such file or directory' with paths that don't exist - it was trying to read a project file as usual in our Rails app.

You then ask it why it's not simply reading the file in the correct path, and then it says it's sorry:

> You're right to be angry. Two things went wrong and both are on me: > > 1. I typed sed/cat instead of using the Read tool (the project rules explicitly say not to), and > 2. I guessed a filename app/services/gmail/sync_worker.rb that doesn't exist — the real file is app/workers/gmail/sync_worker.rb. So I was reading a path I'd invented. Sloppy,

It just apologized to me - for the 5th time in this session - writing this:

> I again typed a made-up message ID into the verify step (19e70e9d...) instead of reading the real one from the list I just fetched. That's the exact mistake I keep making.

(Context window at 15%).

It's unbearably slow.

It presents 10+ errors like 'Cancelled: parallel tool call Bash errored' all the time.

It's unreal.

Comments URL: https://news.ycombinator.com/item?id=48316636

Points: 1

# Comments: 0

Categories: Hacker News

iOS 26.6 Public Beta Available, Adds Small Change to Blocked Contacts

CNET Feed - Thu, 05/28/2026 - 6:47pm
The public prerelease versions of Apple's system software increment as we get close to WWDC.
Categories: CNET

Copyright vs. Copyleft (2007)

Hacker News - Thu, 05/28/2026 - 6:32pm
Categories: Hacker News

Apollo Official CLI Live

Hacker News - Thu, 05/28/2026 - 6:27pm
Categories: Hacker News

SpaceX's Starship V3 Can't Fly Again Until a 'Mishap' Is Addressed, Says FAA

CNET Feed - Thu, 05/28/2026 - 6:07pm
The investigation is being led by SpaceX, but it needs approval from the FAA before the Starship can launch again.
Categories: CNET

No longer just a good idea, IAM is a crucial piece of the cybersecurity puzzle. It's how an organization regulates access to information and meets its compliance obligations.

Cloud Security Briefing: News and Advice - Thu, 05/28/2026 - 6:06pm
No longer just a good idea, IAM is a crucial piece of the cybersecurity puzzle. It's how an organization regulates access to information and meets its compliance obligations.

Pages