Hacker News

Subscribe to Hacker News feed
Hacker News RSS
Updated: 10 min 6 sec ago

Ask HN: Would ChatGPT semantic token coloring give insight into its replies?

Sun, 04/07/2024 - 3:32pm

For example, it could highlight important tokens in the context window as it outputs each word in its reply.

Comments URL: https://news.ycombinator.com/item?id=39963146

Points: 1

# Comments: 1

Categories: Hacker News

Ask HN: What's the right way to scale K8s for GPU workloads?

Sun, 04/07/2024 - 3:27pm

I'm running a relatively simple app right now that makes use of EKS clusters for GPU workloads. Right now everything's pretty simple and working great, despite all of the warnings about k8s being complicated.

However, everything I have is currently scaled up and down manually. Looking into HPA it doesn't seem like it's a tool built for GPU tasks where each pod can only handle at most one unit of work at a time. I have both async GPU workers and a small flask API that uses GPUs. These are currently all using g4dn.xlarge EC2s. Essentially I want a scaling scheme where the number of nodes approximates the number of concurrent requests, up to a maximum.

I did look into simpler solutions like Replicate etc. but found they would be too limiting and a little bit over-simplified for us.

Edit: I know I should probably use k8s Jobs for the async worker stuff, but when I looked into it there seemed to be more operational complexity than I can handle. I'm a solo dev and set up workers that poll a Postgres job queue so they can manage themselves instead of there being yet another service to schedule and maintain k8s Jobs.

Comments URL: https://news.ycombinator.com/item?id=39963105

Points: 1

# Comments: 2

Categories: Hacker News

IBM System/360

Sun, 04/07/2024 - 2:33pm
Categories: Hacker News

Results of Show HN: No winning entries

Sun, 04/07/2024 - 2:21pm

There were no successful entrants to this contest: https://news.ycombinator.com/item?id=39949966

I myself was not able to complete the challenge in two attempts.

I record my attempts here: https://www.reddit.com/r/RoboInnovateChallenge/comments/1bwzpkg/prealpha_challenge_1_bring_fruit_to_mouth/kyi4cr7/

And a video walkthrough is available here: (pending)

The $10 prize pool will be rolled over into Challenge 2, which will be slightly more complicated.

Comments URL: https://news.ycombinator.com/item?id=39962620

Points: 1

# Comments: 0

Categories: Hacker News

Pages