Hacker News
Show HN: Open-source API for real-time Google Meet transcription with bots
Article URL: https://www.vexa.ai/
Comments URL: https://news.ycombinator.com/item?id=43785050
Points: 1
# Comments: 0
Show HN: Lemon Slice Live, a real-time video-audio AI model
Hey HN, this is Lina, Andrew, and Sidney from Lemon Slice. We’ve trained a custom diffusion transformer (DiT) model that achieves video streaming at 25fps and wrapped it into a demo that allows anyone to turn a photo into a real-time, talking avatar. Here’s an example conversation from co-founder Andrew: https://www.youtube.com/watch?v=CeYp5xQMFZY. Try it for yourself at: https://lemonslice.com/live.
(Btw, we used to be called Infinity AI and did a Show HN under that name last year: https://news.ycombinator.com/item?id=41467704.)
Unlike existing avatar video chat platforms like HeyGen, Tolan, or Apple Memoji filters, we do not require training custom models, rigging a character ahead of time, or having a human drive the avatar. Our tech allows users to create and immediately video-call a custom character by uploading a single image. The character image can be any style - from photorealistic to cartoons, paintings, and more.
To achieve this demo, we had to do the following (among other things! but these were the hardest):
1. Training a fast DiT model. To make our video generation fast, we had to both design a model that made the right trade-offs between speed and quality, and use standard distillation approaches. We first trained a custom video diffusion transformer (DiT) from scratch that achieves excellent lip and facial expression sync to audio. To further optimize the model for speed, we applied teacher-student distillation. The distilled model achieves 25fps video generation at 256-px resolution. Purpose-built transformer ASICs will eventually allow us to stream our video model at 4k resolution.
2. Solving the infinite video problem. Most video DiT models (Sora, Runway, Kling) generate 5-second chunks. They can iteratively extend it by another 5sec by feeding the end of the 1st chunk into the start of the 2nd in an autoregressive manner. Unfortunately the models experience quality degradation after multiple extensions due to accumulation of generation errors. We developed a temporal consistency preservation technique that maintains visual coherence across long sequences. Our technique significantly reduces artifact accumulation and allows us to generate indefinitely-long videos.
3. A complex streaming architecture with minimal latency. Enabling an end-to-end avatar zoom call requires several building blocks, including voice transcription, LLM inference, and text-to-speech generation in addition to video generation. We use Deepgram as our AI voice partner. Modal as the end-to-end compute platform. And Daily.co and Pipecat to help build a parallel processing pipeline that orchestrates everything via continuously streaming chunks. Our system achieves end-to-end latency of 3-6 seconds from user input to avatar response. Our target is <2 second latency.
More technical details here: https://lemonslice.com/live/technical-report.
Current limitations that we want to solve include: (1) enabling whole-body and background motions (we’re training a next-gen model for this), (2) reducing delays and improving resolution (purpose-built ASICs will help), (3) training a model on dyadic conversations so that avatars learn to listen naturally, and (4) allowing the character to “see you” and respond to what they see to create a more natural and engaging conversation.
We believe that generative video will usher in a new media type centered around interactivity: TV shows, movies, ads, and online courses will stop and talk to us. Our entertainment will be a mixture of passive and active experiences depending on what we’re in the mood for. Well, prediction is hard, especially about the future, but that’s how we see it anyway!
We’d love for you to try out the demo and let us know what you think! Post your characters and/or conversation recordings below.
Comments URL: https://news.ycombinator.com/item?id=43785044
Points: 7
# Comments: 2
OpenVSX, which VSCode forks rely on for extensions, down for 24 hours
Article URL: https://status.open-vsx.org/
Comments URL: https://news.ycombinator.com/item?id=43785039
Points: 1
# Comments: 0
Good Company
Article URL: https://store.steampowered.com/app/911430/Good_Company/
Comments URL: https://news.ycombinator.com/item?id=43785033
Points: 1
# Comments: 0
News Is Blocked on Meta's Feeds in Canada. Here's What Fills the Void
Article URL: https://www.nytimes.com/2025/04/21/technology/canada-election-facebook-instagram-meta.html
Comments URL: https://news.ycombinator.com/item?id=43785031
Points: 2
# Comments: 2
Adverse Drug Reaction: Midazolam-Induced Extrapyramidal Symptoms: A Case Report
Article URL: https://pmc.ncbi.nlm.nih.gov/articles/PMC7323825/
Comments URL: https://news.ycombinator.com/item?id=43785028
Points: 2
# Comments: 0
They Stole a Quarter-Billion in Crypto and Got Caught Within a Month
Article URL: https://www.nytimes.com/2025/04/24/magazine/crybercrime-crypto-minecraft.html
Comments URL: https://news.ycombinator.com/item?id=43784546
Points: 1
# Comments: 0
White House Proposal Could Gut Climate Modeling the World Depends On
Article URL: https://www.propublica.org/article/trump-noaa-budget-cuts-climate-change-modeling-princeton-gfdl
Comments URL: https://news.ycombinator.com/item?id=43784542
Points: 1
# Comments: 0
My More-hardcore Theanine Self-experiment: Coffee is bad
Article URL: https://dynomight.substack.com/p/theanine-2
Comments URL: https://news.ycombinator.com/item?id=43784540
Points: 1
# Comments: 0
Understanding Why Count(*) Can Be Slow in PostgreSQL
Article URL: https://vaibhavjha.substack.com/p/understanding-why-count-can-be-slow
Comments URL: https://news.ycombinator.com/item?id=43784522
Points: 1
# Comments: 0
What do you think of this blog for getting AI startup ideas?
Article URL: https://michaelmallari.bitbucket.io/
Comments URL: https://news.ycombinator.com/item?id=43784494
Points: 1
# Comments: 2
Hooded pitohui, one of the only toxic birds
Article URL: https://www.australiangeographic.com.au/blogs/creatura-blog/2014/06/hooded-pitohui-bird/
Comments URL: https://news.ycombinator.com/item?id=43784489
Points: 1
# Comments: 0
Prepper Disk
Article URL: https://www.prepperdisk.com/
Comments URL: https://news.ycombinator.com/item?id=43784475
Points: 1
# Comments: 1
AI Voice Agent Building Experience as a Contractor
Article URL: https://www.indiehackers.com/post/ai-voice-agent-building-experience-as-a-contractor-9ee96ec7ff
Comments URL: https://news.ycombinator.com/item?id=43784471
Points: 1
# Comments: 0
From Sludge to Strategy: How Smart Nudges Reboot HCP Engagement
Article URL: https://blog.doceree.com/hcp-engagement-nudges
Comments URL: https://news.ycombinator.com/item?id=43784463
Points: 2
# Comments: 0
Asus releases fix for AMI bug that lets hackers brick servers
Article URL: https://www.bleepingcomputer.com/news/security/asus-releases-fix-for-ami-bug-that-lets-hackers-brick-servers/
Comments URL: https://news.ycombinator.com/item?id=43784460
Points: 1
# Comments: 0
Ask HN: How do you retain both technical and domain knowledge long-term?
I'm exploring a learning system that addresses the dual challenge many of us face: remembering both technical concepts AND the business domain knowledge needed to apply them effectively. After years of coding in different industries, I've noticed that understanding the domain (finance, healthcare, e-commerce, etc.) is often as challenging as mastering the technical stack, yet most learning tools focus solely on the technical side. Some questions I'm curious about:
How do you currently capture and retain domain-specific knowledge alongside technical concepts? What's your biggest challenge when onboarding to a new codebase with an unfamiliar business domain? Have you tried using flash cards or spaced repetition for either technical or domain knowledge? What worked or didn't? Would you find value in a tool that could help teams build shared mental models of both their tech stack and business domain? How do you currently transfer domain knowledge between team members?
I'm in early validation stages and would appreciate your insights before building anything. If there's enough interest, I'll share what I learn from this thread.
Comments URL: https://news.ycombinator.com/item?id=43784449
Points: 2
# Comments: 0
MCP's 3 U's: Making a Tool Useful, Usable, and Used by and for an LLM
Article URL: https://blog.owulveryck.info/2025/04/22/mcps-3-us-making-a-tool-useful-usable-and-used-by-and-for-an-llm.html
Comments URL: https://news.ycombinator.com/item?id=43784442
Points: 1
# Comments: 0
Not for private gain – An open letter
Article URL: https://notforprivategain.org/
Comments URL: https://news.ycombinator.com/item?id=43784434
Points: 1
# Comments: 0
Google forcing some remote workers to come back 3 days a week
Article URL: https://www.cnbc.com/2025/04/23/google-teams-are-including-remote-workers-in-their-cuts.html
Comments URL: https://news.ycombinator.com/item?id=43784427
Points: 1
# Comments: 0