Feed aggregator
FedRAMP Marketplace
Article URL: https://marketplace.fedramp.gov/products
Comments URL: https://news.ycombinator.com/item?id=43789656
Points: 1
# Comments: 0
An Interactive Overview of Grammar-Based Sampling for LLMs
Article URL: http://michaelgiba.com/grammar-based/index.html
Comments URL: https://news.ycombinator.com/item?id=43789653
Points: 1
# Comments: 0
Do Not Train" Meta Tags: The Robots.txt of AI – Will Anyone Respect Them?
I've been noticing more creators and platforms quietly adding things like to their pages - kind of like a robots.txt, but for LLMs. For those unfamiliar, robots.txt is a standard file websites use to tell search engines which pages they shouldn't crawl. These new "noai" tags serve a similar purpose, but for AI training models instead of search crawlers.
Some examples of platforms implementing these opt-out mechanisms: - Sketchfab now offers creators an option to block AI training in their account settings - DeviantArt pioneered these tags as part of their content protection approach - ArtStation added both meta tags and updated their Terms of Service - Shutterstock created a compensation model for contributors whose images are used in AI training
But here's where things get concerning - there's growing evidence these tags are being treated as optional suggestions rather than firm boundaries:
- Various creators have reported issues with these tags being ignored. For instance, a discussion on DeviantArt (https://www.deviantart.com/lumaris/journal/NoAI-meta-tag-is-NOT-honored-by-DA-941468316) documents cases where the tags weren't honored, with references to GitHub conversations showing implementation issues
- In a GitHub pull request for an image dataset tool (https://github.com/rom1504/img2dataset/pull/218), developers made respecting these tags optional rather than default, which one commenter described as having "gutted it so that we can wash our hands of responsibility without actually respecting anyone's wishes"
- Raptive Support, a company implementing these tags, admits they "are not yet an industry standard, and we cannot guarantee that any or all bots will respect them" (https://help.raptive.com/hc/en-us/articles/13764527993755-NoAI-Meta-Tag-FAQs)
- A proposal to the HTML standards body (https://github.com/whatwg/html/issues/9334) acknowledges these tags don't enforce consent and compliance "might not happen short of robust regulation"
Some creators have become so cynical that one prominent artist David Revoy announced they're abandoning tags like #NoAI because "the damage has already been done" and they "can't remove [their] art one by one from their database." (https://www.davidrevoy.com/article977/artificial-inteligence-why-i-ll-not-hashtag-my-art-humanart-humanmade-or-noai)
This raises several practical questions:
- Will this actually work in practice without enforcement mechanisms?
- Could it be legally enforceable down the line?
- Has anyone successfully used these tags to prevent unauthorized training?
Beyond the technical implementation, I think this points to a broader conversation about creator consent in the AI era. Is this more symbolic - a signal that people want some version of "AI consent" for the open web? Or could it evolve into an actual standard with teeth?
I'm curious if folks here have added something like this to their own websites or content. Have you implemented any technical measures to detect if your content is being used for training anyway? And for those working in AI: what's your take on respecting these kinds of opt-out signals?
Would love to hear what others think.
Comments URL: https://news.ycombinator.com/item?id=43789634
Points: 1
# Comments: 1
Congress Republicans seek $27 billion for Golden Dome in Trump tax bill
Article URL: https://www.cnbc.com/2025/04/24/congress-republicans-seek-27-billion-for-golden-dome-in-trump-tax-bill-reuters.html
Comments URL: https://news.ycombinator.com/item?id=43789630
Points: 4
# Comments: 0
US Agency To Ease Self-Driving Vehicle Deployment Hurdles, Retain Reporting Rules
Keep It Simple: New Slate Truck Could Be Next Year's Cheapest EV
This $25,000 Electric Slate Truck Transforms Into an SUV
Nothing Janky About This New Programming Language
Article URL: https://thenewstack.io/nothing-janky-about-this-new-programming-language/
Comments URL: https://news.ycombinator.com/item?id=43789541
Points: 2
# Comments: 0
Trump to target ActBlue in presidential memorandum
Article URL: https://www.politico.com/news/2025/04/24/trump-to-target-actblue-in-presidential-memorandum-00307251
Comments URL: https://news.ycombinator.com/item?id=43789530
Points: 4
# Comments: 0
Desert reservoirs capture and store organic carbon, according to research
Article URL: https://phys.org/news/2025-04-reservoirs-capture-carbon.html
Comments URL: https://news.ycombinator.com/item?id=43789517
Points: 1
# Comments: 0
Half of the universe's hydrogen gas, long unaccounted for, has been found
Article URL: https://news.berkeley.edu/2025/04/11/half-of-the-universes-hydrogen-gas-long-unaccounted-for-has-been-found/
Comments URL: https://news.ycombinator.com/item?id=43789510
Points: 2
# Comments: 0
Show HN: Memory Chess – Spatial memory training with randomized chess boards
Article URL: https://thememorychess.com
Comments URL: https://news.ycombinator.com/item?id=43789507
Points: 3
# Comments: 2
Petition to the Open Source Initiative: Publish the Full 2025 Election Results
Article URL: https://codeberg.org/OSI-Concerns/election-results-2025#readme
Comments URL: https://news.ycombinator.com/item?id=43789501
Points: 1
# Comments: 0
I used simple rules to make DFAs that kinda match accepted physics models
Article URL: https://keweizhou1996-df477.web.app/dfa.html
Comments URL: https://news.ycombinator.com/item?id=43789471
Points: 1
# Comments: 0
AlexLib Translation Feedback
Article URL: https://alexlibfeedback.com/
Comments URL: https://news.ycombinator.com/item?id=43789470
Points: 1
# Comments: 0
Show HN: Rust CLI to generate LLM directory context (XML supported)
dead simple rust CLI to "rivet" together the structure and content of a project directory, with XML output supported.
I saw many new python and TS tools pop up that do this, only a matter of time until someone built it in rust :)
Comments URL: https://news.ycombinator.com/item?id=43789469
Points: 1
# Comments: 0
Government censorship comes to Bluesky, but not its third-party apps yet
Article URL: https://techcrunch.com/2025/04/23/government-censorship-comes-to-bluesky-but-not-its-third-party-apps-yet/
Comments URL: https://news.ycombinator.com/item?id=43789460
Points: 10
# Comments: 0
Can LLMs be useful? (re: Hank Green)
Article URL: https://blog.melodysium.gay/blog/can-llms-be-useful/
Comments URL: https://news.ycombinator.com/item?id=43789452
Points: 1
# Comments: 1
Growth Mindset
Article URL: https://mentorloop.com/blog/growth-mindset-vs-fixed-mindset-what-do-they-really-mean/
Comments URL: https://news.ycombinator.com/item?id=43789432
Points: 1
# Comments: 1