Hacker News

Current embedding-based RAG systems primarily rely on semantic similarity. Given a document and a query, the system usually retrieves multiple sections that appear semantically relevant. However, in domain-specific applications, such as financial analysis or legal research, users often have domain-specific preferences for which parts of a document to consult first. These preferences are typically driven by experience about where answers are typically found or which sections are considered more trustworthy sources of information.

For example:

- When querying about financial performance metrics (e.g., earnings adjustments), experienced analysts typically look first at the Management’s Discussion and Analysis (MD&A) section or related financial statement footnotes.

- For questions about company risks, they usually prioritize the Risk Factors section before turning to broader disclosures.

These expert-driven navigation patterns are difficult to capture using embedding-based RAG alone. Fine-tuning embedding models to reflect such preferences is possible, but it tends to be costly and resource-intensive.

An alternative approach is to incorporate reasoning-based retrieval, which mimics how humans find information. For example, when reading a long document, a human typically starts by reviewing the table of contents to determine which sections to read first, based on the context of the query and preference. Similarly, one can build an LLM agent that analyzes the "table of contents" and then navigates through the document according to expert preferences. This can be achieved by using few-shot prompting, where the system learns from sample user preference examples provided in the prompt, allowing it to prioritize sections based on the user’s needs.

To support this paradigm, we developed an open-sourced tool called PageIndex. It can transform any long documents into an LLM-friendly "table-of-contents" tree index, which is ready for the LLM agents to navigate. With PageIndex, you can easily build RAG agents that align with user preferences and domain logic.

Would love any feedback, particularly thoughts on reasoning-based RAG or other potential applications of PageIndex.

Comments URL: https://news.ycombinator.com/item?id=43707928

Points: 7

# Comments: 1

Categories: Hacker News

Vibe Check: o3 is here and it's great

Hacker News - Wed, 04/16/2025 - 1:17pm

Article URL: https://every.to/chain-of-thought/vibe-check-o3-is-out-and-it-s-great

Comments URL: https://news.ycombinator.com/item?id=43707925

Points: 2

# Comments: 1

Categories: Hacker News

Most detailed brain map constructed from speck of mouse tissue

Hacker News - Wed, 04/16/2025 - 1:17pm

Article URL: https://www.cnn.com/2025/04/15/science/3d-brain-map-mouse-mammal-breakthrough/index.html

Comments URL: https://news.ycombinator.com/item?id=43707919

Points: 1

# Comments: 1

Categories: Hacker News

New open-source model GLM-4-32B with performance comparable to Qwen 2.5 72B

Hacker News - Wed, 04/16/2025 - 1:16pm

Article URL: https://github.com/THUDM/GLM-4

Comments URL: https://news.ycombinator.com/item?id=43707907

Points: 1

# Comments: 0

Categories: Hacker News

Birth of Basic [video]

Hacker News - Wed, 04/16/2025 - 1:15pm

Article URL: https://www.youtube.com/watch?v=WYPNjSoDrqw

Comments URL: https://news.ycombinator.com/item?id=43707894

Points: 1

# Comments: 0

Categories: Hacker News

Trump derails Chinese H20 GPU sales, forcing Nvidia to eat $5.5B this quarter

Hacker News - Wed, 04/16/2025 - 1:15pm

Article URL: https://www.theregister.com/2025/04/16/trump_responds_to_nvidias_us/

Comments URL: https://news.ycombinator.com/item?id=43707891

Points: 7

# Comments: 0

Categories: Hacker News

Biographical Information Summary - This is Just a Summary Joe Pearce
About Joe Pearce joeintenn
Links Joe Pearce
Flounder's Keylime Pie is the Best in the World, At Least I Think So... Joe Pearce
Harley Ride Joe Pearce
Cobra with New Cover Joe Pearce
Mustang Cobra After Ceramic Coating Joe Pearce
Carter County Cruise In Joe Pearce
2003 Ford Mustang SVT Cobra Convertible NAPA Auto Car Show Top 10 Joe Pearce
Ponies in the Smokies - Mustang Trophy Joe Pearce

Hacker News

The new Framework 13 HX370

Erlang Solutions' Blog round-up

Which year: guess which year each photo was taken

US Government threatens Harvard with foreign student ban

Getting better performance out of object storage

Why I Support Privacy

UK startups now have an advantage in recruiting

Flatten Your Data

Can you spot the real AI model names?

Thalamic nuclei observed driving conscious perception

Ask HN: How do you raise your kids in the age of AI?

Miami-Dade to empower cops in immigration crackdown

Freaky Tales and Johnny Canuck

Why All Engineers Need to Learn Sales

How to align with user preference in a RAG system?

Vibe Check: o3 is here and it's great

Most detailed brain map constructed from speck of mouse tissue

New open-source model GLM-4-32B with performance comparable to Qwen 2.5 72B

Birth of Basic [video]

Trump derails Chinese H20 GPU sales, forcing Nvidia to eat $5.5B this quarter

Pages

Welcome to Joe Pearce's Home Page.

Web page offered by Joe Pearce © 2004 - 2025 - All rights reserved.

Thanks to the ETSU Computer and Information Sciences Department.

Thanks to the NSTCC Computer and Information Sciences and Computer Engineering Technologies Department.

This is my Favicon.

You are here

Hacker News

Pages

Welcome to Joe Pearce's Home Page.

Web page offered by Joe Pearce © 2004 - 2025 - All rights reserved.

Thanks to the ETSU Computer and Information Sciences Department.

Thanks to the NSTCC Computer and Information Sciences and Computer Engineering Technologies Department.

This is my Favicon.