Naive vector RAG
Split the document into fixed-size chunks, embed each, store in a vector DB. At query time, embed the query and return the top-k nearest chunks.
- Fast to build, easy to ship — the default in every framework
- Chunks have no awareness of headings, sections, or document tree
- Cross-references and entities are pulverized at chunking time
- "Why was this chunk returned?" is a hard question