This approach also has limitations. Namely, your ability to retrieve information...

simonw · on Jan 22, 2023

There's so much scope for creativity and improvement here - that's one of the things that excites me about this technique, it's full of opportunities for exploring new ways of using language models.

salamo · on Jan 23, 2023

In my experience semantic search is great for finding implicit relationships (bad guy => villain) but sometimes fails in unpredictable ways for more elementary matches (friends => friend). That's why it can be good to combine semantic search with something like BM25, which is what I use in my blog search [1]. N-gram text frequency algorithms like TF-IDF and BM25 are also lightning fast compared to semantic search.

[1] https://lukesalamone.github.io/posts/rolling-my-own-blog-sea...

motoboi · on Jan 23, 2023

gpt_index does that. A tree of document chunks (leafs) is built with parent nodes as increasingly summarized versions of the child nodes built with GPT.

The tree is then traversed to find the most relevant chunk asking GPT to compare entries based on relevance to the question. This results in an original document chunk, which is given as context in a final prompt asking to answer the query.

This is great and powerful, but very not cost effective. Log(n) requests to completion API, for n documents.

The embedding search is probably necessary for bigger datasets.