Ask HN: How to build a LLM model on local files?

brucethemoose2 · on May 21, 2023

For non commercial use? To answer your question, finetune a llama based instruction model, maybe using the lit-llama repo. For this you will need to rent a pretty beefy cloud instance, and you will need to resume the finetuning (or use a LORA) to put new data in. Then host it on a cheaper server with a llama.cpp frontend.

But what you really might want is a vector search. This seems like a better fit.

dragonwriter · on May 21, 2023

You might want both: finetuning provides what is loosely analogous to “understanding” across the whole corpus, while vector search provides exact recall of the most relevant specific items.

tikkun · on May 21, 2023

There are some "drag and drop" type solutions, like https://www.chatbase.co/. There are various more - search for custom chatgpt on product hunt and you'll find a lot.

tikkun · on June 3, 2023

See also: https://hackernews.hn/item?id=36176272