Best way to add knowledge to a llm : r/LocalLLaMA

- [ ] [Best way to add knowledge to a llm : r/LocalLLaMA](https://www.reddit.com/r/LocalLLaMA/comments/1ao2bzu/best_way_to_add_knowledge_to_a_llm/)


# Best way to add knowledge to a LLM: r/LocalLLaMA

**DESCRIPTION:** Studies like [this one](https://arxiv.org/abs/2401.08406) show GPT4 gets 75% accuracy on prompting alone. GPT4 + RAG you get 80% accuracy. GPT4 + Finetuning 81%. GPT4 + RAG + Finetuning = 86%. Other studies like [this one](https://arxiv.org/pdf/2312.05934.pdf) say just for knowledge retrieval from huge datasets, RAG is enough.

Kaggle's LLM Science Exam competition [link](https://www.kaggle.com/competitions/kaggle-llm-science-exam/overview) made participants answer hard science questions. The winning solution showed Llama-2 70b with prompting gets 80%. + finetuning via SFT you get 86%. But + finetuning + RAG you get 93%. All had to undergo finetuning since the output was MMLU's classification type ie output A, B, C, D etc (so a classification problem).

I would use RAG as a first try to see if it can work. Now the issue is which embeddings, which database etc. Chunk size, reranking etc.

If you find RAG to be quite annoying to set up, another approach is to shove your dataset for finetuning. It'll become a text completion model, so you might need say GPT4 to create some instructions from the dataset to "prime" your model.

So RAG definitely works, pushing accuracies from 75% to 80%. But + finetuning you get 86%. There are some bad theories spreading finetuning does not inject new knowledge, but these studies and the Kaggle comp prove otherwise.

Likewise see Open Hermes, and any finetuned model - finetuning is just continuous pretraining. Definitely the weights of the model are being edited to account for more information.

I'm also the dev of [Unsloth](https://github.com/unslothai/unsloth) :) If you're going to do finetuning, I have a free Colab notebook to finetune Mistral 7b 2x faster and use 70% less VRAM. [Colab Notebook](https://colab.research.google.com/drive/1Dyauq4kTZoLewQ1cApceUQVNcnnNTzg_?usp=sharing)

All in all, I would try first prompt engineering, then RAG, then finetuning, then RAG + finetuning as the final step.

**URL:** [r/LocalLLaMA](https://www.reddit.com/r/LocalLLaMA/comments/1ao2bzu/best_way_to_add_knowledge_to_a_llm/)

#### Suggested labels
#### {'label-name': 'Knowledge-Enhancement-Techniques', 'label-description': 'Methods and tools used to improve knowledge acquisition in AI models.', 'gh-repo': 'llm,finetuning,dataset,RAG,embeddings,Research', 'confidence': 70.22}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best way to add knowledge to a llm : r/LocalLLaMA #665

Best way to add knowledge to a LLM: r/LocalLLaMA

Suggested labels

{'label-name': 'Knowledge-Enhancement-Techniques', 'label-description': 'Methods and tools used to improve knowledge acquisition in AI models.', 'gh-repo': 'llm,finetuning,dataset,RAG,embeddings,Research', 'confidence': 70.22}

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Best way to add knowledge to a llm : r/LocalLLaMA #665

Description

Best way to add knowledge to a LLM: r/LocalLLaMA

Suggested labels

{'label-name': 'Knowledge-Enhancement-Techniques', 'label-description': 'Methods and tools used to improve knowledge acquisition in AI models.', 'gh-repo': 'llm,finetuning,dataset,RAG,embeddings,Research', 'confidence': 70.22}

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions