Prerequisites.

This project is only runnable on Ubuntu or Windows with WSL. This project utilizes Nvidia Container Toolkit, Docker, TensorRT-LLM, ONNX-Runtime and more.

All dependencies are installed and deployed with docker.

Prepare your system.

Run following script to install Nvidia Container Toolkit in your Linux environment.

bash install-cuda-container-toolkit.sh

It will also install docker. If you experienced any docker related errors on later steps, there can be issues with installation of it on this step.

Build project

Project is defined mainly in 4 files: a docker-compose.yml and 3 Dockerfiles for each container.

semantic-search represents a container that performs a semnantic search over a FAISS Vector Index using llama_index package and a custom class OptimumEmbedding to do Indexing and Searching.

llm TensorRT environment with a TensorRT-LLM built engine to run Mistran-7b-Instruct-v0.2-int4. All conversion to TRT engine are performed inside the container during build time.

api API service to perform basic answering for prompts including prompts history. JSON-strings that correpospond to the following pydantic-format are expected:

class Query(BaseModel):
    query: str
    history: Optional[List[Dict[str,str]]]

To build the project run:

docker compose build

Installation process takes about 15-20 minutes. So, take a coffee break. All container will be equipped with conda environments with Python 3.10.14 and a dependencies defined in corresponding environment.yml's.

Running the containers

docker-compose up

api container will wait for semantic-search and llm, they willa also have GPU capabilities.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
api		api
llm		llm
semantic-search		semantic-search
.gitignore		.gitignore
RAG.rar		RAG.rar
README.md		README.md
docker-compose.yml		docker-compose.yml
install-cuda-container-toolkit.sh		install-cuda-container-toolkit.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Prerequisites.

Prepare your system.

Build project

Running the containers

About

Uh oh!

Releases

Packages

Languages

InfroLab/RAG

Folders and files

Latest commit

History

Repository files navigation

Prerequisites.

Prepare your system.

Build project

Running the containers

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages