GitHub - awtestergit/UBOXAI: UBOX-AI: Bringing AI within reach of every business, simplifying work, and amplifying productivity.

UBOX-AI:

Plug-and-Play AI Document Processing & Knowledge Machine for Every Business with Full Data Privacy

Index

Description
Architecture
Installation
Knowledge Warehouse

Description

UBOX-AI is an AI-powered machine designed to revolutionize document processing, simplify repetitive tasks, and boost productivity for small businesses and home offices. With UBOX-AI, you can bring the latest in artificial intelligence directly into your workspace with ease and efficiency.

We believe in "AI democracy" - making AI accessible to all businesses, regardless of size. UBOX-AI is a showcase of how even a Small Language Model (SLM) can deliver powerful results, automating tasks and streamlining workflows that were once time-consuming and complex.
For businesses intrigued by AI but concerned about cost, UBOX-AI is a budget-friendly solution that leverages open-source technologies. It allows companies to experience AI firsthand, without high upfront investments - afterall, we optimized the whole AI system to run under one 4090 (or even lesser) GPU of a game machine, single-digit (not double-digit) of thousands.
UBOX-AI is also a plug-and-play solution, meaning it’s incredibly easy to deploy. Simply connect UBOX-AI to your company’s intranet, and it immediately serves as a web-based AI server. Employees can access AI tools securely through a browser, without the need for complicated setups or specialized software.
Your data privacy and security are the top priority. UBOX-AI operates as a private deployment, ensuring that all AI processes run locally, keeping your business data secure and confidential while delivering the benefits of cutting-edge AI technology.

Current features include:

Dochat - chat with your document
Doctract - extract key elements from your document
Docompare - compare two documents to highlight differences
DocKnow - chat with your knowledge warehouse (FAQ & documents)

Future work:

Knowledge Warehouse Manager improvement.
More AI applications to come.

Architecture

All services are deployed on one single host machine.

You can opt for hosting AI services in a docker (which is the easiest way to deploy).
For LLM, Ollama is supported by setting up Ollama on the host machine.
- If you do not want to set up Ollama, you can use OpenAI by providing your OpenAI key as well.

Installation

Docker installation
- go to https://www.docker.com/ to install docker if you have not yet installed it.

Qdrant vector database installation

https://qdrant.tech/documentation/quickstart/

```
docker pull qdrant/qdrant
```

docker run -p 6333:6333 -p 6334:6334 -v $(pwd)/qdrant_storage:/qdrant/storage:z qdrant/qdrant

Ollama & Models (If you plan to use OpenAI, skip this step. Note: OpenAI, Ollama different models use different embedding - the search result won't be satisfactory if you mix them)
- go to https://ollama.com/ to install ollama if you have not yet installed it.
- pull Llama3.1 for LLM, https://ollama.com/library/llama3.1, you can pull 8B or larger depending on your host machine's GPU.
- pull nomic-embed-text for embedding, https://ollama.com/library/nomic-embed-text
UBOXAI installation, you can either pull docker (the easiest way) or pull git, build and run
- Docker way
  - ```
  docker pull awtestergit/uboxai:latest
```
- If you use Ollama shown above:
- ```
docker run --name uboxai -p 11434:11434 -p 5000:5000 -p 5010:5010 -p 5050:5050 -it -d --privileged -v ./:/orig -v $(pwd)/qdrant_storage:/qdrant/storage:z -v dind-certs:/certs -v /var/run/docker.sock:/var/run/docker.sock -e DOCKER_TLS_CERTDIR=/certs awtestergit/uboxai:latest ./entry_start.sh
```
  - If you use OpenAI:
  - ```
  docker run --name uboxai -p 11434:11434 -p 5000:5000 -p 5010:5010 -p 5050:5050 -it -d --privileged -v ./:/orig -v $(pwd)/qdrant_storage:/qdrant/storage:z -v dind-certs:/certs -v /var/run/docker.sock:/var/run/docker.sock -e DOCKER_TLS_CERTDIR=/certs awtestergit/uboxai:latest ./entry_start.sh your_openai_key
```
- That is it! You can go to http://localhost:5050 on your host machine, or http://<host_machine_ip>:5050 from another machine on the same network, where host_machine_ip is the ip of your host machine, e.g, 192.168.x.xx.
- The Dochat, Doctract, and Docompare work by now, and for Docknow, you need to populate the Knowledge Warehouse
  - See Knowledge Warehouse
- Git pull way
  - Prepare virtual environment (use your favorite virtual management tool), for example using conda:
```
conda create -n uboxai python
conda activate uboxai
```
  - Git pull:
```
git clone https://github.com/awtestergit/UBOXAI.git
cd UBOXAI/server
pip install -r requirements.txt
```
  - Nginx installation
    - https://nginx.org/en/docs/install.html
    - configure sites: (shown as linux example, check nginx documents for other platforms)
      - copy 'uboxai' file under UBOXAI folder just created, which is the configuration for nginx, to /etc/nginx/sites-available (e.g, cp UBOXAI /etc/nginx/sites-available)
        
        modify 'root /uboxai/ui/build;' in uboxai file, use your own ui/build path to replace '/UBOXAI/ui/build' as necessary
      - ln -s /etc/nginx/sites-available/uboxai /etc/nginx/sites-enabled/ (probably you need to 'sudo ln ...'
    - restart nginx (in Linux: service nginx restart)
  - Start uboxai
    - in server/config.json, change RUN_IN_DOCKER=1 to RUN_IN_DOCKER=0
    - at uboxai root, start ./entry_start.sh if using Ollama, or ./entry_start.sh <your_open_key> if using OpenAI
      - Note: if you need other machine under same subnet to access the UBOXAI, in entry_start.sh file replace 'source start_uboxai.sh 127.0.0.1' to 'source start_uboxai.sh <host_machine_ip>', where host_machine_ip is the ip of your host machine, e.g, 192.168.x.xx.
      - If you get 'permission denied: ./entry_start.sh', you need to set the execution permission properly, e.g.: chmod +x ./entry_start.sh
    - That is it! You can go to http://localhost:5050 on your host machine, or http://<host_machine_ip>:5050 from another machine on the same network
  - If you want to modify the UI and build by yourself, you need to install node, npm, etc, take a look at /ui/requirements.txt

Knowledge Warehouse

Once you have the UBOXAI set up, you can populate the knowledge warehouse, which can store FAQs and your documents (supports PDF or DOCX formats). Note: There are plenty of works needed for the knowledge warehouse manager.

If you use Docker way, you can run this command to get to docker:

  docker run --name uboxai -p 11434:11434 -p 5000:5000 -p 5010:5010 -p 5050:5050 -it -d --privileged -v ./:/orig -v $(pwd)/qdrant_storage:/qdrant/storage:z -v dind-certs:/certs -v /var/run/docker.sock:/var/run/docker.sock -e DOCKER_TLS_CERTDIR=/certs awtestergit/uboxai:latest

go to folder /uboxai/server
start venv by 'source /uboxai/bin/activate'
start Knowledge Warehouse Manager by 'python vectordb_manager.py'

If you use Git pull way, activate the virtual environment, e.g, 'conda activate uboxai'
- start Knowledge Warehouse Manager by 'python vectordb_manager.py'
After the start of knowledge warehouse manager, use web browser on the host machine, http://localhost:5010, to populate the warehouse
- for FAQ, you can bulk load by create the file using faq template. check faq template example.csv provided
- for uploading documents, you can use the manager to upload file(s)

After the population, you can use Docknow to query - if the query hits FAQ, then the answer will be returned from FAQ; if not, the query will search from the documents uploaded to vector database and LLM will answer accordingly.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
server		server
ui		ui
How it works _ UBOXAI.pdf		How it works _ UBOXAI.pdf
LICENSE		LICENSE
README.md		README.md
entry_start.sh		entry_start.sh
faq_template_example.csv		faq_template_example.csv
start_uboxai.sh		start_uboxai.sh
uboxai		uboxai

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Index

Description

Architecture

Installation

Knowledge Warehouse

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

awtestergit/UBOXAI

Folders and files

Latest commit

History

Repository files navigation

Index

Description

Architecture

Installation

Knowledge Warehouse

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages