Blog

RAG Chatbot 2: Setting up Open WebUI on a cloud VM

The use-case: Imagine your time wants to use the company’s internal AI system collaboratively. Possibly, your company hosts Ollama with an internal RAG pipeline to make internal documents available to its employees. Open WebUI (OUI) provides a set of capabilities that make it ideal for this situation: It offers a flexible and secure solution for natural language processing (NLP) tasks by providing key features that enable data control, customization, and scalability.

September 22, 2025

RAG Chatbot 1: Project structure

A term one must come across when aiming to deploy a chatbot for the internal use within a company is RAG, translating to Retrieval Augmented Generation. What is it about? RAG combines a generator model with a retriever model to produce high-quality text. The generator produces an initial draft, while the retriever retrieves relevant existing texts and incorporates them into the output. This hybrid approach can improve coherence, variety, and factual accuracy in generated text. (Meta-Llama-3-8B-Instruct.Q4_0.gguf “What is RAG in context of chatbots?”)

July 28, 2025