Skip to main content
Lyzr Studio’s Classic Knowledge Base creates a no-code RAG pipeline for searchable document and website understanding. Use it when an agent needs grounded answers from unstructured content such as PDFs, DOCX files, text, or web pages.

Creating a Knowledge Base

  1. Go to Knowledge Base and select + New.
  2. Select Basic as the Knowledge Base Type. This option uses simple vector-based retrieval with embeddings.
  3. Enter a Name (letters, numbers, and underscores only) and an optional Description.
  4. Select a Vector Store and LLM Embedding Model.
  5. Select Create Knowledge Base.
Screenshot of the Create Knowledge Base modal showing the three KB type options (Basic, Graph, Semantic Data Model), a Name field, Description field, Vector Store selector set to Qdrant, and LLM Embedding Model selector.
  1. Add content through file upload, text, URL, or live source.
  2. Train the KB.
  3. Attach it to an agent through the agent builder’s Knowledge Base feature.

Supported file types

  • .pdf
  • .doc
  • .docx
  • .txt
  • Website URLs

Upload limitations

LimitValue
Files per upload5
File sizeLess than 15 MB each
RecommendationUpload in batches and test retrieval quality between batches

Chunking strategy

Chunking controls how documents are split before embedding. Smaller chunks improve precision; larger chunks preserve context.
SettingDescription
Chunk sizeMaximum number of tokens in each chunk
OverlapNumber of tokens shared between adjacent chunks, preserving context at boundaries
Number of chunksMaximum number of chunks returned per query

Retrieval types

TypeBest for
Basic RetrievalGeneral vector similarity search
MMR (Maximal Marginal Relevance)Reducing duplicate chunks while preserving relevance
HyDE (Hypothetical Document Embeddings)Improving retrieval accuracy on open-ended or vague queries

Score threshold

The score threshold filters out chunks whose similarity score falls below a minimum value. Raising the threshold improves answer precision but may reduce recall on borderline queries. Start at the default and adjust based on test results.

Playground Retrieval

Once the KB is trained, open it and use the Playground Retrieval panel on the right to test retrieval before attaching it to an agent. Type a query in the input field and select Retrieve to see the matching chunks and their similarity scores. This lets you verify that the right content is surfacing for representative questions and catch chunking or configuration issues before doing a full agent deployment. Screenshot of the Knowledge Base detail view showing a file list on the left and the Playground Retrieval panel on the right with a query input field and a Retrieve button.

Live Sources

Live Sources automatically sync content on a configurable frequency. When new content is detected, Lyzr adds the delta instead of re-ingesting everything. Available for:
  • SharePoint: syncs documents from selected SharePoint sites
  • Website: crawls and re-indexes updated pages
  • Google Drive: planned support
Minimum sync frequency is 1 hour.

KB-as-a-Service

A trained Knowledge Base is also accessible as a standalone API endpoint, independent of any agent. This lets other services or agents built outside Lyzr Studio query the same KB directly, which is useful for shared knowledge repositories or multi-product deployments.

Next steps