Files
aurak/server
Developer 0b0a060967 fix: 全部TS错误修复(25->0) + 证书API 500修复 + i18n缺失key补全 + 类型定义修正
- 证书API 500修复: AssessmentCertificate实体注册到app.module.ts
- 前端TS错误25个清零: i18n key 17个, 类型定义8个
- i18n补全: 17个缺失key添加到zh/en/ja
- KnowledgeFile类型: 添加title, content字段
- importService: 改用apiClient.request替代raw fetch
- ModeSelector: 移除jsx prop
- questionBankService: .ok -> .status >= 400
- NotebookDetailView: .filter -> .items.filter
- ImportTasksDrawer: tasks.items提取
- API端点审计: 16/16通过
- 数据库Schema审计: 25表288列一致
- AGENTS.md更新
2026-05-18 08:30:59 +08:00
..

Simple Knowledge Base - Backend Service (Server)

A RAG (Retrieval-Augmented Generation) knowledge base backend system built with the NestJS framework. Provides file parsing, vector indexing, hybrid search, and multi-model management features.

🌟 Key Features

  • Intelligent Document Processing: Integrated with Apache Tika to support text extraction from various formats including PDF, Word, Markdown, and TXT.
  • Efficient Vector Search: Uses Elasticsearch as the vector database. Supports hybrid search combining KNN vector search and full-text search.
  • Flexible RAG Engine: Built on LangChain. Supports customization of chunking rules and reranking.
  • Multi-Model Provider: Supports dynamic connection to OpenAI, Google Gemini, and locally deployed LLM models.
  • Secure Management: Built-in JWT authentication and user permission management.

🛠️ Tech Stack

  • Framework: NestJS (TypeScript)
  • Database: SQLite (TypeORM)
  • Search Engine: Elasticsearch 8.x/9.x
  • AI Framework: LangChain
  • Libraries: RxJS, Class-Validator

📋 Prerequisites

Before running the project, ensure you have the following environment set up:

  • Node.js (v18 or higher recommended)
  • Yarn
  • Docker & Docker Compose (for infrastructure)

🚀 Quick Start

1. Start Infrastructure

Use the docker-compose.yml file in the project root (simple-kb/) to quickly start Elasticsearch and Tika.

# Run from project root directory
docker-compose up -d

After successful startup:

  • Elasticsearch: Listens on port 19200 (maps container's 9200 port)
  • Tika: Listens on port 9998

2. Install Dependencies

Navigate to the server directory and install packages:

cd server
yarn install

3. Environment Configuration

The project uses environment variables for configuration. Make sure the settings are correct (especially Elasticsearch address):

# Database path
DATABASE_PATH=server/data/metadata.db

# JWT secret
JWT_SECRET=your_secure_secret

# Elasticsearch settings (match docker-compose ports)
ELASTICSEARCH_HOST=http://localhost:19200
ELASTICSEARCH_INDEX=knowledge_base

# Tika settings
TIKA_HOST=http://localhost:9998

# File upload storage path
UPLOAD_FILE_PATH=./uploads

4. Start Services

# Development mode (recommended, with hot reload)
yarn run start:dev

# Build and run in production mode
yarn build
yarn run start:prod

The backend service runs on http://localhost:13000 by default, with API prefix /api.

🧪 Testing

# Unit tests
yarn run test

# E2E tests
yarn run test:e2e

⚠️ Notes and Tips

  1. Database Initialization:

    • On first run, TypeORM automatically creates metadata.db (or configured DB) under the server/data/ directory.
    • In development mode, with synchronize: true, table structures are auto-synced.
  2. Elasticsearch Connection:

    • If you encounter a Connection refused error, check if Docker containers are running properly (docker ps).
    • On service startup, an index named knowledge_base is automatically detected and created.
  3. Default Account:

    • If you reset the database, either register a new user or refer to existing admin data. It's recommended to use the frontend registration feature to create the first user.
  4. File Parsing:

    • When uploading large files, parsing by Tika may take a few seconds. Please wait while checking the frontend processing status.