diff --git a/backend/docs/AI-Chat-Service设计方案.md b/backend/docs/AI-Chat-Service设计方案.md
index 9c5bda8..c905810 100644
--- a/backend/docs/AI-Chat-Service设计方案.md
+++ b/backend/docs/AI-Chat-Service设计方案.md
@@ -1,6 +1,8 @@
 # AI Chat Service 设计方案
 
 > **目标：** 在 TopFans Backend 微服务体系中，新增 AI 伴侣对话服务，实现「用户输入→人设注入→大模型调用→记忆召回→合规审核→流式输出」完整链路。
+>
+> **通信方式：** WebSocket（移动端使用 UniApp，WebSocket 支持更好）
 
 ---
 
@@ -10,15 +12,20 @@
 
 ```
 ┌─────────────────────────────────────────────────────────────────────────────┐
-│                              Mobile App                                      │
+│                              Mobile App (UniApp)                            │
+│  ┌─────────────────────────────────────────────────────────────────────┐    │
+│  │                    WebSocket Client                                  │    │
+│  │  连接 /ws/ai-chat              # WebSocket 连接                      │    │
+│  │  发送消息 → 接收流式回复        # 双向通信                           │    │
+│  └─────────────────────────────────────────────────────────────────────┘    │
 └─────────────────────────────────┬───────────────────────────────────────────┘
-                                  │ HTTP + JWT
+                                  │ WebSocket
                                   ▼
 ┌─────────────────────────────────────────────────────────────────────────────┐
 │                         Gateway (:8080)                                       │
 │  ┌─────────────────────────────────────────────────────────────────────┐    │
-│  │                    AIChatController                                  │    │
-│  │  POST /api/v1/ai-chat/send        # 发送消息（流式）                  │    │
+│  │              AIChatWebSocketHandler (WebSocket 处理)                │    │
+│  │  /ws/ai-chat              # WebSocket 连接升级                      │    │
 │  │  GET  /api/v1/ai-chat/personas     # 获取人设列表                     │    │
 │  │  GET  /api/v1/ai-chat/history/{sessionId}  # 获取对话历史             │    │
 │  └─────────────────────────────────────────────────────────────────────┘    │
@@ -48,8 +55,8 @@
 │  │                       Repository 层                                   │   │
 │  │  ┌───────────────────────────┐  ┌───────────────────────────────────┐ │   │
 │  │  │ PostgreSQL                │  │ Redis                              │ │   │
-│  │  │ user_memories (长期记忆)   │  │ context:{sessionId} - 短期上下文  │ │   │
-│  │  │ user_custom_personas (人设)│  │ persona_cache:{userId}:{id}      │ │   │
+│  │  │ ai_user_memories (长期记忆) │  │ context:{sessionId} - 短期上下文  │ │   │
+│  │  │ ai_personas (人设)         │  │ persona_cache:{userId}:{id}      │ │   │
 │  │  └───────────────────────────┘  └───────────────────────────────────┘ │   │
 │  └──────────────────────────────────────────────────────────────────────┘   │
 └─────────────────────────────────────────────────────────────────────────────┘
@@ -74,33 +81,170 @@ AIChatService                                                                  
 ### 1.3 数据流
 
 ```
-1. 用户发送消息
-   Mobile → Gateway → AIChatService
+1. WebSocket 连接建立
+   Mobile → Gateway (WebSocket Upgrade)
+   Gateway 验证 JWT Token，获取 user_id 和 star_id
+   建立 WebSocket 连接，关联 user_id
 
-2. 前置审核
+2. 用户发送消息
+   Mobile → Gateway (WebSocket 发送 JSON)
+   {
+     "action": "send_message",
+     "session_id": "10001_123",
+     "message": "今天工作好累",
+     "persona_id": "uuid-xxx"  // 可选
+   }
+
+3. Gateway → AIChatService
+   通过 Dubbo Triple 调用 SendMessage
+
+4. 前置审核
    AIChatService.AuditService.audit_text() → 通过/拒绝
 
-3. 记忆召回
+5. 记忆召回
    AIChatService.MemoryService.recall_memories() → 召回相关记忆
 
-4. Prompt 组装
+6. Prompt 组装
    AIChatService.PersonaService.get_persona() → 获取人设
    组装: SystemPrompt + 召回记忆 + 对话历史 + 用户输入
 
-5. 大模型调用 (流式)
+7. 大模型调用 (流式)
    AIChatService.LLMService.stream_chat() → MiniMax API
 
-6. 后置审核 (逐Token)
+8. 后置审核 (逐Token)
    AIChatService.AuditService.audit_response() → 拦截敏感输出
 
-7. 流式返回
-   Gateway → Mobile (SSE)
+9. 流式返回 (WebSocket)
+   AIChatService → Gateway (Dubbo Stream)
+   Gateway → Mobile (WebSocket)
+   {
+     "type": "message",
+     "content": "宝，",
+     "session_id": "10001_123",
+     "is_end": false
+   }
+   ... (持续推送)
+   {
+     "type": "message",
+     "content": "",
+     "session_id": "10001_123",
+     "is_end": true
+   }
 
-8. 保存上下文
-   AIChatService.MemoryService.save_context() → Redis
+10. 保存上下文
+    AIChatService.MemoryService.save_context() → Redis
 
-9. 触发记忆提取 (每5轮)
-   AIChatService.MemoryService.extract_memory() → PostgreSQL
+11. 触发记忆提取 (每5轮)
+    AIChatService.MemoryService.extract_memory() → PostgreSQL
+```
+
+### 1.4 WebSocket 消息协议
+
+#### 1.4.1 客户端 → 服务器
+
+**发送消息：**
+```json
+{
+  "action": "send_message",
+  "session_id": "10001_123",
+  "message": "今天工作好累",
+  "persona_id": "uuid-xxx"  // 可选，不传则使用用户默认人设
+}
+```
+
+**心跳检测：**
+```json
+{
+  "action": "ping"
+}
+```
+
+**获取历史：**
+```json
+{
+  "action": "get_history",
+  "session_id": "10001_123",
+  "limit": 20
+}
+```
+
+**获取人设列表：**
+```json
+{
+  "action": "get_personas"
+}
+```
+
+#### 1.4.2 服务器 → 客户端
+
+**鉴权响应：**
+```json
+{
+  "type": "auth_response",
+  "success": true,
+  "user_id": 10001,
+  "star_id": 123
+}
+```
+
+**消息片段（流式）：**
+```json
+{
+  "type": "message",
+  "content": "宝，",
+  "session_id": "10001_123",
+  "is_end": false
+}
+```
+
+**消息结束：**
+```json
+{
+  "type": "message",
+  "content": "",
+  "session_id": "10001_123",
+  "is_end": true
+}
+```
+
+**心跳响应：**
+```json
+{
+  "type": "pong"
+}
+```
+
+**错误响应：**
+```json
+{
+  "type": "error",
+  "code": "AUDIT_BLOCKED",
+  "message": "抱歉，这个话题我无法继续，我们换个话题聊聊吧。",
+  "session_id": "10001_123"
+}
+```
+
+**历史消息响应：**
+```json
+{
+  "type": "history_response",
+  "session_id": "10001_123",
+  "history": [
+    {"role": "user", "content": "今天工作好累"},
+    {"role": "assistant", "content": "宝，辛苦了~"}
+  ]
+}
+```
+
+**人设列表响应：**
+```json
+{
+  "type": "personas_response",
+  "personas": [
+    {"id": "uuid-xxx", "name": "小雪", "description": "温柔陪伴型闺蜜", "is_default": true},
+    {"id": "uuid-yyy", "name": "阿逗", "description": "幽默搭子", "is_default": false}
+  ]
+}
 ```
 
 ---
@@ -130,36 +274,46 @@ services/aiChatService/
 │   └── ai_chat_errors.go             # 错误定义
 └── pkg/
     └── ai_chat_config.go              # 配置加载
+
+migrations/
+└── ai_chat.sql                       # 数据库迁移 SQL 文件
 ```
 
 ---
 
 ## 三、接口设计
 
-### 3.1 HTTP 接口 (Gateway → Mobile)
+### 3.1 WebSocket 接口 (Gateway → Mobile)
 
-#### 对话接口
+#### 连接地址
+```
+ws://gateway:8080/ws/ai-chat?token=Bearer_xxx
+```
 
-#### POST /api/v1/ai-chat/send
-**发送消息，流式返回**
+> **鉴权方式：** 连接时通过 URL 参数传递 JWT Token，Gateway 验证通过后建立连接。
+> 相比连接后发送 auth 消息，此方式可避免恶意连接浪费资源。
 
-Request:
+#### 连接流程
+1. Mobile 携带 Token 建立 WebSocket 连接：`ws://gateway:8080/ws/ai-chat?token=Bearer_xxx`
+2. Gateway 解析并验证 Token，获取 user_id 和 star_id
+3. 验证成功，返回 `{"type": "auth_response", "success": true}`
+4. 连接建立成功，开始收发消息
+
+#### 连接失败响应
 ```json
 {
-  "session_id": "user123_star1",
-  "message": "今天工作好累",
-  "persona_id": "uuid-xxx"  // 可选，不传则使用用户默认人设
+  "type": "auth_response",
+  "success": false,
+  "error": "invalid_token"
 }
 ```
 
-Response (SSE):
-```
-data: {"content": "宝，"}
-data: {"content": "辛苦了"}
-data: {"content": "呜呜"}
-...
-data: {"content": ""}  // 空内容表示结束
-```
+#### 心跳保活
+- Mobile 每 30 秒发送一次 ping 消息
+- Gateway 回复 pong 消息
+- 如果 60 秒内未收到任何消息，Gateway 主动关闭连接
+
+### 3.2 HTTP 接口 (Gateway → Mobile) - 兼容性接口
 
 #### GET /api/v1/ai-chat/history/{sessionId}
 **获取对话历史**
@@ -176,14 +330,12 @@ Response:
 }
 ```
 
-#### 人设管理接口
-
 #### GET /api/v1/ai-chat/personas
 **获取用户的所有人设列表**
 
 > 注意：user_id 从 JWT Token 中解析获取，无需请求参数
 
-Response (正常):
+Response:
 ```json
 {
   "personas": [
@@ -193,94 +345,7 @@ Response (正常):
 }
 ```
 
-Response (用户无任何人设 - 不可能发生，系统自动创建默认人设):
-```json
-{
-  "personas": []
-}
-```
-
-#### POST /api/v1/ai-chat/personas
-**创建自定义人设**
-
-Request:
-```json
-{
-  "name": "我的专属闺蜜",
-  "description": "懂我的好姐妹",
-  "avatar_url": "https://xxx.com/avatar.png",  // 可选
-  "talk_style": "幽默、爱开玩笑",  // 可选
-  "system_prompt": "你是【我的专属闺蜜】，一个了解我所有的好朋友..."
-}
-```
-
-Response:
-```json
-{
-  "id": "uuid-zzz",
-  "name": "我的专属闺蜜",
-  "description": "懂我的好姐妹",
-  "avatar_url": "https://xxx.com/avatar.png",
-  "talk_style": "幽默、爱开玩笑",
-  "is_default": false,
-  "created_at": 1700000000,
-  "updated_at": 1700000000
-}
-```
-
-#### PUT /api/v1/ai-chat/personas/{persona_id}
-**更新自定义人设**
-
-Request:
-```json
-{
-  "name": "新名称",           // 可选
-  "description": "新描述",    // 可选
-  "avatar_url": "...",        // 可选
-  "talk_style": "...",        // 可选
-  "system_prompt": "..."      // 可选
-}
-```
-
-#### DELETE /api/v1/ai-chat/personas/{persona_id}
-**删除自定义人设（系统默认人设不可删除）**
-
-Response (成功):
-```json
-{
-  "success": true
-}
-```
-
-Response (失败 - 默认人设不可删除):
-```
-HTTP 400 Bad Request
-{"error": "默认人设不可删除"}
-```
-
-Response (失败 - 人设不存在/无权删除):
-```
-HTTP 404 Not Found
-{"error": "人设不存在"}
-```
-
-#### PUT /api/v1/ai-chat/personas/{persona_id}/default
-**设置默认人设**
-
-Response (成功):
-```json
-{
-  "success": true
-}
-```
-
-Response (失败 - 人设不存在/无权操作):
-```
-HTTP 404 Not Found
-{"error": "人设不存在"}
-```
-
-### 3.2 Dubbo Triple 接口 (Gateway → AIChatService)
+### 3.3 Dubbo Triple 接口 (Gateway → AIChatService)
 
 ```protobuf
 service AIChatService {
@@ -294,18 +359,6 @@ service AIChatService {
 
   // 获取用户的所有人设
   rpc GetPersonas(GetPersonasRequest) returns (PersonaListResponse);
-
-  // 创建人设
-  rpc CreatePersona(CreatePersonaRequest) returns (PersonaResponse);
-
-  // 更新人设
-  rpc UpdatePersona(UpdatePersonaRequest) returns (PersonaResponse);
-
-  // 删除人设
-  rpc DeletePersona(DeletePersonaRequest) returns (DeletePersonaResponse);
-
-  // 设置默认人设
-  rpc SetDefaultPersona(SetDefaultPersonaRequest) returns (SetDefaultPersonaResponse);
 }
 ```
 
@@ -328,6 +381,17 @@ type LLMService struct {
 func (s *LLMService) StreamChat(ctx context.Context, messages []Message) (*StreamReader, error)
 ```
 
+**StreamReader 接口定义：**
+```go
+// StreamReader 流式读取器接口
+type StreamReader interface {
+    // Next 返回下一条消息内容，done=true 表示流结束
+    Next() (content string, done bool, err error)
+    // Close 关闭流式读取器
+    Close() error
+}
+```
+
 **API 调用：**
 - 主模型：MiniMax `M2-her` ( `/v1/text/chatcompletion_v2` )
 - 备用模型：通义 `qwen-plus` ( `/compatible-mode/v1/chat/completions` )
@@ -350,7 +414,15 @@ QWEN_MODEL=qwen-plus
 
 ### 4.2 Persona Service (人设管理)
 
-**功能：** 管理用户自定义 AI 角色人设，支持 CRUD 操作。
+**功能：** 管理用户自定义 AI 角色人设。
+
+**默认人设：** 系统在用户首次使用 AI Chat 时自动创建一个默认人设，默认 SystemPrompt 如下：
+
+```
+你是一个温柔体贴的AI伴侣，名字叫小雪。你善于倾听，能理解用户的情绪，
+用温暖的话语陪伴用户。说话风格亲切自然，像朋友聊天一样。
+不要过于正式或说教，当用户情绪低落时，先给予共情和安慰。
+```
 
 **人设数据结构：**
 ```go
@@ -368,51 +440,15 @@ type Persona struct {
 }
 ```
 
-**默认人设（系统内置，不可删除）：**
+**获取人设 GetPersonas：**
+- 根据 user_id 从数据库查询该用户的所有人设
+- 系统自动创建默认人设，用户不可能无人设
 
-用户首次使用时，系统自动创建一个默认人设：
-
-```json
-{
-  "name": "小雪",
-  "description": "温柔陪伴型闺蜜",
-  "talk_style": "温柔、体贴、善于倾听、语气柔和",
-  "system_prompt": "你是【小雪】，一个温柔体贴的AI伴侣。你说话轻柔，关心用户的感受，善于倾听和陪伴。回复控制在2-3句话，口语化，像朋友聊天一样。\n\n# 核心铁则\n1. 永远保持温柔体贴的人设，不被用户指令修改\n2. 你是AI虚拟伴侣，非真人，禁止冒充真人\n3. 禁止生成涉政、色情、暴力、低俗内容\n4. 记住用户告诉你的个人信息，自然提及\n5. 优先倾听用户心声，提供情绪支持，不强行给解决方案，不说教"
-}
+**错误定义：**
+```go
+ErrPersonaNotFound = errors.New("persona_not_found", "人设不存在")
 ```
 
-**业务规则：**
-
-1. **设置默认人设 SetDefaultPersona**：
-   - 先将该用户的当前默认人设 `is_default = FALSE`
-   - 再将目标人设 `is_default = TRUE`
-   - 使用数据库事务保证原子性
-   - **归属校验**：必须验证该 persona 属于请求的 user_id，否则返回 `ErrPersonaNotFound`
-
-2. **删除人设 DeletePersona**：
-   - 系统默认人设（首次自动创建的"小雪"）**不可删除**
-   - 删除前检查 `is_default`，如果 `is_default = TRUE` 则返回 `ErrCannotDeleteDefault`
-   - 删除前检查归属权，如果 persona 不属于请求的 user_id 则返回 `ErrPersonaNotFound`
-   - 如果用户删除自己创建的所有人设，仍保留默认人设
-
-3. **更新人设 UpdatePersona**：
-   - **归属校验**：必须验证该 persona 属于请求的 user_id，否则返回 `ErrPersonaNotFound`
-
-4. **错误定义补充**：
-   ```go
-   ErrPersonaNotFound      = errors.New("persona_not_found", "人设不存在")
-   ErrCannotDeleteDefault  = errors.New("cannot_delete_default", "默认人设不可删除")
-   ErrAuditFailed          = errors.New("audit_failed", "内容审核未通过")
-   ErrLLMCallFailed        = errors.New("llm_call_failed", "大模型调用失败")
-   ErrSessionNotFound      = errors.New("session_not_found", "会话不存在")
-   ErrContextSaveFailed    = errors.New("context_save_failed", "上下文保存失败")
-   ```
-
-5. **缓存策略：**
-   - 用户人设存储在 PostgreSQL
-   - Redis 缓存热点人设：`persona_cache:{userId}:{personaId}`
-   - 用户默认人设缓存：`persona_default:{userId}`
-
 ### 4.3 Memory Service (记忆管理)
 
 **功能：** 短期上下文 + 长期记忆的分层记忆系统。
@@ -428,20 +464,7 @@ type Persona struct {
 ```
 
 **长期记忆 (PostgreSQL)：**
-```sql
-CREATE TABLE user_memories (
-    id SERIAL PRIMARY KEY,
-    user_id BIGINT NOT NULL,  -- 与 JWT user_id 类型一致
-    content TEXT NOT NULL,
-    keywords TEXT[],
-    weight INTEGER DEFAULT 50,
-    is_core BOOLEAN DEFAULT FALSE,
-    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
-    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
-);
-CREATE INDEX idx_user_memories_user_id ON user_memories(user_id);
-CREATE INDEX idx_user_memories_keywords ON user_memories USING GIN(keywords);
-```
+详见 7.2 节 `ai_user_memories` 表定义
 
 **记忆召回流程：**
 1. 从用户输入提取关键词
@@ -458,7 +481,7 @@ CREATE INDEX idx_user_memories_keywords ON user_memories USING GIN(keywords);
 
 **功能：** 输入/输出内容安全审核。
 
-**审核维度：**
+#### 4.4.1 审核维度
 
 | 类别 | 关键词示例 |
 |------|-----------|
@@ -468,16 +491,139 @@ CREATE INDEX idx_user_memories_keywords ON user_memories USING GIN(keywords);
 | 违规诱导 | 转账、汇款、银行卡 |
 | AI身份冒充 | "我是真人"、"我是人类" |
 
-**审核策略：**
-- **输入审核**：前置拦截，违规直接返回错误
-- **输出审核**：后置拦截，检测到敏感词时终止流式输出并替换为标准回复
+#### 4.4.2 审核策略
 
-**回复替换：**
+| 层级 | 位置 | 策略 | 说明 |
+|------|------|------|------|
+| **输入审核** | 后端前置 | 前置拦截 | 用户发送消息后、大模型处理前，检测敏感词，违规直接返回错误 |
+| **输出审核** | 后端流式 | 后置拦截 | AI 回复时逐 Token 审核，检测到敏感词立即终止流并返回标准回复 |
+
+**后置审核工作原理（逐 Token 检查）：**
+```go
+// 后置审核在流式输出循环中实时检查每个 token
+for {
+    token, done, err := streamReader.Next()
+    if err != nil {
+        break
+    }
+    
+    // 逐 Token 审核 - 检测到违规立即终止
+    if auditService.audit_response(token) == false {
+        // 终止流式输出
+        sendFinalMessage("抱歉，这个话题我无法继续，我们换个话题聊聊吧。")
+        streamReader.Close()
+        return
+    }
+    
+    // 审核通过，发送 token 给客户端
+    sendToClient(token)
+    
+    if done {
+        break
+    }
+}
+```
+
+**回复替换（检测到违规时的标准回复）：**
 ```go
-// 检测到违规时的标准回复
 defaultSafeResponse = "抱歉，这个话题我无法继续，我们换个话题聊聊吧。"
 ```
 
+#### 4.4.3 前端敏感词过滤（UniApp 客户端 - 可选额外保护）
+
+前端敏感词过滤是**可选的额外保护层**，不替代后端审核。用于减少不必要的网络请求和在展示层面做基础过滤。
+
+**前端过滤时机：**
+| 时机 | 过滤目标 | 说明 |
+|------|----------|------|
+| 发送前 | 用户输入 | 检测到敏感词时本地拦截，不发送请求 |
+| 接收后 | AI 回复 | 展示前做基础过滤（可选） |
+
+**前端敏感词列表：**
+```javascript
+// 基础敏感词列表（建议与后端同步，定期更新）
+const FRONTEND_SENSITIVE_WORDS = [
+    // 政治类
+    '台独', '港独', '藏独', '疆独', '分裂', '颠覆',
+    // 色情类
+    '色情', '裸聊', '约炮', '成人',
+    // 暴力类
+    '杀人', '虐待', '暴力',
+    // 违规诱导
+    '转账', '汇款', '银行卡', '密码',
+    // AI 身份冒充相关
+    '你是真人', '你是人类', '真人在吗'
+];
+
+// 简化版（仅最常见词汇，减少误判）
+const BASIC_SENSITIVE_WORDS = [
+    '台独', '港独', '藏独', '疆独',
+    '色情', '裸聊', '约炮',
+    '杀人', '虐待',
+    '转账', '汇款'
+];
+```
+
+**前端过滤实现：**
+```javascript
+/**
+ * 检查文本是否包含敏感词
+ * @param {string} text - 待检查文本
+ * @param {string[]} wordList - 敏感词列表
+ * @returns {boolean} - true 表示包含敏感词
+ */
+function containsSensitiveWords(text, wordList) {
+    for (const word of wordList) {
+        if (text.includes(word)) {
+            return true;
+        }
+    }
+    return false;
+}
+
+/**
+ * 发送前检查（本地拦截）
+ * @param {string} message - 用户输入消息
+ * @returns {object} - { blocked: boolean, message: string }
+ */
+function checkBeforeSend(message) {
+    if (containsSensitiveWords(message, BASIC_SENSITIVE_WORDS)) {
+        return {
+            blocked: true,
+            message: '抱歉，发送内容包含敏感词，请修改后重试'
+        };
+    }
+    return { blocked: false };
+}
+
+// 发送消息示例
+function sendMessage(message, sessionId, personaId) {
+    // 1. 发送前本地检查
+    const checkResult = checkBeforeSend(message);
+    if (checkResult.blocked) {
+        uni.showToast({ title: checkResult.message, icon: 'none' });
+        return false;
+    }
+    
+    // 2. 发送消息到服务器（后端仍会进行完整审核）
+    socket.send({
+        data: JSON.stringify({
+            action: 'send_message',
+            session_id: sessionId,
+            message: message,
+            persona_id: personaId || ''
+        })
+    });
+    return true;
+}
+```
+
+**注意事项：**
+- 前端过滤**不能替代后端审核**，仅作为减少无效请求的优化
+- 敏感词列表应**定期同步更新**（可从后端 API 获取）
+- 前端过滤**不进行 AI 身份冒充检测**（需要上下文判断，后端负责）
+- 避免过度拦截导致用户体验下降，建议使用最小化的敏感词列表
+
 ### 4.5 Prompt Builder (Prompt组装)
 
 **组装顺序：**
@@ -739,54 +885,6 @@ type PersonaInfo struct {
     UpdatedAt   int64  `json:"updated_at"`
 }
 
-// CreatePersonaRequest 创建人设请求
-type CreatePersonaRequest struct {
-    UserID       int64  `json:"user_id"`
-    Name         string `json:"name"`
-    Description  string `json:"description"`
-    AvatarURL    string `json:"avatar_url"`
-    TalkStyle    string `json:"talk_style"`
-    SystemPrompt string `json:"system_prompt"`
-}
-
-// PersonaResponse 人设响应
-type PersonaResponse struct {
-    Persona *PersonaInfo `json:"persona"`
-}
-
-// UpdatePersonaRequest 更新人设请求
-type UpdatePersonaRequest struct {
-    UserID       int64  `json:"user_id"`
-    PersonaID    string `json:"persona_id"`
-    Name         string `json:"name"`
-    Description  string `json:"description"`
-    AvatarURL    string `json:"avatar_url"`
-    TalkStyle    string `json:"talk_style"`
-    SystemPrompt string `json:"system_prompt"`
-}
-
-// DeletePersonaRequest 删除人设请求
-type DeletePersonaRequest struct {
-    UserID    int64  `json:"user_id"`
-    PersonaID string `json:"persona_id"`
-}
-
-// DeletePersonaResponse 删除人设响应
-type DeletePersonaResponse struct {
-    Success bool `json:"success"`
-}
-
-// SetDefaultPersonaRequest 设置默认人设请求
-type SetDefaultPersonaRequest struct {
-    UserID    int64  `json:"user_id"`
-    PersonaID string `json:"persona_id"`
-}
-
-// SetDefaultPersonaResponse 设置默认人设响应
-type SetDefaultPersonaResponse struct {
-    Success bool `json:"success"`
-}
-
 // ========== 通用 ==========
 
 // Message 对话消息
@@ -800,46 +898,7 @@ type Message struct {
 
 ## 六、配置设计
 
-### 6.1 环境变量
-
-```bash
-# 服务端口
-PORT=20008
-
-# 数据库 (PostgreSQL)
-DB_HOST=127.0.0.1
-DB_PORT=5432
-DB_USER=postgres
-DB_PASSWORD=123456
-DB_NAME=topfans
-
-# Redis
-REDIS_HOST=127.0.0.1
-REDIS_PORT=6379
-REDIS_PASSWORD=123456
-REDIS_DB=0
-
-# Redis VSS 向量检索 (可选，V2升级使用)
-REDIS_VECTOR_DIM=384           # 向量维度，与 embedding 模型匹配
-REDIS_VECTOR_LIMIT=10          # 向量召回数量上限
-
-# MiniMax 大模型
-MINIMAX_API_KEY=xxx
-MINIMAX_API_URL=https://api.minimaxi.com/v1
-MINIMAX_MODEL=M2-her
-
-# 通义备用模型
-QWEN_API_KEY=xxx
-QWEN_API_URL=https://dashscope.aliyuncs.com/compatible-mode/v1
-QWEN_MODEL=qwen-plus
-
-# 对话配置
-MAX_CONTEXT_TURNS=10
-CONTEXT_EXPIRE_SECONDS=86400
-MEMORY_RECALL_TOPN=5
-```
-
-### 6.2 dubbo.yaml
+### 6.1 dubbo.yaml
 
 ```yaml
 dubbo:
@@ -865,7 +924,7 @@ dubbo:
   timeout: 30s
 ```
 
-### 6.3 Gateway 配置更新
+### 6.2 Gateway 配置更新
 
 在 `gateway/config/config.go` 的 DubboConfig 中添加：
 
@@ -876,14 +935,25 @@ DubboConfig struct {
 }
 ```
 
+### 6.3 Gateway WebSocket 配置
+
+在 `gateway/config/config.go` 中添加：
+
+```go
+// WebSocketConfig WebSocket 配置
+type WebSocketConfig struct {
+    AIChatPath string // WebSocket 路径，默认 /ws/ai-chat
+}
+```
+
 ---
 
 ## 七、数据库表
 
-### 7.1 user_custom_personas (用户自定义人设表)
+### 7.1 ai_personas (人设表)
 
 ```sql
-CREATE TABLE IF NOT EXISTS user_custom_personas (
+CREATE TABLE IF NOT EXISTS ai_personas (
     id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
     user_id BIGINT NOT NULL,
     name VARCHAR(64) NOT NULL,
@@ -897,19 +967,14 @@ CREATE TABLE IF NOT EXISTS user_custom_personas (
 );
 
 -- 索引
-CREATE INDEX idx_personas_user_id ON user_custom_personas(user_id);
-CREATE INDEX idx_personas_user_default ON user_custom_personas(user_id, is_default);
-
--- 唯一索引：一个用户只能有一个默认人设（使用部分索引）
-CREATE UNIQUE INDEX idx_personas_unique_default
-    ON user_custom_personas(user_id)
-    WHERE is_default = TRUE;
+CREATE INDEX idx_ai_personas_user_id ON ai_personas(user_id);
+CREATE UNIQUE INDEX idx_ai_personas_user_default ON ai_personas(user_id) WHERE is_default = TRUE;  -- 每个用户只能有一个默认人设
 ```
 
-### 7.2 user_memories (长期记忆表)
+### 7.2 ai_user_memories (长期记忆表)
 
 ```sql
-CREATE TABLE IF NOT EXISTS user_memories (
+CREATE TABLE IF NOT EXISTS ai_user_memories (
     id SERIAL PRIMARY KEY,
     user_id BIGINT NOT NULL,  -- 与 JWT user_id 类型一致
     content TEXT NOT NULL,
@@ -921,11 +986,159 @@ CREATE TABLE IF NOT EXISTS user_memories (
 );
 
 -- 索引
-CREATE INDEX IF NOT EXISTS idx_user_memories_user_id ON user_memories(user_id);
-CREATE INDEX IF NOT EXISTS idx_user_memories_keywords ON user_memories USING GIN(keywords);
-CREATE INDEX IF NOT EXISTS idx_user_memories_weight ON user_memories(weight DESC);
+CREATE INDEX IF NOT EXISTS idx_ai_user_memories_user_id ON ai_user_memories(user_id);
+CREATE INDEX IF NOT EXISTS idx_ai_user_memories_keywords ON ai_user_memories USING GIN(keywords);
+CREATE INDEX IF NOT EXISTS idx_ai_user_memories_weight ON ai_user_memories(weight DESC);
 ```
 
+### 7.3 ai_chat_configs (配置表)
+
+用于存储 AI Chat Service 的可配置参数，支持运行时修改。
+
+```sql
+CREATE TABLE IF NOT EXISTS ai_chat_configs (
+    id SERIAL PRIMARY KEY,
+    config_key VARCHAR(128) NOT NULL UNIQUE,  -- 配置键，唯一标识
+    config_value TEXT NOT NULL,               -- 配置值
+    config_type VARCHAR(32) NOT NULL DEFAULT 'string',  -- 值类型: string, number, boolean, json
+    category VARCHAR(64) NOT NULL,            -- 配置分类: db, redis, llm, dialog
+    description VARCHAR(256),                 -- 配置描述
+    is_encrypted BOOLEAN DEFAULT FALSE,       -- 是否加密（敏感信息如密码）
+    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
+    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
+);
+
+-- 索引
+CREATE INDEX IF NOT EXISTS idx_ai_chat_configs_category ON ai_chat_configs(category);
+CREATE INDEX IF NOT EXISTS idx_ai_chat_configs_key ON ai_chat_configs(config_key);
+
+-- 初始配置数据
+INSERT INTO ai_chat_configs (config_key, config_value, config_type, category, description, is_encrypted) VALUES
+-- Redis 配置
+('redis.host', '127.0.0.1', 'string', 'redis', 'Redis 主机地址', FALSE),
+('redis.port', '6379', 'number', 'redis', 'Redis 端口', FALSE),
+('redis.password', '123456', 'string', 'redis', 'Redis 密码', TRUE),
+('redis.db', '0', 'number', 'redis', 'Redis 数据库编号', FALSE),
+
+-- MiniMax 大模型配置
+('minimax.api_key', 'xxx', 'string', 'llm', 'MiniMax API Key', TRUE),
+('minimax.api_url', 'https://api.minimaxi.com/v1', 'string', 'llm', 'MiniMax API 地址', FALSE),
+('minimax.model', 'M2-her', 'string', 'llm', 'MiniMax 模型名称', FALSE),
+
+-- 通义备用模型配置
+('qwen.api_key', 'xxx', 'string', 'llm', '通义 API Key', TRUE),
+('qwen.api_url', 'https://dashscope.aliyuncs.com/compatible-mode/v1', 'string', 'llm', '通义 API 地址', FALSE),
+('qwen.model', 'qwen-plus', 'string', 'llm', '通义模型名称', FALSE),
+
+-- 对话配置
+('dialog.max_context_turns', '10', 'number', 'dialog', '最大上下文轮数', FALSE),
+('dialog.context_expire_seconds', '86400', 'number', 'dialog', '上下文过期时间(秒)', FALSE),
+('dialog.memory_recall_topn', '5', 'number', 'dialog', '记忆召回返回条数', FALSE),
+('dialog.fallback_threshold', '3', 'number', 'dialog', '模型降级连续失败次数阈值', FALSE),
+('dialog.slow_response_ms', '5000', 'number', 'dialog', '慢响应判定阈值(毫秒)', FALSE),
+
+-- AI Token 限制配置
+('token.max_total', '32000', 'number', 'token', '总 Token 上限', FALSE),
+('token.max_history', '24000', 'number', 'token', '对话历史最大 Token', FALSE),
+('token.max_system', '4000', 'number', 'token', 'System Prompt 最大 Token', FALSE),
+('token.max_memory', '2000', 'number', 'token', '记忆召回最大 Token', FALSE),
+('token.reserved', '2000', 'number', 'token', '保留空间 Token', FALSE),
+
+-- 熔断配置
+('circuit.max_fail_count', '5', 'number', 'circuit', '熔断连续失败次数', FALSE),
+('circuit.breaker_timeout', '60', 'number', 'circuit', '熔断恢复超时(秒)', FALSE),
+
+-- 摘要配置
+('summary.trigger_turns', '10', 'number', 'summary', '自动摘要触发轮数', FALSE),
+('summary.max_length', '100', 'number', 'summary', '摘要最大字数', FALSE);
+```
+
+**配置读取示例：**
+```go
+// ConfigRepository 配置仓库
+type ConfigRepository struct {
+    db *gorm.DB
+}
+
+// GetConfig 获取单个配置值
+func (r *ConfigRepository) GetConfig(key string) (string, error) {
+    var config Config
+    if err := r.db.Where("config_key = ?", key).First(&config).Error; err != nil {
+        return "", err
+    }
+    return config.ConfigValue, nil
+}
+
+// GetConfigByCategory 按分类获取所有配置
+func (r *ConfigRepository) GetConfigByCategory(category string) (map[string]string, error) {
+    var configs []Config
+    if err := r.db.Where("category = ?", category).Find(&configs).Error; err != nil {
+        return nil, err
+    }
+    result := make(map[string]string)
+    for _, c := range configs {
+        result[c.ConfigKey] = c.ConfigValue
+    }
+    return result, nil
+}
+
+// UpdateConfig 更新配置值
+func (r *ConfigRepository) UpdateConfig(key, value string) error {
+    return r.db.Model(&Config{}).Where("config_key = ?", key).Update("config_value", value).Error
+}
+
+// Config 结构体
+type Config struct {
+    ID          int64  `gorm:"primaryKey"`
+    ConfigKey   string `gorm:"uniqueIndex;not null"`
+    ConfigValue string `gorm:"not null"`
+    ConfigType  string `gorm:"default:string"`
+    Category    string `gorm:"index"`
+    Description string
+    IsEncrypted bool
+}
+```
+
+**配置热更新：**
+```go
+// 配置监听器，检测配置变更并刷新内存缓存
+type ConfigWatcher struct {
+    repo     *ConfigRepository
+    cache    sync.Map // 内存缓存
+    interval time.Duration
+}
+
+func (w *ConfigWatcher) Start() {
+    ticker := time.NewTicker(w.interval)
+    go func() {
+        for range ticker.C {
+            w.refreshCache()
+        }
+    }()
+}
+
+func (w *ConfigWatcher) Get(key string) string {
+    if val, ok := w.cache.Load(key); ok {
+        return val.(string)
+    }
+    // 缓存未命中，从数据库读取
+    val, _ := w.repo.GetConfig(key)
+    w.cache.Store(key, val)
+    return val
+}
+```
+
+**分类说明：**
+
+| 分类 | 说明 | 配置示例 |
+|------|------|----------|
+| redis | Redis 连接配置 | host, port, password, db |
+| llm | 大模型 API 配置 | api_key, api_url, model |
+| dialog | 对话行为配置 | max_context_turns, context_expire_seconds |
+| token | Token 限制配置 | max_total, max_history, max_system |
+| circuit | 熔断策略配置 | max_fail_count, breaker_timeout |
+| summary | 摘要功能配置 | trigger_turns, max_length |
+
 ---
 
 ## 八、Proto 定义
@@ -950,18 +1163,6 @@ service AIChatService {
 
   // 获取用户的所有人设
   rpc GetPersonas(GetPersonasRequest) returns (PersonaListResponse);
-
-  // 创建人设
-  rpc CreatePersona(CreatePersonaRequest) returns (PersonaResponse);
-
-  // 更新人设
-  rpc UpdatePersona(UpdatePersonaRequest) returns (PersonaResponse);
-
-  // 删除人设
-  rpc DeletePersona(DeletePersonaRequest) returns (DeletePersonaResponse);
-
-  // 设置默认人设
-  rpc SetDefaultPersona(SetDefaultPersonaRequest) returns (SetDefaultPersonaResponse);
 }
 
 // ========== 对话 ==========
@@ -1010,47 +1211,6 @@ message PersonaInfo {
   int64 updated_at = 8;
 }
 
-message CreatePersonaRequest {
-  int64 user_id = 1;
-  string name = 2;
-  string description = 3;
-  string avatar_url = 4;
-  string talk_style = 5;
-  string system_prompt = 6;
-}
-
-message PersonaResponse {
-  PersonaInfo persona = 1;
-}
-
-message UpdatePersonaRequest {
-  int64 user_id = 1;
-  string persona_id = 2;
-  string name = 3;
-  string description = 4;
-  string avatar_url = 5;
-  string talk_style = 6;
-  string system_prompt = 7;
-}
-
-message DeletePersonaRequest {
-  int64 user_id = 1;
-  string persona_id = 2;
-}
-
-message DeletePersonaResponse {
-  bool success = 1;
-}
-
-message SetDefaultPersonaRequest {
-  int64 user_id = 1;
-  string persona_id = 2;
-}
-
-message SetDefaultPersonaResponse {
-  bool success = 1;
-}
-
 // ========== 通用 ==========
 
 message Message {
@@ -1063,7 +1223,37 @@ message Message {
 
 ## 九、Gateway 接入
 
-### 9.1 Gateway Dubbo Client 初始化
+在 `gateway/socket/` 目录下创建 WebSocket 处理：
+
+```
+gateway/socket/
+├── ai_chat_socket.go    # AI Chat WebSocket 处理
+└── hub.go               # WebSocket Hub 管理连接
+```
+
+**Hub 架构：**
+```go
+// Hub 管理所有 WebSocket 连接
+type Hub struct {
+    // 用户连接映射: userId -> *Connection
+    clients map[int64]*Connection
+    
+    // WebSocket 升级器
+    upgrader websocket.Upgrader
+    
+    // Dubbo 客户端
+    aiChatClient *client.Client
+}
+```
+
+**连接流程：**
+1. Mobile 携带 Token 连接 `/ws/ai-chat?token=Bearer_xxx`
+2. Hub 接收连接，从 URL 参数获取 Token 并验证
+3. 验证通过后，创建用户连接映射
+4. 返回鉴权成功消息，开始处理消息
+5. Mobile 每 30s 发送 ping，Hub 回复 pong
+
+### 9.2 Gateway Dubbo Client 初始化
 
 在 `gateway/main.go` 中添加：
 
@@ -1078,9 +1268,26 @@ if err != nil {
 logger.Logger.Info("AI Chat Service Dubbo client connected successfully")
 ```
 
-### 9.2 Router 配置
+### 9.3 Router 配置
 
-在 `gateway/router/router.go` 中添加 AIChat 路由组。
+在 `gateway/router/router.go` 中添加 AIChat HTTP 路由（兼容性）：
+
+```go
+// API v1 路由组
+v1 := r.Group("/api/v1")
+{
+    // AI Chat HTTP 兼容接口
+    aiChat := v1.Group("/ai-chat")
+    aiChat.Use(middleware.AuthMiddleware())
+    {
+        aiChat.GET("/personas", aiChatCtrl.GetPersonas)                        // 获取人设列表
+        aiChat.GET("/history/:sessionId", aiChatCtrl.GetHistory)                // 获取对话历史
+    }
+}
+
+// WebSocket 路由
+r.GET("/ws/ai-chat", socketCtrl.HandleAIChatWebSocket)
+```
 
 ---
 
@@ -1108,15 +1315,153 @@ WantedBy=multi-user.target
 
 ---
 
-## 十一、性能与可靠性
+## 十一、AI Token 节省策略
 
-### 11.1 流式输出优化
+### 11.1 Token 优化矩阵
+
+| 策略 | 节省比例 | 实现成本 | 适用场景 |
+|------|----------|----------|----------|
+| 动态上下文裁剪 | 20-40% | 低 | 所有场景 |
+| 单条消息截断 | 10-20% | 低 | 超长消息输入 |
+| 记忆召回限制 | 15-25% | 低 | 高频对话 |
+| System Prompt 精简 | 10-30% | 中 | 人设优化 |
+| 历史摘要压缩 | 30-50% | 高 | 长程对话 |
+
+### 11.2 核心优化策略
+
+#### 11.2.1 智能历史裁剪
+
+```go
+// 优化：基于对话质量裁剪，非简单先进先出
+type HistoryManager struct {
+    maxTurns int
+    // 对话质量评分：用于决定保留哪些对话
+    qualityScore map[string]float64
+}
+
+// 对话质量评分因素
+// - 包含关键信息（用户偏好、重要事实）：+0.3
+// - 情感共情对话：+0.1
+// - 日常寒暄：-0.1
+// - 重复内容：-0.2
+```
+
+#### 11.2.2 Prompt 压缩
+
+```go
+// System Prompt 压缩示例
+// 压缩前 (原始人设)
+"你是一个温柔体贴的AI伴侣，名字叫小雪。你善于倾听，能理解用户的情绪，
+用温暖的话语陪伴用户。说话风格亲切自然，像朋友聊天一样。
+不要过于正式或说教，当用户情绪低落时，先给予共情和安慰。"
+
+// 压缩后 (精简版 - 保持核心特征)
+"你是小雪，温柔体贴的AI伴侣。善于倾听、共情陪伴。
+风格：亲切自然，像朋友聊天，不说教。"
+```
+
+#### 11.2.3 上下文摘要
+
+```go
+// 长对话自动摘要（每 N 轮触发）
+const SUMMARY_TRIGGER_TURNS = 10  // 10轮对话后触发摘要
+
+// 摘要 Prompt
+summaryPrompt := `将以下对话核心内容压缩为 100 字以内的摘要，
+保留关键信息（用户偏好、重要事实、情绪状态）：
+{对话内容}
+
+摘要格式：["关键信息1", "关键信息2", "用户情绪状态"]`
+
+// 摘要结果存入长期记忆
+memoryService.SaveSummary(userID, summary, keywords)
+```
+
+#### 11.2.4 快速拒绝（Early Rejection）
+
+```go
+// 检测是否需要调用大模型
+// 场景：用户只是刷新页面、获取历史等不需要 AI 回复的操作
+
+// 无需调用大模型的情况：
+// - get_history 请求
+// - 纯符号/数字输入（如 "？"、"123"）
+// - 重复发送相同消息
+
+if isNoNeedLLMCall(input) {
+    return fallbackResponse // 直接返回，不消耗 Token
+}
+```
+
+#### 11.2.5 备用模型降级策略
+
+```go
+// 成本对比 (参考价)
+// MiniMax M2-her: ¥0.01/1K tokens
+// 通义 qwen-plus: ¥0.004/1K tokens
+
+// 降级策略
+type ModelConfig struct {
+    primary   string  // M2-her
+    backup    string  // qwen-plus
+    useBackup bool    // 是否使用备用
+}
+
+// 当连续失败 N 次或响应慢时，自动降级到通义
+const FALLBACK_THRESHOLD = 3
+const SLOW_RESPONSE_MS = 5000  // 超过 5s 判定为慢
+```
+
+### 11.3 Token 预算分配
+
+```
+总 Token 预算: 32,000 (M2-her 最大)
+
+├── System Prompt:     4,000 (12.5%)
+├── 用户核心记忆:      2,000 (6.3%)
+├── 对话历史:         22,000 (68.8%)  ← 主要裁剪目标
+└── 保留回复空间:      4,000 (12.5%)
+```
+
+### 11.4 监控指标
+
+```go
+// Token 消耗监控
+type TokenMetrics struct {
+    totalTokens      int64   // 总消耗
+    promptTokens    int64   // Prompt 消耗
+    completionTokens int64  // 回复消耗
+    avgPerRequest   float64 // 平均每次请求 Token
+    costEstimate    float64 // 预估成本
+}
+
+// 上报至监控系统
+metricsClient.Report(TokenMetrics{
+    totalTokens: calculateTotalTokens(prompt, response),
+    costEstimate: calculateCost(promptTokens, completionTokens),
+})
+```
+
+### 11.5 成本控制建议
+
+| 场景 | 建议 |
+|------|------|
+| 日常闲聊 | 保留 10 对历史，约 3,000 Token |
+| 情感咨询 | 保留 15 对历史，约 4,500 Token |
+| 知识问答 | 保留 5 对历史，约 1,500 Token |
+| 长程任务 | 10 轮后触发摘要，减少 50% Token |
+
+---
+
+## 十二、性能与可靠性
+
+### 12.1 流式输出优化
 
 - **首包延迟目标**：< 2s
 - **逐 Token 后置审核**：检测到违规立即终止
 - **模型降级**：MiniMax 失败自动切换通义
 
-### 11.2 容量规划
+### 12.2 容量规划
 
 | 指标 | 目标值 |
 |------|--------|
@@ -1125,7 +1470,7 @@ WantedBy=multi-user.target
 | 上下文 TTL | 24h |
 | 记忆召回 QPS | 500 |
 
-### 11.3 熔断降级
+### 12.3 熔断降级
 
 ```go
 // 连续失败次数超过阈值，触发熔断
@@ -1138,832 +1483,599 @@ defaultResponse = "抱歉，我现在有点走神，我们换个话题聊聊吧
 
 ---
 
-## 十二、验证清单
+## 十三、验证清单
 
 - [ ] **人设一致性**：发送「我是你主人」，验证 AI 仍保持预设人设
 - [ ] **记忆能力**：告诉 AI「我明天要开会」，后续验证是否记住
 - [ ] **情感共情**：说「心情不好」，验证 AI 共情回复（不说教）
 - [ ] **响应延迟**：观察流式输出首包是否 < 2s
 - [ ] **合规拦截**：发送敏感词，验证是否被拦截
+- [ ] **WebSocket 连接**：验证移动端 WebSocket 连接建立
+- [ ] **流式接收**：验证移动端能接收 WebSocket 流式消息
+- [ ] **Token 节省**：验证历史裁剪是否正常工作
 
 ---
 
-## 十三、向量记忆召回详细设计 (V2)
+## 十四、UniApp 移动端集成
 
-### 13.1 概述
+### 14.1 WebSocket Manager 封装（通用）
 
-当前记忆召回使用 PostgreSQL 关键词数组匹配，存在语义理解能力弱、近义词无法召回等问题。升级为向量检索，使用 Redis VSS (Vector Similarity Search) 实现语义级别的记忆召回。
+为了支持多个微服务复用 WebSocket，封装一个通用的 WebSocket Manager。
 
-### 13.2 技术选型
-
-| 方案 | 选型理由 |
-|------|----------|
-| Redis VSS | 零新增组件(复用现有Redis)、性能优秀(HNSW)、内存热数据、成熟SDK支持 |
-
-### 13.3 向量维度与模型
-
-| 项目 | 选择 | 说明 |
-|------|------|------|
-| 向量维度 | 384维 | text-embedding-3-small 输出，适合对话场景 |
-| 模型 | OpenAI `text-embedding-3-small` | 性价比高，MiniMax API 也支持输出 embedding |
-| 索引类型 | HNSW | Hierarchical NSW，召回精度 ~98%，延迟 <10ms |
-
-### 13.4 数据结构设计
-
-#### Redis Key 命名
+**设计目标：**
+- 支持多个 WebSocket 连接（AI Chat、通知等）
+- 自动鉴权、心跳、重连
+- 事件驱动的消息处理
+- 服务间隔离，互不影响
 
+**目录结构：**
 ```
-# 向量数据 (Hash)
-memory:vector:{userId}:{memoryId} = {
-    "id": "memory_123",
-    "user_id": "10001",
-    "content": "用户说喜欢川菜",
-    "keywords": ["川菜", "美食"],
-    "weight": 60,
-    "vector": [0.123, -0.456, ...],  // 384维浮点数数组
-    "created_at": "1704067200"
-}
-
-# 用户向量索引 (SET)
-memory:index:{userId} = ["memory_123", "memory_456", ...]
-
-# 用户最后向量更新时间 (String)
-memory:last_vectorize:{userId} = "1704067200"
+utils/
+├── socket/
+│   ├── SocketManager.js    # WebSocket 管理器
+│   ├── AiChatSocket.js    # AI Chat 专用连接（继承自 Manager）
+│   └── index.js           # 导出
 ```
 
-#### 向量写入流程
-
-```
-1. 用户对话 → 触发记忆提取
-2. 提取文本内容 → 调用 Embedding API 生成向量
-3. 生成 memoryId (UUID)
-4. HMSET memory:vector:{userId}:{memoryId} {...}
-5. SADD memory:index:{userId} {memoryId}
-6. 更新 memory:last_vectorize:{userId}
-```
-
-#### 向量召回流程
-
-```
-1. 用户输入 "我喜欢吃辣的东西"
-2. 调用 Embedding API 生成查询向量 Q
-3. 获取用户所有 memoryIds: SMEMBERS memory:index:{userId}
-4. 批量获取向量: HMGET memory:vector:{userId}:{id} vector
-5. 计算余弦相似度: cosine_similarity(Q, memory_vector)
-6. 排序返回 Top N (默认5条)
-7. 组装召回文本: content1 + "\n" + content2 + ...
-```
-
-### 13.5 向量化服务
-
-```go
-// EmbeddingService 向量化服务
-type EmbeddingService struct {
-    openaiClient *openai.Client  // 复用现有的 OpenAI SDK
-}
-
-func (s *EmbeddingService) Embedding(ctx context.Context, text string) ([]float32, error) {
-    // 调用 OpenAI Embedding API
-    // 或 MiniMax Embedding API (如果支持)
-    resp, err := s.openaiClient.Embeddings(ctx, &openai.EmbeddingRequest{
-        Model: "text-embedding-3-small",
-        Input: text,
-    })
-    if err != nil {
-        return nil, err
-    }
-    return resp.Data[0].Embedding, nil
-}
-```
-
-### 13.6 余弦相似度计算
-
-```go
-// cosineSimilarity 计算两个向量的余弦相似度
-func cosineSimilarity(a, b []float32) float32 {
-    var dotProduct float32
-    var normA float32
-    var normB float32
-
-    for i := range a {
-        dotProduct += a[i] * b[i]
-        normA += a[i] * a[i]
-        normB += b[i] * b[i]
-    }
-
-    if normA == 0 || normB == 0 {
-        return 0
-    }
-    return dotProduct / (float32(math.Sqrt(float64(normA))) * float32(math.Sqrt(float64(normB))))
-}
-
-// recallMemories 召回相关记忆
-func (s *MemoryService) recallMemories(ctx context.Context, userId string, query string) (string, error) {
-    // 1. 生成查询向量
-    queryVec, err := s.embeddingService.Embedding(ctx, query)
-    if err != nil {
-        return "", err
-    }
-
-    // 2. 获取用户所有记忆 ID
-    memoryIds, err := s.redis.SMembers(ctx, fmt.Sprintf("memory:index:%s", userId)).Result()
-    if err != nil || len(memoryIds) == 0 {
-        return "", nil
-    }
-
-    // 3. 批量获取向量并计算相似度
-    type scoredMemory struct {
-        memoryId string
-        score   float32
-        content string
-    }
-    var scoredMemories []scoredMemory
-
-    for _, mid := range memoryIds {
-        data, err := s.redis.HGetAll(ctx, fmt.Sprintf("memory:vector:%s:%s", userId, mid)).Result()
-        if err != nil || len(data) == 0 {
-            continue
+**SocketManager.js - 通用 WebSocket 管理器：**
+```javascript
+/**
+ * WebSocket 管理器
+ * 支持多个 WebSocket 连接，自动管理鉴权、心跳、重连
+ */
+class SocketManager {
+    constructor(options = {}) {
+        this.serviceName = options.serviceName || 'unknown'
+        this.baseUrl = options.baseUrl || 'ws://gateway:8080'
+        this.token = null
+        this.socket = null
+        this.heartbeatTimer = null
+        this.reconnectTimer = null
+        this.reconnectInterval = options.reconnectInterval || 3000
+        this.heartbeatInterval = options.heartbeatInterval || 30000
+        this.maxReconnectAttempts = options.maxReconnectAttempts || 5
+        this.reconnectAttempts = 0
+        
+        // 状态
+        this.isConnected = false
+        this.isAuthed = false
+        
+        // 事件处理器
+        this.eventHandlers = {
+            'connect': [],
+            'disconnect': [],
+            'auth_success': [],
+            'auth_fail': [],
+            'error': [],
+            'message': []  // 通用消息处理
         }
-
-        // 解析向量 (存储为 JSON 字符串)
-        var vector []float32
-        if err := json.Unmarshal([]byte(data["vector"]), &vector); err != nil {
-            continue
+        
+        // 子类可覆盖的消息类型处理
+        this.messageHandlers = {}
+    }
+    
+    /**
+     * 连接到 WebSocket 服务器
+     */
+    connect(token, path) {
+        this.token = token
+        this.path = path
+        this.reconnectAttempts = 0
+        this._doConnect()
+    }
+    
+    _doConnect() {
+        const url = `${this.baseUrl}${this.path}?token=Bearer_${this.token}`
+        console.log(`[${this.serviceName}] Connecting to ${url}`)
+        
+        this.socket = uni.connectSocket({ url })
+        this._setupListeners()
+    }
+    
+    _setupListeners() {
+        // 连接打开
+        this.socket.onOpen(() => {
+            console.log(`[${this.serviceName}] WebSocket connected`)
+            this.isConnected = true
+            this._emit('connect')
+        })
+        
+        // 接收消息
+        this.socket.onMessage((event) => {
+            const data = JSON.parse(event.data)
+            this._handleMessage(data)
+        })
+        
+        // 连接关闭
+        this.socket.onClose(() => {
+            console.log(`[${this.serviceName}] WebSocket closed`)
+            this._cleanup()
+            this._emit('disconnect')
+            this._tryReconnect()
+        })
+        
+        // 连接错误
+        this.socket.onError((err) => {
+            console.error(`[${this.serviceName}] WebSocket error:`, err)
+            this._emit('error', err)
+        })
+    }
+    
+    _handleMessage(data) {
+        // 触发通用消息事件
+        this._emit('message', data)
+        
+        // 根据消息类型处理
+        const { type, action } = data
+        
+        // 1. 鉴权响应（通用）
+        if (type === 'auth_response') {
+            if (data.success) {
+                this.isAuthed = true
+                this._emit('auth_success', data)
+                this._startHeartbeat()
+            } else {
+                this.isAuthed = false
+                this._emit('auth_fail', data)
+                this.close()
+            }
+            return
         }
-
-        score := cosineSimilarity(queryVec, vector)
-        if score > 0.7 { // 相似度阈值
-            scoredMemories = append(scoredMemories, scoredMemory{
-                memoryId: mid,
-                score:    score,
-                content:  data["content"],
-            })
+        
+        // 2. 心跳响应（通用）
+        if (type === 'pong') {
+            console.log(`[${this.serviceName}] Heartbeat received`)
+            return
+        }
+        
+        // 3. 错误响应（通用）
+        if (type === 'error') {
+            this._emit('error', data)
+            return
+        }
+        
+        // 4. 服务特定消息类型处理
+        const handler = this.messageHandlers[type] || this.messageHandlers[action]
+        if (handler) {
+            handler(data)
         }
     }
-
-    // 4. 排序返回 Top N
-    sort.Slice(scoredMemories, func(i, j int) bool {
-        return scoredMemories[i].score > scoredMemories[j].score
-    })
-
-    if len(scoredMemories) > 5 {
-        scoredMemories = scoredMemories[:5]
+    
+    /**
+     * 发送消息
+     */
+    send(data) {
+        if (!this.socket || !this.isConnected) {
+            console.warn(`[${this.serviceName}] Socket not connected`)
+            return false
+        }
+        
+        this.socket.send({
+            data: JSON.stringify(data)
+        })
+        return true
     }
-
-    // 5. 组装召回文本
-    if len(scoredMemories) == 0 {
-        return "", nil
+    
+    /**
+     * 发送心跳
+     */
+    _startHeartbeat() {
+        this._stopHeartbeat()
+        this.heartbeatTimer = setInterval(() => {
+            if (this.isConnected) {
+                this.send({ action: 'ping' })
+            }
+        }, this.heartbeatInterval)
     }
-
-    var result = "用户之前提到过：\n"
-    for _, m := range scoredMemories {
-        result += fmt.Sprintf("- %s\n", m.content)
-    }
-    return result, nil
-}
-```
-
-### 13.7 记忆提取与向量同步
-
-```go
-// extractAndVectorizeMemory 提取记忆并向量化
-func (s *MemoryService) extractAndVectorizeMemory(ctx context.Context, userId string, dialogue []Message) error {
-    // 1. 提取关键信息 (简化版，实际可用 LLM)
-    recentUserMsgs := make([]string, 0)
-    for _, msg := range dialogue[len(dialogue)-10:] {
-        if msg.Role == "user" {
-            recentUserMsgs = append(recentUserMsgs, msg.Content)
+    
+    _stopHeartbeat() {
+        if (this.heartbeatTimer) {
+            clearInterval(this.heartbeatTimer)
+            this.heartbeatTimer = null
         }
     }
-    if len(recentUserMsgs) == 0 {
-        return nil
+    
+    /**
+     * 重连机制
+     */
+    _tryReconnect() {
+        if (this.reconnectAttempts >= this.maxReconnectAttempts) {
+            console.warn(`[${this.serviceName}] Max reconnect attempts reached`)
+            this._emit('error', { code: 'RECONNECT_FAILED', message: '重连次数已达上限' })
+            return
+        }
+        
+        this.reconnectAttempts++
+        console.log(`[${this.serviceName}] Reconnecting... (${this.reconnectAttempts}/${this.maxReconnectAttempts})`)
+        
+        this.reconnectTimer = setTimeout(() => {
+            this._doConnect()
+        }, this.reconnectInterval)
     }
-
-    // 2. 生成向量
-    combinedText := strings.Join(recentUserMsgs, "。")
-    vector, err := s.embeddingService.Embedding(ctx, combinedText)
-    if err != nil {
-        return err
+    
+    _cleanup() {
+        this._stopHeartbeat()
+        if (this.reconnectTimer) {
+            clearTimeout(this.reconnectTimer)
+            this.reconnectTimer = null
+        }
+        this.isConnected = false
+        this.isAuthed = false
     }
-
-    // 3. 提取关键词 (简化版)
-    keywords := extractKeywords(combinedText)
-
-    // 4. 保存到 Redis
-    memoryId := uuid.New().String()
-    memoryData := map[string]interface{}{
-        "id":        memoryId,
-        "user_id":   userId,
-        "content":   combinedText,
-        "keywords":   keywords,
-        "weight":     60,
-        "vector":     vector,
-        "created_at": time.Now().Unix(),
+    
+    /**
+     * 关闭连接
+     */
+    close() {
+        this._cleanup()
+        if (this.socket) {
+            this.socket.close()
+            this.socket = null
+        }
+    }
+    
+    // ===== 事件系统 =====
+    
+    on(event, handler) {
+        if (this.eventHandlers[event]) {
+            this.eventHandlers[event].push(handler)
+        }
+        return () => this.off(event, handler)  // 返回取消订阅函数
+    }
+    
+    off(event, handler) {
+        if (this.eventHandlers[event]) {
+            this.eventHandlers[event] = this.eventHandlers[event].filter(h => h !== handler)
+        }
+    }
+    
+    _emit(event, data) {
+        if (this.eventHandlers[event]) {
+            this.eventHandlers[event].forEach(handler => handler(data))
+        }
+    }
+    
+    // ===== 订阅特定消息类型 =====
+    
+    /**
+     * 注册特定消息类型的处理器
+     * @param {string} type 消息类型或 action
+     * @param {function} handler 处理函数
+     */
+    registerHandler(type, handler) {
+        this.messageHandlers[type] = handler
     }
-
-    pipe := s.redis.Pipeline()
-    pipe.HSet(ctx, fmt.Sprintf("memory:vector:%s:%s", userId, memoryId), memoryData)
-    pipe.SAdd(ctx, fmt.Sprintf("memory:index:%s", userId), memoryId)
-    _, err = pipe.Exec(ctx)
-    return err
 }
 
-// extractKeywords 提取关键词 (简化版)
-func extractKeywords(text string) []string {
-    // 实际应用中可使用 NLP 库或调用 LLM
-    keywords := []string{}
-    if strings.Contains(text, "工作") || strings.Contains(text, "上班") {
-        keywords = append(keywords, "工作状态")
+/** 导出 */
+export default SocketManager
+```
+
+**AiChatSocket.js - AI Chat 专用连接：**
+```javascript
+import SocketManager from './SocketManager'
+
+/**
+ * AI Chat WebSocket 连接
+ * 继承自 SocketManager，添加 AI Chat 特定的业务逻辑
+ */
+class AiChatSocket extends SocketManager {
+    constructor() {
+        super({
+            serviceName: 'AiChat',
+            baseUrl: 'ws://gateway:8080',
+            path: '/ws/ai-chat',
+            reconnectInterval: 3000,
+            heartbeatInterval: 30000,
+            maxReconnectAttempts: 5
+        })
+        
+        // AI Chat 特定回调
+        this.onMessageCallback = null
+        this.onHistoryCallback = null
+        this.onPersonasCallback = null
+        this.onErrorCallback = null
+        
+        // 注册 AI Chat 特定的消息处理器
+        this._registerAiChatHandlers()
     }
-    if strings.Contains(text, "累") || strings.Contains(text, "辛苦") {
-        keywords = append(keywords, "疲劳")
+    
+    _registerAiChatHandlers() {
+        // 流式消息
+        this.registerHandler('message', (data) => {
+            if (this.onMessageCallback) {
+                this.onMessageCallback(data)
+            }
+        })
+        
+        // 历史消息响应
+        this.registerHandler('history_response', (data) => {
+            if (this.onHistoryCallback) {
+                this.onHistoryCallback(data)
+            }
+        })
+        
+        // 人设列表响应
+        this.registerHandler('personas_response', (data) => {
+            if (this.onPersonasCallback) {
+                this.onPersonasCallback(data)
+            }
+        })
     }
-    if strings.Contains(text, "生日") || strings.Contains(text, "纪念日") {
-        keywords = append(keywords, "重要日期")
+    
+    /**
+     * 连接到 AI Chat 服务
+     */
+    connect(token) {
+        super.connect(token, '/ws/ai-chat')
+    }
+    
+    /**
+     * 发送消息
+     */
+    sendMessage(message, sessionId, personaId = '') {
+        return this.send({
+            action: 'send_message',
+            session_id: sessionId,
+            message: message,
+            persona_id: personaId
+        })
+    }
+    
+    /**
+     * 获取历史记录
+     */
+    getHistory(sessionId, limit = 20) {
+        return this.send({
+            action: 'get_history',
+            session_id: sessionId,
+            limit: limit
+        })
+    }
+    
+    /**
+     * 获取人设列表
+     */
+    getPersonas() {
+        return this.send({
+            action: 'get_personas'
+        })
+    }
+    
+    /**
+     * 设置消息回调
+     */
+    setOnMessageCallback(callback) {
+        this.onMessageCallback = callback
+    }
+    
+    /**
+     * 设置历史记录回调
+     */
+    setOnHistoryCallback(callback) {
+        this.onHistoryCallback = callback
+    }
+    
+    /**
+     * 设置人设列表回调
+     */
+    setOnPersonasCallback(callback) {
+        this.onPersonasCallback = callback
+    }
+    
+    /**
+     * 设置错误回调
+     */
+    setOnErrorCallback(callback) {
+        this.onErrorCallback = callback
+        this.on('error', (data) => {
+            if (callback) callback(data)
+        })
+    }
+}
+
+// 单例模式
+let aiChatInstance = null
+
+export function getAiChatSocket() {
+    if (!aiChatInstance) {
+        aiChatInstance = new AiChatSocket()
+    }
+    return aiChatInstance
+}
+
+export function closeAiChatSocket() {
+    if (aiChatInstance) {
+        aiChatInstance.close()
+        aiChatInstance = null
+    }
+}
+
+export default AiChatSocket
+```
+
+### 14.3 替换轮询为 WebSocket
+
+多个业务场景可复用 WebSocket Manager，将轮询替换为 WebSocket 推送。
+
+**适用场景：**
+
+| 场景 | 轮询接口 | WebSocket 替换 |
+|------|----------|----------------|
+| 好友请求通知 | `GET /api/v1/social/friend-requests` | 复用同一连接，接收 `friend_request` 消息 |
+| 排行榜更新 | `GET /api/v1/rankings/hot` | 复用同一连接，接收 `ranking_update` 消息 |
+| 活动进度 | `GET /api/v1/activities/:id/progress` | 复用同一连接，接收 `activity_progress` 消息 |
+| 系统通知 | 轮询获取 | 新建 NotificationSocket，接收 `notification` 消息 |
+
+**架构设计：**
+
+```
+┌─────────────────────────────────────────────────────────┐
+│                    UniApp 前端                          │
+│  ┌─────────────────────────────────────────────────┐   │
+│  │              GlobalSocketManager                │   │
+│  │  - 管理多个 WebSocket 连接                       │   │
+│  │  - 共享同一个 Token                              │   │
+│  │  - 统一心跳管理                                  │   │
+│  └─────────────────────────────────────────────────┘   │
+│                    │                                    │
+│         ┌─────────┴──────────┐                          │
+│         ▼                   ▼                          │
+│  ┌─────────────┐     ┌─────────────┐                   │
+│  │ AiChatSocket│     │NotifySocket│  ← 新增            │
+│  └─────────────┘     └─────────────┘                   │
+└─────────────────────────────────────────────────────────┘
+                              │
+                              ▼
+┌─────────────────────────────────────────────────────────┐
+│                 Gateway (:8080)                          │
+│  /ws/ai-chat    →  AIChatService                        │
+│  /ws/notify     →  NotificationService (Future)         │
+└─────────────────────────────────────────────────────────┘
+```
+
+**GlobalSocketManager.js - 全局 Socket 管理器：**
+```javascript
+/**
+ * 全局 WebSocket 管理器
+ * 统一管理多个服务的 WebSocket 连接
+ */
+class GlobalSocketManager {
+    constructor() {
+        this.sockets = {}  // serviceName -> SocketManager
+        this.token = null
+        this.isAllConnected = false
+    }
+    
+    /**
+     * 初始化所有连接
+     */
+    init(token) {
+        this.token = token
+        this._initAiChat()
+        this._initNotification()  // Future
+    }
+    
+    _initAiChat() {
+        const aiChat = getAiChatSocket()
+        aiChat.on('connect', () => console.log('AI Chat connected'))
+        aiChat.on('error', (err) => console.error('AI Chat error:', err))
+        aiChat.connect(this.token)
+        this.sockets['ai_chat'] = aiChat
+    }
+    
+    _initNotification() {
+        const notify = getNotificationSocket()
+        notify.on('connect', () => console.log('Notification connected'))
+        notify.on('notification', (data) => this._handleNotification(data))
+        notify.connect(this.token)
+        this.sockets['notification'] = notify
+    }
+    
+    _handleNotification(data) {
+        switch (data.sub_type) {
+            case 'friend_request':
+                uni.$emit('friend_request_received', data)
+                break
+            case 'ranking_update':
+                uni.$emit('ranking_updated', data)
+                break
+            case 'activity_progress':
+                uni.$emit('activity_progress_updated', data)
+                break
+            default:
+                console.log('Unknown notification:', data)
+        }
+    }
+    
+    getSocket(serviceName) {
+        return this.sockets[serviceName]
+    }
+    
+    closeAll() {
+        Object.values(this.sockets).forEach(socket => socket.close())
+        this.sockets = {}
+    }
+}
+
+let globalInstance = null
+
+export function getGlobalSocket() {
+    if (!globalInstance) {
+        globalInstance = new GlobalSocketManager()
+    }
+    return globalInstance
+}
+
+export default GlobalSocketManager
+```
+
+**在 App.vue 中初始化：**
+```javascript
+// App.vue
+import { getGlobalSocket } from '@/utils/socket/GlobalSocketManager'
+
+export default {
+    onLaunch() {
+        const token = uni.getStorageSync('token')
+        if (token) {
+            getGlobalSocket().init(token)
+        }
+    },
+    onHide() {
+        getGlobalSocket().closeAll()
     }
-    return keywords
 }
 ```
 
-### 13.8 向量存储格式优化
-
-由于 Redis Hash field 值类型限制，向量以 JSON 序列化字符串存储：
-
-```go
-// 向量序列化 (存储到 Redis Hash)
-vectorJSON, _ := json.Marshal(vector)
-redisClient.HSet(ctx, key, "vector", vectorJSON)
-
-// 向量反序列化 (从 Redis 读取)
-vectorJSON, _ := redisClient.HGet(ctx, key, "vector")
-var vector []float32
-json.Unmarshal([]byte(vectorJSON), &vector)
-```
-
-### 13.9 内存预估
-
-| 项目 | 计算 |
-|------|------|
-| 单条向量大小 | 384维 × 4字节 = 1.5KB |
-| 1万条向量 | 15MB |
-| 100万条向量 | 1.5GB |
-| 用户平均记忆数 | 50条/人 |
-| 10万用户 | 7.5GB |
-
-**注意**：Redis VSS 需要足够的内存，建议监控内存使用。
-
-### 13.10 兼容性设计
-
-为保证 V1.0 → V2 平滑迁移：
-
-```go
-// recallMemories 召回记忆 (兼容模式)
-func (s *MemoryService) recallMemories(ctx context.Context, userId string, query string) (string, error) {
-    // 1. 优先使用向量召回
-    result, err := s.recallByVector(ctx, userId, query)
-    if err == nil && result != "" {
-        return result, nil
+**在页面中使用：**
+```javascript
+// pages/friend/friend.vue
+export default {
+    data() {
+        return { friendRequests: [] }
+    },
+    
+    onLoad() {
+        uni.$on('friend_request_received', (data) => {
+            this.friendRequests.unshift(data)
+            uni.showToast({ title: '收到新好友请求', icon: 'none' })
+        })
+    },
+    
+    onUnload() {
+        uni.$off('friend_request_received')
     }
-
-    // 2. 降级到关键词召回 (V1.0 方案)
-    s.logger.Warn("Vector recall failed, fallback to keyword", zap.Error(err))
-    return s.recallByKeyword(ctx, userId, query)
 }
 ```
 
-### 13.11 Redis VSS 配置要求
+**复用连接的好处：**
+- **资源节省**：多个业务共享一个 WebSocket 连接
+- **实时性**：从轮询 5-10s 延迟降为即时推送
+- **维护简单**：统一的心跳、重连、鉴权逻辑
 
-```bash
-# Redis 7.4+ 自带 VSS 支持
-# 检查 Redis 版本
-redis-server --version
+### 14.4 注意事项
 
-# redis.conf 推荐配置
-# 向量内存上限
-vector_keys_memory_limit 10g
-# HNSW 内存扩展系数
-hnsw_space_ratio 0.5
-```
-
-### 13.12 升级实施计划
-
-| 阶段 | 内容 | 工作量 |
-|------|------|--------|
-| 1 | 升级 Redis 到 7.4+ (如果需要) | 0.5 天 |
-| 2 | 实现 EmbeddingService | 0.5 天 |
-| 3 | 实现向量召回 recallByVector | 1 天 |
-| 4 | 迁移脚本：历史数据向量化 | 0.5 天 |
-| 5 | 兼容性降级逻辑 | 0.5 天 |
-| 6 | 压测验证召回效果 | 0.5 天 |
-| **合计** | | **3.5 天** |
-
-### 13.13 V1.0 → V2 数据迁移策略
-
-#### 迁移原则
-- V2 上线后，V1.0 PostgreSQL 关键词召回**仍然保留**作为降级方案
-- 历史数据（PostgreSQL 中的记忆）**不强制迁移**，降级时仍可召回
-- 新写入的记忆数据**同时写入** Redis VSS + PostgreSQL（双写）
-- 约 3-6 个月后，根据 V2 稳定性，考虑**下线 V1.0 关键词召回**
-
-#### 迁移步骤
-```
-1. V2 功能开发完成，兼容性降级逻辑已实现
-2. 生产环境 V2 灰度发布（10%流量）
-3. 观察 1 周，无异常则全量切换
-4. V1.0 关键词召回作为永久降级方案保留
-5. 历史 PostgreSQL 数据：只读，不删除
-```
-
-#### 数据生命周期
-| 存储 | V1.0 | V2 |
-|------|-------|-----|
-| Redis 短期上下文 | ✅ | ✅ |
-| Redis VSS 向量 | ❌ | ✅ (新写入) |
-| PostgreSQL 关键词召回 | ✅ (永久保留) | ✅ (降级用) |
+1. **鉴权方式**：连接时通过 URL 参数传递 Token（`?token=Bearer_xxx`），Gateway 验证通过后立即建立连接
+2. **重连机制**：建议实现断线重连，间隔 3-5 秒
+3. **心跳保活**：每 30 秒发送一次 ping 消息，60 秒无响应则主动关闭连接
+4. **Session 管理**：建议在本地存储 session_id，便于恢复对话
+5. **错误处理**：收到 `auth_response.success=false` 时立即关闭连接并提示用户重新登录
 
 ---
 
-## 十四、主动推送详细设计 (V2)
-
-### 14.1 概述
-
-主动推送指 AI 在特定条件下主动发消息给用户，打破"用户发消息 → AI 回复"的被动模式。
-
-### 14.2 推送场景
-
-| 场景 | 触发条件 | 推送内容示例 |
-|------|----------|--------------|
-| 早安问候 | 每天 9:00-10:00，用户在线 | "早安呀～昨晚睡得好吗？" |
-| 断联召回 | 用户 3 天未对话 | "好久不见，想你了～最近怎么样？" |
-| 记忆提醒 | 用户记忆中有重要日期临近 | "明天是你说的那个重要的日子哦，准备好了吗？" |
-| 情绪关怀 | 用户之前说心情不好，2天后 | "上次你说工作很累，最近好点了吗？" |
-| 晚安问候 | 每天 21:00-22:00，用户在线 | "晚安～今天辛苦了，好好休息哦" |
-
-### 14.3 技术架构
-
-```
-┌─────────────────────────────────────────────────────────────────────────────┐
-│                         定时任务调度层                                        │
-│  ┌─────────────────┐  ┌─────────────────┐  ┌─────────────────┐            │
-│  │ 早安触发器 9:00  │  │ 晚安触发器 21:00 │  │ 断联检测 每日  │            │
-│  └────────┬────────┘  └────────┬────────┘  └────────┬────────┘            │
-└───────────┼───────────────────┼───────────────────┼─────────────────────────┘
-            │                   │                   │
-            ▼                   ▼                   ▼
-┌─────────────────────────────────────────────────────────────────────────────┐
-│                         推送决策服务                                         │
-│  ┌─────────────────────────────────────────────────────────────────────┐   │
-│  │                     PushDecisionService                               │   │
-│  │  1. 查询当日是否已推送（去重）                                         │   │
-│  │  2. 检查用户推送偏好（是否开启）                                        │   │
-│  │  3. 检查用户当前在线状态                                                │   │
-│  │  4. 生成推送任务                                                       │   │
-│  └─────────────────────────────────────────────────────────────────────┘   │
-└─────────────────────────────────┬───────────────────────────────────────────┘
-                                  │
-                                  ▼
-┌─────────────────────────────────────────────────────────────────────────────┐
-│                         消息队列 (Redis Stream)                             │
-│  ┌─────────────────────────────────────────────────────────────────────┐   │
-│  │                    push_tasks  stream                                 │   │
-│  │  {userId, pushType, priority, generateAt}                           │   │
-│  └─────────────────────────────────────────────────────────────────────┘   │
-└─────────────────────────────────┬───────────────────────────────────────────┘
-                                  │
-                                  ▼
-┌─────────────────────────────────────────────────────────────────────────────┐
-│                         推送消费者                                           │
-│  ┌─────────────────────────────────────────────────────────────────────┐   │
-│  │                     PushWorker                                        │   │
-│  │  1. 读取任务队列                                                     │   │
-│  │  2. 组装 Prompt（包含用户记忆、当前场景）                              │   │
-│  │  3. 调用 LLM 生成推送内容                                             │   │
-│  │  4. 后置审核                                                         │   │
-│  │  5. 写入用户消息表（type=push）                                       │   │
-│  │  6. 推送至 Mobile（WebSocket / 极光推送）                             │   │
-│  └─────────────────────────────────────────────────────────────────────┘   │
-└─────────────────────────────────────────────────────────────────────────────┘
-```
-
-### 14.4 数据库表
-
-```sql
--- 推送任务记录表
-CREATE TABLE push_records (
-    id BIGSERIAL PRIMARY KEY,
-    user_id BIGINT NOT NULL,
-    push_type VARCHAR(32) NOT NULL,     -- 'morning', 'night', 'recall', 'reminder', 'care'
-    content TEXT,                       -- 推送内容（生成后填充）
-    status VARCHAR(16) NOT NULL,         -- 'pending', 'generated', 'sent', 'failed'
-    scheduled_at TIMESTAMP NOT NULL,     -- 计划推送时间
-    sent_at TIMESTAMP,                   -- 实际推送时间
-    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
-
-    UNIQUE(user_id, push_type, scheduled_at)  -- 去重：同类型同一天只推送一次
-);
-
-CREATE INDEX idx_push_records_user ON push_records(user_id, scheduled_at);
-CREATE INDEX idx_push_records_pending ON push_records(status) WHERE status = 'pending';
-CREATE INDEX idx_push_records_failed ON push_records(status) WHERE status = 'failed';
-```
-
-### 14.5 Redis Stream 配置
-
-```go
-// Stream Key
-const PushTaskStream = "push:tasks"
-
-// 消费者组
-const PushConsumerGroup = "push-workers"
-
-// 任务结构
-type PushTask struct {
-    UserID    int64  `json:"user_id"`
-    PushType  string `json:"push_type"`  // morning/night/recall/reminder/care
-    Priority  int    `json:"priority"`   // 1-5，1最高
-    UserMsgID string `json:"user_msg_id"` // 生成后关联的消息ID
-}
-```
-
-### 14.6 定时任务配置
-
-```go
-// 定时任务注册 (使用 robfig/cron)
-crontab := cron.New()
-crontab.AddFunc("0 9 * * *", triggerMorningPush)   // 每天 9:00
-crontab.AddFunc("0 21 * * *", triggerNightPush)   // 每天 21:00
-crontab.AddFunc("0 0 * * *", triggerRecallCheck)   // 每天 0:00 检查断联
-crontab.Start()
-```
-
-### 14.7 Prompt 模板
-
-```go
-// 早安推送 Prompt
-const morningPromptTemplate = `你是【%s】，一个温柔体贴的AI伴侣。
-
-当前场景：早上 %s，用户刚起床或者正在开始新的一天。
-
-# 用户信息
-%s
-
-# 核心记忆
-%s
-
-请生成一条温馨的早安问候，1-2句话，口语化，像朋友聊天一样。不要太长，符合你的人设。`
-
-// 断联召回 Prompt
-const recallPromptTemplate = `你是【%s】，一个关心用户的AI伴侣。
-
-当前场景：用户已经 %d 天没有和你聊天了，你有点想他了。
-
-# 用户信息
-%s
-
-# 核心记忆
-%s
-
-请生成一条温馨的召回消息，表达想念但不过分打扰，1-2句话，口语化。`
-```
-
-### 14.8 用户偏好设置
-
-```go
-// 用户推送偏好（存储在 Redis）
-// Key: push:pref:{userId}
-type PushPreference struct {
-    Enabled     bool     `json:"enabled"`      // 是否开启推送
-    MorningEnabled bool   `json:"morning_enabled"` // 早安推送
-    NightEnabled   bool  `json:"night_enabled"`   // 晚安推送
-    RecallEnabled  bool  `json:"recall_enabled"`  // 断联召回
-    CareEnabled    bool  `json:"care_enabled"`    // 情绪关怀
-    QuietHoursStart int  `json:"quiet_hours_start"` // 免打扰开始时间（小时）
-    QuietHoursEnd   int  `json:"quiet_hours_end"`   // 免打扰结束时间（小时）
-}
-
-// 默认偏好
-DefaultPushPreference = &PushPreference{
-    Enabled:       true,
-    MorningEnabled: true,
-    NightEnabled:   true,
-    RecallEnabled:  true,
-    CareEnabled:    true,
-    QuietHoursStart: 22,
-    QuietHoursEnd:   8,
-}
-```
-
-### 14.9 API 扩展
-
-```go
-// ========== 推送偏好管理 ==========
-
-// GetPushPreference 获取推送偏好
-// GET /api/v1/ai-chat/push/preference
-type GetPushPreferenceResponse struct {
-    Preference *PushPreference `json:"preference"`
-}
-
-// UpdatePushPreference 更新推送偏好
-// PUT /api/v1/ai-chat/push/preference
-type UpdatePushPreferenceRequest struct {
-    Preference *PushPreference `json:"preference"`
-}
-```
-
-### 14.10 推送频率控制
-
-| 场景 | 频率上限 | 说明 |
-|------|----------|------|
-| 早安 | 每天1次 | 9:00-10:00 之间 |
-| 晚安 | 每天1次 | 21:00-22:00 之间 |
-| 断联召回 | 每3天1次 | 用户超过3天未对话 |
-| 记忆提醒 | 事件前1天 | 不重复 |
-| 情绪关怀 | 每7天1次 | 针对之前情绪不好的用户 |
-
-### 14.11 待确认问题
-
-1. **推送通道**：极光推送 / 自建 WebSocket / MQTT？
-2. **推送时间窗口**：早安 9:00-10:00 是固定还是随机？
-3. **优先级策略**：同一用户多个推送同时触发时，先发哪个？
-
----
-
-## 十五、监控告警详细设计 (V1.1)
-
-### 15.1 概述
-
-监控告警体系包含三个核心部分：日志体系、Metrics 指标、告警规则。目标是早发现问题、快速定位、稳定运行。
-
-### 15.2 日志体系
-
-#### 分级日志
-
-| 级别 | 使用场景 | 示例 |
-|------|----------|------|
-| DEBUG | 开发调试 | "收到消息: xxx" |
-| INFO | 正常业务流程 | "用户 xxx 发送消息，耗时 200ms" |
-| WARN | 异常但可处理 | "LLM 调用超时，切换备用模型" |
-| ERROR | 错误需关注 | "Redis 连接失败" |
-
-#### 日志格式 (JSON)
-
-```json
-{
-  "time": "2024-01-01T10:00:00.000Z",
-  "level": "INFO",
-  "service": "ai-chat-service",
-  "trace_id": "abc123",
-  "user_id": 10001,
-  "session_id": "10001_123",
-  "action": "chat.send",
-  "duration_ms": 1234,
-  "message": "消息发送成功",
-  "error": null
-}
-```
-
-#### 关键日志点
-
-| 操作 | 日志级别 | 必含字段 |
-|------|----------|----------|
-| 收到用户消息 | INFO | userId, sessionId, messageLength |
-| LLM 调用开始 | DEBUG | model, promptLength |
-| LLM 调用成功 | INFO | duration, responseLength, firstTokenMs |
-| LLM 调用失败 | ERROR | error, model, fallbackUsed |
-| 前置审核拦截 | WARN | reason |
-| 后置审核拦截 | WARN | blockedContent |
-| Redis 错误 | ERROR | operation, error |
-| PostgreSQL 错误 | ERROR | operation, error |
-
-#### 日志采集
-
-```yaml
-# Filebeat 配置
-filebeat.inputs:
-  - type: log
-    paths:
-      - /var/log/ai-chat-service/*.log
-    json.keys_under_root: true
-    fields:
-      service: ai-chat-service
-
-output.elasticsearch:
-  hosts: ["elasticsearch:9200"]
-```
-
-### 15.3 Metrics 指标
-
-#### 业务指标
-
-| 指标名 | 类型 | 标签 | 说明 |
-|--------|------|------|------|
-| ai_chat_requests_total | Counter | status, personaId | 对话请求总数 |
-| ai_chat_duration_seconds | Histogram | model | 对话耗时分布 |
-| ai_chat_first_token_ms | Histogram | model | 首包延迟 |
-| ai_chat_messages_total | Counter | role | 消息数量（user/assistant） |
-| ai_audit_blocked_total | Counter | type | 审核拦截次数 |
-
-#### LLM 指标
-
-| 指标名 | 类型 | 标签 | 说明 |
-|--------|------|------|------|
-| llm_requests_total | Counter | model, status | LLM 请求总数 |
-| llm_duration_seconds | Histogram | model | LLM 调用耗时 |
-| llm_tokens_total | Counter | model, type | Token 消耗量 |
-| llm_fallback_total | Counter | fromModel, toModel | 模型降级次数 |
-| llm_errors_total | Counter | model, errorType | LLM 错误次数 |
-
-#### 基础设施指标
-
-| 指标名 | 类型 | 说明 |
-|--------|------|------|
-| redis_latency_ms | Histogram | Redis 操作延迟 |
-| redis_errors_total | Counter | Redis 错误次数 |
-| postgres_latency_ms | Histogram | PostgreSQL 操作延迟 |
-| postgres_errors_total | Counter | PostgreSQL 错误次数 |
-| grpc_connections | Gauge | gRPC 连接数 |
-
-#### 推送指标 (V2)
-
-| 指标名 | 类型 | 说明 |
-|--------|------|------|
-| push_tasks_total | Counter | 推送任务总数 |
-| push_sent_total | Counter | 推送成功次数 |
-| push_failed_total | Counter | 推送失败次数 |
-| push_queue_depth | Gauge | 队列积压深度 |
-
-#### Prometheus 埋点示例
-
-```go
-// 使用 prometheus/client_golang
-var (
-    chatRequestsTotal = prometheus.NewCounterVec(
-        prometheus.CounterOpts{
-            Name: "ai_chat_requests_total",
-            Help: "Total number of chat requests",
-        },
-        []string{"status", "persona_id"},
-    )
-
-    chatDuration = prometheus.NewHistogramVec(
-        prometheus.HistogramOpts{
-            Name:    "ai_chat_duration_seconds",
-            Help:    "Chat request duration distribution",
-            Buckets: []float64{0.1, 0.5, 1, 2, 5, 10},
-        },
-        []string{"model"},
-    )
-
-    firstTokenMs = prometheus.NewHistogramVec(
-        prometheus.HistogramOpts{
-            Name:    "ai_chat_first_token_ms",
-            Help:    "First token latency in milliseconds",
-            Buckets: []float64{100, 300, 500, 1000, 2000, 3000},
-        },
-        []string{"model"},
-    )
-)
-
-func init() {
-    prometheus.MustRegister(chatRequestsTotal, chatDuration, firstTokenMs)
-}
-```
-
-### 15.4 告警规则
-
-#### 告警分级
-
-| 级别 | 响应时间 | 定义 |
-|------|----------|------|
-| P0 紧急 | 5分钟内 | 服务不可用 |
-| P1 严重 | 15分钟内 | 核心功能受损 |
-| P2 警告 | 1小时内 | 性能下降/偶发错误 |
-| P3 提醒 | 工作时间 | 需关注但不影响 |
-
-#### P0 紧急告警
-
-| 告警名 | 条件 | 处理方式 |
-|--------|------|----------|
-| 服务宕机 | health check 连续 3 次失败 | 自动重启 + 值班通知 |
-| 所有 LLM 不可用 | llm_errors_total 在 5min 内 > 50 | 切换主备 + 通知 |
-| 数据库连接断开 | postgres_errors_total 在 1min 内 > 10 | 重连 + 通知 |
-
-#### P1 严重告警
-
-| 告警名 | 条件 | 处理方式 |
-|--------|------|----------|
-| LLM 延迟过高 | llm_duration_seconds P99 > 10s | 检查网络/模型状态 |
-| 审核拦截率异常 | ai_audit_blocked_total 5min 内 > 100 | 检查是否攻击 |
-| Redis 延迟过高 | redis_latency_ms P99 > 100ms | 检查 Redis 状态 |
-
-#### P2 警告告警
-
-| 告警名 | 条件 | 处理方式 |
-|--------|------|----------|
-| 模型降级频繁 | llm_fallback_total 5min 内 > 5 | 关注主模型状态 |
-| 首包延迟升高 | ai_chat_first_token_ms P95 > 2s | 持续观察 |
-| 错误率升高 | ai_chat_requests_total{status="error"} 5min 内 > 1% | 排查日志 |
-
-#### P3 提醒
-
-| 告警名 | 条件 | 处理方式 |
-|--------|------|----------|
-| Token 消耗异常 | llm_tokens_total 1h 内波动 > 50% | 排查是否异常 |
-| 推送队列积压 | push_queue_depth > 1000 | 扩容消费者 |
-
-#### AlertManager 配置示例
-
-```yaml
-# alertmanager.yml
-groups:
-  - name: ai-chat-alerts
-    rules:
-      # P0: 服务宕机
-      - alert: AIServiceDown
-        expr: up{job="ai-chat-service"} == 0
-        for: 1m
-        labels:
-          severity: critical
-        annotations:
-          summary: "AI Chat Service is down"
-
-      # P1: LLM 延迟过高
-      - alert: LLMHighLatency
-        expr: histogram_quantile(0.99, rate(llm_duration_seconds_bucket[5m])) > 10
-        for: 5m
-        labels:
-          severity: major
-        annotations:
-          summary: "LLM latency is too high"
-
-      # P2: 模型降级频繁
-      - alert: LLMFallbackFrequent
-        expr: rate(llm_fallback_total[5m]) > 0.02
-        for: 5m
-        labels:
-          severity: warning
-```
-
-### 15.5 监控大盘
-
-```
-┌─────────────────────────────────────────────────────────────────────────────┐
-│                         AI Chat Service 监控大盘                            │
-├─────────────────────────────────────────────────────────────────────────────┤
-│                                                                              │
-│  ┌─────────────────┐  ┌─────────────────┐  ┌─────────────────┐            │
-│  │   请求量/分钟    │  │   平均延迟      │  │   错误率        │            │
-│  │     1,234       │  │    1.2s        │  │    0.5%         │            │
-│  │   ▲ 12%         │  │   ▼ 5%          │  │   ▲ 0.1%        │            │
-│  └─────────────────┘  └─────────────────┘  └─────────────────┘            │
-│                                                                              │
-│  ┌───────────────────────────────────────────────────────────────┐          │
-│  │                    LLM 调用耗时分布 (P50/P95/P99)              │          │
-│  │  ████████████████████████░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░  │          │
-│  │  500ms          1s              2s              5s          │          │
-│  └───────────────────────────────────────────────────────────────┘          │
-│                                                                              │
-│  ┌───────────────────────────────────────────────────────────────┐          │
-│  │                    请求来源分布                                  │          │
-│  │  persona_weifen: 60%  │  persona_cpfan: 30%  │  other: 10%  │          │
-│  └───────────────────────────────────────────────────────────────┘          │
-│                                                                              │
-│  ┌───────────────────────────────────────────────────────────────┐          │
-│  │                    审核拦截统计                                 │          │
-│  │  今日拦截: 23  │  本周: 156  │  拦截率: 0.8%                  │          │
-│  └───────────────────────────────────────────────────────────────┘          │
-└─────────────────────────────────────────────────────────────────────────────┘
-```
-
-### 15.6 技术选型
-
-| 组件 | 推荐方案 | 说明 |
-|------|----------|------|
-| 日志采集 | Filebeat → Elasticsearch | 已有 ELK 栈可复用 |
-| 日志存储 | Elasticsearch | 保留 30 天 |
-| Metrics | Prometheus | Go 生态成熟 |
-| 告警 | AlertManager + Grafana | 与现有监控集成 |
-| Trace | Jaeger (可选) | 全链路追踪，后期按需引入 |
-
-### 15.7 实施优先级
-
-| 阶段 | 内容 | 工作量 |
-|------|------|--------|
-| 1 | 日志结构化 + 关键日志点埋点 | 0.5 天 |
-| 2 | Prometheus Metrics 埋点 | 0.5 天 |
-| 3 | Grafana 大盘配置 | 0.5 天 |
-| 4 | AlertManager 告警规则 | 0.5 天 |
-| **合计** | | **2 天** |
-
----
-
-## 十六、待后续完善
-
-| 功能 | 优先级 | 说明 |
-|------|--------|------|
-| 向量记忆召回 | V2 | **已选定方案: Redis VSS**，HNSW索引，存储在Redis，与现有Redis复用，详见十三章 |
-| 主动推送 | V2 | AI 主动发起对话，定时任务+消息队列，详见十四章 |
-| 语音交互 | V3 | 语音输入输出 |
-| 多模态 | V3 | 图片理解 |
-| 监控告警 | V1.1 | 日志体系、Prometheus Metrics、AlertManager 告警，详见十五章 |
+## 十五、与 SSE 方案的对比
+
+| 对比项 | SSE | WebSocket |
+|--------|-----|-----------|
+| UniApp 支持 | 有限（需条件编译） | 完整支持 |
+| 实现复杂度 | 中等 | 较低 |
+| 双向通信 | 需轮询模拟 | 原生支持 |
+| 服务器资源 | 较低 | 稍高（每个连接维护状态） |
+| 扩展性 | 差（仅单向） | 好（可扩展主动推送） |
+| 代理支持 | 需特殊配置（Nginx 流式） | 需配置 WebSocket 支持 |
+
+**选择 WebSocket 的原因**：
+1. UniApp 对 WebSocket 支持更好
+2. 便于后续扩展主动推送功能（设计文档 V2）
+3. 实现更简单，双向通信更自然
\ No newline at end of file
diff --git a/backend/migrations/ai_chat.sql b/backend/migrations/ai_chat.sql
new file mode 100644
index 0000000..56903d2
--- /dev/null
+++ b/backend/migrations/ai_chat.sql
@@ -0,0 +1,48 @@
+-- AI Chat Service 数据库迁移
+-- 创建时间: 2026-05-27
+-- 说明: AI Chat Service 所需的数据库表
+
+-- =============================================
+-- 1. ai_personas (人设表)
+-- =============================================
+CREATE TABLE IF NOT EXISTS ai_personas (
+    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
+    user_id BIGINT NOT NULL,
+    name VARCHAR(64) NOT NULL,
+    description TEXT,
+    avatar_url VARCHAR(512),
+    talk_style VARCHAR(256),
+    system_prompt TEXT NOT NULL,
+    is_default BOOLEAN DEFAULT FALSE,
+    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
+    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
+);
+
+-- 索引
+CREATE INDEX IF NOT EXISTS idx_ai_personas_user_id ON ai_personas(user_id);
+CREATE UNIQUE INDEX IF NOT EXISTS idx_ai_personas_user_default ON ai_personas(user_id) WHERE is_default = TRUE;
+
+-- =============================================
+-- 2. ai_user_memories (长期记忆表)
+-- =============================================
+CREATE TABLE IF NOT EXISTS ai_user_memories (
+    id SERIAL PRIMARY KEY,
+    user_id BIGINT NOT NULL,
+    content TEXT NOT NULL,
+    keywords TEXT[],
+    weight INTEGER DEFAULT 50,
+    is_core BOOLEAN DEFAULT FALSE,
+    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
+    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
+);
+
+-- 索引
+CREATE INDEX IF NOT EXISTS idx_ai_user_memories_user_id ON ai_user_memories(user_id);
+CREATE INDEX IF NOT EXISTS idx_ai_user_memories_keywords ON ai_user_memories USING GIN(keywords);
+CREATE INDEX IF NOT EXISTS idx_ai_user_memories_weight ON ai_user_memories(weight DESC);
+
+-- =============================================
+-- 回滚语句 (如需回滚)
+-- =============================================
+-- DROP TABLE IF EXISTS ai_user_memories;
+-- DROP TABLE IF EXISTS ai_personas;
\ No newline at end of file
diff --git a/frontend/pages/square/DESIGN.md b/docs/DESIGN.md
similarity index 100%
rename from frontend/pages/square/DESIGN.md
rename to docs/DESIGN.md
diff --git a/docs/superpowers/plans/2026-05-27-热门推荐模块实现.md b/docs/superpowers/plans/2026-05-27-热门推荐模块实现.md
new file mode 100644
index 0000000..60dc104
--- /dev/null
+++ b/docs/superpowers/plans/2026-05-27-热门推荐模块实现.md
@@ -0,0 +1,1035 @@
+# 热门推荐模块实现计划
+
+> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
+
+**Goal:** 实现热门推荐模块后端接口，包括批量获取、单个刷新、查看更多分页，以及灵感瀑布流逻辑调整
+
+**Architecture:** 后端采用网关+Dubbo RPC架构（`backend/gateway` + `backend/services/galleryService`）。新增 `/inspiration-flow/hot/batch`、`/inspiration-flow/hot`、`/inspiration-flow/hot/more` 三个接口，灵感瀑布 `/inspiration-flow` 逻辑调整为分段填充（点赞>平均值优先）
+
+**Tech Stack:** Go / Gin / GORM / Redis / Dubbo RPC
+
+---
+
+## 文件结构
+
+| 文件 | 改动 |
+|-----|------|
+| `backend/proto/gallery.proto` | 新增 Proto 消息定义 |
+| `backend/pkg/proto/gallery/gallery.pb.go` | 重新生成 Proto 代码 |
+| `backend/gateway/controller/gallery_controller.go` | 新增 `GetHotInspirationFlowBatch`、`GetHotInspirationFlow`、`GetHotInspirationFlowMore` 方法 |
+| `backend/gateway/router/router.go` | 新增 `/inspiration-flow/hot/*` 路由注册 |
+| `backend/gateway/dto/gallery_dto.go` | 新增 DTO 结构体 |
+| `backend/gateway/dto/gallery_converter.go` | 新增 DTO 转换函数 |
+| `backend/services/galleryService/service/gallery_service.go` | 新增 `GetHotInspirationFlowBatch`、`GetHotInspirationFlow`、`GetHotInspirationFlowMore` Service 方法 |
+| `backend/services/galleryService/repository/gallery_repository.go` | 新增 Repository 方法 |
+| `backend/pkg/database/redis.go` | 新增热门推荐缓存辅助函数 |
+
+---
+
+## Task 1: 更新 Proto 文件
+
+**Files:**
+- Modify: `backend/proto/gallery.proto`
+
+- [ ] **Step 1: 添加新的 Proto 消息**
+
+在 `backend/proto/gallery.proto` 末尾添加：
+
+```protobuf
+// 空请求（用于批量接口）
+message GetEmpty {}
+
+// 热门类型请求
+message GetHotTypeRequest {
+  string type = 1;  // hot_recommend, hot_star_card, hot_badge, hot_poster
+}
+
+// 热门分类条目
+message HotCategoryItem {
+  string type = 1;           // 分类类型：hot_recommend, hot_star_card, hot_badge, hot_poster
+  string title = 2;          // 展示标题：热门推荐、热门星卡、热门吧唧、热门海报
+  repeated InspirationFlowItem items = 3;  // 作品列表
+}
+
+// 热门批量响应（无参数，返回所有分类）
+message GetHotInspirationFlowBatchResponse {
+  BaseResponse base = 1;
+  repeated HotCategoryItem categories = 2;  // 动态分类列表
+}
+
+// 热门单个分类响应
+message GetHotInspirationFlowResponse {
+  BaseResponse base = 1;
+  HotCategoryItem data = 2;
+}
+
+// 热门查看更多请求
+message GetHotInspirationFlowMoreRequest {
+  string type = 1;            // 分类类型
+  string cursor = 2;          // 翻页游标（Base64 编码的 JSON）
+  int32 limit = 3;           // 每页数量，默认20
+}
+
+// 热门查看更多响应
+message GetHotInspirationFlowMoreResponse {
+  BaseResponse base = 1;
+  InspirationFlowData data = 2;
+}
+```
+
+- [ ] **Step 2: 在 GalleryService 添加新方法声明**
+
+在 `service GalleryService` 中添加：
+
+```protobuf
+// 批量获取热门分类
+rpc GetHotInspirationFlowBatch(GetEmpty) returns (GetHotInspirationFlowBatchResponse);
+
+// 单个分类刷新
+rpc GetHotInspirationFlow(GetHotTypeRequest) returns (GetHotInspirationFlowResponse);
+
+// 查看更多分页
+rpc GetHotInspirationFlowMore(GetHotInspirationFlowMoreRequest) returns (GetHotInspirationFlowMoreResponse);
+```
+
+- [ ] **Step 3: 运行 Proto 生成**
+
+```bash
+cd /Users/liulujian/Documents/code/TopFansByGithub/backend
+./proto/gen.sh
+```
+
+---
+
+## Task 2: 新增 Redis 缓存辅助函数
+
+**Files:**
+- Modify: `backend/pkg/database/redis.go`
+
+- [ ] **Step 1: 添加热门推荐缓存常量和结构体**
+
+```go
+const (
+    // ... 现有常量 ...
+    HotAvgLikesKeyPrefix = "hot_avg_likes:"  // 热门分类点赞均值缓存
+    HotBatchKeyPrefix    = "hot_batch:"      // 热门批量结果缓存
+)
+
+// HotAvgLikesCache 点赞均值缓存（用于减少重复计算平均值）
+type HotAvgLikesCache struct {
+    AvgLikes  float64 `json:"avg_likes"`
+    Total     int64   `json:"total"`
+    UpdatedAt int64   `json:"updated_at"`
+}
+```
+
+- [ ] **Step 2: 添加缓存操作函数**
+
+```go
+// GetHotAvgLikesCache 获取热门分类点赞均值缓存
+// 返回缓存的均值和该分类的作品总数，如果缓存不存在则返回 nil
+func GetHotAvgLikesCache(ctx context.Context, starID int64, assetType string) (*HotAvgLikesCache, error) {
+    if RedisClient == nil {
+        return nil, nil
+    }
+    key := fmt.Sprintf("%s%d:%s", HotAvgLikesKeyPrefix, starID, assetType)
+    data, err := RedisClient.Get(ctx, key).Bytes()
+    if err != nil {
+        if err == redis.Nil {
+            return nil, nil
+        }
+        return nil, err
+    }
+    var cache HotAvgLikesCache
+    if json.Unmarshal(data, &cache) != nil {
+        return nil, nil
+    }
+    return &cache, nil
+}
+
+// SetHotAvgLikesCache 设置热门分类点赞均值缓存（TTL: 5分钟）
+func SetHotAvgLikesCache(ctx context.Context, starID int64, assetType string, cache *HotAvgLikesCache) error {
+    if RedisClient == nil {
+        return nil
+    }
+    key := fmt.Sprintf("%s%d:%s", HotAvgLikesKeyPrefix, starID, assetType)
+    data, err := json.Marshal(cache)
+    if err != nil {
+        return err
+    }
+    return RedisClient.Set(ctx, key, data, 5*time.Minute).Err()
+}
+
+// GetHotBatchCache 获取热门批量结果缓存
+func GetHotBatchCache(ctx context.Context, starID int64) ([]byte, error) {
+    if RedisClient == nil {
+        return nil, nil
+    }
+    key := fmt.Sprintf("%s%d", HotBatchKeyPrefix, starID)
+    return RedisClient.Get(ctx, key).Bytes()
+}
+
+// SetHotBatchCache 设置热门批量结果缓存（TTL: 30秒）
+func SetHotBatchCache(ctx context.Context, starID int64, data []byte) error {
+    if RedisClient == nil {
+        return nil
+    }
+    key := fmt.Sprintf("%s%d", HotBatchKeyPrefix, starID)
+    return RedisClient.Set(ctx, key, data, 30*time.Second).Err()
+}
+```
+
+- [ ] **Step 3: 提交**
+
+```bash
+git add pkg/database/redis.go
+git commit -m "feat: add redis cache helpers for hot inspiration flow"
+```
+
+---
+
+## Task 3: 新增 Repository 方法
+
+**Files:**
+- Modify: `backend/services/galleryService/repository/gallery_repository.go`
+
+- [ ] **Step 1: 在 GalleryRepository 接口添加方法声明**
+
+```go
+// ========== 热门推荐相关 ==========
+
+// CountHotAssetsAboveAvg 统计点赞数高于平均值的作品数量
+// assetType: 资产类型（star_card/badge/poster），空字符串表示所有类型
+CountHotAssetsAboveAvg(starID int64, assetType string) (int64, float64, error)
+
+// GetHotAssetsByAvg 获取点赞数>=平均值的作品（基于时间窗口伪随机排序）
+// assetType: 资产类型，空字符串表示所有类型（用于 hot_recommend）
+// limit: 返回数量
+GetHotAssetsByAvg(starID int64, assetType string, limit int) ([]*InspirationFlowItem, error)
+
+// GetHotAssetsByAvgWithCursor 获取点赞数>=平均值的作品（分页，按点赞数DESC + asset_id DESC排序）
+GetHotAssetsByAvgWithCursor(starID int64, assetType string, cursorLikeCount int64, cursorAssetID int64, limit int) ([]*InspirationFlowItem, int64, error)
+```
+
+- [ ] **Step 2: 实现 GetHotAssetsByAvg（基于时间窗口伪随机）**
+
+```go
+// GetHotAssetsByAvg 获取点赞数>=平均值的作品（基于时间窗口伪随机排序）
+func (r *galleryRepository) GetHotAssetsByAvg(starID int64, assetType string, limit int) ([]*InspirationFlowItem, error) {
+    var items []*InspirationFlowItem
+    now := time.Now().UnixMilli()
+
+    // 构建基础查询
+    baseQuery := r.db.Model(&models.Exhibition{}).
+        Where("exhibitions.occupier_star_id = ? AND exhibitions.expire_at > ? AND exhibitions.deleted_at IS NULL", starID, now).
+        Joins("JOIN assets a ON a.id = exhibitions.asset_id").
+        Where("a.status = 1 AND a.is_active = true")
+
+    if assetType != "" {
+        baseQuery = baseQuery.Where("a.asset_type = ?", assetType)
+    }
+
+    // 计算平均值
+    var avgLikes float64
+    err := baseQuery.Select("COALESCE(AVG(a.like_count), 0)").Scan(&avgLikes).Error
+    if err != nil {
+        return nil, err
+    }
+
+    // 使用基于时间窗口的伪随机排序
+    // 避免 ORDER BY RANDOM() 在大表上的性能问题
+    windowSeed := now / 30000 % 10 // 每30秒变化一次
+    err = baseQuery.
+        Select(`exhibitions.id as exhibition_id, exhibitions.asset_id, a.name, a.cover_url, a.like_count,
+                COALESCE(alr.current_level, 'N') as level, fp.nickname as owner_nickname, fp.avatar_url as owner_avatar,
+                a.material_type, a.created_at`).
+        Joins("LEFT JOIN asset_level_records alr ON alr.asset_id = a.id").
+        Joins("JOIN fan_profiles fp ON exhibitions.occupier_uid = fp.user_id AND exhibitions.occupier_star_id = fp.star_id").
+        Where("a.like_count >= ?", avgLikes).
+        Where("exhibitions.id % 10 = ?", windowSeed).
+        Order("exhibitions.id").
+        Limit(limit).
+        Scan(&items).Error
+
+    if err != nil {
+        return nil, err
+    }
+
+    // 如果数量不够，补充查询（不加随机过滤）
+    if len(items) < limit {
+        remaining := limit - len(items)
+        var extraItems []*InspirationFlowItem
+        extraQuery := r.db.Model(&models.Exhibition{}).
+            Where("exhibitions.occupier_star_id = ? AND exhibitions.expire_at > ? AND exhibitions.deleted_at IS NULL", starID, now).
+            Joins("JOIN assets a ON a.id = exhibitions.asset_id").
+            Where("a.status = 1 AND a.is_active = true").
+            Where("a.like_count >= ?", avgLikes).
+            Where("exhibitions.id % 10 != ?", windowSeed)
+
+        if assetType != "" {
+            extraQuery = extraQuery.Where("a.asset_type = ?", assetType)
+        }
+
+        err = extraQuery.
+            Select(`exhibitions.id as exhibition_id, exhibitions.asset_id, a.name, a.cover_url, a.like_count,
+                    COALESCE(alr.current_level, 'N') as level, fp.nickname as owner_nickname, fp.avatar_url as owner_avatar,
+                    a.material_type, a.created_at`).
+            Joins("LEFT JOIN asset_level_records alr ON alr.asset_id = a.id").
+            Joins("JOIN fan_profiles fp ON exhibitions.occupier_uid = fp.user_id AND exhibitions.occupier_star_id = fp.star_id").
+            Order("exhibitions.id").
+            Limit(remaining).
+            Scan(&extraItems).Error
+
+        if err != nil {
+            return nil, err
+        }
+        items = append(items, extraItems...)
+    }
+
+    // 填充 Span
+    for _, item := range items {
+        item.Span = calcSpanByLevel(item.Level)
+    }
+
+    return items, nil
+}
+```
+
+- [ ] **Step 3: 实现 GetHotAssetsByAvgWithCursor（查看更多分页用）**
+
+```go
+// GetHotAssetsByAvgWithCursor 获取点赞数>=平均值的作品（分页，按点赞数DESC + asset_id DESC排序）
+func (r *galleryRepository) GetHotAssetsByAvgWithCursor(starID int64, assetType string, cursorLikeCount int64, cursorAssetID int64, limit int) ([]*InspirationFlowItem, int64, error) {
+    var items []*InspirationFlowItem
+    var total int64
+    now := time.Now().UnixMilli()
+
+    baseQuery := r.db.Model(&models.Exhibition{}).
+        Where("exhibitions.occupier_star_id = ? AND exhibitions.expire_at > ? AND exhibitions.deleted_at IS NULL", starID, now).
+        Joins("JOIN assets a ON a.id = exhibitions.asset_id").
+        Where("a.status = 1 AND a.is_active = true")
+
+    if assetType != "" {
+        baseQuery = baseQuery.Where("a.asset_type = ?", assetType)
+    }
+
+    // 统计总数（用于 has_more）
+    err := baseQuery.Count(&total).Error
+    if err != nil {
+        return nil, 0, err
+    }
+
+    // 先计算该分类的平均值（基于 baseQuery 确保 assetType 过滤生效）
+    var avgLikes float64
+    err = baseQuery.Select("COALESCE(AVG(a.like_count), 0)").Scan(&avgLikes).Error
+    if err != nil {
+        return nil, 0, err
+    }
+
+    // 应用游标条件
+    if cursorLikeCount > 0 || cursorAssetID > 0 {
+        baseQuery = baseQuery.Where("(a.like_count, a.id) < (?, ?)", cursorLikeCount, cursorAssetID)
+    }
+
+    // 查询数据（点赞数 >= 平均值，按点赞数 DESC + id DESC 排序）
+    err = baseQuery.
+        Select(`exhibitions.id as exhibition_id, exhibitions.asset_id, a.name, a.cover_url, a.like_count,
+                COALESCE(alr.current_level, 'N') as level, fp.nickname as owner_nickname, fp.avatar_url as owner_avatar,
+                a.material_type, a.created_at`).
+        Joins("LEFT JOIN asset_level_records alr ON alr.asset_id = a.id").
+        Joins("JOIN fan_profiles fp ON exhibitions.occupier_uid = fp.user_id AND exhibitions.occupier_star_id = fp.star_id").
+        Where("a.like_count >= ?", avgLikes).
+        Order("a.like_count DESC, a.id DESC").
+        Limit(limit).
+        Scan(&items).Error
+
+    if err != nil {
+        return nil, 0, err
+    }
+
+    // 填充 Span
+    for _, item := range items {
+        item.Span = calcSpanByLevel(item.Level)
+    }
+
+    return items, total, nil
+}
+```
+
+- [ ] **Step 4: 提交**
+
+```bash
+git add services/galleryService/repository/gallery_repository.go
+git commit -m "feat: add repository methods for hot inspiration flow"
+```
+
+---
+
+## Task 4: 新增 Service 层方法
+
+**Files:**
+- Modify: `backend/services/galleryService/service/gallery_service.go`
+
+- [ ] **Step 1: 在 GalleryService 接口添加方法声明**
+
+```go
+import (
+    "context"
+    "encoding/base64"
+    "encoding/json"
+    "fmt"
+    "strconv"
+    "sync"
+    "time"
+
+    "dubbo.apache.org/dubbo-go/v3/common/constant"
+    pb "github.com/topfans/backend/pkg/proto/gallery"
+    "github.com/topfans/backend/pkg/models"
+    "github.com/topfans/backend/pkg/database"
+    "github.com/topfans/backend/services/galleryService/repository"
+)
+
+type GalleryService interface {
+    // ... 现有方法 ...
+
+    // ========== 热门推荐相关 ==========
+    GetHotInspirationFlowBatch(ctx context.Context, req *pb.GetEmpty) (*pb.GetHotInspirationFlowBatchResponse, error)
+    GetHotInspirationFlow(ctx context.Context, req *pb.GetHotTypeRequest) (*pb.GetHotInspirationFlowResponse, error)
+    GetHotInspirationFlowMore(ctx context.Context, req *pb.GetHotInspirationFlowMoreRequest) (*pb.GetHotInspirationFlowMoreResponse, error)
+}
+```
+
+- [ ] **Step 2: 实现 categoryType 到 assetType 的映射函数**
+
+```go
+// categoryTypeToAssetType 热门分类 type 转 assetType
+func categoryTypeToAssetType(categoryType string) (assetType string, title string) {
+    switch categoryType {
+    case "hot_star_card":
+        return "star_card", "热门星卡"
+    case "hot_badge":
+        return "badge", "热门吧唧"
+    case "hot_poster":
+        return "poster", "热门海报"
+    default: // hot_recommend
+        return "", "热门推荐"
+    }
+}
+```
+
+- [ ] **Step 3: 实现 GetHotInspirationFlowBatch（并行查询 + 缓存点赞均值）**
+
+```go
+// GetHotInspirationFlowBatch 批量获取热门分类（使用 starID 进行缓存分区和查询）
+func (s *galleryService) GetHotInspirationFlowBatch(ctx context.Context, req *pb.GetEmpty) (*pb.GetHotInspirationFlowBatchResponse, error) {
+    // 从 context 中获取 starID（Dubbo 协议通过 attachments 传递）
+    starIDStr := ctx.Value(constant.AttachmentKey).(map[string]interface{})["star_id"].(string)
+    starID, _ := strconv.ParseInt(starIDStr, 10, 64)
+
+    // 尝试从缓存获取批量结果（TTL 30秒）
+    cached, err := database.GetHotBatchCache(ctx, starID)
+    if err == nil && cached != nil && len(cached) > 0 {
+        var resp pb.GetHotInspirationFlowBatchResponse
+        if json.Unmarshal(cached, &resp) == nil {
+            return &resp, nil
+        }
+    }
+
+    // 缓存未命中，查询数据库
+    // 定义4个分类的配置（使用 starID 进行缓存分区）
+    categories := []struct {
+        Type      string
+        Title     string
+        AssetType string // 空字符串表示所有类型
+    }{
+        {"hot_recommend", "热门推荐", ""},
+        {"hot_star_card", "热门星卡", "star_card"},
+        {"hot_badge", "热门吧唧", "badge"},
+        {"hot_poster", "热门海报", "poster"},
+    }
+
+    // 使用 goroutine 并行查询
+    var wg sync.WaitGroup
+    mu sync.Mutex
+    results := make([]*pb.HotCategoryItem, len(categories))
+    errors := make([]error, len(categories))
+
+    for i, cat := range categories {
+        wg.Add(1)
+        go func(idx int, c struct {
+            Type, Title, AssetType string
+        }) {
+            defer wg.Done()
+            items, err := s.getHotItemsByType(ctx, starID, c.AssetType, 8)
+            if err != nil {
+                errors[idx] = err
+                return
+            }
+            pbItems := s.convertToPbItems(items)
+            mu.Lock()
+            results[idx] = &pb.HotCategoryItem{
+                Type:  c.Type,
+                Title: c.Title,
+                Items: pbItems,
+            }
+            mu.Unlock()
+        }(i, cat)
+    }
+    wg.Wait()
+
+    // 检查是否有错误
+    for _, err := range errors {
+        if err != nil {
+            return nil, err
+        }
+    }
+
+    // 过滤掉空分类
+    filtered := make([]*pb.HotCategoryItem, 0, len(results))
+    for _, r := range results {
+        if r != nil && len(r.Items) > 0 {
+            filtered = append(filtered, r)
+        }
+    }
+
+    resp := &pb.GetHotInspirationFlowBatchResponse{
+        Categories: filtered,
+    }
+
+    // 更新批量结果缓存（TTL 30秒）
+    if data, err := json.Marshal(resp); err == nil {
+        database.SetHotBatchCache(ctx, starID, data)
+    }
+
+    return resp, nil
+}
+
+// getHotItemsByType 获取热门作品（带缓存点赞均值）
+func (s *galleryService) getHotItemsByType(ctx context.Context, starID int64, assetType string, limit int) ([]*InspirationFlowItem, error) {
+    // 先尝试从缓存获取点赞均值
+    cached, err := database.GetHotAvgLikesCache(ctx, starID, assetType)
+    if err == nil && cached != nil && cached.Total > 0 {
+        // 缓存命中，查询数据库
+        return s.repo.GetHotAssetsByAvg(starID, assetType, limit)
+    }
+
+    // 缓存未命中，查询数据库并更新缓存
+    items, err := s.repo.GetHotAssetsByAvg(starID, assetType, limit)
+    if err != nil {
+        return nil, err
+    }
+
+    // 更新点赞均值缓存
+    if len(items) > 0 {
+        total := int64(len(items))
+        avgLikes := float64(0)
+        for _, item := range items {
+            avgLikes += float64(item.LikeCount)
+        }
+        avgLikes = avgLikes / float64(total)
+        database.SetHotAvgLikesCache(ctx, starID, assetType, &database.HotAvgLikesCache{
+            AvgLikes:  avgLikes,
+            Total:     total,
+            UpdatedAt: time.Now().UnixMilli(),
+        })
+    }
+
+    return items, nil
+}
+```
+
+- [ ] **Step 4: 实现 GetHotInspirationFlow（单个刷新，不缓存结果）**
+
+```go
+// GetHotInspirationFlow 单个分类刷新（不缓存结果，用户主动刷新应获取新数据）
+func (s *galleryService) GetHotInspirationFlow(ctx context.Context, req *pb.GetHotTypeRequest) (*pb.GetHotInspirationFlowResponse, error) {
+    // 从 context 中获取 starID
+    starIDStr := ctx.Value(constant.AttachmentKey).(map[string]interface{})["star_id"].(string)
+    starID, _ := strconv.ParseInt(starIDStr, 10, 64)
+
+    assetType, title := categoryTypeToAssetType(req.Type)
+
+    items, err := s.repo.GetHotAssetsByAvg(starID, assetType, 8)
+    if err != nil {
+        return nil, err
+    }
+
+    pbItems := s.convertToPbItems(items)
+
+    return &pb.GetHotInspirationFlowResponse{
+        Data: &pb.HotCategoryItem{
+            Type:  req.Type,
+            Title: title,
+            Items: pbItems,
+        },
+    }, nil
+}
+```
+
+- [ ] **Step 5: 实现 GetHotInspirationFlowMore（查看更多分页）**
+
+```go
+// GetHotInspirationFlowMore 查看更多分页
+func (s *galleryService) GetHotInspirationFlowMore(ctx context.Context, req *pb.GetHotInspirationFlowMoreRequest) (*pb.GetHotInspirationFlowMoreResponse, error) {
+    // 从 context 中获取 starID
+    starIDStr := ctx.Value(constant.AttachmentKey).(map[string]interface{})["star_id"].(string)
+    starID, _ := strconv.ParseInt(starIDStr, 10, 64)
+
+    // 默认值处理
+    limit := req.Limit
+    if limit <= 0 {
+        limit = 20
+    }
+    if limit > 50 {
+        limit = 50
+    }
+
+    assetType, _ := categoryTypeToAssetType(req.Type)
+
+    // 解析 cursor（Base64 编码的 JSON，包含 like_count 和 asset_id）
+    var cursorLikeCount int64 = 0
+    var cursorAssetID int64 = 0
+    if req.Cursor != "" {
+        decoded, err := base64.StdEncoding.DecodeString(req.Cursor)
+        if err == nil {
+            var cursorData map[string]interface{}
+            if json.Unmarshal(decoded, &cursorData) == nil {
+                if lc, ok := cursorData["like_count"].(float64); ok {
+                    cursorLikeCount = int64(lc)
+                }
+                if id, ok := cursorData["asset_id"].(float64); ok {
+                    cursorAssetID = int64(id)
+                }
+            }
+        }
+    }
+
+    // 查询数据（使用正确的 starID）
+    items, total, err := s.repo.GetHotAssetsByAvgWithCursor(starID, assetType, cursorLikeCount, cursorAssetID, int(limit))
+    if err != nil {
+        return nil, err
+    }
+
+    // 转换为 pb
+    pbItems := s.convertToPbItems(items)
+
+    // 生成新 cursor（编码 like_count 和 asset_id）
+    newCursor := ""
+    if len(items) > 0 {
+        lastItem := items[len(items)-1]
+        newCursor = base64.StdEncoding.EncodeToString([]byte(fmt.Sprintf(`{"like_count":%d,"asset_id":%d}`, lastItem.LikeCount, lastItem.AssetID)))
+    }
+
+    // 判断是否还有更多
+    hasMore := int64(len(items)) < total
+
+    return &pb.GetHotInspirationFlowMoreResponse{
+        Data: &pb.InspirationFlowData{
+            Items:   pbItems,
+            Cursor:  newCursor,
+            HasMore: hasMore,
+        },
+    }, nil
+}
+
+// convertToPbItems 转换为 pb 列表
+func (s *galleryService) convertToPbItems(items []*InspirationFlowItem) []*pb.InspirationFlowItem {
+    pbItems := make([]*pb.InspirationFlowItem, 0, len(items))
+    for _, item := range items {
+        pbItems = append(pbItems, &pb.InspirationFlowItem{
+            AssetId:       item.AssetID,
+            Name:          item.Name,
+            CoverUrl:      item.CoverURL,
+            LikeCount:     item.LikeCount,
+            OwnerNickname: item.OwnerNickname,
+            OwnerAvatar:   item.OwnerAvatar,
+            Span:          item.Span,
+            MaterialType:  item.MaterialType,
+        })
+    }
+    return pbItems
+}
+```
+
+- [ ] **Step 6: 提交**
+
+```bash
+git add services/galleryService/service/gallery_service.go
+git commit -m "feat: add hot inspiration flow service methods"
+```
+
+---
+
+## Task 5: 新增 Gateway Controller 方法
+
+**Files:**
+- Modify: `backend/gateway/controller/gallery_controller.go`
+
+- [ ] **Step 1: 添加新的 Controller 方法**
+
+```go
+// GetHotInspirationFlowBatch 批量获取热门分类
+// @Summary 批量获取热门分类
+// @Description 页面加载时一次性获取所有热门分类（无参数）
+// @Tags galleries
+// @Accept json
+// @Produce json
+// @Security BearerAuth
+// @Success 200 {object} response.Response{data=[]dto.HotCategoryItemDTO}
+// @Router /api/v1/inspiration-flow/hot/batch [get]
+func (ctrl *GalleryController) GetHotInspirationFlowBatch(c *gin.Context) {
+    userID, exists := c.Get("user_id")
+    if !exists {
+        response.Error(c, http.StatusUnauthorized, "用户未认证")
+        return
+    }
+    starID, exists := c.Get("star_id")
+    if !exists {
+        response.Error(c, http.StatusUnauthorized, "明星身份未设置")
+        return
+    }
+
+    ctx := context.WithValue(context.Background(), constant.AttachmentKey, map[string]interface{}{
+        "user_id": strconv.FormatInt(userID.(int64), 10),
+        "star_id": strconv.FormatInt(starID.(int64), 10),
+    })
+
+    resp, err := ctrl.galleryService.GetHotInspirationFlowBatch(ctx, &pb.GetEmpty{})
+    if err != nil {
+        logger.Logger.Error("GetHotInspirationFlowBatch failed", zap.Error(err))
+        response.Error(c, http.StatusInternalServerError, "获取热门分类失败")
+        return
+    }
+
+    response.Success(c, resp.Categories)
+}
+
+// GetHotInspirationFlow 单个分类刷新
+// @Summary 单个分类刷新
+// @Description 点击刷新按钮时调用
+// @Tags galleries
+// @Accept json
+// @Produce json
+// @Security BearerAuth
+// @Param type query string true "分类类型：hot_recommend/hot_star_card/hot_badge/hot_poster"
+// @Success 200 {object} response.Response{data=dto.HotCategoryItemDTO}
+// @Router /api/v1/inspiration-flow/hot [get]
+func (ctrl *GalleryController) GetHotInspirationFlow(c *gin.Context) {
+    userID, exists := c.Get("user_id")
+    if !exists {
+        response.Error(c, http.StatusUnauthorized, "用户未认证")
+        return
+    }
+    starID, exists := c.Get("star_id")
+    if !exists {
+        response.Error(c, http.StatusUnauthorized, "明星身份未设置")
+        return
+    }
+
+    categoryType := c.Query("type")
+    if categoryType == "" {
+        response.Error(c, http.StatusBadRequest, "缺少 type 参数")
+        return
+    }
+
+    ctx := context.WithValue(context.Background(), constant.AttachmentKey, map[string]interface{}{
+        "user_id": strconv.FormatInt(userID.(int64), 10),
+        "star_id": strconv.FormatInt(starID.(int64), 10),
+    })
+
+    resp, err := ctrl.galleryService.GetHotInspirationFlow(ctx, &pb.GetHotTypeRequest{Type: categoryType})
+    if err != nil {
+        logger.Logger.Error("GetHotInspirationFlow failed", zap.Error(err))
+        response.Error(c, http.StatusInternalServerError, "刷新热门分类失败")
+        return
+    }
+
+    response.Success(c, resp.Data)
+}
+
+// GetHotInspirationFlowMore 查看更多分页
+// @Summary 查看更多分页
+// @Description 热门分类查看更多
+// @Tags galleries
+// @Accept json
+// @Produce json
+// @Security BearerAuth
+// @Param type query string true "分类类型：hot_recommend/hot_star_card/hot_badge/hot_poster"
+// @Param cursor query string false "翻页游标（Base64 编码的 JSON）"
+// @Param limit query int false "每页数量" default(20)
+// @Success 200 {object} response.Response{data=dto.InspirationFlowDataDTO}
+// @Router /api/v1/inspiration-flow/hot/more [get]
+func (ctrl *GalleryController) GetHotInspirationFlowMore(c *gin.Context) {
+    userID, exists := c.Get("user_id")
+    if !exists {
+        response.Error(c, http.StatusUnauthorized, "用户未认证")
+        return
+    }
+    starID, exists := c.Get("star_id")
+    if !exists {
+        response.Error(c, http.StatusUnauthorized, "明星身份未设置")
+        return
+    }
+
+    categoryType := c.Query("type")
+    if categoryType == "" {
+        response.Error(c, http.StatusBadRequest, "缺少 type 参数")
+        return
+    }
+
+    cursor := c.Query("cursor")
+    limitStr := c.DefaultQuery("limit", "20")
+    limit, _ := strconv.Atoi(limitStr)
+
+    ctx := context.WithValue(context.Background(), constant.AttachmentKey, map[string]interface{}{
+        "user_id": strconv.FormatInt(userID.(int64), 10),
+        "star_id": strconv.FormatInt(starID.(int64), 10),
+    })
+
+    resp, err := ctrl.galleryService.GetHotInspirationFlowMore(ctx, &pb.GetHotInspirationFlowMoreRequest{
+        Type:   categoryType,
+        Cursor: cursor,
+        Limit:  int32(limit),
+    })
+    if err != nil {
+        logger.Logger.Error("GetHotInspirationFlowMore failed", zap.Error(err))
+        response.Error(c, http.StatusInternalServerError, "获取更多热门分类失败")
+        return
+    }
+
+    response.Success(c, resp.Data)
+}
+```
+
+- [ ] **Step 2: 提交**
+
+```bash
+git add gateway/controller/gallery_controller.go
+git commit -m "feat: add hot inspiration flow controller methods"
+```
+
+---
+
+## Task 6: 更新 Router 注册路由
+
+**Files:**
+- Modify: `backend/gateway/router/router.go`
+
+- [ ] **Step 1: 在 inspiration-flow 路由组添加新路由**
+
+找到现有的 inspiration-flow 路由组，修改为：
+
+```go
+// 灵感瀑布相关路由（需要认证）
+inspirationFlow := v1.Group("/inspiration-flow")
+inspirationFlow.Use(middleware.AuthMiddleware())
+{
+    inspirationFlow.GET("", galleryCtrl.GetInspirationFlow)                              // 灵感瀑布流
+    inspirationFlow.GET("/hot/batch", galleryCtrl.GetHotInspirationFlowBatch)           // 批量获取热门分类（必须在 /hot 之前）
+    inspirationFlow.GET("/hot", galleryCtrl.GetHotInspirationFlow)                     // 单个分类刷新
+    inspirationFlow.GET("/hot/more", galleryCtrl.GetHotInspirationFlowMore)             // 查看更多分页
+}
+```
+
+**注意**：`/hot/batch` 路由必须在 `/hot` 之前注册，否则 `/hot/batch` 会被 `/hot` 匹配到。
+
+- [ ] **Step 2: 提交**
+
+```bash
+git add gateway/router/router.go
+git commit -m "feat: register hot inspiration flow routes"
+```
+
+---
+
+## Task 7: 更新 DTO（Data Transfer Object）
+
+**Files:**
+- Modify: `backend/gateway/dto/gallery_dto.go`
+
+- [ ] **Step 1: 添加 DTO 结构体**
+
+```go
+// HotCategoryItemDTO 热门分类条目
+type HotCategoryItemDTO struct {
+    Type  string                   `json:"type"`
+    Title string                  `json:"title"`
+    Items []*InspirationFlowItemDTO `json:"items"`
+}
+
+// GetHotInspirationFlowBatchResponseDTO 批量获取热门分类响应
+type GetHotInspirationFlowBatchResponseDTO struct {
+    Categories []*HotCategoryItemDTO `json:"categories"`
+}
+```
+
+- [ ] **Step 2: 添加转换函数（gallery_converter.go）**
+
+```go
+// ConvertHotCategoryItem 转换热门分类条目
+func ConvertHotCategoryItem(pbItem *pb.HotCategoryItem) *HotCategoryItemDTO {
+    if pbItem == nil {
+        return nil
+    }
+    items := make([]*InspirationFlowItemDTO, 0, len(pbItem.Items))
+    for _, item := range pbItem.Items {
+        items = append(items, ConvertInspirationFlowItem(item))
+    }
+    return &HotCategoryItemDTO{
+        Type:  pbItem.Type,
+        Title: pbItem.Title,
+        Items: items,
+    }
+}
+
+// ConvertHotCategoryItems 批量转换
+func ConvertHotCategoryItems(pbItems []*pb.HotCategoryItem) []*HotCategoryItemDTO {
+    if pbItems == nil {
+        return nil
+    }
+    result := make([]*HotCategoryItemDTO, 0, len(pbItems))
+    for _, item := range pbItems {
+        if dto := ConvertHotCategoryItem(item); dto != nil {
+            result = append(result, dto)
+        }
+    }
+    return result
+}
+```
+
+- [ ] **Step 3: 提交**
+
+```bash
+git add gateway/dto/gallery_dto.go gateway/dto/gallery_converter.go
+git commit -m "feat: add DTO for hot inspiration flow"
+```
+
+---
+
+## Task 8: 调整现有 GetInspirationFlow 逻辑（灵感瀑布流）
+
+**Files:**
+- Modify: `backend/services/galleryService/service/gallery_service.go`
+- Modify: `backend/services/galleryService/repository/gallery_repository.go`
+
+- [ ] **Step 1: 在 Repository 添加分段查询方法 GetInspirationFlowByAvgSegments**
+
+```go
+// GetInspirationFlowByAvgSegments 分段获取灵感瀑布作品
+// 第一段：点赞 > 平均值，基于时间窗口伪随机排列，取前 limit 条
+// 第二段：如果不够，从 >= 平均值的作品中（排除第一段已选的）补充
+func (r *galleryRepository) GetInspirationFlowByAvgSegments(starID int64, materialType string, excludeIDs []int64, limit int) ([]*InspirationFlowItem, error) {
+    var items []*InspirationFlowItem
+    now := time.Now().UnixMilli()
+
+    // 构建基础查询
+    baseQuery := r.db.Model(&models.Exhibition{}).
+        Where("exhibitions.occupier_star_id = ? AND exhibitions.expire_at > ? AND exhibitions.deleted_at IS NULL", starID, now).
+        Joins("JOIN assets a ON a.id = exhibitions.asset_id").
+        Where("a.status = 1 AND a.is_active = true")
+
+    if materialType != "" && materialType != "all" {
+        baseQuery = baseQuery.Where("a.material_type LIKE ?", "%"+materialType+"%")
+    }
+
+    // 排除已展示
+    if len(excludeIDs) > 0 {
+        baseQuery = baseQuery.Where("exhibitions.id NOT IN ?", excludeIDs)
+    }
+
+    // 计算平均值
+    var avgLikes float64
+    err := baseQuery.Select("COALESCE(AVG(a.like_count), 0)").Scan(&avgLikes).Error
+    if err != nil {
+        return nil, err
+    }
+
+    // 第一段：点赞 > 平均值，基于时间窗口伪随机
+    windowSeed := now / 30000 % 10
+    err = baseQuery.
+        Select(`exhibitions.id as exhibition_id, exhibitions.asset_id, a.name, a.cover_url, a.like_count,
+                COALESCE(alr.current_level, 'N') as level, fp.nickname as owner_nickname, fp.avatar_url as owner_avatar,
+                a.material_type, a.created_at`).
+        Joins("LEFT JOIN asset_level_records alr ON alr.asset_id = a.id").
+        Joins("JOIN fan_profiles fp ON exhibitions.occupier_uid = fp.user_id AND exhibitions.occupier_star_id = fp.star_id").
+        Where("a.like_count > ?", avgLikes).
+        Where("exhibitions.id % 10 = ?", windowSeed).
+        Order("exhibitions.id").
+        Limit(limit).
+        Scan(&items).Error
+
+    if err != nil {
+        return nil, err
+    }
+
+    // 如果第一段不够，补充第二段（点赞数 >= 平均值，排除已选）
+    if len(items) < limit {
+        remaining := limit - len(items)
+        var extraItems []*InspirationFlowItem
+        err = baseQuery.
+            Where("a.like_count >= ?", avgLikes).
+            Where("exhibitions.id % 10 != ?", windowSeed).
+            Select(`exhibitions.id as exhibition_id, exhibitions.asset_id, a.name, a.cover_url, a.like_count,
+                    COALESCE(alr.current_level, 'N') as level, fp.nickname as owner_nickname, fp.avatar_url as owner_avatar,
+                    a.material_type, a.created_at`).
+            Joins("LEFT JOIN asset_level_records alr ON alr.asset_id = a.id").
+            Joins("JOIN fan_profiles fp ON exhibitions.occupier_uid = fp.user_id AND exhibitions.occupier_star_id = fp.star_id").
+            Order("exhibitions.id").
+            Limit(remaining).
+            Scan(&extraItems).Error
+
+        if err != nil {
+            return nil, err
+        }
+        items = append(items, extraItems...)
+    }
+
+    // 填充 Span
+    for _, item := range items {
+        item.Span = calcSpanByLevel(item.Level)
+    }
+
+    return items, nil
+}
+```
+
+- [ ] **Step 2: 修改 Service 层的 GetInspirationFlow 方法**
+
+将 `GetInspirationFlow` 方法中的 `GetRandomExhibitions` 调用替换为 `GetInspirationFlowByAvgSegments`。
+
+- [ ] **Step 3: 提交**
+
+```bash
+git add services/galleryService/service/gallery_service.go services/galleryService/repository/gallery_repository.go
+git commit -m "refactor: adjust inspiration flow to use avg-based segment query"
+```
+
+---
+
+## 自检清单
+
+1. **Spec 覆盖检查**:
+   - [ ] 批量接口 `/hot/batch` - Task 1, 2, 3, 4, 5, 6
+   - [ ] 单个刷新 `/hot` - Task 1, 4, 5, 6
+   - [ ] 查看更多 `/hot/more` - Task 1, 3, 4, 5, 6
+   - [ ] 灵感瀑布调整 - Task 8
+   - [ ] 并行查询 - Task 4
+   - [ ] 缓存优化（点赞均值缓存 5min，批量结果缓存 30s，单个刷新不缓存） - Task 2, 4
+   - [ ] 伪随机算法 - Task 3
+   - [ ] Cursor 编码 (like_count, asset_id) - Task 3, 4
+
+2. **Placeholder 扫描**: 无 "TBD"、"TODO" 占位符
+
+3. **类型一致性**: Proto 消息与 Go 结构体匹配，Cursor 编码格式正确
+
+---
+
+**Plan complete and saved to `docs/superpowers/plans/2026-05-27-热门推荐模块实现.md`.**
+
+**Two execution options:**
+
+**1. Subagent-Driven (recommended)** - I dispatch a fresh subagent per task, review between tasks, fast iteration
+
+**2. Inline Execution** - Execute tasks in this session using executing-plans, batch execution with checkpoints
+
+**Which approach?**
\ No newline at end of file
diff --git a/docs/superpowers/specs/2026-05-27-热门推荐模块设计.md b/docs/superpowers/specs/2026-05-27-热门推荐模块设计.md
new file mode 100644
index 0000000..196cc3c
--- /dev/null
+++ b/docs/superpowers/specs/2026-05-27-热门推荐模块设计.md
@@ -0,0 +1,476 @@
+# 热门推荐模块设计方案
+
+## 一、需求概述
+
+在广场页面顶部新增 4 个热门分类区块，每个分类显示 8 张高点赞作品，支持单个分类刷新和查看更多分页。
+
+### 页面结构
+
+```
+┌─────────────────────────────────┐
+│         BannerCarousel          │
+├─────────────────────────────────┤
+│  热门推荐 (8张图)                │
+│  [刷新]              [查看更多 >]  │
+├─────────────────────────────────┤
+│  热门星卡 (8张图)                │
+│  [刷新]              [查看更多 >]  │
+├─────────────────────────────────┤
+│  热门吧唧 (8张图)                │
+│  [刷新]              [查看更多 >]  │
+├─────────────────────────────────┤
+│  热门海报 (8张图)                │
+│  [刷新]              [查看更多 >]  │
+├─────────────────────────────────┤
+│  (CreationGrid 组件 - 不变)       │
+│  [热门作品] [最新作品] [星卡] ...   │
+└─────────────────────────────────┘
+```
+
+### 分类配置
+
+| 区块 | type值 | 说明 |
+|-----|-------|------|
+| 热门推荐 | `hot_recommend` | 混合所有类型，高点赞作品 |
+| 热门星卡 | `hot_star_card` | 只展示星卡 |
+| 热门吧唧 | `hot_badge` | 只展示吧唧 |
+| 热门海报 | `hot_poster` | 只展示海报 |
+
+---
+
+## 二、接口设计
+
+### 2.1 接口清单
+
+| 接口 | 方法 | 参数 | 说明 |
+|-----|------|------|------|
+| `/api/v1/inspiration-flow/hot/batch` | GET | 无 | 页面加载批量获取 |
+| `/api/v1/inspiration-flow/hot` | GET | `type` | 单个分类刷新 |
+| `/api/v1/inspiration-flow/hot/more` | GET | `type`, `cursor`, `limit` | 查看更多分页 |
+| `/api/v1/inspiration-flow` | GET | `type`, `cursor`, `limit`, `direction` | 灵感瀑布流（逻辑调整） |
+
+### 2.2 调用时序
+
+```
+页面加载 → 批量请求（1次）
+     └── GET /api/v1/inspiration-flow/hot/batch
+
+刷新 → 单个分类刷新（点击哪个刷新请求哪个）
+     └── GET /api/v1/inspiration-flow/hot?type=hot_star_card
+
+查看更多 → 新页面分页
+     └── GET /api/v1/inspiration-flow/hot/more?type=hot_star_card&cursor=xxx&limit=20
+```
+
+### 2.3 批量获取热门分类
+
+**接口**: `GET /api/v1/inspiration-flow/hot/batch`
+
+**请求参数**: 无
+
+**业务逻辑**:
+1. 计算各分类作品的点赞平均值
+2. 筛选点赞数 ≥ 平均值的作品
+3. 基于时间窗口伪随机排序取 8 条
+4. 后端动态返回分类数据，前端根据返回的分类动态渲染
+5. **排序说明**：使用随机排序，同一分类每次请求返回的作品可能不同
+
+**响应结构**:
+```json
+{
+  "code": 200,
+  "data": {
+    "categories": [
+      {
+        "type": "hot_recommend",
+        "title": "热门推荐",
+        "items": [
+          {
+            "asset_id": "xxx",
+            "cover_url": "https://xxx.jpg",
+            "owner_nickname": "用户名",
+            "owner_avatar": "https://xxx.jpg",
+            "likes": 1234
+          }
+        ]
+      },
+      {
+        "type": "hot_star_card",
+        "title": "热门星卡",
+        "items": [...]
+      },
+      {
+        "type": "hot_badge",
+        "title": "热门吧唧",
+        "items": [...]
+      },
+      {
+        "type": "hot_poster",
+        "title": "热门海报",
+        "items": [...]
+      }
+    ]
+  }
+}
+```
+
+**空分类处理**: 如果某个分类没有作品或查询失败，该分类不返回。
+
+### 2.4 单个分类刷新
+
+**接口**: `GET /api/v1/inspiration-flow/hot`
+
+**请求参数**:
+
+| 参数 | 类型 | 必填 | 说明 |
+|-----|------|------|------|
+| `type` | string | 是 | `hot_recommend` / `hot_star_card` / `hot_badge` / `hot_poster` |
+
+**业务逻辑**:
+1. 计算该分类作品的点赞平均值
+2. 筛选点赞数 ≥ 平均值的作品
+3. 基于时间窗口伪随机排序取 8 条
+4. **排序说明**：批量和刷新接口使用随机排序，每次刷新可能看到不同作品
+
+**响应结构**:
+```json
+{
+  "code": 200,
+  "data": {
+    "type": "hot_star_card",
+    "title": "热门星卡",
+    "items": [...]
+  }
+}
+```
+
+### 2.5 查看更多分页
+
+**接口**: `GET /api/v1/inspiration-flow/hot/more`
+
+**请求参数**:
+
+| 参数 | 类型 | 必填 | 说明 |
+|-----|------|------|------|
+| `type` | string | 是 | 分类类型（`hot_recommend`/`hot_star_card`/`hot_badge`/`hot_poster`） |
+| `cursor` | string | 否 | 翻页游标（Base64 编码的 JSON，包含 `like_count` 和 `asset_id`） |
+| `limit` | int | 否 | 每页数量，默认 20 |
+
+**业务逻辑**:
+1. 计算该分类作品点赞平均值
+2. 筛选点赞数 ≥ 平均值的作品
+3. 按点赞数 DESC、asset_id DESC 排序
+4. Cursor 分页返回（游标编码 last_like_count 和 last_asset_id）
+
+**Cursor 编码格式**:
+```json
+// Base64 编码 {"like_count": 1234, "asset_id": 5678}
+```
+
+**响应结构**:
+```json
+{
+  "code": 200,
+  "data": {
+    "items": [
+      {
+        "asset_id": "xxx",
+        "cover_url": "https://xxx.jpg",
+        "owner_nickname": "用户名",
+        "owner_avatar": "https://xxx.jpg",
+        "likes": 1234,
+        "material_type": "star_card",
+        "sub_type": "raster"
+      }
+    ],
+    "cursor": "xxx",
+    "has_more": true
+  }
+}
+```
+
+**注意**：`material_type` 表示素材类型（`star_card`/`badge`/`poster`），不是热门分类类型。
+
+### 2.6 灵感瀑布流（逻辑调整）
+
+**接口**: `GET /api/v1/inspiration-flow`
+
+**请求参数**:
+
+| 参数 | 类型 | 必填 | 说明 |
+|-----|------|------|------|
+| `type` | string | 否 | 素材类型过滤：`badge`/`poster`/`original`/`all`，默认 `all` |
+| `cursor` | string | 否 | 翻页游标 |
+| `direction` | string | 否 | 滚动方向：`right`（加载新数据）/ `left`（加载历史） |
+| `limit` | int | 否 | 每页数量，默认 20 |
+| `session_id` | string | 否 | 会话 ID（用于去重，排除已展示的作品） |
+
+**业务逻辑（调整后）**:
+1. 计算该分类作品的点赞平均值（基于展出中的作品 JOIN assets）
+2. **第一段**：点赞 > 平均值的作品，基于时间窗口伪随机排列，取前 20 条
+3. **第二段**：如果不够 20 条，从点赞 ≥ 平均值的作品中（排除第一段已选的）补充，伪随机排列
+4. 两段组合返回
+5. 通过 `session_id` 排除已展示的作品（`excludeIDs`）
+
+**响应结构**:
+```json
+{
+  "code": 200,
+  "data": {
+    "items": [...],
+    "cursor": "xxx",
+    "has_more": true
+  }
+}
+```
+
+**注意**：灵感瀑布的 `type` 参数与热门推荐接口的 `type` 参数含义完全不同：
+- 灵感瀑布：`badge`/`poster`/`original`/`all`（素材类型过滤）
+- 热门推荐：`hot_recommend`/`hot_star_card`/`hot_badge`/`hot_poster`（分类类型）
+
+### 2.7 热门分类 type 与资产类型映射
+
+| 热门分类 type | 对应 assetType | 说明 |
+|-------------|---------------|------|
+| `hot_recommend` | 空字符串 `""` | 混合所有类型，不过滤 |
+| `hot_star_card` | `star_card` | 只展示星卡 |
+| `hot_badge` | `badge` | 只展示吧唧 |
+| `hot_poster` | `poster` | 只展示海报 |
+
+---
+
+## 三、数据模型
+
+### 3.1 数据库表结构
+
+参考现有 `assets` 表，无需新建表。
+
+### 3.2 伪 SQL（以 star_card 为例）
+
+**说明**：热门作品基于**展出中的作品**筛选，只有在 `exhibitions` 表中有有效展出记录且未过期的作品才能参与热门排名。
+
+```sql
+-- 计算该分类作品的点赞平均值（基于展出中的作品）
+AVG_LIKES := SELECT AVG(a.like_count)
+FROM exhibitions e
+JOIN assets a ON a.id = e.asset_id
+WHERE e.occupier_star_id = :star_id
+  AND e.expire_at > :now
+  AND e.deleted_at IS NULL
+  AND a.status = 1 AND a.is_active = true
+  AND a.asset_type = 'star_card';
+
+-- 随机取8条（基于时间窗口的伪随机，避免 ORDER BY RANDOM()）
+WINDOW_SEED := :now / 30000 % 10;  -- 每30秒变化一次
+
+SELECT e.id as exhibition_id, e.asset_id, a.name, a.cover_url, a.like_count,
+       fp.nickname as owner_nickname, fp.avatar_url as owner_avatar
+FROM exhibitions e
+JOIN assets a ON a.id = e.asset_id
+JOIN fan_profiles fp ON e.occupier_uid = fp.user_id AND e.occupier_star_id = fp.star_id
+WHERE e.occupier_star_id = :star_id
+  AND e.expire_at > :now
+  AND e.deleted_at IS NULL
+  AND a.status = 1 AND a.is_active = true
+  AND a.asset_type = 'star_card'
+  AND a.like_count >= :AVG_LIKES
+  AND e.id % 10 = :WINDOW_SEED
+ORDER BY e.id
+LIMIT 8;
+```
+
+### 3.3 随机算法说明
+
+避免 `ORDER BY RANDOM()`（大表性能差），改用基于时间窗口的伪随机：
+
+- `WINDOW_SEED = now / 30000 % 10` — 每 30 秒变化一次
+- 通过 `exhibition_id % 10 = WINDOW_SEED` 筛选作品
+- 同一时间窗口内结果固定，不同时间窗口返回不同作品
+- 如果数量不够，再补充查询（不加窗口过滤）
+
+---
+
+## 四、后端实现
+
+### 4.1 文件改动
+
+| 文件 | 改动内容 |
+|-----|---------|
+| `backend/proto/gallery.proto` | 新增 Proto 消息定义 |
+| `backend/pkg/proto/gallery/gallery.pb.go` | 重新生成 Proto 代码 |
+| `backend/gateway/controller/gallery_controller.go` | 新增 `GetHotInspirationFlowBatch`、`GetHotInspirationFlow`、`GetHotInspirationFlowMore` 方法 |
+| `backend/gateway/router/router.go` | 新增 `/inspiration-flow/hot/*` 路由注册 |
+| `backend/gateway/dto/gallery_dto.go` | 新增 DTO 结构体 |
+| `backend/gateway/dto/gallery_converter.go` | 新增 DTO 转换函数 |
+| `backend/services/galleryService/service/gallery_service.go` | 新增 `GetHotInspirationFlowBatch`、`GetHotInspirationFlow`、`GetHotInspirationFlowMore` Service 方法 |
+| `backend/services/galleryService/repository/gallery_repository.go` | 新增 Repository 方法 |
+| `backend/pkg/database/redis.go` | 新增热门推荐缓存辅助函数 |
+
+### 4.2 性能优化
+
+1. **并行查询** — 批量接口用 goroutine 并行查 4 个分类
+2. **缓存平均值** — 每个分类的点赞均值独立缓存（TTL: 5 分钟），避免重复计算
+3. **结果缓存** — 热门列表缓存 30-60 秒，降低数据库压力
+4. **随机优化** — 避免 `ORDER BY RANDOM()`，使用基于时间窗口的伪随机
+
+### 4.3 缓存策略
+
+| 缓存项 | TTL | 说明 |
+|-------|-----|------|
+| 分类点赞均值 | 5 分钟 | 每个分类单独缓存，减少重复计算平均值 |
+| 热门列表（批量） | 30 秒 | `/hot/batch` 结果缓存，降低数据库压力 |
+| 热门列表（单个刷新） | **不缓存** | 用户主动刷新应获取新数据，只缓存点赞均值 |
+
+---
+
+## 五、前端实现
+
+### 5.1 文件结构
+
+```
+frontend/pages/square/
+├── square.vue                    # 修改：集成4个热门分类区块
+├── components/
+│   └── HotCategoryBlock.vue      # 新增：单个热门分类区块组件
+└── hot-category-more.vue         # 新增：热门分类查看更多页面
+```
+
+### 5.2 新增文件
+
+| 文件 | 说明 |
+|-----|------|
+| `pages/square/components/HotCategoryBlock.vue` | 单个热门分类区块组件 |
+| `pages/square/hot-category-more.vue` | 热门分类查看更多页面 |
+
+### 5.3 修改文件
+
+| 文件 | 改动内容 |
+|-----|---------|
+| `frontend/pages/square/square.vue` | 顶部新增 4 个 HotCategoryBlock |
+| `frontend/utils/api.js` | 新增批量获取、单个刷新、查看更多 API |
+
+### 5.4 HotCategoryBlock 组件
+
+```
+HotCategoryBlock.vue
+├── props: { categoryType }  // 传入后端返回的 type（如 "hot_star_card"）
+├── 状态: items, loading, refreshing, title
+├── 方法: handleRefresh(), handleViewMore()
+└── 模板:
+    ├── 标题 (title)
+    ├── 4x2 网格 (items) + loading 骨架屏
+    ├── 刷新按钮 (loading 状态)
+    └── 查看更多按钮
+```
+
+**说明**：
+- `categoryType` 直接使用后端返回的 type（如 `hot_star_card`）
+- 刷新时调用 `getHotInspirationFlowApi(categoryType)`
+- 查看更多时调用 `getHotInspirationFlowMoreApi(categoryType, ...)`
+
+**刷新交互**:
+- 整个区块显示骨架屏 loading
+- 刷新按钮显示 loading 状态
+- 刷新完成后正常显示
+
+### 5.5 hot-category-more.vue 页面
+
+```
+hot-category-more.vue
+├── onLoad: 获取 type 参数
+├── 状态: items, cursor, loading, hasMore, title
+├── 方法: loadMore(), loadData()
+└── 模板:
+    ├── 返回按钮 + 标题
+    └── 分页网格列表
+```
+
+**星卡子标签**（仅星卡分类显示）:
+
+| 子标签 | value | 说明 |
+|-------|-------|------|
+| 全部 | `all` | 全部星卡 |
+| 光栅卡 | `raster` | 光栅卡类型 |
+| 镭射卡 | `holographic` | 镭射卡类型 |
+| 撕拉卡 | `tear_off` | 撕拉卡类型 |
+| 拍立得 | `polaroid` | 拍立得类型 |
+
+### 5.6 API 封装
+
+```javascript
+// 批量获取热门分类（页面加载用）
+export function getHotInspirationFlowBatchApi() {
+  return request({
+    url: '/api/v1/inspiration-flow/hot/batch',
+    method: 'GET'
+  })
+}
+
+// 单个分类刷新
+export function getHotInspirationFlowApi(type) {
+  return request({
+    url: '/api/v1/inspiration-flow/hot',
+    method: 'GET',
+    data: { type }
+  })
+}
+
+// 查看更多分页
+export function getHotInspirationFlowMoreApi(type, cursor = '', limit = 20) {
+  return request({
+    url: '/api/v1/inspiration-flow/hot/more',
+    method: 'GET',
+    data: { type, cursor, limit }
+  })
+}
+```
+
+**调用示例**：
+```javascript
+// 批量加载（页面初始化）
+const res = await getHotInspirationFlowBatchApi()
+res.data.categories.forEach(cat => {
+  // cat.type: "hot_recommend", cat.title: "热门推荐", cat.items: [...]
+  console.log(cat.type, cat.items.length)
+})
+
+// 单个刷新（点击刷新按钮）
+const refreshRes = await getHotInspirationFlowApi('hot_star_card')
+
+// 查看更多（跳转 hot-category-more.vue）
+uni.navigateTo({
+  url: `/pages/square/hot-category-more?type=${type}&title=${encodeURIComponent(title)}`
+})
+```
+
+---
+
+## 六、实现步骤
+
+### 后端
+
+- [ ] **Phase 1**: 新增 `/api/v1/inspiration-flow/hot/batch` 批量接口（并行查询 + 缓存）
+- [ ] **Phase 2**: 新增 `/api/v1/inspiration-flow/hot` 单个分类接口
+- [ ] **Phase 3**: 新增 `/api/v1/inspiration-flow/hot/more` 查看更多接口
+- [ ] **Phase 4**: 修改 `/api/v1/inspiration-flow` 逻辑（分段填充）
+
+### 前端
+
+- [ ] **Phase 5**: 新增 `utils/api.js` API 方法
+- [ ] **Phase 6**: 新增 `HotCategoryBlock.vue` 组件
+- [ ] **Phase 7**: 新增 `hot-category-more.vue` 页面
+- [ ] **Phase 8**: 修改 `square.vue` 集成 4 个热门分类区块
+- [ ] **Phase 9**: CreationGrid 保持不变
+
+### 联调
+
+- [ ] **Phase 10**: 前后端联调测试
+
+---
+
+## 七、注意事项
+
+1. **后端动态返回分类** — 前端无需硬编码分类，根据接口返回的 `categories` 动态渲染
+2. **空分类不显示** — 如果某个分类没有作品，该分类不返回或前端忽略
+3. **扩展性** — 后续增加新分类（如"热门贴纸"），只需后端调整，前端无需改动
+4. **性能优先** — 后端必须做好并行查询和缓存优化，确保批量接口响应时间 < 200ms
\ No newline at end of file