main:添加核心文件并初始化项目

新增内容： - 创建基础项目结构。 - 添加 `.gitignore` 和 `.dockerignore` 文件。 - 编写 `pyproject.toml` 和依赖文件。 - 添加算法模块及示例算法。 - 实现核心功能模块（日志、错误处理、指标）。 - 添加开发和运行所需的相关脚本文件及文档。
2026-02-02 10:46:01 +08:00
commit 31af5e2286
54 changed files with 5726 additions and 0 deletions
--- a/.dockerignore
+++ b/.dockerignore
@@ -0,0 +1,29 @@
 __pycache__/
 *.py[cod]
 *$py.class
 *.so
 .Python
 build/
 develop-eggs/
 dist/
 downloads/
 eggs/
 .eggs/
 lib/
 lib64/
 parts/
 sdist/
 var/
 wheels/
 *.egg-info/
 .installed.cfg
 *.egg
 .pytest_cache/
 .coverage
 htmlcov/
 .env
 .venv
 venv/
 ENV/
 *.log
 .DS_Store
--- a/.env.example
+++ b/.env.example
@@ -0,0 +1,32 @@
 # Environment Configuration
 # Copy this file to .env and fill in your values
 # Application
 APP_NAME=FunctionalScaffold
 APP_VERSION=1.0.0
 APP_ENV=development
 # Server
 HOST=0.0.0.0
 PORT=8000
 WORKERS=4
 # Logging
 LOG_LEVEL=INFO
 LOG_FORMAT=json
 # Metrics
 METRICS_ENABLED=true
 # Tracing
 TRACING_ENABLED=false
 JAEGER_ENDPOINT=http://localhost:14268/api/traces
 # External Services (examples)
 # OSS_ENDPOINT=https://oss-cn-hangzhou.aliyuncs.com
 # OSS_ACCESS_KEY_ID=your_access_key
 # OSS_ACCESS_KEY_SECRET=your_secret_key
 # OSS_BUCKET_NAME=your_bucket
 # Database (if needed)
 # DATABASE_URL=mysql://user:password@localhost:5432/dbname
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1,70 @@
 .claude
 docs/prompt
 .idea
 # Byte-compiled / optimized / DLL files
 __pycache__/
 *.py[cod]
 *$py.class
 # C extensions
 *.so
 # Distribution / packaging
 .Python
 build/
 develop-eggs/
 dist/
 downloads/
 eggs/
 .eggs/
 lib/
 lib64/
 parts/
 sdist/
 var/
 wheels/
 pip-wheel-metadata/
 share/python-wheels/
 *.egg-info/
 .installed.cfg
 *.egg
 MANIFEST
 # Unit test / coverage reports
 htmlcov/
 .tox/
 .nox/
 .coverage
 .coverage.*
 .cache
 nosetests.xml
 coverage.xml
 *.cover
 *.py,cover
 .hypothesis/
 .pytest_cache/
 # Virtual environments
 venv/
 env/
 ENV/
 env.bak/
 venv.bak/
 # IDEs
 .vscode/
 *.swp
 *.swo
 *~
 # Environment variables
 .env
 .env.local
 # Logs
 *.log
 # OS
 .DS_Store
 Thumbs.db
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -0,0 +1,399 @@
 # CLAUDE.md
 本文件为 Claude Code (claude.ai/code) 在此代码仓库中工作时提供指导。
 ## 项目概述
 **FunctionalScaffold（函数式脚手架）** 是一个算法工程化 Serverless 解决方案的脚手架生成器。
 - 为了方便团队交流，项目的自然语言使用中文，包括代码注释和文档等
 ### 核心目标
 解决三大痛点：
 1. **不确定的算力需求** - 需要动态扩缩容能力
 2. **算法同学工程化能力不足** - 降低工程化门槛
 3. **后端同学集成难度过高** - 标准化接口规范
 ## 技术架构
 采用 **Docker 封装的 Serverless API 服务**方案：
 - 算法代码 + 运行环境打包为 Docker 镜像
 - 部署到云厂商 Serverless 平台实现自动扩缩容
 - FastAPI 作为 HTTP 接口层
 - 算法逻辑保持独立和专注
 ### 架构流程
 ```
 用户请求 → API网关 → 容器实例（冷/热启动）→ FastAPI → 算法程序 → 返回结果
                                              ↓
                                        外部服务（OSS/数据库）
 ```
 ### 代码架构
 项目采用 **src layout** 结构（Python 最佳实践）：
 ```
 src/functional_scaffold/
 ├── algorithms/          # 算法层 - 所有算法必须继承 BaseAlgorithm
 │   ├── base.py         # 提供 execute() 包装器（埋点、错误处理）
 │   └── prime_checker.py # 示例：质数判断算法
 ├── api/                # API 层 - FastAPI 路由和模型
 │   ├── models.py       # Pydantic 数据模型（使用 ConfigDict）
 │   ├── routes.py       # 路由定义（/invoke, /healthz, /readyz, /jobs）
 │   └── dependencies.py # 依赖注入（request_id 生成）
 ├── core/               # 核心功能 - 横切关注点
 │   ├── errors.py       # 异常类层次结构
 │   ├── logging.py      # 结构化日志（JSON 格式）
 │   ├── metrics.py      # Prometheus 指标和装饰器
 │   └── tracing.py      # 分布式追踪（ContextVar）
 ├── utils/              # 工具函数
 │   └── validators.py   # 输入验证
 ├── config.py           # 配置管理（pydantic-settings）
 └── main.py             # FastAPI 应用入口
 ```
 **关键设计模式：**
 1. **算法抽象层**：所有算法继承 `BaseAlgorithm`，只需实现 `process()` 方法。`execute()` 方法自动处理埋点、日志和错误包装。
 2. **依赖注入**：使用 FastAPI 的 `Depends()` 机制注入 request_id，通过 `ContextVar` 在异步上下文中传递。
 3. **配置管理**：使用 `pydantic-settings` 从环境变量或 `.env` 文件加载配置，支持类型验证。
 4. **可观测性**：
   - 日志：结构化 JSON 日志（pythonjsonlogger）
   - 指标：Prometheus 格式（request_counter, request_latency, algorithm_counter）
   - 追踪：request_id 关联所有日志和指标
 ## 开发命令
 ### 环境设置
 ```bash
 # 创建虚拟环境并安装依赖（开发模式）
 python -m venv venv
 source venv/bin/activate  # Windows: venv\Scripts\activate
 pip install -e ".[dev]"
 ```
 ### 运行服务
 ```bash
 # 方式1：使用辅助脚本（推荐）
 ./scripts/run_dev.sh
 # 方式2：直接运行（开发模式，自动重载）
 uvicorn src.functional_scaffold.main:app --reload --port 8000
 # 方式3：生产模式
 uvicorn src.functional_scaffold.main:app --host 0.0.0.0 --port 8000 --workers 4
 ```
 访问地址：
 - Swagger UI: http://localhost:8000/docs
 - ReDoc: http://localhost:8000/redoc
 - Metrics: http://localhost:8000/metrics
 ### 测试
 ```bash
 # 运行所有测试
 pytest tests/ -v
 # 运行单个测试文件
 pytest tests/test_algorithms.py -v
 # 运行单个测试类
 pytest tests/test_algorithms.py::TestPrimeChecker -v
 # 运行单个测试方法
 pytest tests/test_algorithms.py::TestPrimeChecker::test_prime_numbers -v
 # 生成覆盖率报告
 pytest tests/ --cov=src/functional_scaffold --cov-report=html
 # 查看报告：open htmlcov/index.html
 # 使用辅助脚本（包含代码检查）
 ./scripts/run_tests.sh
 ```
 ### 代码质量
 ```bash
 # 代码格式化（自动修复）
 black src/ tests/
 # 代码检查（不修改文件）
 black --check src/ tests/
 # 代码检查
 ruff check src/ tests/
 # 自动修复可修复的问题
 ruff check --fix src/ tests/
 ```
 配置说明：
 - Black: 行长度 100，目标 Python 3.9+
 - Ruff: 行长度 100，目标 Python 3.9+
 ### Docker
 ```bash
 # 构建镜像
 docker build -f deployment/Dockerfile -t functional-scaffold:latest .
 # 运行容器
 docker run -p 8000:8000 functional-scaffold:latest
 # 使用 docker-compose（包含 Prometheus + Grafana）
 cd deployment
 docker-compose up
 # Grafana: http://localhost:3000 (admin/admin)
 # Prometheus: http://localhost:9090
 ```
 ### 文档
 ```bash
 # 导出 OpenAPI 规范到 docs/swagger/openapi.json
 python scripts/export_openapi.py
 ```
 ## 添加新算法
 ### 1. 创建算法类（继承 BaseAlgorithm）
 ```python
 # src/functional_scaffold/algorithms/my_algorithm.py
 from typing import Dict, Any
 from .base import BaseAlgorithm
 class MyAlgorithm(BaseAlgorithm):
    """我的算法类"""
    def process(self, input_data: Any) -> Dict[str, Any]:
        """
        算法处理逻辑
        Args:
            input_data: 输入数据
        Returns:
            Dict[str, Any]: 处理结果
        """
        # 实现算法逻辑
        result = do_something(input_data)
        return {"result": result}
 ```
 ### 2. 注册到 `__init__.py`
 ```python
 # src/functional_scaffold/algorithms/__init__.py
 from .my_algorithm import MyAlgorithm
 __all__ = [..., "MyAlgorithm"]
 ```
 ### 3. 添加 API 端点（在 `api/routes.py`）
 ```python
@router.post("/my-endpoint")
 async def my_endpoint(
    request: MyRequest,
    request_id: str = Depends(get_request_id)
 ):
    """我的算法端点"""
    algorithm = MyAlgorithm()
    result = algorithm.execute(request.data)
    return MyResponse(request_id=request_id, **result)
 ```
 ### 4. 定义数据模型（在 `api/models.py`）
 ```python
 class MyRequest(BaseModel):
    """我的请求模型"""
    model_config = ConfigDict(
        json_schema_extra={
            "example": {"data": "示例数据"}
        }
    )
    data: str = Field(..., description="输入数据")
 ```
 ### 5. 编写测试
 ```python
 # tests/test_my_algorithm.py
 def test_my_algorithm():
    """测试我的算法"""
    algo = MyAlgorithm()
    result = algo.process("测试数据")
    assert result["result"] == expected
 ```
 ## 配置管理
 配置通过 `src/functional_scaffold/config.py` 的 `Settings` 类管理：
 - 从环境变量读取（不区分大小写）
 - 支持 `.env` 文件
 - 使用 `pydantic-settings` 进行类型验证
 配置示例：
 ```bash
 # .env 文件
 APP_ENV=production
 LOG_LEVEL=INFO
 METRICS_ENABLED=true
 ```
 访问配置：
 ```python
 from functional_scaffold.config import settings
 print(settings.app_env)  # "production"
 ```
 ## 可观测性
 ### 日志
 使用 `core/logging.py` 的 `setup_logging()`：
 ```python
 from functional_scaffold.core.logging import setup_logging
 # 设置日志
 logger = setup_logging(level="INFO", format_type="json")
 # 记录日志
 logger.info("处理请求", extra={"user_id": "123"})
 ```
 ### 指标
 使用 `core/metrics.py` 的装饰器：
 ```python
 from functional_scaffold.core.metrics import track_algorithm_execution
@track_algorithm_execution("my_algorithm")
 def my_function():
    """我的函数"""
    pass
 ```
 可用指标：
 - `http_requests_total{method, endpoint, status}` - HTTP 请求总数
 - `http_request_duration_seconds{method, endpoint}` - HTTP 请求延迟
 - `algorithm_executions_total{algorithm, status}` - 算法执行总数
 - `algorithm_execution_duration_seconds{algorithm}` - 算法执行延迟
 ### 追踪
 Request ID 自动注入到所有请求：
 ```python
 from functional_scaffold.core.tracing import get_request_id
 # 在请求上下文中获取 request_id
 request_id = get_request_id()
 ```
 ## 部署
 ### Kubernetes
 ```bash
 kubectl apply -f deployment/kubernetes/deployment.yaml
 kubectl apply -f deployment/kubernetes/service.yaml
 ```
 配置说明：
 - 3 个副本
 - 资源限制：256Mi-512Mi 内存，250m-500m CPU
 - 健康检查：存活探针 (/healthz)，就绪探针 (/readyz)
 ### 阿里云函数计算
 ```bash
 fun deploy -t deployment/serverless/aliyun-fc.yaml
 ```
 ### AWS Lambda
 ```bash
 sam deploy --template-file deployment/serverless/aws-lambda.yaml
 ```
 ## 必须交付的三大组件
 ### 1. 接入规范
 **API 端点标准：**
 - `/invoke` - 同步调用接口
 - `/jobs` - 异步任务接口（当前返回 501）
 - `/healthz` - 存活检查
 - `/readyz` - 就绪检查
 - `/metrics` - Prometheus 指标
 **Schema 规范：**
 - 请求/响应 Schema（Pydantic 验证）
 - 错误响应格式（统一的 ErrorResponse）
 - 元数据和版本信息（每个响应包含 metadata）
 ### 2. Python SDK 运行时
 **已实现的能力：**
 - ✅ 参数校验（Pydantic + utils/validators.py）
 - ✅ 错误包装和标准化（core/errors.py）
 - ✅ 埋点（core/metrics.py - 延迟、失败率）
 - ✅ 分布式追踪的关联 ID（core/tracing.py）
 - ⏳ Worker 运行时（重试、超时、DLQ - 待实现）
 ### 3. 脚手架生成器
 **已包含的模板：**
 - ✅ 示例算法函数（algorithms/prime_checker.py）
 - ✅ Dockerfile（deployment/Dockerfile）
 - ✅ CI/CD 流水线配置（.github/workflows/）
 - ✅ Serverless 平台部署 YAML（deployment/serverless/）
 - ✅ Grafana 仪表板模板（monitoring/grafana/dashboard.json）
 - ✅ 告警规则配置（monitoring/alerts/rules.yaml）
 ## 开发理念
 **算法同学只需修改核心算法函数。** 所有基础设施、可观测性、部署相关的工作都由脚手架处理。
 算法开发者只需：
 1. 继承 `BaseAlgorithm`
 2. 实现 `process()` 方法
 3. 返回字典格式的结果
 框架自动提供：
 - HTTP 接口封装
 - 参数验证
 - 错误处理
 - 日志记录
 - 性能指标
 - 健康检查
 - 容器化部署
 ## 注意事项
 1. **Pydantic V2**：使用 `ConfigDict` 而非 `class Config`，使用 `model_config` 而非 `Config`。
 2. **异步上下文**：request_id 使用 `ContextVar` 存储，在异步函数中自动传递。
 3. **测试隔离**：每个测试使用 `TestClient`，不需要启动真实服务器。
 4. **Docker 构建**：Dockerfile 使用非 root 用户（appuser），包含健康检查。
 5. **配置优先级**：环境变量 > .env 文件 > 默认值。
--- a/README.md
+++ b/README.md
@@ -0,0 +1,259 @@
 # FunctionalScaffold
 **算法工程化 Serverless 解决方案脚手架**
 一个基于 FastAPI 和 Docker 的 Serverless 算法服务脚手架，帮助算法工程师快速构建生产级的算法服务。
 ## 特性
 - ✅ **标准化 API 接口** - 符合 RESTful 规范的 HTTP 接口
 - ✅ **开箱即用** - 完整的项目结构和配置
 - ✅ **自动文档** - Swagger/OpenAPI 自动生成
 - ✅ **监控指标** - Prometheus 指标和 Grafana 仪表板
 - ✅ **健康检查** - 存活和就绪探针
 - ✅ **容器化部署** - Docker 和 Kubernetes 支持
 - ✅ **Serverless 就绪** - 支持阿里云函数计算和 AWS Lambda
 - ✅ **完整测试** - 单元测试和集成测试
 - ✅ **CI/CD** - GitHub Actions 工作流
 ## 快速开始
 ### 前置要求
 - Python 3.9+
 - Docker (可选)
 ### 本地开发
 1. 克隆仓库
 ```bash
 git clone <repository-url>
 cd FunctionalScaffold
 ```
 2. 创建虚拟环境并安装依赖
 ```bash
 python -m venv venv
 source venv/bin/activate  # Windows: venv\Scripts\activate
 pip install -e ".[dev]"
 ```
 3. 启动开发服务器
 ```bash
 # 方式1：使用脚本
 ./scripts/run_dev.sh
 # 方式2：直接运行
 uvicorn src.functional_scaffold.main:app --reload --port 8000
 ```
 4. 访问 API 文档
 打开浏览器访问：
 - Swagger UI: http://localhost:8000/docs
 - ReDoc: http://localhost:8000/redoc
 - OpenAPI JSON: http://localhost:8000/openapi.json
 ### 使用 Docker
 ```bash
 # 构建镜像
 docker build -f deployment/Dockerfile -t functional-scaffold:latest .
 # 运行容器
 docker run -p 8000:8000 functional-scaffold:latest
 # 或使用 docker-compose
 cd deployment
 docker-compose up
 ```
 ## API 端点
 ### 核心接口
 - `POST /invoke` - 同步调用算法
 - `POST /jobs` - 异步任务接口（预留）
 ### 健康检查
 - `GET /healthz` - 存活检查
 - `GET /readyz` - 就绪检查
 ### 监控
 - `GET /metrics` - Prometheus 指标
 ## 示例请求
 ### 质数判断
 ```bash
 curl -X POST http://localhost:8000/invoke \
  -H "Content-Type: application/json" \
  -d '{"number": 17}'
 ```
 响应：
 ```json
 {
  "request_id": "550e8400-e29b-41d4-a716-446655440000",
  "status": "success",
  "result": {
    "number": 17,
    "is_prime": true,
    "factors": [],
    "algorithm": "trial_division"
  },
  "metadata": {
    "algorithm": "PrimeChecker",
    "version": "1.0.0",
    "elapsed_time": 0.001
  }
 }
 ```
 ## 项目结构
 ```
 FunctionalScaffold/
 ├── src/functional_scaffold/      # 核心代码
 │   ├── algorithms/               # 算法实现
 │   ├── api/                      # API 层
 │   ├── core/                     # 核心功能
 │   ├── utils/                    # 工具函数
 │   ├── config.py                 # 配置管理
 │   └── main.py                   # 应用入口
 ├── tests/                        # 测试
 ├── deployment/                   # 部署配置
 │   ├── Dockerfile
 │   ├── docker-compose.yml
 │   ├── kubernetes/
 │   └── serverless/
 ├── monitoring/                   # 监控配置
 ├── scripts/                      # 辅助脚本
 └── docs/                         # 文档
 ```
 ## 开发指南
 ### 添加新算法
 1. 在 `src/functional_scaffold/algorithms/` 创建新算法文件
 2. 继承 `BaseAlgorithm` 类并实现 `process` 方法
 3. 在 API 路由中注册新端点
 示例：
 ```python
 from .base import BaseAlgorithm
 class MyAlgorithm(BaseAlgorithm):
    def process(self, input_data):
        # 实现算法逻辑
        result = do_something(input_data)
        return {"result": result}
 ```
 ### 运行测试
 ```bash
 # 运行所有测试
 pytest tests/ -v
 # 运行测试并生成覆盖率报告
 pytest tests/ --cov=src/functional_scaffold --cov-report=html
 # 使用脚本
 ./scripts/run_tests.sh
 ```
 ### 代码质量
 ```bash
 # 代码格式化
 black src/ tests/
 # 代码检查
 ruff check src/ tests/
 ```
 ### 导出 OpenAPI 规范
 ```bash
 python scripts/export_openapi.py
 ```
 生成的文件位于 `docs/swagger/openapi.json`
 ## 部署
 ### Kubernetes
 ```bash
 kubectl apply -f deployment/kubernetes/
 ```
 ### 阿里云函数计算
 ```bash
 fun deploy -t deployment/serverless/aliyun-fc.yaml
 ```
 ### AWS Lambda
 ```bash
 sam deploy --template-file deployment/serverless/aws-lambda.yaml
 ```
 ## 监控
 ### Prometheus 指标
 访问 `/metrics` 端点查看可用指标：
 - `http_requests_total` - HTTP 请求总数
 - `http_request_duration_seconds` - HTTP 请求延迟
 - `algorithm_executions_total` - 算法执行总数
 - `algorithm_execution_duration_seconds` - 算法执行延迟
 ### Grafana 仪表板
 导入 `monitoring/grafana/dashboard.json` 到 Grafana
 ## 配置
 通过环境变量或 `.env` 文件配置：
 ```bash
 # 应用配置
 APP_NAME=FunctionalScaffold
 APP_VERSION=1.0.0
 APP_ENV=development
 # 服务器配置
 HOST=0.0.0.0
 PORT=8000
 WORKERS=4
 # 日志配置
 LOG_LEVEL=INFO
 LOG_FORMAT=json
 # 指标配置
 METRICS_ENABLED=true
 ```
 参考 `.env.example` 查看完整配置选项。
 ## 许可证
 MIT License
 ## 贡献
 欢迎提交 Issue 和 Pull Request！
--- a/deployment/Dockerfile
+++ b/deployment/Dockerfile
@@ -0,0 +1,31 @@
 FROM python:3.11-slim
 WORKDIR /app
 # 安装系统依赖
 RUN apt-get update && apt-get install -y --no-install-recommends \
    gcc \
    && rm -rf /var/lib/apt/lists/*
 # 复制依赖文件
 COPY requirements.txt .
 # 安装 Python 依赖
 RUN pip install --no-cache-dir -r requirements.txt
 # 复制应用代码
 COPY src/ ./src/
 # 创建非 root 用户
 RUN useradd -m -u 1000 appuser && chown -R appuser:appuser /app
 USER appuser
 # 暴露端口
 EXPOSE 8000
 # 健康检查
 HEALTHCHECK --interval=30s --timeout=3s --start-period=5s --retries=3 \
    CMD python -c "import urllib.request; urllib.request.urlopen('http://localhost:8000/healthz')"
 # 启动命令
 CMD ["uvicorn", "src.functional_scaffold.main:app", "--host", "0.0.0.0", "--port", "8000"]
--- a/deployment/Dockerfile.redis-exporter
+++ b/deployment/Dockerfile.redis-exporter
@@ -0,0 +1,33 @@
 # Redis Exporter Dockerfile
 FROM python:3.11-slim
 WORKDIR /app
 # 安装依赖
 COPY requirements.txt .
 RUN pip install --no-cache-dir redis prometheus-client
 # 复制 exporter 代码
 COPY src/functional_scaffold/core/metrics_redis_exporter.py .
 # 暴露端口
 EXPOSE 8001
 # 启动 HTTP 服务器提供指标
 CMD ["python", "-c", "\
 from http.server import HTTPServer, BaseHTTPRequestHandler; \
 from metrics_redis_exporter import get_metrics; \
 class MetricsHandler(BaseHTTPRequestHandler): \
    def do_GET(self): \
        if self.path == '/metrics': \
            self.send_response(200); \
            self.send_header('Content-Type', 'text/plain; version=0.0.4'); \
            self.end_headers(); \
            self.wfile.write(get_metrics()); \
        else: \
            self.send_response(404); \
            self.end_headers(); \
    def log_message(self, format, *args): pass; \
 server = HTTPServer(('0.0.0.0', 8001), MetricsHandler); \
 print('Redis Exporter 启动在端口 8001'); \
 server.serve_forever()"]
--- a/deployment/docker-compose.yml
+++ b/deployment/docker-compose.yml
@@ -0,0 +1,108 @@
 version: '3.8'
 services:
  app:
    build:
      context: ..
      dockerfile: deployment/Dockerfile
    ports:
      - "8111:8000"
    environment:
      - APP_ENV=development
      - LOG_LEVEL=INFO
      - METRICS_ENABLED=true
      # 方案1：Pushgateway 配置
      - PUSHGATEWAY_URL=pushgateway:9091
      - METRICS_JOB_NAME=functional_scaffold
      # 方案2：Redis 配置
      - REDIS_HOST=redis
      - REDIS_PORT=6379
      - REDIS_METRICS_DB=0
    volumes:
      - ../src:/app/src
    restart: unless-stopped
    depends_on:
      - redis
      - pushgateway
    healthcheck:
      test: ["CMD", "python", "-c", "import urllib.request; urllib.request.urlopen('http://localhost:8000/healthz')"]
      interval: 30s
      timeout: 3s
      retries: 3
      start_period: 5s
  # Redis - 用于集中式指标存储（方案2）
  redis:
    image: redis:7-alpine
    ports:
      - "6379:6379"
    volumes:
      - redis_data:/data
    command: redis-server --appendonly yes
    restart: unless-stopped
    healthcheck:
      test: ["CMD", "redis-cli", "ping"]
      interval: 10s
      timeout: 3s
      retries: 3
  # Pushgateway - 用于短生命周期任务的指标推送（方案1，推荐）
  pushgateway:
    image: prom/pushgateway:latest
    ports:
      - "9091:9091"
    restart: unless-stopped
    command:
      - '--persistence.file=/data/pushgateway.data'
      - '--persistence.interval=5m'
    volumes:
      - pushgateway_data:/data
  # Redis Exporter - 将 Redis 指标导出为 Prometheus 格式（方案2需要）
  redis-exporter:
    build:
      context: ..
      dockerfile: deployment/Dockerfile.redis-exporter
    ports:
      - "8001:8001"
    environment:
      - REDIS_HOST=redis
      - REDIS_PORT=6379
      - REDIS_METRICS_DB=0
    depends_on:
      - redis
    restart: unless-stopped
  prometheus:
    image: prom/prometheus:latest
    ports:
      - "9090:9090"
    volumes:
      - ../monitoring/prometheus.yml:/etc/prometheus/prometheus.yml
      - prometheus_data:/prometheus
    command:
      - '--config.file=/etc/prometheus/prometheus.yml'
      - '--storage.tsdb.path=/prometheus'
    restart: unless-stopped
    depends_on:
      - pushgateway
      - redis-exporter
  grafana:
    image: grafana/grafana:latest
    ports:
      - "3000:3000"
    environment:
      - GF_SECURITY_ADMIN_PASSWORD=admin
    volumes:
      - grafana_data:/var/lib/grafana
      - ../monitoring/grafana:/etc/grafana/provisioning
    restart: unless-stopped
    depends_on:
      - prometheus
 volumes:
  prometheus_data:
  grafana_data:
  redis_data:
  pushgateway_data:
--- a/deployment/kubernetes/deployment.yaml
+++ b/deployment/kubernetes/deployment.yaml
@@ -0,0 +1,53 @@
 apiVersion: apps/v1
 kind: Deployment
 metadata:
  name: functional-scaffold
  labels:
    app: functional-scaffold
 spec:
  replicas: 3
  selector:
    matchLabels:
      app: functional-scaffold
  template:
    metadata:
      labels:
        app: functional-scaffold
    spec:
      containers:
      - name: functional-scaffold
        image: functional-scaffold:latest
        imagePullPolicy: IfNotPresent
        ports:
        - containerPort: 8000
          name: http
        env:
        - name: APP_ENV
          value: "production"
        - name: LOG_LEVEL
          value: "INFO"
        - name: METRICS_ENABLED
          value: "true"
        resources:
          requests:
            memory: "256Mi"
            cpu: "250m"
          limits:
            memory: "512Mi"
            cpu: "500m"
        livenessProbe:
          httpGet:
            path: /healthz
            port: 8000
          initialDelaySeconds: 10
          periodSeconds: 30
          timeoutSeconds: 3
          failureThreshold: 3
        readinessProbe:
          httpGet:
            path: /readyz
            port: 8000
          initialDelaySeconds: 5
          periodSeconds: 10
          timeoutSeconds: 3
          failureThreshold: 3
--- a/deployment/kubernetes/service.yaml
+++ b/deployment/kubernetes/service.yaml
@@ -0,0 +1,31 @@
 apiVersion: v1
 kind: Service
 metadata:
  name: functional-scaffold
  labels:
    app: functional-scaffold
 spec:
  type: ClusterIP
  ports:
  - port: 80
    targetPort: 8000
    protocol: TCP
    name: http
  selector:
    app: functional-scaffold
 ---
 apiVersion: v1
 kind: Service
 metadata:
  name: functional-scaffold-metrics
  labels:
    app: functional-scaffold
 spec:
  type: ClusterIP
  ports:
  - port: 8000
    targetPort: 8000
    protocol: TCP
    name: metrics
  selector:
    app: functional-scaffold
--- a/deployment/serverless/aliyun-fc.yaml
+++ b/deployment/serverless/aliyun-fc.yaml
@@ -0,0 +1,40 @@
 # 阿里云函数计算配置
 ROSTemplateFormatVersion: '2015-09-01'
 Transform: 'Aliyun::Serverless-2018-04-03'
 Resources:
  functional-scaffold:
    Type: 'Aliyun::Serverless::Service'
    Properties:
      Description: '算法工程化 Serverless 脚手架'
      LogConfig:
        Project: functional-scaffold-logs
        Logstore: function-logs
      VpcConfig:
        VpcId: 'vpc-xxxxx'
        VSwitchIds:
          - 'vsw-xxxxx'
        SecurityGroupId: 'sg-xxxxx'
    prime-checker:
      Type: 'Aliyun::Serverless::Function'
      Properties:
        Description: '质数判断算法服务'
        Runtime: custom-container
        MemorySize: 512
        Timeout: 60
        InstanceConcurrency: 10
        CAPort: 8000
        CustomContainerConfig:
          Image: 'registry.cn-hangzhou.aliyuncs.com/your-namespace/functional-scaffold:latest'
          Command: '["uvicorn", "src.functional_scaffold.main:app", "--host", "0.0.0.0", "--port", "8000"]'
        EnvironmentVariables:
          APP_ENV: production
          LOG_LEVEL: INFO
          METRICS_ENABLED: 'true'
      Events:
        httpTrigger:
          Type: HTTP
          Properties:
            AuthType: ANONYMOUS
            Methods:
              - GET
              - POST
--- a/deployment/serverless/aws-lambda.yaml
+++ b/deployment/serverless/aws-lambda.yaml
@@ -0,0 +1,46 @@
 # AWS Lambda 配置（使用 Lambda Container Image）
 AWSTemplateFormatVersion: '2010-09-09'
 Transform: AWS::Serverless-2016-10-31
 Description: FunctionalScaffold Serverless Application
 Globals:
  Function:
    Timeout: 60
    MemorySize: 512
    Environment:
      Variables:
        APP_ENV: production
        LOG_LEVEL: INFO
        METRICS_ENABLED: 'true'
 Resources:
  FunctionalScaffoldFunction:
    Type: AWS::Serverless::Function
    Properties:
      PackageType: Image
      ImageUri: !Sub '${AWS::AccountId}.dkr.ecr.${AWS::Region}.amazonaws.com/functional-scaffold:latest'
      Events:
        ApiEvent:
          Type: Api
          Properties:
            Path: /{proxy+}
            Method: ANY
      Policies:
        - AWSLambdaBasicExecutionRole
  FunctionalScaffoldApi:
    Type: AWS::Serverless::Api
    Properties:
      StageName: prod
      Cors:
        AllowMethods: "'*'"
        AllowHeaders: "'*'"
        AllowOrigin: "'*'"
 Outputs:
  ApiUrl:
    Description: "API Gateway endpoint URL"
    Value: !Sub "https://${FunctionalScaffoldApi}.execute-api.${AWS::Region}.amazonaws.com/prod/"
  FunctionArn:
    Description: "Function ARN"
    Value: !GetAtt FunctionalScaffoldFunction.Arn
--- a/docs/grafana-dashboard-guide.md
+++ b/docs/grafana-dashboard-guide.md
@@ -0,0 +1,237 @@
 # Grafana Dashboard 导入和使用指南
 ## Dashboard 概述
 新的 dashboard 包含 10 个面板，全面展示应用的监控指标：
 ### 第一行：核心性能指标
 1. **HTTP 请求速率 (QPS)** - 每秒请求数，按端点和方法分组
 2. **HTTP 请求延迟 (P50/P95/P99)** - 请求响应时间的百分位数
 ### 第二行：关键指标
 3. **请求成功率** - 成功请求占比（仪表盘）
 4. **当前并发请求数** - 实时并发数（仪表盘）
 5. **HTTP 请求总数** - 累计请求数（统计卡片）
 6. **算法执行总数** - 累计算法调用数（统计卡片）
 ### 第三行：算法性能
 7. **算法执行速率** - 每秒算法执行次数
 8. **算法执行延迟 (P50/P95/P99)** - 算法执行时间的百分位数
 ### 第四行：分布分析
 9. **请求分布（按端点）** - 饼图展示各端点的请求占比
 10. **请求状态分布** - 饼图展示成功/失败请求占比
 ## 导入步骤
 ### 1. 配置 Prometheus 数据源
 首先确保 Prometheus 数据源已正确配置：
 1. 打开 Grafana：http://localhost:3000
 2. 登录（默认：admin/admin）
 3. 进入 **Configuration** → **Data Sources**
 4. 点击 **Add data source**
 5. 选择 **Prometheus**
 6. 配置：
   - **Name**: `Prometheus`（必须是这个名称）
   - **URL**: `http://prometheus:9090`（注意：使用服务名，不是 localhost）
   - **Access**: Server (default)
 7. 点击 **Save & Test**，确保显示绿色的成功提示
 ### 2. 导入 Dashboard
 有两种方式导入 dashboard：
 #### 方式 1：通过 JSON 文件导入（推荐）
 1. 在 Grafana 左侧菜单，点击 **Dashboards** → **Import**
 2. 点击 **Upload JSON file**
 3. 选择文件：`monitoring/grafana/dashboard.json`
 4. 在导入页面：
   - **Name**: FunctionalScaffold 监控仪表板
   - **Folder**: General（或创建新文件夹）
   - **Prometheus**: 选择刚才配置的 Prometheus 数据源
 5. 点击 **Import**
 #### 方式 2：通过 JSON 内容导入
 1. 在 Grafana 左侧菜单，点击 **Dashboards** → **Import**
 2. 复制 `monitoring/grafana/dashboard.json` 的全部内容
 3. 粘贴到 **Import via panel json** 文本框
 4. 点击 **Load**
 5. 配置数据源并点击 **Import**
 ### 3. 验证 Dashboard
 导入成功后，你应该看到：
 - ✅ 所有面板都正常显示
 - ✅ 有数据的面板显示图表和数值
 - ✅ 右上角显示自动刷新（5秒）
 - ✅ 时间范围默认为最近 1 小时
 ## 生成测试数据
 如果 dashboard 中没有数据或数据很少，运行流量生成脚本：
 ```bash
 # 启动流量生成器
 ./scripts/generate_traffic.sh
 ```
 这会持续发送请求到应用，生成监控数据。等待 1-2 分钟后，dashboard 中应该会显示丰富的图表。
 ## Dashboard 功能
 ### 自动刷新
 Dashboard 配置了自动刷新，默认每 5 秒更新一次。你可以在右上角修改刷新间隔：
 - 5s（默认）
 - 10s
 - 30s
 - 1m
 - 5m
 ### 时间范围
 默认显示最近 1 小时的数据。你可以在右上角修改时间范围：
 - Last 5 minutes
 - Last 15 minutes
 - Last 30 minutes
 - Last 1 hour（默认）
 - Last 3 hours
 - Last 6 hours
 - Last 12 hours
 - Last 24 hours
 - 或自定义时间范围
 ### 实时模式
 Dashboard 启用了 **Live** 模式（右上角的 Live 按钮），可以实时查看最新数据。
 ### 交互功能
 - **缩放**：在时间序列图表上拖动选择区域可以放大
 - **图例点击**：点击图例可以隐藏/显示对应的数据系列
 - **Tooltip**：鼠标悬停在图表上查看详细数值
 - **面板全屏**：点击面板标题旁的图标可以全屏查看
 ## 常见问题
 ### 问题 1：数据源连接失败
 **错误信息**：`dial tcp [::1]:9090: connect: connection refused`
 **解决方案**：
 - 确保 Prometheus URL 使用 `http://prometheus:9090`（服务名）
 - 不要使用 `http://localhost:9090`（在容器内部无法访问）
 ### 问题 2：面板显示 "No data"
 **可能原因**：
 1. 应用还没有收到任何请求
 2. Prometheus 还没有抓取到数据
 3. 时间范围选择不当
 **解决方案**：
 1. 发送一些测试请求：
   ```bash
   curl -X POST http://localhost:8111/invoke \
     -H "Content-Type: application/json" \
     -d '{"number": 17}'
   ```
 2. 等待 15-30 秒让 Prometheus 抓取数据
 3. 调整时间范围为 "Last 5 minutes"
 4. 运行流量生成脚本：`./scripts/generate_traffic.sh`
 ### 问题 3：延迟图表显示 "NaN" 或空值
 **原因**：直方图数据不足，无法计算百分位数
 **解决方案**：
 - 发送更多请求以积累足够的数据
 - 等待几分钟让数据积累
 - 使用流量生成脚本持续发送请求
 ### 问题 4：数据源变量未正确设置
 **错误信息**：面板显示 "Datasource not found"
 **解决方案**：
 1. 确保 Prometheus 数据源的名称是 `Prometheus`
 2. 或者在 dashboard 设置中重新选择数据源：
   - 点击右上角的齿轮图标（Dashboard settings）
   - 进入 **Variables** 标签
   - 编辑 `DS_PROMETHEUS` 变量
   - 选择正确的 Prometheus 数据源
 ## PromQL 查询说明
 Dashboard 使用的主要 PromQL 查询：
 ### HTTP 请求速率
 ```promql
 sum(rate(http_requests_total[1m])) by (endpoint, method)
 ```
 ### HTTP 请求延迟 P95
 ```promql
 histogram_quantile(0.95, sum(rate(http_request_duration_seconds_bucket[1m])) by (le, endpoint, method))
 ```
 ### 请求成功率
 ```promql
 sum(rate(http_requests_total{status="success"}[5m])) / sum(rate(http_requests_total[5m]))
 ```
 ### 算法执行速率
 ```promql
 sum(rate(algorithm_executions_total[1m])) by (algorithm, status)
 ```
 ## 自定义 Dashboard
 你可以根据需要自定义 dashboard：
 1. **添加新面板**：点击右上角的 "Add panel" 按钮
 2. **编辑面板**：点击面板标题 → Edit
 3. **调整布局**：拖动面板调整位置和大小
 4. **保存更改**：点击右上角的保存图标
 ## 导出和分享
 ### 导出 Dashboard
 1. 点击右上角的分享图标
 2. 选择 **Export** 标签
 3. 点击 **Save to file** 下载 JSON 文件
 ### 分享 Dashboard
 1. 点击右上角的分享图标
 2. 选择 **Link** 标签
 3. 复制链接分享给团队成员
 ## 告警配置（可选）
 你可以为面板配置告警规则：
 1. 编辑面板
 2. 切换到 **Alert** 标签
 3. 点击 **Create alert rule from this panel**
 4. 配置告警条件和通知渠道
 ## 相关资源
 - Grafana 官方文档：https://grafana.com/docs/
 - Prometheus 查询语言：https://prometheus.io/docs/prometheus/latest/querying/basics/
 - Dashboard 最佳实践：https://grafana.com/docs/grafana/latest/best-practices/
 ## 技术支持
 如果遇到问题：
 1. 检查 Prometheus 是否正常运行：http://localhost:9090
 2. 检查应用 metrics 端点：http://localhost:8111/metrics
 3. 查看 Grafana 日志：`docker-compose logs grafana`
 4. 查看 Prometheus 日志：`docker-compose logs prometheus`
--- a/docs/metrics-guide.md
+++ b/docs/metrics-guide.md
@@ -0,0 +1,346 @@
 # 指标记录方案对比与使用指南
 ## 问题背景
 在多实例部署场景下（Kubernetes、Serverless），原有的内存指标存储方案存在以下问题：
 1. **指标分散**：每个实例独立记录指标，无法聚合
 2. **数据丢失**：实例销毁后指标丢失
 3. **统计不准**：无法获得全局准确的指标视图
 ## 解决方案对比
 ### 方案1：Pushgateway（推荐）
 **原理：** 应用主动推送指标到 Pushgateway，Prometheus 从 Pushgateway 抓取
 **优点：**
 - ✅ Prometheus 官方支持，生态成熟
 - ✅ 实现简单，代码改动小
 - ✅ 适合短生命周期任务（Serverless、批处理）
 - ✅ 支持持久化，重启不丢失数据
 **缺点：**
 - ⚠️ 单点故障风险（可通过高可用部署解决）
 - ⚠️ 不适合超高频推送（每秒数千次）
 **适用场景：**
 - Serverless 函数
 - 批处理任务
 - 短生命周期容器
 - 实例数量动态变化的场景
 ### 方案2：Redis + 自定义 Exporter
 **原理：** 应用将指标写入 Redis，自定义 Exporter 从 Redis 读取并转换为 Prometheus 格式
 **优点：**
 - ✅ 灵活可控，支持复杂聚合逻辑
 - ✅ Redis 高性能，支持高并发写入
 - ✅ 可以实现自定义的指标计算
 **缺点：**
 - ⚠️ 需要自己实现 Exporter，维护成本高
 - ⚠️ 增加了系统复杂度
 - ⚠️ Redis 需要额外的运维成本
 **适用场景：**
 - 需要自定义指标聚合逻辑
 - 超高频指标写入（每秒数万次）
 - 需要实时查询指标数据
 ### 方案3：标准 Prometheus Pull 模式（不推荐）
 **原理：** Prometheus 从每个实例抓取指标，在查询时聚合
 **优点：**
 - ✅ Prometheus 标准做法
 - ✅ 无需额外组件
 **缺点：**
 - ❌ 需要服务发现机制（Kubernetes Service Discovery）
 - ❌ 短生命周期实例可能来不及抓取
 - ❌ 实例销毁后数据丢失
 **适用场景：**
 - 长生命周期服务
 - 实例数量相对固定
 - 有完善的服务发现机制
 ## 使用指南
 ### 方案1：Pushgateway（推荐）
 #### 1. 启动服务
 ```bash
 cd deployment
 docker-compose up -d redis pushgateway prometheus grafana
 ```
 #### 2. 修改代码
 在 `src/functional_scaffold/api/routes.py` 中：
 ```python
 # 替换导入
 from functional_scaffold.core.metrics_pushgateway import (
    track_request,
    track_algorithm_execution,
 )
 # 使用方式不变
@router.post("/invoke")
@track_request("POST", "/invoke")
 async def invoke_algorithm(request: InvokeRequest):
    # ... 业务逻辑
 ```
 #### 3. 配置环境变量
 在 `.env` 文件中：
 ```bash
 PUSHGATEWAY_URL=localhost:9091
 METRICS_JOB_NAME=functional_scaffold
 INSTANCE_ID=instance-1  # 可选，默认使用 HOSTNAME
 ```
 #### 4. 验证
 ```bash
 # 查看 Pushgateway 指标
 curl http://localhost:9091/metrics
 # 查看 Prometheus
 open http://localhost:9090
 # 查询示例
 http_requests_total{job="functional_scaffold"}
 ```
 ### 方案2：Redis + Exporter
 #### 1. 启动服务
 ```bash
 cd deployment
 docker-compose up -d redis redis-exporter prometheus grafana
 ```
 #### 2. 修改代码
 在 `src/functional_scaffold/api/routes.py` 中：
 ```python
 # 替换导入
 from functional_scaffold.core.metrics_redis import (
    track_request,
    track_algorithm_execution,
 )
 # 使用方式不变
@router.post("/invoke")
@track_request("POST", "/invoke")
 async def invoke_algorithm(request: InvokeRequest):
    # ... 业务逻辑
 ```
 #### 3. 配置环境变量
 在 `.env` 文件中：
 ```bash
 REDIS_HOST=localhost
 REDIS_PORT=6379
 REDIS_METRICS_DB=0
 REDIS_PASSWORD=  # 可选
 INSTANCE_ID=instance-1  # 可选
 ```
 #### 4. 安装 Redis 依赖
 ```bash
 pip install redis
 ```
 或在 `requirements.txt` 中添加：
 ```
 redis>=5.0.0
 ```
 #### 5. 验证
 ```bash
 # 查看 Redis 中的指标
 redis-cli
 > HGETALL metrics:request_counter
 # 查看 Exporter 输出
 curl http://localhost:8001/metrics
 # 查看 Prometheus
 open http://localhost:9090
 ```
 ## 性能对比
 | 指标 | Pushgateway | Redis + Exporter | 标准 Pull |
 |------|-------------|------------------|-----------|
 | 写入延迟 | ~5ms | ~1ms | N/A |
 | 查询延迟 | ~10ms | ~20ms | ~5ms |
 | 吞吐量 | ~1000 req/s | ~10000 req/s | ~500 req/s |
 | 内存占用 | 低 | 中 | 低 |
 | 复杂度 | 低 | 高 | 低 |
 ## 迁移步骤
 ### 从原有方案迁移到 Pushgateway
 1. **安装依赖**（如果需要）：
   ```bash
   pip install prometheus-client
   ```
 2. **替换导入**：
   ```python
   # 旧代码
   from functional_scaffold.core.metrics import track_request
   # 新代码
   from functional_scaffold.core.metrics_pushgateway import track_request
   ```
 3. **配置环境变量**：
   ```bash
   export PUSHGATEWAY_URL=localhost:9091
   ```
 4. **启动 Pushgateway**：
   ```bash
   docker-compose up -d pushgateway
   ```
 5. **更新 Prometheus 配置**（已包含在 `monitoring/prometheus.yml`）
 6. **测试验证**：
   ```bash
   # 发送请求
   curl -X POST http://localhost:8000/invoke -d '{"number": 17}'
   # 查看指标
   curl http://localhost:9091/metrics | grep http_requests_total
   ```
 ### 从原有方案迁移到 Redis
 1. **安装依赖**：
   ```bash
   pip install redis
   ```
 2. **替换导入**：
   ```python
   # 旧代码
   from functional_scaffold.core.metrics import track_request
   # 新代码
   from functional_scaffold.core.metrics_redis import track_request
   ```
 3. **配置环境变量**：
   ```bash
   export REDIS_HOST=localhost
   export REDIS_PORT=6379
   ```
 4. **启动 Redis 和 Exporter**：
   ```bash
   docker-compose up -d redis redis-exporter
   ```
 5. **测试验证**：
   ```bash
   # 发送请求
   curl -X POST http://localhost:8000/invoke -d '{"number": 17}'
   # 查看 Redis
   redis-cli HGETALL metrics:request_counter
   # 查看 Exporter
   curl http://localhost:8001/metrics
   ```
 ## 常见问题
 ### Q1: Pushgateway 会成为单点故障吗？
 A: 可以通过以下方式解决：
 - 部署多个 Pushgateway 实例（负载均衡）
 - 使用持久化存储（已配置）
 - 推送失败时降级到本地日志
 ### Q2: Redis 方案的性能如何？
 A: Redis 单实例可以支持 10万+ QPS，对于大多数场景足够。如果需要更高性能，可以：
 - 使用 Redis Cluster
 - 批量写入（减少网络往返）
 - 使用 Pipeline
 ### Q3: 如何在 Kubernetes 中使用？
 A:
 - **Pushgateway**: 部署为 Service，应用通过 Service 名称访问
 - **Redis**: 使用 StatefulSet 或托管 Redis 服务
 ### Q4: 指标数据会丢失吗？
 A:
 - **Pushgateway**: 支持持久化，重启不丢失
 - **Redis**: 配置了 AOF 持久化，重启不丢失
 - **标准 Pull**: 实例销毁后丢失
 ### Q5: 如何选择方案？
 建议：
 - **Serverless/短生命周期** → Pushgateway
 - **超高并发/自定义逻辑** → Redis
 - **长生命周期/K8s** → 标准 Pull（需配置服务发现）
 ## 监控和告警
 ### Grafana 仪表板
 访问 http://localhost:3000（admin/admin）
 已预配置的面板：
 - HTTP 请求总数
 - HTTP 请求延迟（P50/P95/P99）
 - 算法执行次数
 - 算法执行延迟
 - 错误率
 ### 告警规则
 在 `monitoring/alerts/rules.yaml` 中配置：
 ```yaml
 groups:
  - name: functional_scaffold
    rules:
      - alert: HighErrorRate
        expr: rate(http_requests_total{status="error"}[5m]) > 0.05
        for: 5m
        labels:
          severity: warning
        annotations:
          summary: "高错误率告警"
          description: "错误率超过 5%"
 ```
 ## 参考资料
 - [Prometheus Pushgateway 文档](https://github.com/prometheus/pushgateway)
 - [Prometheus 最佳实践](https://prometheus.io/docs/practices/)
 - [Redis 官方文档](https://redis.io/documentation)
--- a/docs/metrics-improvement-summary.md
+++ b/docs/metrics-improvement-summary.md
@@ -0,0 +1,227 @@
 # Prometheus 指标记录问题修复总结
 ## 问题描述
 Prometheus 中没有正常记录应用的访问数据。虽然 `/metrics` 端点可以访问，并且定义了所有指标类型，但这些指标都没有任何数据值。
 ## 根本原因
 1. **HTTP 请求指标未记录**：`api/routes.py` 中的路由处理函数没有使用 `@track_request` 装饰器来记录 HTTP 请求指标
 2. **算法执行指标未记录**：`algorithms/base.py` 中的 `execute()` 方法没有调用 metrics 模块来记录算法执行指标
 ## 解决方案
 ### 1. 添加 HTTP 请求指标跟踪中间件
 **文件**：`src/functional_scaffold/main.py`
 **修改内容**：
 - 导入 metrics 相关的对象：`request_counter`, `request_latency`, `in_progress_requests`
 - 添加 `track_metrics` 中间件，自动跟踪所有 HTTP 请求
 **优点**：
 - 自动化：不需要在每个路由上手动添加装饰器
 - 统一：所有端点的指标记录逻辑一致
 - 易维护：新增端点自动获得指标跟踪能力
 **实现代码**：
 ```python
@app.middleware("http")
 async def track_metrics(request: Request, call_next):
    """记录所有HTTP请求的指标"""
    if not settings.metrics_enabled:
        return await call_next(request)
    # 跳过 /metrics 端点本身，避免循环记录
    if request.url.path == "/metrics":
        return await call_next(request)
    in_progress_requests.inc()
    start_time = time.time()
    status = "success"
    try:
        response = await call_next(request)
        if response.status_code >= 400:
            status = "error"
        return response
    except Exception as e:
        status = "error"
        raise e
    finally:
        elapsed = time.time() - start_time
        request_counter.labels(
            method=request.method,
            endpoint=request.url.path,
            status=status
        ).inc()
        request_latency.labels(
            method=request.method,
            endpoint=request.url.path
        ).observe(elapsed)
        in_progress_requests.dec()
 ```
 ### 2. 添加算法执行指标记录
 **文件**：`src/functional_scaffold/algorithms/base.py`
 **修改内容**：
 - 在 `execute()` 方法中导入 `algorithm_counter` 和 `algorithm_latency`
 - 在 `finally` 块中记录算法执行指标
 **实现代码**：
 ```python
 def execute(self, *args, **kwargs) -> Dict[str, Any]:
    from ..core.metrics import algorithm_counter, algorithm_latency
    start_time = time.time()
    status = "success"
    try:
        # ... 算法执行逻辑 ...
    except Exception as e:
        status = "error"
        # ... 错误处理 ...
    finally:
        elapsed_time = time.time() - start_time
        algorithm_counter.labels(algorithm=self.name, status=status).inc()
        algorithm_latency.labels(algorithm=self.name).observe(elapsed_time)
 ```
 ## 验证结果
 ### 1. 应用 /metrics 端点
 修复后，`/metrics` 端点正常返回指标数据：
 ```
 # HTTP 请求指标
 http_requests_total{endpoint="/healthz",method="GET",status="success"} 3.0
 http_requests_total{endpoint="/invoke",method="POST",status="success"} 2.0
 http_requests_total{endpoint="/readyz",method="GET",status="success"} 1.0
 # HTTP 请求延迟
 http_request_duration_seconds_sum{endpoint="/invoke",method="POST"} 0.0065615177154541016
 http_request_duration_seconds_count{endpoint="/invoke",method="POST"} 2.0
 # 算法执行指标
 algorithm_executions_total{algorithm="PrimeChecker",status="success"} 2.0
 algorithm_execution_duration_seconds_sum{algorithm="PrimeChecker"} 0.00023603439331054688
 algorithm_execution_duration_seconds_count{algorithm="PrimeChecker"} 2.0
 # 当前进行中的请求
 http_requests_in_progress 0.0
 ```
 ### 2. Prometheus 查询
 Prometheus 成功抓取并存储了指标数据：
 ```bash
 # 查询 HTTP 请求总数
 curl 'http://localhost:9090/api/v1/query?query=http_requests_total'
 # 查询算法执行总数
 curl 'http://localhost:9090/api/v1/query?query=algorithm_executions_total'
 ```
 ## 可用指标
 修复后，以下指标可以在 Prometheus 和 Grafana 中使用：
 ### HTTP 请求指标
 1. **http_requests_total** (Counter)
   - 标签：`method`, `endpoint`, `status`
   - 描述：HTTP 请求总数
   - 用途：统计各端点的请求量、成功率
 2. **http_request_duration_seconds** (Histogram)
   - 标签：`method`, `endpoint`
   - 描述：HTTP 请求延迟分布
   - 用途：分析请求响应时间、P50/P95/P99 延迟
 3. **http_requests_in_progress** (Gauge)
   - 描述：当前正在处理的请求数
   - 用途：监控并发请求数、负载情况
 ### 算法执行指标
 1. **algorithm_executions_total** (Counter)
   - 标签：`algorithm`, `status`
   - 描述：算法执行总数
   - 用途：统计算法调用量、成功率
 2. **algorithm_execution_duration_seconds** (Histogram)
   - 标签：`algorithm`
   - 描述：算法执行延迟分布
   - 用途：分析算法性能、优化瓶颈
 ## 使用示例
 ### Prometheus 查询示例
 ```promql
 # 每秒请求数 (QPS)
 rate(http_requests_total[5m])
 # 请求成功率
 sum(rate(http_requests_total{status="success"}[5m])) / sum(rate(http_requests_total[5m]))
 # P95 延迟
 histogram_quantile(0.95, rate(http_request_duration_seconds_bucket[5m]))
 # 算法执行失败率
 sum(rate(algorithm_executions_total{status="error"}[5m])) / sum(rate(algorithm_executions_total[5m]))
 ```
 ### 生成测试流量
 使用提供的脚本生成测试流量：
 ```bash
 # 启动流量生成器
 ./scripts/generate_traffic.sh
 # 在另一个终端查看实时指标
 watch -n 1 'curl -s http://localhost:8111/metrics | grep http_requests_total'
 ```
 ## Grafana 仪表板
 访问 Grafana 查看可视化指标：
 1. 打开浏览器访问：http://localhost:3000
 2. 登录（默认用户名/密码：admin/admin）
 3. 导入仪表板：`monitoring/grafana/dashboard.json`
 仪表板包含以下面板：
 - 请求速率（QPS）
 - 请求延迟（P50/P95/P99）
 - 错误率
 - 算法执行统计
 - 并发请求数
 ## 注意事项
 1. **中间件顺序**：指标跟踪中间件应该在日志中间件之后注册，确保所有请求都被记录
 2. **/metrics 端点**：中间件会跳过 `/metrics` 端点本身，避免循环记录
 3. **错误状态**：HTTP 状态码 >= 400 会被标记为 `status="error"`
 4. **性能影响**：指标记录的性能开销极小（微秒级），不会影响应用性能
 ## 后续优化建议
 1. **添加更多维度**：可以添加 `user_id`、`region` 等标签进行更细粒度的分析
 2. **自定义指标**：根据业务需求添加自定义指标（如缓存命中率、外部 API 调用次数等）
 3. **告警规则**：配置 Prometheus 告警规则，在指标异常时发送通知
 4. **长期存储**：考虑使用 Thanos 或 Cortex 进行长期指标存储和查询
 ## 相关文件
 - `src/functional_scaffold/main.py` - HTTP 请求指标跟踪中间件
 - `src/functional_scaffold/algorithms/base.py` - 算法执行指标记录
 - `src/functional_scaffold/core/metrics.py` - 指标定义
 - `monitoring/prometheus.yml` - Prometheus 配置
 - `monitoring/grafana/dashboard.json` - Grafana 仪表板
 - `scripts/generate_traffic.sh` - 流量生成脚本
--- a/docs/swagger/README.md
+++ b/docs/swagger/README.md
@@ -0,0 +1,107 @@
 # Swagger 文档
 本目录包含自动生成的 OpenAPI 规范文档。
 ## 生成文档
 运行以下命令生成或更新 OpenAPI 规范：
 ```bash
 python scripts/export_openapi.py
 ```
 这将生成 `openapi.json` 文件，包含完整的 API 规范。
 ## 查看文档
 ### 在线查看
 启动应用后，访问以下 URL：
 - **Swagger UI**: http://localhost:8000/docs
 - **ReDoc**: http://localhost:8000/redoc
 ### 离线查看
 使用 Swagger Editor 或其他 OpenAPI 工具打开 `openapi.json` 文件。
 ## API 规范
 ### 端点列表
 #### 算法接口
 - `POST /invoke` - 同步调用算法
  - 请求体: `{"number": integer}`
  - 响应: 算法执行结果
 - `POST /jobs` - 异步任务接口（预留）
  - 当前返回 501 Not Implemented
 #### 健康检查
 - `GET /healthz` - 存活检查
  - 响应: `{"status": "healthy", "timestamp": float}`
 - `GET /readyz` - 就绪检查
  - 响应: `{"status": "ready", "timestamp": float, "checks": {...}}`
 #### 监控
 - `GET /metrics` - Prometheus 指标
  - 响应: Prometheus 文本格式
 ### 数据模型
 #### InvokeRequest
 ```json
 {
  "number": 17
 }
 ```
 #### InvokeResponse
 ```json
 {
  "request_id": "uuid",
  "status": "success",
  "result": {
    "number": 17,
    "is_prime": true,
    "factors": [],
    "algorithm": "trial_division"
  },
  "metadata": {
    "algorithm": "PrimeChecker",
    "version": "1.0.0",
    "elapsed_time": 0.001
  }
 }
 ```
 #### ErrorResponse
 ```json
 {
  "error": "ERROR_CODE",
  "message": "Error description",
  "details": {},
  "request_id": "uuid"
 }
 ```
 ## 更新文档
 当修改 API 接口后，需要重新生成文档：
 1. 修改代码（路由、模型等）
 2. 运行 `python scripts/export_openapi.py`
 3. 提交更新后的 `openapi.json`
 ## 注意事项
 - `openapi.json` 是自动生成的，不要手动编辑
 - 所有 API 变更都应该在代码中完成，然后重新生成文档
 - 确保 Pydantic 模型包含完整的文档字符串和示例
--- a/docs/swagger/openapi.json
+++ b/docs/swagger/openapi.json
@@ -0,0 +1,404 @@
 {
  "openapi": "3.1.0",
  "info": {
    "title": "FunctionalScaffold",
    "description": "算法工程化 Serverless 脚手架 - 提供标准化的算法服务接口",
    "version": "1.0.0"
  },
  "paths": {
    "/invoke": {
      "post": {
        "tags": [
          "Algorithm"
        ],
        "summary": "同步调用算法",
        "description": "同步调用质数判断算法，立即返回结果",
        "operationId": "invoke_algorithm_invoke_post",
        "parameters": [
          {
            "name": "x-request-id",
            "in": "header",
            "required": false,
            "schema": {
              "anyOf": [
                {
                  "type": "string"
                },
                {
                  "type": "null"
                }
              ],
              "title": "X-Request-Id"
            }
          }
        ],
        "requestBody": {
          "required": true,
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/InvokeRequest"
              }
            }
          }
        },
        "responses": {
          "200": {
            "description": "成功",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/InvokeResponse"
                }
              }
            }
          },
          "400": {
            "description": "请求参数错误",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/ErrorResponse"
                }
              }
            }
          },
          "500": {
            "description": "服务器内部错误",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/ErrorResponse"
                }
              }
            }
          },
          "422": {
            "description": "Validation Error",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/HTTPValidationError"
                }
              }
            }
          }
        }
      }
    },
    "/healthz": {
      "get": {
        "tags": [
          "Algorithm"
        ],
        "summary": "健康检查",
        "description": "检查服务是否存活",
        "operationId": "health_check_healthz_get",
        "responses": {
          "200": {
            "description": "Successful Response",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/HealthResponse"
                }
              }
            }
          }
        }
      }
    },
    "/readyz": {
      "get": {
        "tags": [
          "Algorithm"
        ],
        "summary": "就绪检查",
        "description": "检查服务是否就绪",
        "operationId": "readiness_check_readyz_get",
        "responses": {
          "200": {
            "description": "Successful Response",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/ReadinessResponse"
                }
              }
            }
          }
        }
      }
    },
    "/jobs": {
      "post": {
        "tags": [
          "Algorithm"
        ],
        "summary": "异步任务接口（预留）",
        "description": "异步任务接口，当前版本未实现",
        "operationId": "create_job_jobs_post",
        "responses": {
          "501": {
            "description": "Successful Response",
            "content": {
              "application/json": {
                "schema": {}
              }
            }
          }
        }
      }
    },
    "/metrics": {
      "get": {
        "tags": [
          "Monitoring"
        ],
        "summary": "Prometheus 指标",
        "description": "导出 Prometheus 格式的监控指标",
        "operationId": "metrics_metrics_get",
        "responses": {
          "200": {
            "description": "Successful Response",
            "content": {
              "application/json": {
                "schema": {}
              }
            }
          }
        }
      }
    }
  },
  "components": {
    "schemas": {
      "ErrorResponse": {
        "properties": {
          "error": {
            "type": "string",
            "title": "Error",
            "description": "错误代码"
          },
          "message": {
            "type": "string",
            "title": "Message",
            "description": "错误消息"
          },
          "details": {
            "anyOf": [
              {
                "additionalProperties": true,
                "type": "object"
              },
              {
                "type": "null"
              }
            ],
            "title": "Details",
            "description": "错误详情"
          },
          "request_id": {
            "anyOf": [
              {
                "type": "string"
              },
              {
                "type": "null"
              }
            ],
            "title": "Request Id",
            "description": "请求ID"
          }
        },
        "type": "object",
        "required": [
          "error",
          "message"
        ],
        "title": "ErrorResponse",
        "description": "错误响应",
        "example": {
          "details": {
            "field": "number",
            "value": "abc"
          },
          "error": "VALIDATION_ERROR",
          "message": "number must be an integer",
          "request_id": "550e8400-e29b-41d4-a716-446655440000"
        }
      },
      "HTTPValidationError": {
        "properties": {
          "detail": {
            "items": {
              "$ref": "#/components/schemas/ValidationError"
            },
            "type": "array",
            "title": "Detail"
          }
        },
        "type": "object",
        "title": "HTTPValidationError"
      },
      "HealthResponse": {
        "properties": {
          "status": {
            "type": "string",
            "title": "Status",
            "description": "健康状态"
          },
          "timestamp": {
            "type": "number",
            "title": "Timestamp",
            "description": "时间戳"
          }
        },
        "type": "object",
        "required": [
          "status",
          "timestamp"
        ],
        "title": "HealthResponse",
        "description": "健康检查响应"
      },
      "InvokeRequest": {
        "properties": {
          "number": {
            "type": "integer",
            "title": "Number",
            "description": "待判断的整数"
          }
        },
        "type": "object",
        "required": [
          "number"
        ],
        "title": "InvokeRequest",
        "description": "同步调用请求",
        "example": {
          "number": 17
        }
      },
      "InvokeResponse": {
        "properties": {
          "request_id": {
            "type": "string",
            "title": "Request Id",
            "description": "请求唯一标识"
          },
          "status": {
            "type": "string",
            "title": "Status",
            "description": "处理状态"
          },
          "result": {
            "additionalProperties": true,
            "type": "object",
            "title": "Result",
            "description": "算法执行结果"
          },
          "metadata": {
            "additionalProperties": true,
            "type": "object",
            "title": "Metadata",
            "description": "元数据信息"
          }
        },
        "type": "object",
        "required": [
          "request_id",
          "status",
          "result",
          "metadata"
        ],
        "title": "InvokeResponse",
        "description": "同步调用响应",
        "example": {
          "metadata": {
            "algorithm": "PrimeChecker",
            "elapsed_time": 0.001,
            "version": "1.0.0"
          },
          "request_id": "550e8400-e29b-41d4-a716-446655440000",
          "result": {
            "algorithm": "trial_division",
            "factors": [],
            "is_prime": true,
            "number": 17
          },
          "status": "success"
        }
      },
      "ReadinessResponse": {
        "properties": {
          "status": {
            "type": "string",
            "title": "Status",
            "description": "就绪状态"
          },
          "timestamp": {
            "type": "number",
            "title": "Timestamp",
            "description": "时间戳"
          },
          "checks": {
            "anyOf": [
              {
                "additionalProperties": {
                  "type": "boolean"
                },
                "type": "object"
              },
              {
                "type": "null"
              }
            ],
            "title": "Checks",
            "description": "各项检查结果"
          }
        },
        "type": "object",
        "required": [
          "status",
          "timestamp"
        ],
        "title": "ReadinessResponse",
        "description": "就绪检查响应"
      },
      "ValidationError": {
        "properties": {
          "loc": {
            "items": {
              "anyOf": [
                {
                  "type": "string"
                },
                {
                  "type": "integer"
                }
              ]
            },
            "type": "array",
            "title": "Location"
          },
          "msg": {
            "type": "string",
            "title": "Message"
          },
          "type": {
            "type": "string",
            "title": "Error Type"
          }
        },
        "type": "object",
        "required": [
          "loc",
          "msg",
          "type"
        ],
        "title": "ValidationError"
      }
    }
  }
 }
--- a/main.py
+++ b/main.py
@@ -0,0 +1,16 @@
 # 这是一个示例 Python 脚本。
 # 按 ⌃R 执行或将其替换为您的代码。
 # 按 双击 ⇧ 在所有地方搜索类、文件、工具窗口、操作和设置。
 def print_hi(name):
    # 在下面的代码行中使用断点来调试脚本。
    print(f'Hi, {name}')  # 按 ⌘F8 切换断点。
 # 按装订区域中的绿色按钮以运行脚本。
 if __name__ == '__main__':
    print_hi('PyCharm')
 # 访问 https://www.jetbrains.com/help/pycharm/ 获取 PyCharm 帮助
--- a/monitoring/alerts/rules.yaml
+++ b/monitoring/alerts/rules.yaml
@@ -0,0 +1,39 @@
 groups:
  - name: functional_scaffold_alerts
    interval: 30s
    rules:
      - alert: HighErrorRate
        expr: rate(http_requests_total{status="error"}[5m]) > 0.05
        for: 5m
        labels:
          severity: warning
        annotations:
          summary: "High error rate detected"
          description: "Error rate is {{ $value }} requests/sec for {{ $labels.endpoint }}"
      - alert: HighLatency
        expr: histogram_quantile(0.95, rate(http_request_duration_seconds_bucket[5m])) > 1
        for: 5m
        labels:
          severity: warning
        annotations:
          summary: "High latency detected"
          description: "P95 latency is {{ $value }}s for {{ $labels.endpoint }}"
      - alert: ServiceDown
        expr: up{job="functional-scaffold"} == 0
        for: 1m
        labels:
          severity: critical
        annotations:
          summary: "Service is down"
          description: "FunctionalScaffold service has been down for more than 1 minute"
      - alert: HighMemoryUsage
        expr: container_memory_usage_bytes{container="functional-scaffold"} / container_spec_memory_limit_bytes{container="functional-scaffold"} > 0.9
        for: 5m
        labels:
          severity: warning
        annotations:
          summary: "High memory usage"
          description: "Memory usage is {{ $value | humanizePercentage }} of limit"
--- a/monitoring/grafana/dashboard.json
+++ b/monitoring/grafana/dashboard.json
@@ -0,0 +1,808 @@
 {
  "annotations": {
    "list": [
      {
        "builtIn": 1,
        "datasource": {
          "type": "datasource",
          "uid": "grafana"
        },
        "enable": true,
        "hide": true,
        "iconColor": "rgba(0, 211, 255, 1)",
        "name": "Annotations & Alerts",
        "type": "dashboard"
      }
    ]
  },
  "editable": true,
  "fiscalYearStartMonth": 0,
  "graphTooltip": 1,
  "id": null,
  "links": [],
  "liveNow": true,
  "panels": [
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "palette-classic"
          },
          "custom": {
            "axisCenteredZero": false,
            "axisColorMode": "text",
            "axisLabel": "请求/秒",
            "axisPlacement": "auto",
            "barAlignment": 0,
            "drawStyle": "line",
            "fillOpacity": 20,
            "gradientMode": "opacity",
            "hideFrom": {
              "tooltip": false,
              "viz": false,
              "legend": false
            },
            "lineInterpolation": "smooth",
            "lineWidth": 2,
            "pointSize": 5,
            "scaleDistribution": {
              "type": "linear"
            },
            "showPoints": "never",
            "spanNulls": true,
            "stacking": {
              "group": "A",
              "mode": "none"
            },
            "thresholdsStyle": {
              "mode": "off"
            }
          },
          "mappings": [],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              {
                "color": "green",
                "value": null
              }
            ]
          },
          "unit": "reqps"
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 12,
        "x": 0,
        "y": 0
      },
      "id": 1,
      "options": {
        "legend": {
          "calcs": ["mean", "lastNotNull"],
          "displayMode": "table",
          "placement": "bottom",
          "showLegend": true
        },
        "tooltip": {
          "mode": "multi",
          "sort": "desc"
        }
      },
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "sum(rate(http_requests_total[1m])) by (endpoint, method)",
          "legendFormat": "{{method}} {{endpoint}}",
          "refId": "A"
        }
      ],
      "title": "HTTP 请求速率 (QPS)",
      "type": "timeseries"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "palette-classic"
          },
          "custom": {
            "axisCenteredZero": false,
            "axisColorMode": "text",
            "axisLabel": "延迟",
            "axisPlacement": "auto",
            "barAlignment": 0,
            "drawStyle": "line",
            "fillOpacity": 10,
            "gradientMode": "none",
            "hideFrom": {
              "tooltip": false,
              "viz": false,
              "legend": false
            },
            "lineInterpolation": "smooth",
            "lineWidth": 2,
            "pointSize": 5,
            "scaleDistribution": {
              "type": "linear"
            },
            "showPoints": "never",
            "spanNulls": true,
            "stacking": {
              "group": "A",
              "mode": "none"
            },
            "thresholdsStyle": {
              "mode": "off"
            }
          },
          "mappings": [],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              {
                "color": "green",
                "value": null
              },
              {
                "color": "yellow",
                "value": 0.1
              },
              {
                "color": "red",
                "value": 0.5
              }
            ]
          },
          "unit": "s"
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 12,
        "x": 12,
        "y": 0
      },
      "id": 2,
      "options": {
        "legend": {
          "calcs": ["mean", "max"],
          "displayMode": "table",
          "placement": "bottom",
          "showLegend": true
        },
        "tooltip": {
          "mode": "multi",
          "sort": "desc"
        }
      },
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "histogram_quantile(0.50, sum(rate(http_request_duration_seconds_bucket[1m])) by (le, endpoint, method))",
          "legendFormat": "P50 - {{method}} {{endpoint}}",
          "refId": "A"
        },
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "histogram_quantile(0.95, sum(rate(http_request_duration_seconds_bucket[1m])) by (le, endpoint, method))",
          "legendFormat": "P95 - {{method}} {{endpoint}}",
          "refId": "B"
        },
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "histogram_quantile(0.99, sum(rate(http_request_duration_seconds_bucket[1m])) by (le, endpoint, method))",
          "legendFormat": "P99 - {{method}} {{endpoint}}",
          "refId": "C"
        }
      ],
      "title": "HTTP 请求延迟 (P50/P95/P99)",
      "type": "timeseries"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "thresholds"
          },
          "mappings": [],
          "max": 1,
          "min": 0,
          "thresholds": {
            "mode": "absolute",
            "steps": [
              {
                "color": "red",
                "value": null
              },
              {
                "color": "yellow",
                "value": 0.95
              },
              {
                "color": "green",
                "value": 0.99
              }
            ]
          },
          "unit": "percentunit"
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 6,
        "x": 0,
        "y": 8
      },
      "id": 3,
      "options": {
        "orientation": "auto",
        "reduceOptions": {
          "values": false,
          "calcs": ["lastNotNull"],
          "fields": ""
        },
        "showThresholdLabels": false,
        "showThresholdMarkers": true
      },
      "pluginVersion": "9.0.0",
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "sum(rate(http_requests_total{status=\"success\"}[5m])) / sum(rate(http_requests_total[5m]))",
          "refId": "A"
        }
      ],
      "title": "请求成功率",
      "type": "gauge"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "thresholds"
          },
          "mappings": [],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              {
                "color": "green",
                "value": null
              },
              {
                "color": "yellow",
                "value": 5
              },
              {
                "color": "red",
                "value": 10
              }
            ]
          },
          "unit": "short"
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 6,
        "x": 6,
        "y": 8
      },
      "id": 4,
      "options": {
        "orientation": "auto",
        "reduceOptions": {
          "values": false,
          "calcs": ["lastNotNull"],
          "fields": ""
        },
        "showThresholdLabels": false,
        "showThresholdMarkers": true
      },
      "pluginVersion": "9.0.0",
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "http_requests_in_progress",
          "refId": "A"
        }
      ],
      "title": "当前并发请求数",
      "type": "gauge"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "thresholds"
          },
          "mappings": [],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              {
                "color": "green",
                "value": null
              }
            ]
          },
          "unit": "short"
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 6,
        "x": 12,
        "y": 8
      },
      "id": 5,
      "options": {
        "colorMode": "value",
        "graphMode": "area",
        "justifyMode": "auto",
        "orientation": "auto",
        "reduceOptions": {
          "values": false,
          "calcs": ["lastNotNull"],
          "fields": ""
        },
        "textMode": "auto"
      },
      "pluginVersion": "9.0.0",
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "sum(http_requests_total)",
          "refId": "A"
        }
      ],
      "title": "HTTP 请求总数",
      "type": "stat"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "thresholds"
          },
          "mappings": [],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              {
                "color": "green",
                "value": null
              }
            ]
          },
          "unit": "short"
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 6,
        "x": 18,
        "y": 8
      },
      "id": 6,
      "options": {
        "colorMode": "value",
        "graphMode": "area",
        "justifyMode": "auto",
        "orientation": "auto",
        "reduceOptions": {
          "values": false,
          "calcs": ["lastNotNull"],
          "fields": ""
        },
        "textMode": "auto"
      },
      "pluginVersion": "9.0.0",
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "sum(algorithm_executions_total)",
          "refId": "A"
        }
      ],
      "title": "算法执行总数",
      "type": "stat"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "palette-classic"
          },
          "custom": {
            "axisCenteredZero": false,
            "axisColorMode": "text",
            "axisLabel": "执行/秒",
            "axisPlacement": "auto",
            "barAlignment": 0,
            "drawStyle": "line",
            "fillOpacity": 20,
            "gradientMode": "opacity",
            "hideFrom": {
              "tooltip": false,
              "viz": false,
              "legend": false
            },
            "lineInterpolation": "smooth",
            "lineWidth": 2,
            "pointSize": 5,
            "scaleDistribution": {
              "type": "linear"
            },
            "showPoints": "never",
            "spanNulls": true,
            "stacking": {
              "group": "A",
              "mode": "none"
            },
            "thresholdsStyle": {
              "mode": "off"
            }
          },
          "mappings": [],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              {
                "color": "green",
                "value": null
              }
            ]
          },
          "unit": "ops"
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 12,
        "x": 0,
        "y": 16
      },
      "id": 7,
      "options": {
        "legend": {
          "calcs": ["mean", "lastNotNull"],
          "displayMode": "table",
          "placement": "bottom",
          "showLegend": true
        },
        "tooltip": {
          "mode": "multi",
          "sort": "desc"
        }
      },
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "sum(rate(algorithm_executions_total[1m])) by (algorithm, status)",
          "legendFormat": "{{algorithm}} - {{status}}",
          "refId": "A"
        }
      ],
      "title": "算法执行速率",
      "type": "timeseries"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "palette-classic"
          },
          "custom": {
            "axisCenteredZero": false,
            "axisColorMode": "text",
            "axisLabel": "延迟",
            "axisPlacement": "auto",
            "barAlignment": 0,
            "drawStyle": "line",
            "fillOpacity": 10,
            "gradientMode": "none",
            "hideFrom": {
              "tooltip": false,
              "viz": false,
              "legend": false
            },
            "lineInterpolation": "smooth",
            "lineWidth": 2,
            "pointSize": 5,
            "scaleDistribution": {
              "type": "linear"
            },
            "showPoints": "never",
            "spanNulls": true,
            "stacking": {
              "group": "A",
              "mode": "none"
            },
            "thresholdsStyle": {
              "mode": "off"
            }
          },
          "mappings": [],
          "thresholds": {
            "mode": "absolute",
            "steps": [
              {
                "color": "green",
                "value": null
              }
            ]
          },
          "unit": "s"
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 12,
        "x": 12,
        "y": 16
      },
      "id": 8,
      "options": {
        "legend": {
          "calcs": ["mean", "max"],
          "displayMode": "table",
          "placement": "bottom",
          "showLegend": true
        },
        "tooltip": {
          "mode": "multi",
          "sort": "desc"
        }
      },
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "histogram_quantile(0.50, sum(rate(algorithm_execution_duration_seconds_bucket[1m])) by (le, algorithm))",
          "legendFormat": "P50 - {{algorithm}}",
          "refId": "A"
        },
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "histogram_quantile(0.95, sum(rate(algorithm_execution_duration_seconds_bucket[1m])) by (le, algorithm))",
          "legendFormat": "P95 - {{algorithm}}",
          "refId": "B"
        },
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "histogram_quantile(0.99, sum(rate(algorithm_execution_duration_seconds_bucket[1m])) by (le, algorithm))",
          "legendFormat": "P99 - {{algorithm}}",
          "refId": "C"
        }
      ],
      "title": "算法执行延迟 (P50/P95/P99)",
      "type": "timeseries"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "palette-classic"
          },
          "custom": {
            "hideFrom": {
              "tooltip": false,
              "viz": false,
              "legend": false
            }
          },
          "mappings": []
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 12,
        "x": 0,
        "y": 24
      },
      "id": 9,
      "options": {
        "legend": {
          "displayMode": "table",
          "placement": "right",
          "showLegend": true,
          "values": ["value"]
        },
        "pieType": "pie",
        "tooltip": {
          "mode": "single",
          "sort": "none"
        }
      },
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "sum(http_requests_total) by (endpoint)",
          "legendFormat": "{{endpoint}}",
          "refId": "A"
        }
      ],
      "title": "请求分布（按端点）",
      "type": "piechart"
    },
    {
      "datasource": {
        "type": "prometheus",
        "uid": "${DS_PROMETHEUS}"
      },
      "fieldConfig": {
        "defaults": {
          "color": {
            "mode": "palette-classic"
          },
          "custom": {
            "hideFrom": {
              "tooltip": false,
              "viz": false,
              "legend": false
            }
          },
          "mappings": []
        },
        "overrides": []
      },
      "gridPos": {
        "h": 8,
        "w": 12,
        "x": 12,
        "y": 24
      },
      "id": 10,
      "options": {
        "legend": {
          "displayMode": "table",
          "placement": "right",
          "showLegend": true,
          "values": ["value"]
        },
        "pieType": "pie",
        "tooltip": {
          "mode": "single",
          "sort": "none"
        }
      },
      "targets": [
        {
          "datasource": {
            "type": "prometheus",
            "uid": "${DS_PROMETHEUS}"
          },
          "expr": "sum(http_requests_total) by (status)",
          "legendFormat": "{{status}}",
          "refId": "A"
        }
      ],
      "title": "请求状态分布",
      "type": "piechart"
    }
  ],
  "refresh": "5s",
  "schemaVersion": 38,
  "style": "dark",
  "tags": ["functional-scaffold", "monitoring"],
  "templating": {
    "list": [
      {
        "current": {
          "selected": false,
          "text": "Prometheus",
          "value": "Prometheus"
        },
        "hide": 0,
        "includeAll": false,
        "label": "数据源",
        "multi": false,
        "name": "DS_PROMETHEUS",
        "options": [],
        "query": "prometheus",
        "refresh": 1,
        "regex": "",
        "skipUrlSync": false,
        "type": "datasource"
      }
    ]
  },
  "time": {
    "from": "now-1h",
    "to": "now"
  },
  "timepicker": {
    "refresh_intervals": ["5s", "10s", "30s", "1m", "5m"]
  },
  "timezone": "browser",
  "title": "FunctionalScaffold 监控仪表板",
  "uid": "functional-scaffold",
  "version": 1,
  "weekStart": ""
 }
--- a/monitoring/prometheus.yml
+++ b/monitoring/prometheus.yml
@@ -0,0 +1,46 @@
 # Prometheus 配置文件
 global:
  scrape_interval: 15s
  evaluation_interval: 15s
  external_labels:
    cluster: 'functional-scaffold'
    environment: 'development'
 # 抓取配置
 scrape_configs:
  # 方案1：从 Pushgateway 抓取指标（推荐）
  - job_name: 'pushgateway'
    honor_labels: true
    static_configs:
      - targets: ['pushgateway:9091']
    metric_relabel_configs:
      # 保留 instance 标签
      - source_labels: [instance]
        target_label: instance
        action: replace
  # 方案2：从 Redis Exporter 抓取指标
  - job_name: 'redis-exporter'
    static_configs:
      - targets: ['redis-exporter:8001']
  # 直接从应用实例抓取（如果有多个实例，需要配置服务发现）
  - job_name: 'app'
    static_configs:
      - targets: ['app:8000']
    metrics_path: '/metrics'
  # Prometheus 自身监控
  - job_name: 'prometheus'
    static_configs:
      - targets: ['localhost:9090']
 # 告警规则文件
 rule_files:
  - '/etc/prometheus/rules/*.yml'
 # Alertmanager 配置（可选）
 # alerting:
 #   alertmanagers:
 #     - static_configs:
 #         - targets: ['alertmanager:9093']
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -0,0 +1,50 @@
 [build-system]
 requires = ["setuptools>=65.0", "wheel"]
 build-backend = "setuptools.build_meta"
 [project]
 name = "functional-scaffold"
 version = "1.0.0"
 description = "算法工程化 Serverless 脚手架"
 requires-python = ">=3.9"
 authors = [
    {name = "FunctionalScaffold Team"}
 ]
 readme = "README.md"
 dependencies = [
    "fastapi>=0.109.0",
    "uvicorn[standard]>=0.27.0",
    "pydantic>=2.5.0",
    "pydantic-settings>=2.0.0",
    "prometheus-client>=0.19.0",
    "python-json-logger>=2.0.7",
 ]
 [project.optional-dependencies]
 dev = [
    "pytest>=7.4.0",
    "pytest-asyncio>=0.21.0",
    "pytest-cov>=4.1.0",
    "httpx>=0.26.0",
    "black>=23.12.0",
    "ruff>=0.1.0",
 ]
 [tool.setuptools.packages.find]
 where = ["src"]
 [tool.black]
 line-length = 100
 target-version = ['py39']
 [tool.ruff]
 line-length = 100
 target-version = "py39"
 [tool.pytest.ini_options]
 testpaths = ["tests"]
 python_files = ["test_*.py"]
 python_classes = ["Test*"]
 python_functions = ["test_*"]
 addopts = "-v --strict-markers"
--- a/requirements-dev.txt
+++ b/requirements-dev.txt
@@ -0,0 +1,6 @@
 pytest>=7.4.0
 pytest-asyncio>=0.21.0
 pytest-cov>=4.1.0
 httpx>=0.26.0
 black>=23.12.0
 ruff>=0.1.0
--- a/requirements.txt
+++ b/requirements.txt
@@ -0,0 +1,10 @@
 fastapi>=0.109.0
 uvicorn[standard]>=0.27.0
 pydantic>=2.5.0
 pydantic-settings>=2.0.0
 prometheus-client>=0.19.0
 python-json-logger>=2.0.7
 # 指标存储方案（可选，根据选择的方案安装）
 # 方案2：Redis 方案需要
 redis>=5.0.0
--- a/scripts/export_openapi.py
+++ b/scripts/export_openapi.py
@@ -0,0 +1,35 @@
 #!/usr/bin/env python3
 """导出 OpenAPI 规范到 JSON 文件"""
 import json
 import sys
 from pathlib import Path
 # 添加 src 到路径
 sys.path.insert(0, str(Path(__file__).parent.parent / "src"))
 from functional_scaffold.main import app
 def export_openapi():
    """导出 OpenAPI 规范"""
    openapi_schema = app.openapi()
    # 确保输出目录存在
    output_dir = Path(__file__).parent.parent / "docs" / "swagger"
    output_dir.mkdir(parents=True, exist_ok=True)
    # 写入文件
    output_file = output_dir / "openapi.json"
    with open(output_file, "w", encoding="utf-8") as f:
        json.dump(openapi_schema, f, indent=2, ensure_ascii=False)
    print(f"OpenAPI schema exported to: {output_file}")
    print(f"Schema version: {openapi_schema.get('openapi')}")
    print(f"API title: {openapi_schema.get('info', {}).get('title')}")
    print(f"API version: {openapi_schema.get('info', {}).get('version')}")
    print(f"Endpoints: {len(openapi_schema.get('paths', {}))}")
 if __name__ == "__main__":
    export_openapi()
--- a/scripts/generate_traffic.sh
+++ b/scripts/generate_traffic.sh
@@ -0,0 +1,22 @@
 #!/bin/bash
 # 生成测试流量脚本
 echo "开始生成测试流量..."
 echo "按 Ctrl+C 停止"
 count=0
 while true; do
    # 随机生成一个数字
    number=$((RANDOM % 1000 + 1))
    # 发送请求
    curl -s -X POST http://localhost:8111/invoke \
        -H "Content-Type: application/json" \
        -d "{\"number\": $number}" > /dev/null
    count=$((count + 1))
    echo "[$count] 已发送请求: number=$number"
    # 随机延迟 0.5-2 秒
    sleep $(awk -v min=0.5 -v max=2 'BEGIN{srand(); print min+rand()*(max-min)}')
 done
--- a/scripts/run_dev.sh
+++ b/scripts/run_dev.sh
@@ -0,0 +1,24 @@
 #!/bin/bash
 # 开发环境启动脚本
 set -e
 echo "Starting FunctionalScaffold in development mode..."
 # 检查虚拟环境
 if [ ! -d "venv" ]; then
    echo "Creating virtual environment..."
    python3 -m venv venv
 fi
 # 激活虚拟环境
 source venv/bin/activate
 # 安装依赖
 echo "Installing dependencies..."
 pip install -e ".[dev]"
 # 启动服务
 echo "Starting server on http://localhost:8000"
 echo "API docs available at http://localhost:8000/docs"
 uvicorn src.functional_scaffold.main:app --reload --host 0.0.0.0 --port 8000
--- a/scripts/run_tests.sh
+++ b/scripts/run_tests.sh
@@ -0,0 +1,28 @@
 #!/bin/bash
 # 测试运行脚本
 set -e
 echo "Running tests for FunctionalScaffold..."
 # 激活虚拟环境（如果存在）
 if [ -d "venv" ]; then
    source venv/bin/activate
 fi
 # 运行代码检查
 echo "Running code quality checks..."
 echo "- Checking with ruff..."
 ruff check src/ tests/ || true
 echo "- Checking formatting with black..."
 black --check src/ tests/ || true
 # 运行测试
 echo ""
 echo "Running tests..."
 pytest tests/ -v --cov=src/functional_scaffold --cov-report=term --cov-report=html
 echo ""
 echo "Tests completed!"
 echo "Coverage report available at: htmlcov/index.html"
--- a/scripts/start_metrics.sh
+++ b/scripts/start_metrics.sh
@@ -0,0 +1,114 @@
 #!/bin/bash
 # 指标方案快速启动脚本
 set -e
 # 颜色定义
 RED='\033[0;31m'
 GREEN='\033[0;32m'
 YELLOW='\033[1;33m'
 NC='\033[0m' # No Color
 echo "=========================================="
 echo "FunctionalScaffold 指标方案启动脚本"
 echo "=========================================="
 # 检查 docker-compose
 if ! command -v docker-compose &> /dev/null; then
    echo -e "${RED}错误: docker-compose 未安装${NC}"
    exit 1
 fi
 # 选择方案
 echo ""
 echo "请选择指标方案："
 echo "1. Pushgateway（推荐，适合 Serverless）"
 echo "2. Redis + Exporter（适合高并发）"
 echo "3. 两者都启动（用于对比测试）"
 echo ""
 read -p "输入选项 (1/2/3): " choice
 cd "$(dirname "$0")/../deployment"
 case $choice in
    1)
        echo -e "${GREEN}启动 Pushgateway 方案...${NC}"
        docker-compose up -d redis pushgateway prometheus grafana
        echo ""
        echo -e "${GREEN}✓ Pushgateway 方案已启动${NC}"
        echo ""
        echo "服务地址："
        echo "  - Pushgateway: http://localhost:9091"
        echo "  - Prometheus:  http://localhost:9090"
        echo "  - Grafana:     http://localhost:3000 (admin/admin)"
        echo ""
        echo "下一步："
        echo "  1. 修改代码导入: from functional_scaffold.core.metrics_pushgateway import ..."
        echo "  2. 配置环境变量: PUSHGATEWAY_URL=localhost:9091"
        echo "  3. 启动应用: ./scripts/run_dev.sh"
        echo "  4. 运行测试: python scripts/test_metrics.py pushgateway"
        ;;
    2)
        echo -e "${GREEN}启动 Redis 方案...${NC}"
        # 检查 redis 依赖
        if ! python -c "import redis" 2>/dev/null; then
            echo -e "${YELLOW}警告: redis 库未安装${NC}"
            echo "正在安装 redis..."
            pip install redis
        fi
        docker-compose up -d redis redis-exporter prometheus grafana
        echo ""
        echo -e "${GREEN}✓ Redis 方案已启动${NC}"
        echo ""
        echo "服务地址："
        echo "  - Redis:        localhost:6379"
        echo "  - Redis Exporter: http://localhost:8001/metrics"
        echo "  - Prometheus:   http://localhost:9090"
        echo "  - Grafana:      http://localhost:3000 (admin/admin)"
        echo ""
        echo "下一步："
        echo "  1. 修改代码导入: from functional_scaffold.core.metrics_redis import ..."
        echo "  2. 配置环境变量: REDIS_HOST=localhost REDIS_PORT=6379"
        echo "  3. 启动应用: ./scripts/run_dev.sh"
        echo "  4. 运行测试: python scripts/test_metrics.py redis"
        ;;
    3)
        echo -e "${GREEN}启动所有服务...${NC}"
        # 检查 redis 依赖
        if ! python -c "import redis" 2>/dev/null; then
            echo -e "${YELLOW}警告: redis 库未安装${NC}"
            echo "正在安装 redis..."
            pip install redis
        fi
        docker-compose up -d
        echo ""
        echo -e "${GREEN}✓ 所有服务已启动${NC}"
        echo ""
        echo "服务地址："
        echo "  - 应用:         http://localhost:8000"
        echo "  - Pushgateway:  http://localhost:9091"
        echo "  - Redis:        localhost:6379"
        echo "  - Redis Exporter: http://localhost:8001/metrics"
        echo "  - Prometheus:   http://localhost:9090"
        echo "  - Grafana:      http://localhost:3000 (admin/admin)"
        echo ""
        echo "下一步："
        echo "  1. 查看文档: cat docs/metrics-guide.md"
        echo "  2. 运行测试: python scripts/test_metrics.py"
        ;;
    *)
        echo -e "${RED}无效的选项${NC}"
        exit 1
        ;;
 esac
 echo ""
 echo "=========================================="
 echo "查看日志: docker-compose logs -f"
 echo "停止服务: docker-compose down"
 echo "查看文档: cat ../docs/metrics-guide.md"
 echo "=========================================="
--- a/scripts/test_metrics.py
+++ b/scripts/test_metrics.py
@@ -0,0 +1,262 @@
 #!/usr/bin/env python3
 """指标方案测试脚本"""
 import requests
 import time
 import sys
 from typing import Literal
 MetricsBackend = Literal["pushgateway", "redis", "memory"]
 def test_pushgateway():
    """测试 Pushgateway 方案"""
    print("\n=== 测试 Pushgateway 方案 ===\n")
    # 1. 检查 Pushgateway 是否运行
    try:
        response = requests.get("http://localhost:9091/metrics", timeout=2)
        print(f"✓ Pushgateway 运行正常 (状态码: {response.status_code})")
    except Exception as e:
        print(f"✗ Pushgateway 未运行: {e}")
        return False
    # 2. 发送测试请求到应用
    print("\n发送测试请求...")
    for i in range(5):
        try:
            response = requests.post(
                "http://localhost:8000/invoke",
                json={"number": 17},
                timeout=5,
            )
            print(f"  请求 {i+1}: {response.status_code}")
            time.sleep(0.5)
        except Exception as e:
            print(f"  请求 {i+1} 失败: {e}")
    # 3. 等待指标推送
    print("\n等待指标推送...")
    time.sleep(2)
    # 4. 检查 Pushgateway 中的指标
    try:
        response = requests.get("http://localhost:9091/metrics", timeout=2)
        metrics = response.text
        # 查找关键指标
        if "http_requests_total" in metrics:
            print("✓ 找到 http_requests_total 指标")
            # 提取指标值
            for line in metrics.split("\n"):
                if "http_requests_total" in line and not line.startswith("#"):
                    print(f"  {line}")
        else:
            print("✗ 未找到 http_requests_total 指标")
        if "algorithm_executions_total" in metrics:
            print("✓ 找到 algorithm_executions_total 指标")
            for line in metrics.split("\n"):
                if "algorithm_executions_total" in line and not line.startswith("#"):
                    print(f"  {line}")
        else:
            print("✗ 未找到 algorithm_executions_total 指标")
    except Exception as e:
        print(f"✗ 获取指标失败: {e}")
        return False
    # 5. 检查 Prometheus 是否能抓取
    print("\n检查 Prometheus...")
    try:
        response = requests.get(
            "http://localhost:9090/api/v1/query",
            params={"query": "http_requests_total"},
            timeout=5,
        )
        data = response.json()
        if data["status"] == "success" and data["data"]["result"]:
            print(f"✓ Prometheus 成功抓取指标，找到 {len(data['data']['result'])} 条记录")
            for result in data["data"]["result"][:3]:
                print(f"  {result['metric']} = {result['value'][1]}")
        else:
            print("✗ Prometheus 未找到指标")
    except Exception as e:
        print(f"✗ Prometheus 查询失败: {e}")
    return True
 def test_redis():
    """测试 Redis 方案"""
    print("\n=== 测试 Redis 方案 ===\n")
    # 1. 检查 Redis 是否运行
    try:
        import redis
        client = redis.Redis(host="localhost", port=6379, db=0, decode_responses=True)
        client.ping()
        print("✓ Redis 运行正常")
    except ImportError:
        print("✗ Redis 库未安装，请运行: pip install redis")
        return False
    except Exception as e:
        print(f"✗ Redis 未运行: {e}")
        return False
    # 2. 清空测试数据
    print("\n清空旧数据...")
    try:
        keys = client.keys("metrics:*")
        if keys:
            client.delete(*keys)
            print(f"  删除了 {len(keys)} 个键")
    except Exception as e:
        print(f"  清空失败: {e}")
    # 3. 发送测试请求
    print("\n发送测试请求...")
    for i in range(5):
        try:
            response = requests.post(
                "http://localhost:8000/invoke",
                json={"number": 17},
                timeout=5,
            )
            print(f"  请求 {i+1}: {response.status_code}")
            time.sleep(0.5)
        except Exception as e:
            print(f"  请求 {i+1} 失败: {e}")
    # 4. 检查 Redis 中的指标
    print("\n检查 Redis 指标...")
    try:
        # 检查计数器
        counter_data = client.hgetall("metrics:request_counter")
        if counter_data:
            print(f"✓ 找到 {len(counter_data)} 个请求计数器指标")
            for key, value in list(counter_data.items())[:5]:
                if not key.endswith(":timestamp"):
                    print(f"  {key} = {value}")
        else:
            print("✗ 未找到请求计数器指标")
        # 检查算法计数器
        algo_data = client.hgetall("metrics:algorithm_counter")
        if algo_data:
            print(f"✓ 找到 {len(algo_data)} 个算法计数器指标")
            for key, value in list(algo_data.items())[:5]:
                if not key.endswith(":timestamp"):
                    print(f"  {key} = {value}")
        else:
            print("✗ 未找到算法计数器指标")
    except Exception as e:
        print(f"✗ 检查 Redis 失败: {e}")
        return False
    # 5. 检查 Redis Exporter
    print("\n检查 Redis Exporter...")
    try:
        response = requests.get("http://localhost:8001/metrics", timeout=2)
        metrics = response.text
        if "http_requests_total" in metrics:
            print("✓ Exporter 成功导出 http_requests_total")
            for line in metrics.split("\n"):
                if "http_requests_total" in line and not line.startswith("#"):
                    print(f"  {line}")
                    break
        else:
            print("✗ Exporter 未导出 http_requests_total")
    except Exception as e:
        print(f"✗ Redis Exporter 未运行: {e}")
    return True
 def test_memory():
    """测试原有的内存方案"""
    print("\n=== 测试内存方案（原有方案）===\n")
    # 发送测试请求
    print("发送测试请求...")
    for i in range(5):
        try:
            response = requests.post(
                "http://localhost:8000/invoke",
                json={"number": 17},
                timeout=5,
            )
            print(f"  请求 {i+1}: {response.status_code}")
            time.sleep(0.5)
        except Exception as e:
            print(f"  请求 {i+1} 失败: {e}")
    # 检查应用的 /metrics 端点
    print("\n检查应用 /metrics 端点...")
    try:
        response = requests.get("http://localhost:8000/metrics", timeout=2)
        metrics = response.text
        if "http_requests_total" in metrics:
            print("✓ 找到 http_requests_total 指标")
            for line in metrics.split("\n"):
                if "http_requests_total" in line and not line.startswith("#"):
                    print(f"  {line}")
                    break
        else:
            print("✗ 未找到指标")
    except Exception as e:
        print(f"✗ 获取指标失败: {e}")
        return False
    print("\n⚠️  注意：内存方案在多实例部署时，每个实例的指标是独立的")
    return True
 def main():
    """主函数"""
    print("=" * 60)
    print("FunctionalScaffold 指标方案测试")
    print("=" * 60)
    if len(sys.argv) > 1:
        backend = sys.argv[1]
    else:
        print("\n请选择要测试的方案：")
        print("1. Pushgateway（推荐）")
        print("2. Redis + Exporter")
        print("3. Memory（原有方案）")
        choice = input("\n输入选项 (1/2/3): ").strip()
        backend_map = {"1": "pushgateway", "2": "redis", "3": "memory"}
        backend = backend_map.get(choice, "pushgateway")
    print(f"\n选择的方案: {backend}")
    # 运行测试
    if backend == "pushgateway":
        success = test_pushgateway()
    elif backend == "redis":
        success = test_redis()
    elif backend == "memory":
        success = test_memory()
    else:
        print(f"未知的方案: {backend}")
        sys.exit(1)
    # 输出结果
    print("\n" + "=" * 60)
    if success:
        print("✓ 测试通过")
    else:
        print("✗ 测试失败")
    print("=" * 60)
 if __name__ == "__main__":
    main()
--- a/src/functional_scaffold/init.py
+++ b/src/functional_scaffold/init.py
@@ -0,0 +1,3 @@
 """FunctionalScaffold - 算法工程化 Serverless 脚手架"""
 __version__ = "1.0.0"
--- a/src/functional_scaffold/algorithms/init.py
+++ b/src/functional_scaffold/algorithms/init.py
@@ -0,0 +1,6 @@
 """算法模块"""
 from .base import BaseAlgorithm
 from .prime_checker import PrimeChecker
 __all__ = ["BaseAlgorithm", "PrimeChecker"]
--- a/src/functional_scaffold/algorithms/base.py
+++ b/src/functional_scaffold/algorithms/base.py
@@ -0,0 +1,77 @@
 """算法基类"""
 from abc import ABC, abstractmethod
 from typing import Any, Dict
 import time
 import logging
 logger = logging.getLogger(__name__)
 class BaseAlgorithm(ABC):
    """算法基类，所有算法必须继承此类"""
    def __init__(self):
        self.name = self.__class__.__name__
        self.version = "1.0.0"
    @abstractmethod
    def process(self, *args, **kwargs) -> Dict[str, Any]:
        """
        算法处理逻辑，子类必须实现此方法
        Returns:
            Dict[str, Any]: 算法处理结果
        """
        pass
    def execute(self, *args, **kwargs) -> Dict[str, Any]:
        """
        执行算法，包含埋点和错误处理
        Returns:
            Dict[str, Any]: 包含结果和元数据的字典
        """
        from ..core.metrics import algorithm_counter, algorithm_latency
        start_time = time.time()
        status = "success"
        try:
            logger.info(f"Starting algorithm: {self.name}")
            result = self.process(*args, **kwargs)
            elapsed_time = time.time() - start_time
            logger.info(
                f"Algorithm {self.name} completed successfully in {elapsed_time:.3f}s"
            )
            return {
                "success": True,
                "result": result,
                "metadata": {
                    "algorithm": self.name,
                    "version": self.version,
                    "elapsed_time": elapsed_time,
                },
            }
        except Exception as e:
            status = "error"
            elapsed_time = time.time() - start_time
            logger.error(f"Algorithm {self.name} failed: {str(e)}", exc_info=True)
            return {
                "success": False,
                "error": str(e),
                "metadata": {
                    "algorithm": self.name,
                    "version": self.version,
                    "elapsed_time": elapsed_time,
                },
            }
        finally:
            # 记录算法执行指标
            elapsed_time = time.time() - start_time
            algorithm_counter.labels(algorithm=self.name, status=status).inc()
            algorithm_latency.labels(algorithm=self.name).observe(elapsed_time)
--- a/src/functional_scaffold/algorithms/prime_checker.py
+++ b/src/functional_scaffold/algorithms/prime_checker.py
@@ -0,0 +1,94 @@
 """质数判断算法"""
 from typing import Dict, Any, List
 from .base import BaseAlgorithm
 class PrimeChecker(BaseAlgorithm):
    """
    质数判断算法
    使用试除法判断一个整数是否为质数，并返回因数分解结果
    """
    def process(self, number: int) -> Dict[str, Any]:
        """
        判断给定数字是否为质数
        Args:
            number: 待判断的整数
        Returns:
            Dict[str, Any]: 包含判断结果的字典
                - number: 输入的数字
                - is_prime: 是否为质数
                - factors: 因数列表（如果不是质数）
                - reason: 说明（如果适用）
                - algorithm: 使用的算法名称
        Raises:
            ValueError: 如果输入不是整数
        """
        if not isinstance(number, int):
            raise ValueError(f"Input must be an integer, got {type(number).__name__}")
        # 小于2的数不是质数
        if number < 2:
            return {
                "number": number,
                "is_prime": False,
                "reason": "Numbers less than 2 are not prime",
                "factors": [],
                "algorithm": "trial_division",
            }
        # 判断是否为质数
        is_prime = self._is_prime(number)
        # 如果不是质数，计算因数
        factors = [] if is_prime else self._get_factors(number)
        return {
            "number": number,
            "is_prime": is_prime,
            "factors": factors,
            "algorithm": "trial_division",
        }
    def _is_prime(self, n: int) -> bool:
        """
        使用试除法判断是否为质数
        Args:
            n: 待判断的正整数
        Returns:
            bool: 是否为质数
        """
        if n == 2:
            return True
        if n % 2 == 0:
            return False
        # 只需检查到sqrt(n)
        for i in range(3, int(n**0.5) + 1, 2):
            if n % i == 0:
                return False
        return True
    def _get_factors(self, n: int) -> List[int]:
        """
        获取一个数的所有因数（不包括1和自身）
        Args:
            n: 待分解的正整数
        Returns:
            List[int]: 因数列表
        """
        factors = []
        for i in range(2, n):
            if n % i == 0:
                factors.append(i)
        return factors
--- a/src/functional_scaffold/api/init.py
+++ b/src/functional_scaffold/api/init.py
@@ -0,0 +1,6 @@
 """API 模块"""
 from .routes import router
 from .models import InvokeRequest, InvokeResponse, HealthResponse, ErrorResponse
 __all__ = ["router", "InvokeRequest", "InvokeResponse", "HealthResponse", "ErrorResponse"]
--- a/src/functional_scaffold/api/dependencies.py
+++ b/src/functional_scaffold/api/dependencies.py
@@ -0,0 +1,20 @@
 """API 依赖注入"""
 from fastapi import Header, HTTPException
 from typing import Optional
 from ..core.tracing import set_request_id, generate_request_id
 async def get_request_id(x_request_id: Optional[str] = Header(None)) -> str:
    """
    获取或生成请求ID
    Args:
        x_request_id: 从请求头获取的请求ID
    Returns:
        str: 请求ID
    """
    request_id = x_request_id or generate_request_id()
    set_request_id(request_id)
    return request_id
--- a/src/functional_scaffold/api/models.py
+++ b/src/functional_scaffold/api/models.py
@@ -0,0 +1,82 @@
 """API 数据模型"""
 from pydantic import BaseModel, Field, ConfigDict
 from typing import Any, Dict, Optional
 class InvokeRequest(BaseModel):
    """同步调用请求"""
    model_config = ConfigDict(
        json_schema_extra={
            "example": {
                "number": 17
            }
        }
    )
    number: int = Field(..., description="待判断的整数")
 class InvokeResponse(BaseModel):
    """同步调用响应"""
    model_config = ConfigDict(
        json_schema_extra={
            "example": {
                "request_id": "550e8400-e29b-41d4-a716-446655440000",
                "status": "success",
                "result": {
                    "number": 17,
                    "is_prime": True,
                    "factors": [],
                    "algorithm": "trial_division"
                },
                "metadata": {
                    "algorithm": "PrimeChecker",
                    "version": "1.0.0",
                    "elapsed_time": 0.001
                }
            }
        }
    )
    request_id: str = Field(..., description="请求唯一标识")
    status: str = Field(..., description="处理状态")
    result: Dict[str, Any] = Field(..., description="算法执行结果")
    metadata: Dict[str, Any] = Field(..., description="元数据信息")
 class HealthResponse(BaseModel):
    """健康检查响应"""
    status: str = Field(..., description="健康状态")
    timestamp: float = Field(..., description="时间戳")
 class ReadinessResponse(BaseModel):
    """就绪检查响应"""
    status: str = Field(..., description="就绪状态")
    timestamp: float = Field(..., description="时间戳")
    checks: Optional[Dict[str, bool]] = Field(None, description="各项检查结果")
 class ErrorResponse(BaseModel):
    """错误响应"""
    model_config = ConfigDict(
        json_schema_extra={
            "example": {
                "error": "VALIDATION_ERROR",
                "message": "number must be an integer",
                "details": {"field": "number", "value": "abc"},
                "request_id": "550e8400-e29b-41d4-a716-446655440000"
            }
        }
    )
    error: str = Field(..., description="错误代码")
    message: str = Field(..., description="错误消息")
    details: Optional[Dict[str, Any]] = Field(None, description="错误详情")
    request_id: Optional[str] = Field(None, description="请求ID")
--- a/src/functional_scaffold/api/routes.py
+++ b/src/functional_scaffold/api/routes.py
@@ -0,0 +1,150 @@
 """API 路由"""
 from fastapi import APIRouter, HTTPException, Depends, status
 from fastapi.responses import JSONResponse
 import time
 import logging
 from .models import (
    InvokeRequest,
    InvokeResponse,
    HealthResponse,
    ReadinessResponse,
    ErrorResponse,
 )
 from .dependencies import get_request_id
 from ..algorithms.prime_checker import PrimeChecker
 from ..core.errors import FunctionalScaffoldError, ValidationError, AlgorithmError
 logger = logging.getLogger(__name__)
 router = APIRouter()
@router.post(
    "/invoke",
    response_model=InvokeResponse,
    status_code=status.HTTP_200_OK,
    summary="同步调用算法",
    description="同步调用质数判断算法，立即返回结果",
    responses={
        200: {"description": "成功", "model": InvokeResponse},
        400: {"description": "请求参数错误", "model": ErrorResponse},
        500: {"description": "服务器内部错误", "model": ErrorResponse},
    },
 )
 async def invoke_algorithm(
    request: InvokeRequest,
    request_id: str = Depends(get_request_id),
 ):
    """
    同步调用质数判断算法
    - **number**: 待判断的整数
    """
    try:
        logger.info(f"Processing request {request_id} with number={request.number}")
        # 创建算法实例并执行
        checker = PrimeChecker()
        execution_result = checker.execute(request.number)
        if not execution_result["success"]:
            raise AlgorithmError(
                execution_result.get("error", "Algorithm execution failed"),
                details=execution_result.get("metadata", {}),
            )
        return InvokeResponse(
            request_id=request_id,
            status="success",
            result=execution_result["result"],
            metadata=execution_result["metadata"],
        )
    except ValidationError as e:
        logger.warning(f"Validation error for request {request_id}: {e.message}")
        raise HTTPException(
            status_code=status.HTTP_400_BAD_REQUEST,
            detail=e.to_dict(),
        )
    except AlgorithmError as e:
        logger.error(f"Algorithm error for request {request_id}: {e.message}")
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=e.to_dict(),
        )
    except Exception as e:
        logger.error(f"Unexpected error for request {request_id}: {str(e)}", exc_info=True)
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail={
                "error": "INTERNAL_ERROR",
                "message": str(e),
                "request_id": request_id,
            },
        )
@router.get(
    "/healthz",
    response_model=HealthResponse,
    summary="健康检查",
    description="检查服务是否存活",
 )
 async def health_check():
    """
    健康检查端点
    返回服务的健康状态，用于存活探针
    """
    return HealthResponse(
        status="healthy",
        timestamp=time.time(),
    )
@router.get(
    "/readyz",
    response_model=ReadinessResponse,
    summary="就绪检查",
    description="检查服务是否就绪",
 )
 async def readiness_check():
    """
    就绪检查端点
    返回服务的就绪状态，用于就绪探针
    """
    # 这里可以添加更多检查，例如数据库连接、外部服务等
    checks = {
        "algorithm": True,  # 算法模块可用
    }
    all_ready = all(checks.values())
    return ReadinessResponse(
        status="ready" if all_ready else "not_ready",
        timestamp=time.time(),
        checks=checks,
    )
@router.post(
    "/jobs",
    status_code=status.HTTP_501_NOT_IMPLEMENTED,
    summary="异步任务接口（预留）",
    description="异步任务接口，当前版本未实现",
 )
 async def create_job():
    """
    异步任务接口（预留）
    用于提交长时间运行的任务
    """
    raise HTTPException(
        status_code=status.HTTP_501_NOT_IMPLEMENTED,
        detail={"error": "NOT_IMPLEMENTED", "message": "Async jobs not implemented yet"},
    )
--- a/src/functional_scaffold/config.py
+++ b/src/functional_scaffold/config.py
@@ -0,0 +1,47 @@
 """配置管理模块"""
 from pydantic_settings import BaseSettings
 from pydantic import ConfigDict
 from typing import Optional
 class Settings(BaseSettings):
    """应用配置"""
    model_config = ConfigDict(
        env_file=".env",
        case_sensitive=False
    )
    # 应用信息
    app_name: str = "FunctionalScaffold"
    app_version: str = "1.0.0"
    app_env: str = "development"
    # 服务器配置
    host: str = "0.0.0.0"
    port: int = 8000
    workers: int = 4
    # 日志配置
    log_level: str = "INFO"
    log_format: str = "json"
    # 指标配置
    metrics_enabled: bool = True
    # 追踪配置
    tracing_enabled: bool = False
    jaeger_endpoint: Optional[str] = None
    # 外部服务配置（示例）
    oss_endpoint: Optional[str] = None
    oss_access_key_id: Optional[str] = None
    oss_access_key_secret: Optional[str] = None
    oss_bucket_name: Optional[str] = None
    database_url: Optional[str] = None
 # 全局配置实例
 settings = Settings()
--- a/src/functional_scaffold/core/init.py
+++ b/src/functional_scaffold/core/init.py
@@ -0,0 +1,21 @@
 """核心功能模块"""
 from .errors import (
    FunctionalScaffoldError,
    ValidationError,
    AlgorithmError,
    ConfigurationError,
 )
 from .logging import setup_logging
 from .metrics import metrics_registry, track_request, track_algorithm_execution
 __all__ = [
    "FunctionalScaffoldError",
    "ValidationError",
    "AlgorithmError",
    "ConfigurationError",
    "setup_logging",
    "metrics_registry",
    "track_request",
    "track_algorithm_execution",
 ]
--- a/src/functional_scaffold/core/errors.py
+++ b/src/functional_scaffold/core/errors.py
@@ -0,0 +1,47 @@
 """错误处理模块"""
 from typing import Any, Dict, Optional
 class FunctionalScaffoldError(Exception):
    """基础异常类"""
    def __init__(
        self,
        message: str,
        error_code: Optional[str] = None,
        details: Optional[Dict[str, Any]] = None,
    ):
        self.message = message
        self.error_code = error_code or "INTERNAL_ERROR"
        self.details = details or {}
        super().__init__(self.message)
    def to_dict(self) -> Dict[str, Any]:
        """转换为字典格式"""
        return {
            "error": self.error_code,
            "message": self.message,
            "details": self.details,
        }
 class ValidationError(FunctionalScaffoldError):
    """参数验证错误"""
    def __init__(self, message: str, details: Optional[Dict[str, Any]] = None):
        super().__init__(message, error_code="VALIDATION_ERROR", details=details)
 class AlgorithmError(FunctionalScaffoldError):
    """算法执行错误"""
    def __init__(self, message: str, details: Optional[Dict[str, Any]] = None):
        super().__init__(message, error_code="ALGORITHM_ERROR", details=details)
 class ConfigurationError(FunctionalScaffoldError):
    """配置错误"""
    def __init__(self, message: str, details: Optional[Dict[str, Any]] = None):
        super().__init__(message, error_code="CONFIGURATION_ERROR", details=details)
--- a/src/functional_scaffold/core/logging.py
+++ b/src/functional_scaffold/core/logging.py
@@ -0,0 +1,50 @@
 """日志配置模块"""
 import logging
 import sys
 from typing import Optional
 from pythonjsonlogger.json import JsonFormatter
 def setup_logging(
    level: str = "INFO",
    format_type: str = "json",
    logger_name: Optional[str] = None,
 ) -> logging.Logger:
    """
    配置日志系统
    Args:
        level: 日志级别 (DEBUG, INFO, WARNING, ERROR, CRITICAL)
        format_type: 日志格式 ('json' 或 'text')
        logger_name: 日志器名称，None表示根日志器
    Returns:
        logging.Logger: 配置好的日志器
    """
    logger = logging.getLogger(logger_name)
    logger.setLevel(getattr(logging, level.upper()))
    # 清除现有处理器
    logger.handlers.clear()
    # 创建控制台处理器
    handler = logging.StreamHandler(sys.stdout)
    handler.setLevel(getattr(logging, level.upper()))
    # 设置格式
    if format_type == "json":
        formatter = JsonFormatter(
            "%(asctime)s %(name)s %(levelname)s %(message)s",
            timestamp=True,
        )
    else:
        formatter = logging.Formatter(
            "%(asctime)s - %(name)s - %(levelname)s - %(message)s",
            datefmt="%Y-%m-%d %H:%M:%S",
        )
    handler.setFormatter(formatter)
    logger.addHandler(handler)
    return logger
--- a/src/functional_scaffold/core/metrics.py
+++ b/src/functional_scaffold/core/metrics.py
@@ -0,0 +1,111 @@
 """Prometheus 指标模块"""
 from prometheus_client import Counter, Histogram, Gauge, CollectorRegistry
 from functools import wraps
 import time
 from typing import Callable
 # 创建指标注册表
 metrics_registry = CollectorRegistry()
 # 请求计数器
 request_counter = Counter(
    "http_requests_total",
    "Total HTTP requests",
    ["method", "endpoint", "status"],
    registry=metrics_registry,
 )
 # 请求延迟直方图
 request_latency = Histogram(
    "http_request_duration_seconds",
    "HTTP request latency",
    ["method", "endpoint"],
    registry=metrics_registry,
 )
 # 算法执行计数器
 algorithm_counter = Counter(
    "algorithm_executions_total",
    "Total algorithm executions",
    ["algorithm", "status"],
    registry=metrics_registry,
 )
 # 算法执行延迟
 algorithm_latency = Histogram(
    "algorithm_execution_duration_seconds",
    "Algorithm execution latency",
    ["algorithm"],
    registry=metrics_registry,
 )
 # 当前处理中的请求数
 in_progress_requests = Gauge(
    "http_requests_in_progress",
    "Number of HTTP requests in progress",
    registry=metrics_registry,
 )
 def track_request(method: str, endpoint: str):
    """
    装饰器：跟踪HTTP请求指标
    Args:
        method: HTTP方法
        endpoint: 端点路径
    """
    def decorator(func: Callable):
        @wraps(func)
        async def wrapper(*args, **kwargs):
            in_progress_requests.inc()
            start_time = time.time()
            try:
                result = await func(*args, **kwargs)
                status = "success"
                return result
            except Exception as e:
                status = "error"
                raise e
            finally:
                elapsed = time.time() - start_time
                request_counter.labels(method=method, endpoint=endpoint, status=status).inc()
                request_latency.labels(method=method, endpoint=endpoint).observe(elapsed)
                in_progress_requests.dec()
        return wrapper
    return decorator
 def track_algorithm_execution(algorithm_name: str):
    """
    装饰器：跟踪算法执行指标
    Args:
        algorithm_name: 算法名称
    """
    def decorator(func: Callable):
        @wraps(func)
        def wrapper(*args, **kwargs):
            start_time = time.time()
            try:
                result = func(*args, **kwargs)
                status = "success"
                return result
            except Exception as e:
                status = "error"
                raise e
            finally:
                elapsed = time.time() - start_time
                algorithm_counter.labels(algorithm=algorithm_name, status=status).inc()
                algorithm_latency.labels(algorithm=algorithm_name).observe(elapsed)
        return wrapper
    return decorator
--- a/src/functional_scaffold/core/metrics_pushgateway.py
+++ b/src/functional_scaffold/core/metrics_pushgateway.py
@@ -0,0 +1,162 @@
 """基于 Pushgateway 的 Prometheus 指标模块"""
 from prometheus_client import Counter, Histogram, Gauge, CollectorRegistry, push_to_gateway
 from functools import wraps
 import time
 from typing import Callable, Optional
 import os
 import logging
 logger = logging.getLogger(__name__)
 # 创建指标注册表
 metrics_registry = CollectorRegistry()
 # Pushgateway 配置
 PUSHGATEWAY_URL = os.getenv("PUSHGATEWAY_URL", "localhost:9091")
 JOB_NAME = os.getenv("METRICS_JOB_NAME", "functional_scaffold")
 INSTANCE_ID = os.getenv("INSTANCE_ID", os.getenv("HOSTNAME", "unknown"))
 # 请求计数器
 request_counter = Counter(
    "http_requests_total",
    "Total HTTP requests",
    ["method", "endpoint", "status", "instance"],
    registry=metrics_registry,
 )
 # 请求延迟直方图
 request_latency = Histogram(
    "http_request_duration_seconds",
    "HTTP request latency",
    ["method", "endpoint", "instance"],
    registry=metrics_registry,
 )
 # 算法执行计数器
 algorithm_counter = Counter(
    "algorithm_executions_total",
    "Total algorithm executions",
    ["algorithm", "status", "instance"],
    registry=metrics_registry,
 )
 # 算法执行延迟
 algorithm_latency = Histogram(
    "algorithm_execution_duration_seconds",
    "Algorithm execution latency",
    ["algorithm", "instance"],
    registry=metrics_registry,
 )
 # 当前处理中的请求数
 in_progress_requests = Gauge(
    "http_requests_in_progress",
    "Number of HTTP requests in progress",
    ["instance"],
    registry=metrics_registry,
 )
 def push_metrics(grouping_key: Optional[dict] = None):
    """
    推送指标到 Pushgateway
    Args:
        grouping_key: 额外的分组键
    """
    try:
        grouping = {"instance": INSTANCE_ID}
        if grouping_key:
            grouping.update(grouping_key)
        push_to_gateway(
            PUSHGATEWAY_URL,
            job=JOB_NAME,
            registry=metrics_registry,
            grouping_key=grouping,
        )
        logger.debug(f"成功推送指标到 Pushgateway: {PUSHGATEWAY_URL}")
    except Exception as e:
        logger.error(f"推送指标到 Pushgateway 失败: {e}")
 def track_request(method: str, endpoint: str, auto_push: bool = True):
    """
    装饰器：跟踪HTTP请求指标
    Args:
        method: HTTP方法
        endpoint: 端点路径
        auto_push: 是否自动推送到 Pushgateway
    """
    def decorator(func: Callable):
        @wraps(func)
        async def wrapper(*args, **kwargs):
            in_progress_requests.labels(instance=INSTANCE_ID).inc()
            start_time = time.time()
            try:
                result = await func(*args, **kwargs)
                status = "success"
                return result
            except Exception as e:
                status = "error"
                raise e
            finally:
                elapsed = time.time() - start_time
                request_counter.labels(
                    method=method, endpoint=endpoint, status=status, instance=INSTANCE_ID
                ).inc()
                request_latency.labels(
                    method=method, endpoint=endpoint, instance=INSTANCE_ID
                ).observe(elapsed)
                in_progress_requests.labels(instance=INSTANCE_ID).dec()
                # 自动推送指标
                if auto_push:
                    push_metrics()
        return wrapper
    return decorator
 def track_algorithm_execution(algorithm_name: str, auto_push: bool = True):
    """
    装饰器：跟踪算法执行指标
    Args:
        algorithm_name: 算法名称
        auto_push: 是否自动推送到 Pushgateway
    """
    def decorator(func: Callable):
        @wraps(func)
        def wrapper(*args, **kwargs):
            start_time = time.time()
            try:
                result = func(*args, **kwargs)
                status = "success"
                return result
            except Exception as e:
                status = "error"
                raise e
            finally:
                elapsed = time.time() - start_time
                algorithm_counter.labels(
                    algorithm=algorithm_name, status=status, instance=INSTANCE_ID
                ).inc()
                algorithm_latency.labels(
                    algorithm=algorithm_name, instance=INSTANCE_ID
                ).observe(elapsed)
                # 自动推送指标
                if auto_push:
                    push_metrics()
        return wrapper
    return decorator
--- a/src/functional_scaffold/core/metrics_redis.py
+++ b/src/functional_scaffold/core/metrics_redis.py
@@ -0,0 +1,247 @@
 """基于 Redis 的指标记录模块"""
 from functools import wraps
 import time
 from typing import Callable, Optional
 import os
 import logging
 import json
 from datetime import datetime
 try:
    import redis
    REDIS_AVAILABLE = True
 except ImportError:
    REDIS_AVAILABLE = False
    logging.warning("Redis 未安装，指标将无法记录到 Redis")
 logger = logging.getLogger(__name__)
 # Redis 配置
 REDIS_HOST = os.getenv("REDIS_HOST", "localhost")
 REDIS_PORT = int(os.getenv("REDIS_PORT", "6379"))
 REDIS_DB = int(os.getenv("REDIS_METRICS_DB", "0"))
 REDIS_PASSWORD = os.getenv("REDIS_PASSWORD", None)
 INSTANCE_ID = os.getenv("INSTANCE_ID", os.getenv("HOSTNAME", "unknown"))
 # Redis 键前缀
 METRICS_PREFIX = "metrics:"
 REQUEST_COUNTER_KEY = f"{METRICS_PREFIX}request_counter"
 REQUEST_LATENCY_KEY = f"{METRICS_PREFIX}request_latency"
 ALGORITHM_COUNTER_KEY = f"{METRICS_PREFIX}algorithm_counter"
 ALGORITHM_LATENCY_KEY = f"{METRICS_PREFIX}algorithm_latency"
 IN_PROGRESS_KEY = f"{METRICS_PREFIX}in_progress"
 class RedisMetricsClient:
    """Redis 指标客户端"""
    def __init__(self):
        if not REDIS_AVAILABLE:
            raise ImportError("需要安装 redis 库: pip install redis")
        self.client = redis.Redis(
            host=REDIS_HOST,
            port=REDIS_PORT,
            db=REDIS_DB,
            password=REDIS_PASSWORD,
            decode_responses=True,
        )
        self.instance_id = INSTANCE_ID
    def increment_counter(self, key: str, labels: dict, value: int = 1):
        """
        增加计数器
        Args:
            key: 指标键
            labels: 标签字典
            value: 增加的值
        """
        try:
            # 使用 Hash 存储，键为标签组合
            label_key = self._make_label_key(labels)
            full_key = f"{key}:{label_key}"
            self.client.hincrby(key, full_key, value)
            # 记录最后更新时间
            self.client.hset(key, f"{full_key}:timestamp", int(time.time()))
        except Exception as e:
            logger.error(f"Redis 计数器增加失败: {e}")
    def observe_histogram(self, key: str, labels: dict, value: float):
        """
        记录直方图观测值
        Args:
            key: 指标键
            labels: 标签字典
            value: 观测值
        """
        try:
            label_key = self._make_label_key(labels)
            full_key = f"{key}:{label_key}"
            # 使用 Sorted Set 存储延迟数据（用于计算分位数）
            timestamp = time.time()
            self.client.zadd(full_key, {f"{timestamp}:{value}": timestamp})
            # 保留最近1小时的数据
            cutoff = timestamp - 3600
            self.client.zremrangebyscore(full_key, "-inf", cutoff)
            # 同时记录到 Hash 中用于快速统计
            self.client.hincrby(f"{key}:count", full_key, 1)
            self.client.hincrbyfloat(f"{key}:sum", full_key, value)
        except Exception as e:
            logger.error(f"Redis 直方图记录失败: {e}")
    def set_gauge(self, key: str, labels: dict, value: float):
        """
        设置仪表盘值
        Args:
            key: 指标键
            labels: 标签字典
            value: 值
        """
        try:
            label_key = self._make_label_key(labels)
            full_key = f"{key}:{label_key}"
            self.client.hset(key, full_key, value)
            self.client.hset(key, f"{full_key}:timestamp", int(time.time()))
        except Exception as e:
            logger.error(f"Redis 仪表盘设置失败: {e}")
    def increment_gauge(self, key: str, labels: dict, value: float = 1):
        """增加仪表盘值"""
        try:
            label_key = self._make_label_key(labels)
            full_key = f"{key}:{label_key}"
            self.client.hincrbyfloat(key, full_key, value)
        except Exception as e:
            logger.error(f"Redis 仪表盘增加失败: {e}")
    def decrement_gauge(self, key: str, labels: dict, value: float = 1):
        """减少仪表盘值"""
        self.increment_gauge(key, labels, -value)
    def _make_label_key(self, labels: dict) -> str:
        """
        从标签字典生成键
        Args:
            labels: 标签字典
        Returns:
            str: 标签键
        """
        # 添加实例ID
        labels_with_instance = {**labels, "instance": self.instance_id}
        # 按键排序确保一致性
        sorted_labels = sorted(labels_with_instance.items())
        return ",".join(f"{k}={v}" for k, v in sorted_labels)
    def get_metrics_summary(self) -> dict:
        """
        获取指标摘要（用于调试）
        Returns:
            dict: 指标摘要
        """
        try:
            return {
                "request_counter": self.client.hgetall(REQUEST_COUNTER_KEY),
                "algorithm_counter": self.client.hgetall(ALGORITHM_COUNTER_KEY),
                "in_progress": self.client.hgetall(IN_PROGRESS_KEY),
            }
        except Exception as e:
            logger.error(f"获取指标摘要失败: {e}")
            return {}
 # 全局客户端实例
 _redis_client: Optional[RedisMetricsClient] = None
 def get_redis_client() -> RedisMetricsClient:
    """获取 Redis 客户端单例"""
    global _redis_client
    if _redis_client is None:
        _redis_client = RedisMetricsClient()
    return _redis_client
 def track_request(method: str, endpoint: str):
    """
    装饰器：跟踪HTTP请求指标
    Args:
        method: HTTP方法
        endpoint: 端点路径
    """
    def decorator(func: Callable):
        @wraps(func)
        async def wrapper(*args, **kwargs):
            client = get_redis_client()
            labels = {"method": method, "endpoint": endpoint}
            # 增加进行中的请求数
            client.increment_gauge(IN_PROGRESS_KEY, labels)
            start_time = time.time()
            try:
                result = await func(*args, **kwargs)
                status = "success"
                return result
            except Exception as e:
                status = "error"
                raise e
            finally:
                elapsed = time.time() - start_time
                # 记录指标
                counter_labels = {**labels, "status": status}
                client.increment_counter(REQUEST_COUNTER_KEY, counter_labels)
                client.observe_histogram(REQUEST_LATENCY_KEY, labels, elapsed)
                client.decrement_gauge(IN_PROGRESS_KEY, labels)
        return wrapper
    return decorator
 def track_algorithm_execution(algorithm_name: str):
    """
    装饰器：跟踪算法执行指标
    Args:
        algorithm_name: 算法名称
    """
    def decorator(func: Callable):
        @wraps(func)
        def wrapper(*args, **kwargs):
            client = get_redis_client()
            labels = {"algorithm": algorithm_name}
            start_time = time.time()
            try:
                result = func(*args, **kwargs)
                status = "success"
                return result
            except Exception as e:
                status = "error"
                raise e
            finally:
                elapsed = time.time() - start_time
                # 记录指标
                counter_labels = {**labels, "status": status}
                client.increment_counter(ALGORITHM_COUNTER_KEY, counter_labels)
                client.observe_histogram(ALGORITHM_LATENCY_KEY, labels, elapsed)
        return wrapper
    return decorator
--- a/src/functional_scaffold/core/metrics_redis_exporter.py
+++ b/src/functional_scaffold/core/metrics_redis_exporter.py
@@ -0,0 +1,247 @@
 """Redis 指标 Exporter - 将 Redis 中的指标转换为 Prometheus 格式"""
 from prometheus_client import Counter, Histogram, Gauge, CollectorRegistry, generate_latest
 from prometheus_client.core import GaugeMetricFamily, CounterMetricFamily, HistogramMetricFamily
 import redis
 import os
 import logging
 from typing import Dict, List, Tuple
 import time
 logger = logging.getLogger(__name__)
 # Redis 配置
 REDIS_HOST = os.getenv("REDIS_HOST", "localhost")
 REDIS_PORT = int(os.getenv("REDIS_PORT", "6379"))
 REDIS_DB = int(os.getenv("REDIS_METRICS_DB", "0"))
 REDIS_PASSWORD = os.getenv("REDIS_PASSWORD", None)
 # Redis 键前缀
 METRICS_PREFIX = "metrics:"
 REQUEST_COUNTER_KEY = f"{METRICS_PREFIX}request_counter"
 REQUEST_LATENCY_KEY = f"{METRICS_PREFIX}request_latency"
 ALGORITHM_COUNTER_KEY = f"{METRICS_PREFIX}algorithm_counter"
 ALGORITHM_LATENCY_KEY = f"{METRICS_PREFIX}algorithm_latency"
 IN_PROGRESS_KEY = f"{METRICS_PREFIX}in_progress"
 class RedisMetricsCollector:
    """从 Redis 收集指标并转换为 Prometheus 格式"""
    def __init__(self):
        self.redis_client = redis.Redis(
            host=REDIS_HOST,
            port=REDIS_PORT,
            db=REDIS_DB,
            password=REDIS_PASSWORD,
            decode_responses=True,
        )
    def collect(self):
        """收集所有指标"""
        try:
            # 收集计数器指标
            yield from self._collect_counter(
                REQUEST_COUNTER_KEY,
                "http_requests_total",
                "Total HTTP requests",
            )
            yield from self._collect_counter(
                ALGORITHM_COUNTER_KEY,
                "algorithm_executions_total",
                "Total algorithm executions",
            )
            # 收集直方图指标
            yield from self._collect_histogram(
                REQUEST_LATENCY_KEY,
                "http_request_duration_seconds",
                "HTTP request latency",
            )
            yield from self._collect_histogram(
                ALGORITHM_LATENCY_KEY,
                "algorithm_execution_duration_seconds",
                "Algorithm execution latency",
            )
            # 收集仪表盘指标
            yield from self._collect_gauge(
                IN_PROGRESS_KEY,
                "http_requests_in_progress",
                "Number of HTTP requests in progress",
            )
        except Exception as e:
            logger.error(f"收集指标失败: {e}")
    def _collect_counter(self, redis_key: str, metric_name: str, description: str):
        """收集计数器指标"""
        try:
            data = self.redis_client.hgetall(redis_key)
            if not data:
                return
            # 解析标签和值
            metrics_data = []
            for key, value in data.items():
                if key.endswith(":timestamp"):
                    continue
                labels = self._parse_labels(key)
                metrics_data.append((labels, float(value)))
            # 创建 Prometheus 指标
            if metrics_data:
                label_names = list(metrics_data[0][0].keys())
                counter = CounterMetricFamily(metric_name, description, labels=label_names)
                for labels, value in metrics_data:
                    counter.add_metric(list(labels.values()), value)
                yield counter
        except Exception as e:
            logger.error(f"收集计数器 {redis_key} 失败: {e}")
    def _collect_histogram(self, redis_key: str, metric_name: str, description: str):
        """收集直方图指标"""
        try:
            # 获取计数和总和
            count_data = self.redis_client.hgetall(f"{redis_key}:count")
            sum_data = self.redis_client.hgetall(f"{redis_key}:sum")
            if not count_data:
                return
            metrics_data = []
            for key in count_data.keys():
                labels = self._parse_labels(key)
                count = float(count_data.get(key, 0))
                sum_value = float(sum_data.get(key, 0))
                # 计算分位数（从 Sorted Set 中）
                full_key = f"{redis_key}:{key}"
                latencies = self._get_latencies(full_key)
                buckets = self._calculate_buckets(latencies)
                metrics_data.append((labels, count, sum_value, buckets))
            # 创建 Prometheus 指标
            if metrics_data:
                label_names = list(metrics_data[0][0].keys())
                histogram = HistogramMetricFamily(
                    metric_name, description, labels=label_names
                )
                for labels, count, sum_value, buckets in metrics_data:
                    histogram.add_metric(
                        list(labels.values()),
                        buckets=buckets,
                        sum_value=sum_value,
                    )
                yield histogram
        except Exception as e:
            logger.error(f"收集直方图 {redis_key} 失败: {e}")
    def _collect_gauge(self, redis_key: str, metric_name: str, description: str):
        """收集仪表盘指标"""
        try:
            data = self.redis_client.hgetall(redis_key)
            if not data:
                return
            metrics_data = []
            for key, value in data.items():
                if key.endswith(":timestamp"):
                    continue
                labels = self._parse_labels(key)
                metrics_data.append((labels, float(value)))
            # 创建 Prometheus 指标
            if metrics_data:
                label_names = list(metrics_data[0][0].keys())
                gauge = GaugeMetricFamily(metric_name, description, labels=label_names)
                for labels, value in metrics_data:
                    gauge.add_metric(list(labels.values()), value)
                yield gauge
        except Exception as e:
            logger.error(f"收集仪表盘 {redis_key} 失败: {e}")
    def _parse_labels(self, label_key: str) -> Dict[str, str]:
        """
        解析标签键
        Args:
            label_key: 标签键字符串 (e.g., "method=GET,endpoint=/invoke,instance=host1")
        Returns:
            Dict[str, str]: 标签字典
        """
        labels = {}
        for pair in label_key.split(","):
            if "=" in pair:
                key, value = pair.split("=", 1)
                labels[key] = value
        return labels
    def _get_latencies(self, key: str) -> List[float]:
        """从 Sorted Set 获取延迟数据"""
        try:
            data = self.redis_client.zrange(key, 0, -1)
            latencies = []
            for item in data:
                # 格式: "timestamp:value"
                if ":" in item:
                    _, value = item.rsplit(":", 1)
                    latencies.append(float(value))
            return sorted(latencies)
        except Exception as e:
            logger.error(f"获取延迟数据失败: {e}")
            return []
    def _calculate_buckets(
        self, latencies: List[float]
    ) -> List[Tuple[str, float]]:
        """
        计算直方图桶
        Args:
            latencies: 延迟数据列表
        Returns:
            List[Tuple[str, float]]: 桶列表 [(上限, 计数), ...]
        """
        if not latencies:
            return [("+Inf", 0)]
        # 定义桶边界（秒）
        buckets_boundaries = [0.005, 0.01, 0.025, 0.05, 0.1, 0.25, 0.5, 1, 2.5, 5, 10]
        buckets = []
        for boundary in buckets_boundaries:
            count = sum(1 for lat in latencies if lat <= boundary)
            buckets.append((str(boundary), count))
        # +Inf 桶
        buckets.append(("+Inf", len(latencies)))
        return buckets
 # 创建全局收集器
 redis_collector = RedisMetricsCollector()
 def get_metrics() -> bytes:
    """
    获取 Prometheus 格式的指标
    Returns:
        bytes: Prometheus 格式的指标数据
    """
    registry = CollectorRegistry()
    registry.register(redis_collector)
    return generate_latest(registry)
 if __name__ == "__main__":
    # 测试
    print(get_metrics().decode("utf-8"))
--- a/src/functional_scaffold/core/tracing.py
+++ b/src/functional_scaffold/core/tracing.py
@@ -0,0 +1,39 @@
 """分布式追踪模块"""
 import uuid
 from contextvars import ContextVar
 from typing import Optional
 # 使用 ContextVar 存储请求ID，支持异步上下文
 request_id_var: ContextVar[Optional[str]] = ContextVar("request_id", default=None)
 def generate_request_id() -> str:
    """生成唯一的请求ID"""
    return str(uuid.uuid4())
 def get_request_id() -> Optional[str]:
    """获取当前请求ID"""
    return request_id_var.get()
 def set_request_id(request_id: str) -> None:
    """设置当前请求ID"""
    request_id_var.set(request_id)
 class TracingContext:
    """追踪上下文管理器"""
    def __init__(self, request_id: Optional[str] = None):
        self.request_id = request_id or generate_request_id()
        self.token = None
    def __enter__(self):
        self.token = request_id_var.set(self.request_id)
        return self.request_id
    def __exit__(self, exc_type, exc_val, exc_tb):
        if self.token:
            request_id_var.reset(self.token)
--- a/src/functional_scaffold/main.py
+++ b/src/functional_scaffold/main.py
@@ -0,0 +1,138 @@
 """FastAPI 应用入口"""
 from fastapi import FastAPI, Request
 from fastapi.middleware.cors import CORSMiddleware
 from fastapi.responses import Response
 from prometheus_client import generate_latest, CONTENT_TYPE_LATEST
 import logging
 import time
 from .api import router
 from .config import settings
 from .core.logging import setup_logging
 from .core.metrics import metrics_registry, request_counter, request_latency, in_progress_requests
 # 设置日志
 setup_logging(level=settings.log_level, format_type=settings.log_format)
 logger = logging.getLogger(__name__)
 # 创建 FastAPI 应用
 app = FastAPI(
    title=settings.app_name,
    description="算法工程化 Serverless 脚手架 - 提供标准化的算法服务接口",
    version=settings.app_version,
    docs_url="/docs",
    redoc_url="/redoc",
    openapi_url="/openapi.json",
 )
 # CORS 中间件
 app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],
    allow_credentials=True,
    allow_methods=["*"],
    allow_headers=["*"],
 )
 # 请求日志中间件
@app.middleware("http")
 async def log_requests(request: Request, call_next):
    """记录所有HTTP请求"""
    logger.info(f"Request: {request.method} {request.url.path}")
    response = await call_next(request)
    logger.info(f"Response: {response.status_code}")
    return response
 # 指标跟踪中间件
@app.middleware("http")
 async def track_metrics(request: Request, call_next):
    """记录所有HTTP请求的指标"""
    if not settings.metrics_enabled:
        return await call_next(request)
    # 跳过 /metrics 端点本身，避免循环记录
    if request.url.path == "/metrics":
        return await call_next(request)
    in_progress_requests.inc()
    start_time = time.time()
    status = "success"
    try:
        response = await call_next(request)
        # 根据 HTTP 状态码判断成功或失败
        if response.status_code >= 400:
            status = "error"
        return response
    except Exception as e:
        status = "error"
        raise e
    finally:
        elapsed = time.time() - start_time
        request_counter.labels(
            method=request.method,
            endpoint=request.url.path,
            status=status
        ).inc()
        request_latency.labels(
            method=request.method,
            endpoint=request.url.path
        ).observe(elapsed)
        in_progress_requests.dec()
 # 注册路由
 app.include_router(router, tags=["Algorithm"])
 # Prometheus 指标端点
@app.get(
    "/metrics",
    tags=["Monitoring"],
    summary="Prometheus 指标",
    description="导出 Prometheus 格式的监控指标",
 )
 async def metrics():
    """
    Prometheus 指标端点
    返回应用的监控指标，供 Prometheus 抓取
    """
    if not settings.metrics_enabled:
        return Response(content="Metrics disabled", status_code=404)
    return Response(
        content=generate_latest(metrics_registry),
        media_type=CONTENT_TYPE_LATEST,
    )
 # 启动事件
@app.on_event("startup")
 async def startup_event():
    """应用启动时执行"""
    logger.info(f"Starting {settings.app_name} v{settings.app_version}")
    logger.info(f"Environment: {settings.app_env}")
    logger.info(f"Metrics enabled: {settings.metrics_enabled}")
 # 关闭事件
@app.on_event("shutdown")
 async def shutdown_event():
    """应用关闭时执行"""
    logger.info(f"Shutting down {settings.app_name}")
 if __name__ == "__main__":
    import uvicorn
    uvicorn.run(
        "functional_scaffold.main:app",
        host=settings.host,
        port=settings.port,
        reload=settings.app_env == "development",
        log_level=settings.log_level.lower(),
    )
--- a/src/functional_scaffold/utils/init.py
+++ b/src/functional_scaffold/utils/init.py
@@ -0,0 +1,5 @@
 """工具函数模块"""
 from .validators import validate_integer, validate_positive_integer
 __all__ = ["validate_integer", "validate_positive_integer"]
--- a/src/functional_scaffold/utils/validators.py
+++ b/src/functional_scaffold/utils/validators.py
@@ -0,0 +1,51 @@
 """参数校验工具"""
 from typing import Any
 from ..core.errors import ValidationError
 def validate_integer(value: Any, field_name: str = "value") -> int:
    """
    验证值是否为整数
    Args:
        value: 待验证的值
        field_name: 字段名称（用于错误消息）
    Returns:
        int: 验证后的整数值
    Raises:
        ValidationError: 如果值不是整数
    """
    if not isinstance(value, int) or isinstance(value, bool):
        raise ValidationError(
            f"{field_name} must be an integer",
            details={"field": field_name, "value": value, "type": type(value).__name__},
        )
    return value
 def validate_positive_integer(value: Any, field_name: str = "value") -> int:
    """
    验证值是否为正整数
    Args:
        value: 待验证的值
        field_name: 字段名称（用于错误消息）
    Returns:
        int: 验证后的正整数值
    Raises:
        ValidationError: 如果值不是正整数
    """
    value = validate_integer(value, field_name)
    if value <= 0:
        raise ValidationError(
            f"{field_name} must be a positive integer",
            details={"field": field_name, "value": value},
        )
    return value
--- a/tests/init.py
+++ b/tests/init.py
@@ -0,0 +1 @@
 """测试模块"""
--- a/tests/conftest.py
+++ b/tests/conftest.py
@@ -0,0 +1,23 @@
 """pytest 配置"""
 import pytest
 from fastapi.testclient import TestClient
 from src.functional_scaffold.main import app
@pytest.fixture
 def client():
    """测试客户端"""
    return TestClient(app)
@pytest.fixture
 def sample_prime_numbers():
    """质数样本"""
    return [2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47]
@pytest.fixture
 def sample_composite_numbers():
    """合数样本"""
    return [4, 6, 8, 9, 10, 12, 14, 15, 16, 18, 20, 21, 22, 24, 25]
--- a/tests/test_algorithms.py
+++ b/tests/test_algorithms.py
@@ -0,0 +1,77 @@
 """算法单元测试"""
 import pytest
 from src.functional_scaffold.algorithms.prime_checker import PrimeChecker
 class TestPrimeChecker:
    """质数判断算法测试"""
    def setup_method(self):
        """每个测试方法前执行"""
        self.checker = PrimeChecker()
    def test_prime_numbers(self, sample_prime_numbers):
        """测试质数判断"""
        for num in sample_prime_numbers:
            result = self.checker.process(num)
            assert result["is_prime"] is True
            assert result["number"] == num
            assert result["factors"] == []
            assert result["algorithm"] == "trial_division"
    def test_composite_numbers(self, sample_composite_numbers):
        """测试合数判断"""
        for num in sample_composite_numbers:
            result = self.checker.process(num)
            assert result["is_prime"] is False
            assert result["number"] == num
            assert len(result["factors"]) > 0
            assert result["algorithm"] == "trial_division"
    def test_edge_cases(self):
        """测试边界情况"""
        # 0 不是质数
        result = self.checker.process(0)
        assert result["is_prime"] is False
        assert "reason" in result
        # 1 不是质数
        result = self.checker.process(1)
        assert result["is_prime"] is False
        assert "reason" in result
        # 2 是质数
        result = self.checker.process(2)
        assert result["is_prime"] is True
        # 负数不是质数
        result = self.checker.process(-5)
        assert result["is_prime"] is False
    def test_large_prime(self):
        """测试大质数"""
        large_prime = 7919  # 第1000个质数
        result = self.checker.process(large_prime)
        assert result["is_prime"] is True
    def test_invalid_input(self):
        """测试无效输入"""
        with pytest.raises(ValueError):
            self.checker.process("not a number")
        with pytest.raises(ValueError):
            self.checker.process(3.14)
        with pytest.raises(ValueError):
            self.checker.process(None)
    def test_execute_method(self):
        """测试 execute 方法（包含埋点）"""
        result = self.checker.execute(17)
        assert result["success"] is True
        assert "result" in result
        assert "metadata" in result
        assert result["metadata"]["algorithm"] == "PrimeChecker"
        assert "elapsed_time" in result["metadata"]
--- a/tests/test_api.py
+++ b/tests/test_api.py
@@ -0,0 +1,110 @@
 """API 集成测试"""
 import pytest
 from fastapi import status
 class TestInvokeEndpoint:
    """测试 /invoke 端点"""
    def test_invoke_prime_number(self, client):
        """测试质数判断"""
        response = client.post("/invoke", json={"number": 17})
        assert response.status_code == status.HTTP_200_OK
        data = response.json()
        assert "request_id" in data
        assert data["status"] == "success"
        assert data["result"]["number"] == 17
        assert data["result"]["is_prime"] is True
        assert data["result"]["factors"] == []
    def test_invoke_composite_number(self, client):
        """测试合数判断"""
        response = client.post("/invoke", json={"number": 12})
        assert response.status_code == status.HTTP_200_OK
        data = response.json()
        assert data["status"] == "success"
        assert data["result"]["number"] == 12
        assert data["result"]["is_prime"] is False
        assert len(data["result"]["factors"]) > 0
    def test_invoke_edge_cases(self, client):
        """测试边界情况"""
        # 测试 0
        response = client.post("/invoke", json={"number": 0})
        assert response.status_code == status.HTTP_200_OK
        assert response.json()["result"]["is_prime"] is False
        # 测试 1
        response = client.post("/invoke", json={"number": 1})
        assert response.status_code == status.HTTP_200_OK
        assert response.json()["result"]["is_prime"] is False
        # 测试 2
        response = client.post("/invoke", json={"number": 2})
        assert response.status_code == status.HTTP_200_OK
        assert response.json()["result"]["is_prime"] is True
    def test_invoke_invalid_input(self, client):
        """测试无效输入"""
        # 缺少必需字段
        response = client.post("/invoke", json={})
        assert response.status_code == status.HTTP_422_UNPROCESSABLE_ENTITY
        # 错误的数据类型
        response = client.post("/invoke", json={"number": "not a number"})
        assert response.status_code == status.HTTP_422_UNPROCESSABLE_ENTITY
        # 浮点数
        response = client.post("/invoke", json={"number": 3.14})
        assert response.status_code == status.HTTP_422_UNPROCESSABLE_ENTITY
 class TestHealthEndpoints:
    """测试健康检查端点"""
    def test_healthz(self, client):
        """测试存活检查"""
        response = client.get("/healthz")
        assert response.status_code == status.HTTP_200_OK
        data = response.json()
        assert data["status"] == "healthy"
        assert "timestamp" in data
    def test_readyz(self, client):
        """测试就绪检查"""
        response = client.get("/readyz")
        assert response.status_code == status.HTTP_200_OK
        data = response.json()
        assert data["status"] == "ready"
        assert "timestamp" in data
        assert "checks" in data
 class TestMetricsEndpoint:
    """测试指标端点"""
    def test_metrics(self, client):
        """测试 Prometheus 指标"""
        response = client.get("/metrics")
        assert response.status_code == status.HTTP_200_OK
        assert "text/plain" in response.headers["content-type"]
 class TestJobsEndpoint:
    """测试异步任务端点"""
    def test_jobs_not_implemented(self, client):
        """测试异步任务接口（未实现）"""
        response = client.post("/jobs", json={"number": 17})
        assert response.status_code == status.HTTP_501_NOT_IMPLEMENTED
		`@@ -0,0 +1,3 @@`
							`"""FunctionalScaffold - 算法工程化 Serverless 脚手架"""`

							`__version__ = "1.0.0"`