跳转至

李若的代码笔记

选择

李若的代码笔记

index
3D
3D
- ThreeJS
  ThreeJS
  - 资源
AI
AI
Android
Android
- Backup
- 常识
ApplicationPlatform
ApplicationPlatform
- Google Play
- Paypal
- Stripe
- 支付
- 支付宝
- 转账
Automation
Automation
- App-Appium
- BrowserDebugProtocol
- 浏览器参数
- Playwright
  Playwright
  - Go
  - 技巧
BroswerExtension
BroswerExtension
- manifest.json
- Service Worker
Browser
Browser
- Firefox
- WebKit_Windows11编译
- 仿真emulation
- 指纹浏览器
- 缓存
- Webkit改造
  Webkit改造
BuildTools
BuildTools
- Gradle
  Gradle
- Maven
  Maven
  - Publish
  - 安装
CCPP
CCPP
- 构建
  构建
  - Cmake
  - Ninja
  - VCPKG
CICD
CICD
- SSH
- Windows优雅关闭
- Services
  Services
  - Windows
Computer DIY
Computer DIY
- 1. Base
Datebase
Datebase
- Select
- MySQL
  MySQL
DevTools
DevTools
- mise
- Postman
- Container
  Container
  - OrbStack
  - Docker
    Docker
    
    build
    
    hub
    
    Proxy
    
    runs
- IDE
  IDE
  - JetBrains
    JetBrains
    
    CertPathBuilder验证证书
    
    ckey.run
    
    Idea破解配置
    
    不同IDE的激活码
    
    看透激活码
    
    破解debug+需要的信息
  - VSCode
    
    VSCode
    
    VSCode功能
    
    VSCode插件推荐
    
    代理
    
    常用配置
    
    插件开发
    
    插件开发
    
    contributes
    
    Virtual
    
    配置案例
    配置案例
    
    Java
    
    MSVC
- PackageManager
  PackageManager
  - Chocolatey
  - Scoop
  - JS
    JS
    
    Pnpm
- RemoteDesktop
  RemoteDesktop
  - rustdesk
- VersionControl
  VersionControl
  - GIt
    GIt
    
    base
    
    Commit Specification
    
    merge
    
    patch
    
    Sparse Checkout
    
    名词与注意事项
    
    配置
  - Github
    
    Github
    
    Github Actions
    
    Github Pages
Golang
Golang
- Architecture
- Bug
- Command
- Context
- debug
- error
- gomod
- Problems
- Sys
- testing
- toolchain
- type
- 依赖管理
- 反射
- 接口
- 监控程序
- cgo
  cgo
  - 链接库
- Libs
  Libs
Ideas
Ideas
- 代码生成是语言的扩展
- 给可执行文件追加数据
- 应用
  应用
  - doccli
  - markdownToHtml
  - my_comic
  - markdownToHtmlDocs
    markdownToHtmlDocs
    
    deep-research-report
Javascript
Javascript
- package.json
- Typescript
  Typescript
  - module
JVM
JVM
- Java
  Java
  - BUG Record
  - 线程池
- MybatisPlus
  MybatisPlus
  - 自动填充
- Reactor
  Reactor
  - Flux
  - Index
- Server
  Server
  - Netty
    
    Netty
    
    Websocket
- Spring
  Spring
  - Task
  - MVC
    MVC
    
    注册Servlet组件
  - SpringBoot
    SpringBoot
    
    代理
  - Web
    Web
    
    Async
- 认证授权
  认证授权
  - Shiro
Methodologies
Methodologies
- 目录结构推荐
Multi-Platform-UI
Multi-Platform-UI
- egui
  egui
- Flutter
  Flutter
  - FVM
  - Platform
  - 环境
  - Libraries
    Libraries
    
    flutter_rust_bridge
    
    utils与extension
    
    webview
  - 常见问题
    常见问题
    
    Image
  - 构建
    构建
    
    IOS
- KMP
  KMP
- ReactNative
  ReactNative
  - HeadlessJS
  - TailwindCSS
  - WebView
  - 项目集
  - Animation
    Animation
    
    react-native-reanimated
  - Gesture
    
    Gesture
    
    react-native-gesture-handler
  - Native
    
    Native
    
    原生
- Robius
  Robius
- slint
  slint
Network
Network
- Forward
- Clash
  Clash
- SSE
  SSE
  - Gin
  - SpringWeb Stream
ORM
ORM
- Gorm
  Gorm
  - Gen
  - Insert
  - Select&Omit
  - Update
  - 关联
  - 扩展
OS
OS
- AppImage
- ArchLinux
- Android
  Android
  - WebView
  - 构建
- Desktop
  Desktop
  - ArchLinux
  - DE
  - Wayland
  - WM
  - 网络
  - 输入法
- Linux
  Linux
  - Proxy
- MacOS
  MacOS
  - 常见问题
- Windows
  Windows
  - Shortcuts
  - Win11编译异常慢
  - 多显示器
  - WSL
    WSL
    
    Network
Protocol
Protocol
- HTTPS
- RSocket
  RSocket
Python
Python
- Django
  Django
  - logging
  - Middleware
  - Model
  - Template
  - 信号Signal
  - 其他工具
  - Admin
    Admin
    
    1. Actions
    
    3.分页列表
    
    Base
    
    Index
    
    2.Forms
    2.Forms
    
    Add
    
    Change
    
    Index
- Libs
  Libs
  - uv
  - 常见
- PythonCore
  PythonCore
  - 字符串格式化
Resources
Resources
- Icon
- Shortcuts
- UI
- 创意园
- 工具网站
- 管理后台模板
- 资源
Rust
Rust
- 0.变量
- Cargo
- Full-Stack
- Rustup
- 全局配置
- 抑制编译器警告
- 断言
- 规范
- 1.类型
  1.类型
  - 其他
  - 函数
  - 数字
- 2.复合类型
  2.复合类型
  - 切片
  - 字符串
- RustGUI
  RustGUI
  - Rust UI对比
  - Leptos
    Leptos
    
    资源
Server
Server
- Gin
  Gin
  - route
- Nginx
  Nginx
  - 单页应用
  - 基本
  - module
    module
    
    core functionality
    
    rewrite
    
    http
    http
    
    core
    
    log
Service
Service
- Email
  Email
  - 配置
Shell
Shell
- WindowsCmd
- 常见开发命令
- Bash
  Bash
  - 坑点
- NuShell
  NuShell
- Powershell
  Powershell
- Somethings
  Somethings
  - Problems
  - 环境
Software Engineering
Software Engineering
- Version
- Architectural Pattern
  Architectural Pattern
- Design Pattern
  Design Pattern
  - Observer Pattern
- Messaging Pattern
  Messaging Pattern
- Programming Paradigm
  Programming Paradigm
SystemDesign
SystemDesign
- 命名风格NameStyle
- OpenAPI
  OpenAPI
- RateLimiting
  RateLimiting
  - Guava
- Security
  Security
  - ID伪装
  - TOTP
Web
Web
- npm
- SEO
- VS
- 项目模版搭建
- CSS
  CSS
  - CSSCore
    
    CSSCore
    
    css priority
    
    grid
    
    img
    
    position
  - Samples
    Samples
    
    玻璃与拟物
  - Sass
    Sass
    
    Node-Sass
  - TailwindCSS
    
    TailwindCSS
    
    preflight
- Jquery
  Jquery
  - Ajax
- NetAPI
  NetAPI
  - Axios
- NextJS
  NextJS
  - 渲染模式
- React
  React
  - 0-Basic
  - 1-Hooks
  - BUG
  - 表单
  - 高级
- SPA单页应用
  SPA单页应用
  - 路由
    路由
    
    Hash模式
    
    History模式
- VanillaJS
  VanillaJS
  - 模板字符串
- Vue
  Vue
  - css
  - keepalive
- WebAPI
  WebAPI
  - Event
  - Session
  - BOM
    BOM
    
    history
    
    location
    
    navigation
坑点
坑点

选择

一、先定死一个前提¶

👉 DeepSeek-R1 原版

❌ 个人无法部署
需要多机 H100/A100 集群

👉 个人能玩的，只有：

DeepSeek-R1 的蒸馏模型（distill）

二、个人部署能覆盖的参数范围¶

🎯 实际可行范围（重点）¶

参数规模	是否推荐	说明
1B ~ 3B	✔	轻量、本地工具
7B ~ 8B	✔✔✔	主力模型（最优解）
13B ~ 14B	✔（有条件）	需要好一点 GPU
32B	⚠（极限）	勉强可玩
70B+	❌	基本不用想

三、精度怎么选（关键）¶

精度 = 你用 FP16 / INT8 / INT4

🎯 推荐策略（直接抄就行）¶

模型大小	推荐精度
7B / 8B	INT4（首选） / INT8
13B	INT4
32B	必须 INT4

👉 为什么不用 FP16？¶

举个例子：

7B × 2 bytes ≈ 14GB

👉 一张 3060（12GB）直接爆

四、按显卡给你推荐（最实用部分）¶

🧠 1️⃣ 入门级（无 GPU / 低端）¶

CPU / 核显
内存 ≥ 16GB

👉 可跑：

1B / 3B
7B（INT4，极慢）

👉 工具：

llama.cpp

🧠 2️⃣ 主流玩家（最常见）¶

👉 GPU：

RTX 3060（12GB）
RTX 4060（8GB）

👉 可跑：

7B / 8B（INT4 ✔）
13B（勉强）

👉 推荐模型：

deepseek-r1-distill-qwen-7b
deepseek-r1-distill-llama-8b

🧠 3️⃣ 高端单卡（最佳个人体验）¶

👉 GPU：

RTX 3090 / 4090（24GB）

👉 可跑：

7B / 8B（飞快）
13B（很好）
32B（INT4 ✔ 可用）

👉 推荐：

主力：7B
高级任务：32B

🧠 4️⃣ 发烧级（多卡）¶

👉 GPU：

2×3090 / 4090

👉 可跑：

32B（流畅）
70B（勉强）

五、别忽略这个：上下文长度（隐形杀手）¶

👉 KV cache 会吃显存（很多人翻车）

例如：

模型	context	额外显存
7B	8K	~2GB
32B	8K	~10GB

👉 所以：

“能跑模型 ≠ 能开长上下文”