所有文章

第 2 页 - 当前显示 8 篇，共 75 篇文章

按年份查看所有文章 →

Linux 学习笔记 3 —— 核心命令与系统管理

首次发布: 2026-04-01

... 次访问

Linux系统的一个核心设计哲学是”一切皆文件”，这一理念在设备管理上体现得淋漓尽致。本章将带你深入了解Linux如何将硬件设备抽象为文件，以及如何与这些设备进行高效、安全的交互。

4124 个字词

|

21 分钟

从零开始学 AI - 第一章：图像分类

首次发布: 2026-03-30

... 次访问

图像分类是计算机视觉中的基本任务之一，目标是将输入的图像分配到预定义的类别中。给定一个图像 x \in \mathbb{R}^{H \times W \times C}，其中 H、W 和 C 分别表示图像的高度、宽度和通道数，图像分类的目标是学习出一个图像到类别的映射

6095 个字词

|

30 分钟

Linux 学习笔记 2 —— 核心命令与系统管理

首次发布: 2026-03-23

... 次访问

你可能觉得 Linux 命令枯燥，但想象一下：

3847 个字词

|

19 分钟

Linux 学习笔记 1 —— 系统架构与核心机制

首次发布: 2026-03-22

... 次访问

Linux 系统可以分成三个主要层级：

2342 个字词

|

12 分钟

Foundational Models 论文阅读合集 1

首次发布: 2026-03-20 | 最后更新:2026-04-10

... 次访问

Explorations & Insights

Foundational Models

14538 个字词

|

73 分钟

大模型技术知识点 3

首次发布: 2026-03-12

... 次访问

/

大模型的一些应知知识点。

1934 个字词

|

10 分钟

变分法基础

首次发布: 2026-02-23

... 次访问

变分学是研究泛函极值（以及更一般的临界值）的一个数学分支，在机器学习、物理学等领域有广泛的应用。

4571 个字词

|

23 分钟

FlowRL - Matching Reward Distributions for LLM Reasoning

首次发布: 2026-02-09

... 次访问

Explorations & Insights

/

Large language model (LLM) reasoning is typically formulated as a conditional generation problem: given a question \mathbf{x} \in \mathcal{X}, a policy model \pi_{\theta}(\mathbf{y}|\mathbf{x}) generates an answer \mathbf{y} \in \mathcal{Y}. The quality of the answer is evaluated by a task-specific reward signal r(\mathbf{x}, \mathbf{y}). In reasoning tasks, the reward is usually sparse and terminal (e.g., correctness of the final answer), which means we consider one-step reward instead of returns (i.e., discounted sum of rewards over time steps).

1536 个字词

|

8 分钟

2