Posts
Page 3 - Showing 8 of 75 posts
View all posts by years →t-SNE
The core objective of t-SNE (t-distributed Stochastic Neighbor Embedding) is to reduce the dimensionality of high-dimensional data while preserving local neighborhood structure.
828 个字词
|
4 分钟
Self-Distillation
This paper proposes DINO, a self-distillation framework with no labels, to pretrain ViTs. Besides the fact that the DINO method works quite well on this kind of architecture, there are also two interesting properties emerging from the learned features:
1005 个字词
|
5 分钟
On-Policy Distillation
Currently, large models are post‑trained via RLHF, making them powerful but expensive to train and deploy, while smaller models are usually fine‑tuned with SFT or KD methods and are easier to deploy and adapt but often lack the performance of larger models.
944 个字词
|
5 分钟
Fourier and Wavelets for Deep Learning
令 f\in L^2(\mathbb{R})。傅里叶变换(在 L^2 意义下)把信号表示为全局正弦基的叠加:
3373 个字词
|
17 分钟
流模型 Chapter4——基于分数的生成模型 (Score-based Generative Model)
朗之万去噪分数匹配 (SMLD) 和连续时间 Score-based 模型 (VP-SDE / VE-SDE) 是两种基于分数匹配的生成模型方法。它们通过学习数据分布的分数函数(即对数概率密度的梯度)来生成新样本。
4675 个字词
|
23 分钟
Collections
Tech Notes
Be a big fan of rising technologies.
Algorithm Notes
Woo. Time-tested wisdom.
强化学习合集
Notes on reading papers in the field of reinforcement learning.
Maths Laboratory
Do mathematics have a lab as well? Absolutely, with computer simulations.
生成式模型合集
Generative Models, modeling complex data distributions.
Tags

