Time |
Speaker |
Talk Title |
Paper |
Video |
(Special) 05/26 |
张景昭(清华大学) |
Two Phases of Scaling Laws for Nearest Neighbor Classifiers |
/ |
B站 |
03/03 |
张鼎怀(Mila) |
GFlowNets: Exploration for Probabilistic Inference |
[1],[2],[3],[4] |
B站 |
03/10 |
顾欣然(清华大学) |
Why (and When) does Local SGD Generalize Better than SGD |
[1] |
B站 |
03/17 |
王博涵(中国科学技术大学) |
Provable Benefit of Adaptivity in ADAM |
[1] |
B站 |
03/24 |
温凯越(清华大学) |
How Does Sharpness-Aware Minimization Minimize Sharpness? |
[1] |
B站 |
03/31 |
张博航(北京大学) |
Rethinking the Expressive Power of GNNs via Graph Biconnectivity |
[1] (ICLR 2023 Outstanding Paper) |
B站 |
04/07 |
马鉴昊(UMich) |
Escaping Saddle Points Or Not? |
[1], [2] |
B站 |
04/14 |
陈乐偲(复旦大学) |
On Bilevel Optimization without Lower-level Strong Convexity |
[1] |
B站 |
04/21 |
黄凯旋(Princeton) |
Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data |
[1] |
B站 |
04/28 |
戴言(清华大学) |
Variance-Aware Sparse Linear Bandits |
[1] |
B站 |