posts/how-to-scale/ #7

2024-08-19T11:11:01Z

giscus[bot]
bot Aug 19, 2024

posts/how-to-scale/

My colleagues and I always get excited when, every once in a while, deep learning research throws up a fun little maths problem. Our recent work on u-μP does just this, and in a reasonably systematic way, since we need to work out how to compensate for changes in scale (standard deviation) through deep learning ops. In this post and the accompanying notebook, we explore this problem.

https://graphcore-research.github.io/posts/how-to-scale/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

posts/how-to-scale/ #7

{{title}}

Replies: 0 comments

Select a reply

posts/how-to-scale/ #7

giscus[bot] bot Aug 19, 2024

posts/how-to-scale/

Replies: 0 comments

giscus[bot]
bot Aug 19, 2024