Discussion about this post

User's avatar
Chinar Dankhara's avatar

Hey Sarah, brilliant post as always.

I do want to bring to your attention: https://www.theverge.com/2023/4/14/23683084/openai-gpt-5-rumors-training-sam-altman. GPT-5 is not in works as of 4-14-23

I am also curious on how model architectures will adapt to available compute power. For example, while GPU power might only scale so much due to technical or commercial reasons, we have much further to go with non-GPU accelerated chips such as Trainium: https://towardsdatascience.com/a-first-look-at-aws-trainium-1e0605071970. If you have thoughts on this, I would love to know more!

Expand full comment
4 more comments...

No posts