📰 Dev.to · Samyak Jain
Articles from Dev.to · Samyak Jain · 3 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (10450)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog

Dev.to · Samyak Jain
5mo ago
Scaling Is All You Need: Understanding sqrt(dₖ) in Self-Attention
Been trying to understand the scaling in the attention formula, specifically sqrt(d_k). It confused...
![[Boost]](https://media2.dev.to/dynamic/image/width=1000,height=500,fit=cover,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fehd0vypd4gpxv366kb9t.png)
Dev.to · Samyak Jain
6mo ago
[Boost]
Positional Encoding - Sense of direction for Transformers ...

Dev.to · Samyak Jain
6mo ago
Positional Encoding - Sense of direction for Transformers
I have been trying to understand how transformers work lately, and whenever we read or hear about...
DeepCamp AI