Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
attention
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
A Classic Efficiency Trick Just Moved Into a New Part of the AI
Breach Protocol
Breach Protocol
Breach Protocol
Follow
Jul 1
A Classic Efficiency Trick Just Moved Into a New Part of the AI
#
architecture
#
mixtureofexperts
#
attention
#
efficiency
Comments
1
comment
3 min read
DeepSeek's new open models give everyone a million-word memory by default
Breach Protocol
Breach Protocol
Breach Protocol
Follow
Jul 1
DeepSeek's new open models give everyone a million-word memory by default
#
openweights
#
longcontext
#
deepseek
#
attention
Comments
Add Comment
3 min read
MiniMax M3 大模型注意力机制上所做的重大颠覆与优化
cognitalk
cognitalk
cognitalk
Follow
Jun 9
MiniMax M3 大模型注意力机制上所做的重大颠覆与优化
#
ai
#
podcast
#
algorithms
#
attention
Comments
Add Comment
2 min read
A Looming Crisis of AI Generated Text
Nathan Epstein
Nathan Epstein
Nathan Epstein
Follow
Apr 22
A Looming Crisis of AI Generated Text
#
ai
#
llms
#
writing
#
attention
Comments
Add Comment
4 min read
TurboQuant: How a Simple Spin Saves Gigabytes of GPU Memory
Bharath Kadaluri
Bharath Kadaluri
Bharath Kadaluri
Follow
Apr 8
TurboQuant: How a Simple Spin Saves Gigabytes of GPU Memory
#
turboquant
#
attention
#
transformers
#
llm
Comments
Add Comment
6 min read
Attention Residuals: How Kimi Is Rethinking Transformer Depth
Guatu
Guatu
Guatu
Follow
Apr 7
Attention Residuals: How Kimi Is Rethinking Transformer Depth
#
ai
#
transformers
#
llmarchitecture
#
attention
Comments
Add Comment
3 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account