Attention - DEV Community

Skip to content

DEV Community

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Breach Protocol

Jul 1

A Classic Efficiency Trick Just Moved Into a New Part of the AI

#architecture #mixtureofexperts #attention #efficiency

3 min read

cognitalk

Jun 9

MiniMax M3 大模型注意力机制上所做的重大颠覆与优化

#ai #podcast #algorithms #attention

2 min read

Breach Protocol

Jul 1

DeepSeek's new open models give everyone a million-word memory by default

#openweights #longcontext #deepseek #attention

3 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.