Coverage (30d)
0vs0
This Week
0vs0
Evidence
1 articlesRelationships
0Timeline
Mixture-of-Depths Attention2026-03-22
Research paper introducing mixture-of-depths attention mechanism to mitigate feature dilution in deep LLMs
FlashAttention-42026-03-08
Research paper published introducing FlashAttention-4 algorithm and kernel pipelining co-design for asymmetric hardware scaling
Ecosystem
FlashAttention-4
usestransformer model2 src
usesBlackwell1 src
usesHopper1 src
Mixture-of-Depths Attention
useslarge language models1 src