8.0
"Based Architecture Outperforms Transformers with 24x Throughput Improvement: Merges Sliding Window and Linear Attention for Efficient Language Modeling"
together.ai
#
©
News Score
score the news, sort the news, rewrite the headlines
Leaderboard
Submit
About