Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
75
Mex Ivanov
MexIvanov
Follow
skatzR's profile picture
21world's profile picture
evilfreelancer's profile picture
3 followers
ยท
12 following
MexIvanov
AI & ML interests
NLP, Coding, Quantum Computing and more.
Recent Activity
reacted
to
marksverdhei
's
post
with ๐
about 17 hours ago
Poll: Will 2026 be the year of subquadratic attention? The transformer architecture is cursed by its computational complexity. It is why you run out of tokens and have to compact. But some would argue that this is a feature not a bug and that this is also why these models are so good. We've been doing a lot of research on trying to make equally good models that are computationally cheaper, But so far, none of the approaches have stood the test of time. Or so it seems. Please vote, don't be shy. Remember that the Dunning-Kruger effect is very real, so the person who knows less about transformers than you is going to vote. We want everyone's opinion, no matter confidence. ๐ if you think at least one frontier model* will have no O(n^2) attention by the end of 2026 ๐ฅ If you disagree * Frontier models - models that match / outperform the flagship claude, gemini or chatgpt at the time on multiple popular benchmarks
liked
a model
24 days ago
google/translategemma-27b-it
liked
a model
24 days ago
google/translategemma-12b-it
View all activity
Organizations
None yet
MexIvanov
's datasets
4
Sort:ย Recently updated
MexIvanov/RAG-v1-ruen
Viewer
โข
Updated
Nov 11, 2024
โข
51.4k
โข
9
โข
2
MexIvanov/image-gen-vector-consistency
Viewer
โข
Updated
Aug 30, 2024
โข
184
โข
8
MexIvanov/CodeExercise-Python-27k-ru
Viewer
โข
Updated
Dec 19, 2023
โข
27.2k
โข
195
โข
3
MexIvanov/Vezora-Tested-22k-Python-Alpaca-ru
Viewer
โข
Updated
Dec 19, 2023
โข
22.6k
โข
66
โข
2