Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shahnawaz Ahmed's picture
2

Shahnawaz Ahmed

swaze
StefanAK's profile picture salomons's profile picture tommulder's profile picture
ยท

AI & ML interests

None yet

Recent Activity

reacted to JonnaMat's post with ๐Ÿ‘€ about 6 hours ago
๐Ÿš€ FlashHead: Efficient Drop-In Replacement for the Classification Head in Language Model Inference ๐Ÿ”Ž Check out our latest FlashHead-enabled model: https://huggingface.co/embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead ๐Ÿงฉ Seamless integration with vllm: ``` docker run --rm -it \ --network host \ --shm-size=8g \ --ulimit memlock=-1 \ --ulimit stack=67108864 \ --runtime=nvidia \ --name=vllm-serve \ -e HF_TOKEN=hf_*** \ -e HF_HOME=/root/.cache/huggingface \ embedl/vllm:latest-jetson-orin-flashhead \ vllm serve "embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead" \ --max-model-len 8192 \ --gpu-memory-utilization 0.75 \ --max-num-seqs 2 \ --trust-remote-code ```
liked a Space 4 days ago
jane-street/droppedaneuralnet
updated a Space 3 months ago
embedl/README
View all activity

Organizations

Embedl's profile picture

swaze 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs