efecelik (efe)

posted an update about 1 month ago

Post

3065

The moment we've been waiting for — ACE-Step dropped their new model: Ace-Step 1.5 🎉
🔗 ACE-Step/Ace-Step1.5
And the best part? It's released under the MIT license.
We've already started integrating it into our project. Let's go 🚀

1 reply

·

reacted to Juanxi's post with ❤️ about 2 months ago

Post

2154

Recent Updates on ScalingOpt | Your Stars are Appreciated

We are pleased to announce several key updates to the ScalingOpt project:

Pyramid Visualization Structure
Following a suggestion from Yufei, we have introduced a pyramid-based visualization framework to systematically outline the layered architecture of Foundation Models—from foundational principles to infrastructure-level details. This addition is designed to assist teams in organizing and presenting related materials more clearly.

Integration of Optimizer Summaries by Yifeng
We extend a warm welcome to Yifeng (author of MARS), who has joined the project. He has contributed a comprehensive summary of over 100 optimizers, now available in ScalingOpt. This resource can be accessed via the “Optimization Summary Sheet” on the homepage or under the Optimizers page, featuring a reader-friendly interface that supports easy viewing, downloading, and citation.

Growing Community of Members
We continue to update and expand the list of active members. Researchers interested in Optimization & Efficient AI are encouraged to join and participate in discussions. Feedback and suggestions are also highly welcomed and will be reviewed and incorporated on an ongoing basis.

Tutorials in Progress
The tutorial development is actively underway. Currently, we have prepared over 300 slides and are refining and expanding the content in collaboration with contributors.

This community is driven purely by passion and a commitment to open knowledge sharing. Your support through starring the repository is greatly appreciated!

1 reply

·

posted an update about 2 months ago

Post

1373

🎮 Introducing: Paper Popularity Game

Think you know which AI papers go viral? Test your instincts!
I built a little game where you try to guess the popularity of AI research papers from the Hugging Face Daily Papers feed.

How it works:
You'll see two papers side by side—read the titles, check the abstracts, and pick which one you think got more upvotes from the HF community.

It's a great way to discover trending AI research while having fun.
Tests your intuition about what the ML community finds interesting.

Try it out:
efecelik/paper-popularity-game
Would love to hear your high scores and feedback!

reacted to Ujjwal-Tyagi's post with ❤️ about 2 months ago

Post

2789

So, Koreans are also doing great progress behind Chinese,
Their two open source ai models that are actually good in coding. upstage/Solar-Open-100B skt/A.X-K1

1 reply

·

posted an update about 2 months ago

Post

1640

Interesting paper: PhysRVG

The core idea: instead of treating physics as a soft condition the model can work around during optimization, enforce it strictly via reinforcement learning. The paper focuses on rigid body dynamics - collisions, pendulums, free fall, rolling.

PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models (2601.11087)

2 replies

·

replied to their post about 2 months ago

Yeah I think you're right. Deep expertise used to be the thing, but now AI kinda levels that playing field. Feels like the real advantage is being able to pull from different areas and connect stuff others don't see.

posted an update about 2 months ago

Post

638

Having multiple perspectives helps me create more diverse, innovative projects but without deep mastery in one area, I never feel truly satisfied.

What's the better investment: going deep in one field, or staying broad across many?

2 replies

·

reacted to danielhanchen's post with ❤️❤️ about 2 months ago

Post

2872

You can now do reinforcement learning training with 7× longer context and no accuracy loss, via our new batching algorithms.

Long reasoning chains in RL are costly, but now we enable you to train gpt-oss with GRPO & reach 380K context on a 192GB GPU.

Blog: https://unsloth.ai/docs/new/grpo-long-context

posted an update about 2 months ago

Post

2537

My First MCP Server: DataView
Browse HuggingFace datasets directly from your AI assistant.
-Search & filter datasets
-View rows & stats
-SQL queries & Parquet export
efecelik/dataview-mcp

posted an update about 2 months ago

Post

252

We Built a Music App with ACE-Step – Looking for Feedback

Hey everyone,

We've been building AceSteps – a platform where anyone can create music using the ACE-Step model ( ACE-Step/ACE-Step-v1-3.5B). You can mint your tracks as NFTs, tokenize them into 100,000 fractional shares, and trade them on Uniswap V4. When your song gets popular, token holders earn from ad revenue automatically. It's a Farcaster Mini-App on Base Network.

But we want to make it better, and we'd love your input:

What's the one feature that would make you actually use an AI music tool regularly?
Andd any suggestions on how we can make this model better? Actually sharing here for this question. 🤗

Any feedback, ideas, or critiques are welcome.
🔗 https://docs.acesteps.com/
🔗 https://docs.acesteps.com/pitch-deck.html
🔗 https://farcaster.xyz/?launchFrameUrl=https%3A%2F%2Fwww.acesteps.com%2F
🔗 https://www.acesteps.com

replied to their post about 2 months ago

Thanks for the detailed feedback! You're right that v1 has its quirks and we've experienced the repetition issues too.

Great to hear v1.5 is coming soon. Actually we built a platform called AceSteps using this model(v1). You can create music, mint it as an NFT, tokenize it into tradeable shares, and earn from ad revenue. It's a Farcaster Mini-App on Base Network.

Planning to integrate v1.5 once it drops.

Didn't know they had a Discord server, thanks for the info.

posted an update about 2 months ago

Post

2320

why ACE-Step model isn't popular that much? imo it makes really good music.
ACE-Step/ACE-Step-v1-3.5B

2 replies

·

replied to Kseniase's post 3 months ago

this is a very good study. it reminded me of a time a few years ago when i found things like "few shots" and similar things ridiculous; that was a big mistake.

reacted to Kseniase's post with ❤️ 3 months ago

Post

3775

From Prompt Engineering to Context Engineering: Main Design Patterns

Earlier on, we relied on clever prompt wording, but now structured, complete context matters more than just magic phrasing. The next year is going to be a year of context engineering which expands beyond prompt engineering. The two complement each other: prompt engineering shapes how we ask, while context engineering shapes what the model knows, sees, and can do.

To keep things clear, here are the main techniques and design patterns in both areas, with some useful resources for further exploration:

▪️ 9 Prompt Engineering Techniques (configuring input text)

1. Zero-shot prompting – giving a single instruction without examples. Relies entirely on pretrained knowledge.

2. Few-shot prompting – adding input–output examples to encourage model to show the desired behavior. ⟶ https://arxiv.org/abs/2005.14165

3. Role prompting – assigning a persona or role (e.g. "You are a senior researcher," "Say it as a specialist in healthcare") to shape style and reasoning. ⟶ https://arxiv.org/abs/2403.02756

4. Instruction-based prompting – explicit constraints or guidance, like "think step by step," "use bullet points," "answer in 10 words"

5. Chain-of-Thought (CoT) – encouraging intermediate reasoning traces to improve multi-step reasoning. It can be explicit ("let’s think step by step"), or implicit (demonstrated via examples). ⟶ https://arxiv.org/abs/2201.11903

6. Tree-of-Thought (ToT) – the model explores multiple reasoning paths in parallel, like branches of a tree, instead of following a single chain of thought. ⟶ https://arxiv.org/pdf/2203.11171

7. Reasoning–action prompting (ReAct-style) – prompting the model to interleave reasoning steps with explicit actions and observations. It defines action slots and lets the model generate a sequence of "Thought → Action → Observation" steps. ⟶ https://arxiv.org/abs/2210.03629

Read further ⬇️
Also subscribe to Turing Post: https://www.turingpost.com/subscribe

3 replies

·

replied to KingNish's post 5 months ago

biggest gap in open source datasets is high quality, diverse data for ai, especially in scientific reasoning, multilingual, and multimodal domains

replied to Kseniase's post 5 months ago

open interpreter's my fave 'cause it runs locally, no cloud, and speeds up coding from the terminal

reacted to prithivMLmods's post with 😎 about 1 year ago

Post

5207

Deepswipe by
.
.
.
. Deepseek🐬🗿

Everything is now in recovery. 📉📈

4 replies

·

reacted to Csplk's post with ➕ over 1 year ago

Post

2979

# Offensive Security Reconnaissance Continued with Public Facing Industrial Control System HMIs using Moondream

Building on my previous experiments with Moondream for physical security reconnaissance planning automation (https://huggingface.co/posts/Csplk/926337297827024), I've now turned my attention to exploring the potential of this powerful image-text-text model for offensive security reconnaissance in the realm of Industrial Control Systems (ICS).
ICS HMIs (Human-Machine Interfaces) are increasingly exposed to the public internet, often without adequate security measures in place. This presents a tantalizing opportunity for malicious actors to exploit vulnerabilities and gain unauthorized access to critical infrastructure.

Using Moondream with batch processing ( Csplk/moondream2-batch-processing), I've been experimenting with analyzing public facing ICS ( Csplk/ICS_UIs) HMI ( Csplk/HMI) screenshots from shodan to identify types of exposed ICS system HMIs, how they are operated and how malicious actors with access to these systems could cause damage to physical infrastructure. Feeding images of HMIs and pre-defined text prompts to Moondream batch processing successfully (unconfirmed accuracy levels) extracted information about the underlying systems, including

1. **System type**
2. **Possible Operation Details**
3. **Malicious Actor Outcomes**

Next steps:
* I have a longer and more in depth blog write up in the works that will cover the previous and this post's approaches for experiments for sharing via HF community blog posts soon.
* I plan to continue refining my Moondream-based tool to improve its accuracy and effectiveness in processing public facing ICS HMIs.
* As mentioned before, offensive security with moondream focused HF Space once its fleshed out.

Thanks again to @vikhyatk for the incredible Moondream model. vikhyatk/moondream2

reacted to ezgikorkmaz's post with 👀 over 1 year ago

Post

1280

If you are interested in deep reinforcement learning, find my recent survey below:

A Survey Analyzing Generalization in Deep Reinforcement Learning
Paper: https://arxiv.org/pdf/2401.02349
GitHub: https://github.com/EzgiKorkmaz/generalization-reinforcement-learning

efe

AI & ML interests

Recent Activity

Organizations

efe

AI & ML interests

Recent Activity

Organizations

efecelik's activity