Quasar Research

non-profit
Activity Feed

AI & ML interests

Achieving AGI through broad, domain expert datasets.

Locutusqueย 
posted an update 4 days ago
view post
Post
148
๐Ÿš€ Introducing Esmeralda-Llama-3.1-8B-control
The first release in the Esmeralda model family by Locutusque.

This model is intentionally small and experimental โ€” a control/baseline proof-of-concept designed to answer one question:

ยซโ€œHow strong is my new "Locutusque/esmeralda-agentic" dataset before scaling to larger runs?โ€ยป

Training Details

- Base: Llama 3.1 8B
- Training precision: bf16 mixed precision
- Chat template: modified ChatML
- Dataset size: ~37k examples
- Examples actually used for this run: ~5k

The dataset includes:

- multi-turn agentic traces
- reasoning traces
- structured assistant behavior
- generalist instruction data

Benchmark Results

Compared against:

- Llama 3.1 8B Instruct
- Hermes-3-Llama-3.1-8B

HumanEval

57.3 โ€” Esmeralda
56.1 โ€” Llama 3.1 Instruct
52.4 โ€” Hermes-3

MBPP

53.2 โ€” Esmeralda
56.8 โ€” Llama 3.1 Instruct
48.2 โ€” Hermes-3

GPQA Diamond

15.7 โ€” Esmeralda
15.7 โ€” Llama 3.1 Instruct
18.2 โ€” Hermes-3

EQ-Bench

59.2 โ€” Esmeralda
61.1 โ€” Llama 3.1 Instruct
63.1 โ€” Hermes-3

EQ-Bench Parseable (Syntax Stability)

๐Ÿ”ฅ 100.0% โ€” Esmeralda
92.4% โ€” Llama 3.1 Instruct
91.2% โ€” Hermes-3

Here Be Dragons ๐Ÿ‰

I also experimented with a new TruthfulQA free-generation evaluation setup.

- Responses were judged by Gemma 4 26B A4B
- The judge compared generations directly against ground-truth answers
- Models were evaluated in 8-bit quantized form to speed up inference

TruthfulQA (LLM Judge)

0.682 โ€” Esmeralda-Llama-3.1-8B-control
0.587 โ€” Hermes-3-Llama-3.1-8B (reported MC2 score; methodology differs)

For a lightweight control run trained on only a fraction of the dataset, Iโ€™m pretty encouraged by the results.

The model is released under the standard Llama 3.1 license, and Iโ€™d genuinely love feedback from people testing it in real workflows.

Model: Locutusque/Esmeralda-Llama-3.1-8B-control

Dataset: Locutusque/esmeralda-agentic

Tonicย 
posted an update 15 days ago
view post
Post
2740
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks ,

Turns out : if we predict ๐ŸŒ earth we can save a lot of time looking for interesting things and less time looking at things that we expect to see.

Sentinel-2 imagery ๐Ÿ›ฐ๏ธbasically takes a long time to download towards earth. so our "near real time" systems are quite far from that in practical terms.

meanwhile , if we "predict" what we will see , based on what we do see , we can send down much less data in a timely way , and prioritize ๐Ÿ“กearth-bound response .

I'm talking about illegal fishing , logging , mining or building in nature reserves , the more of that we predict early the more we're able to stop it on time.

At least that's the concept !

check out the blog : https://huggingface.co/blog/Tonic/save-patagonia-by-predicting-earth


- Collection: https://huggingface.co/collections/NuTonic/earth-observation-with-temporal-and-general-understanding
- Code: https://github.com/Josephrp/Nutonic
- Dataset: NuTonic/sat-vl-sft-training-ready-v1
- Model: NuTonic/lspace
- Training: NuTonic/lspace-trackio
- Evals: NuTonic/Patagonia_Eval
  • 2 replies
ยท
Tonicย 
posted an update about 1 month ago
view post
Post
4268
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks,

since everyone liked my previous announcement post ( https://huggingface.co/posts/Tonic/338509028435394 ) so much , i'm back with more high quality proceedural datasets in the Geospacial domain for SFT training !

Check this one out :
NuTonic/sat-bbox-metadata-sft-v1

the goal is to be able to train vision models on multiple images for remote sensing analysis with one shot .

hope you like it ! ๐Ÿš€
  • 2 replies
ยท
Tonicย 
posted an update about 1 month ago
view post
Post
3621
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks ,

I'm sharing huggingface's largest dataset of annotated statelite images today.

check it out here : NuTonic/sat-image-boundingbox-sft-full

I hope you like it , the idea is to be able to use this with small vision models ๐Ÿš€
Tonicย 
posted an update 3 months ago
view post
Post
3757
๐Ÿค” Who would win ?

- a fully subsidized ai lab
OR
- 3 random students named
kurakurai
?

demo : Tonic/fr-on-device

if you like it give the demo a little star and send a shoutout to : @MaxLSB @jddqd and @GAD-cell for absolutely obliterating the pareto frontier of the french language understanding .
  • 4 replies
ยท
Tonicย 
posted an update 3 months ago
view post
Post
3446
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธhello my lovelies ,

it is with great pleasure i present to you my working one-click deploy 16GB ram completely free huggingface spaces deployment.

repo : Tonic/hugging-claw (use git clone to inspect)
literally the one-click link : Tonic/hugging-claw

you can also run it locally and see for yourself :

docker run -it -p 7860:7860 --platform=linux/amd64 \
-e HF_TOKEN="YOUR_VALUE_HERE" \
-e OPENCLAW_GATEWAY_TRUSTED_PROXIES="YOUR_VALUE_HERE" \
-e OPENCLAW_GATEWAY_PASSWORD="YOUR_VALUE_HERE" \
-e OPENCLAW_CONTROL_UI_ALLOWED_ORIGINS="YOUR_VALUE_HERE" \
registry.hf.space/tonic-hugging-claw:latest


just a few quite minor details i'll take care of but i wanted to share here first
  • 2 replies
ยท
Locutusqueย 
posted an update 7 months ago
view post
Post
2866
๐Ÿš€ AutoXLA - Accelerating Large Models on TPU
AutoXLA is an experimental library that automates the distribution, optimization, and quantization of large language models for TPUs using PyTorch/XLA. It extends the Hugging Face Transformers interface with TPU-aware features such as automatic sharding, custom attention kernels, and quantization-aware loading, making large-scale deployment and training both simpler and faster.
With quantization and Splash Attention kernels, AutoXLA achieves up to 4ร— speedups over standard Flash Attention implementations, significantly improving throughput for both inference and training workloads.
Whether youโ€™re experimenting with distributed setups (FSDP, 2D, or 3D sharding) or optimizing memory via LanguageModelQuantizer, AutoXLA is built to make scaling LLMs on TPU seamless.
โš ๏ธ Note: This is an experimental repository. Expect rough edges! Please report bugs or unexpected behavior through GitHub issues.
๐Ÿ”— GitHub Repository: https://github.com/Locutusque/AutoXLA

adamm-hfย 
posted an update 7 months ago
adamm-hfย 
posted an update 7 months ago
view post
Post
831
The new King ๐Ÿ‘‘has arrived!

Moonshot AI now the top model on Hugging Face ๐Ÿ”ฅ
moonshotai/Kimi-K2-Thinking
adamm-hfย 
posted an update 7 months ago
view post
Post
2886
๐Ÿ’ธ๐Ÿค‘You donโ€™t need 100 GPUs to train something amazing!

Our Smol Training Playbook teaches you a better path to world-class LLMs, for free!

Check out the #1 trending space on ๐Ÿค— :
HuggingFaceTB/smol-training-playbook
adamm-hfย 
posted an update 8 months ago
view post
Post
2361
Cool stuff these past weeks on huggingface! ๐Ÿค— ๐Ÿš€ !
โ€ข ๐Ÿ“ˆTrackio, local-first W&B alternative
https://github.com/gradio-app/trackio/issues
โ€ข ๐ŸŒEmbeddingGemma, 300M-param, multilingual embeddings, on-device
https://huggingface.co/blog/embeddinggemma
โ€ข ๐Ÿ’ปOpen LLMs in VS Code (Inference Providers)
https://x.com/reach_vb/status/1966185427582497171
โ€ข ๐Ÿค–Smol2Operator GUI agents
https://huggingface.co/blog/smol2operator
โ€ข ๐Ÿ–ผ๏ธGradio visible watermarking
https://huggingface.co/blog/watermarking-with-gradio
Tonicย 
posted an update 8 months ago
Tonicย 
posted an update 9 months ago
view post
Post
859
COMPUTER CONTROL IS ON-DEVICE !

๐Ÿก๐Ÿค– 78 % of EU smart-home owners DONโ€™T trust cloud voice assistants.

So we killed the cloud.

Meet Extรฉ: a palm-sized Android device that sees, hears & speaks your language - 100 % offline, 0 % data sent anywhere.

๐Ÿ”“ We submitted our technologies for consideration to the Liquid AI hackathon.

๐Ÿ“Š Dataset: 79 k UI-action pairs on Hugging Face (largest Android-control corpus ever) Tonic/android-operator-episodes

โšก Model: 98 % task accuracy, 678MB compressed , fits on existing android devices ! Tonic/l-android-control

๐Ÿ›ค๏ธ Experiment Tracker : check out the training on our TrackioApp Tonic/l-android-control

๐ŸŽฎ Live Model Demo: Upload an Android Screenshot and instructions to see the model in action ! Tonic/l-operator-demo



Built in a garage, funded by pre-orders, no VC. Now weโ€™re scaling to 1 k installer units.

Weโ€™re giving 50 limited-edition prototypes to investors , installers & researchers who want to co-design the sovereign smart home.

๐Ÿ‘‡ Drop โ€œEUSKERAโ€ in the comments if you want an invite, tag a friend who still thinks Alexa is โ€œconvenient,โ€ and smash โ™ฅ๏ธ if AI should belong to people - not servers.
  • 4 replies
ยท
Tonicย 
posted an update 9 months ago
view post
Post
781
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks ,

Just wanted to annouce ๐ŸญSmolFactory : it's the quickest and best way to finetune SmolLM3 and GPT-OSS-20B on huggingface !

Basicaly it's an app you can run on huggingface by duplicating the space and running your training directly on huggingface GPUs .

It will help you basically select datasets and models, fine tune your model , make an experiment tracker you can use on your mobile phone , push all your model card and even automatically make a demo for you on huggingface so you can directly test it out when it's done !

check out the blog to learn more : https://huggingface.co/blog/Tonic/smolfactory

or just try the app directly :
Tonic/SmolFactory

you can vibe check the cool models I made :
French SmolLM3 : Tonic/Petite-LLM-3
Medical GPT-OSS : Tonic/med-gpt-oss-20b-demo

check out the model cards :
multilingual reasoner (gpt-oss) - Tonic/gpt-oss-20b-multilingual-reasoner
med-gpt-oss : Tonic/med-gpt-oss-20b
petite-elle-l-aime : Tonic/petite-elle-L-aime-3-sft

github repo if you like command line more than gradio : https://github.com/josephrp/smolfactory

drop some likes on these links it's really much appreciated !

feedback and PRs are welcome !
Locutusqueย 
posted an update 9 months ago
view post
Post
7263
๐ŸŒฒ๐Ÿ„ LLM Forest Orchestra: Turning Hidden States into Music

Hello everyone! I'm excited to introduce a new Space I've been developing called LLM Forest Orchestra. This project converts the hidden states and attention patterns of transformer models into layered MIDI compositions. The concept draws inspiration from mushrooms and mycelial networks in forests. Fungi create underground connections linking plants and trees, establishing what some call a "wood-wide web" where signals and nutrients travel. Researchers have discovered that these exchanges form patterns resembling rhythms and pulses. When translated appropriately, these patterns can become music.

Transformers operate through remarkably similar principles: tokens share signals via hidden states and attention heads. This Space transforms those invisible information flows into notes, chords, and rhythms, treating the model as a digital forest orchestra.

๐ŸŽ› Features

* Two compute modes:
- Full model operates on a Hugging Face model (defaulting to unsloth/Qwen3-14B-Base).
- Mock latents provides a CPU-friendly option that simulates tensors for immediate experimentation.
* Musical controls: You can adjust scale selection, tempo grid, velocity range, instrument/role presets, and seed randomization.
* Output: The system generates .mid files compatible with DAWs and remixing workflows.

๐ŸŒŒ Why?

Neural networks already resemble unusual musical instruments: signals flow through them, patterns emerge organically, and careful observation reveals hidden melodies. This is analogous to the forest's secret orchestra of mushrooms and trees.

๐Ÿ‘‰ Try it

Try the Space here: Locutusque/LLM-Forest-Orchestra. I'm excited to hear the sounds you can generate. Please share your created MIDIs or remixes in the comments. Let's explore how this hidden forest of transformers can sound together. ๐ŸŒณ๐ŸŽถ
Tonicย 
posted an update 10 months ago
Tonicย 
posted an update 10 months ago
view post
Post
850
๐Ÿ‘‹ Hey there folks,

just submitted my plugin idea to the G-Assist Plugin Hackathon by @nvidia . Check it out, it's a great way to use a local SLA model on a windows machine to easily and locally get things done ! https://github.com/NVIDIA/G-Assist
Tonicย 
posted an update 10 months ago
view post
Post
669
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks ,

Yesterday , Nvidia released a reasoning model that beats o3 on science, math and coding !

Today you can try it out here : Tonic/Nvidia-OpenReasoning

hope you like it !
Tonicย 
posted an update 11 months ago
view post
Post
3426
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Normalize adding compute & runtime traces to your model cards
  • 2 replies
ยท
Tonicย 
posted an update 11 months ago
view post
Post
563
Who's going to Raise Summit in Paris Tomorrow ?

If you're around , I would love to meet you :-)