GGUF Files for stage2_twitter

These are the GGUF files for ilijalichkovski/stage2_twitter.

Note: this model has only been quantized to Q2_K, Q4_K_M, and Q8_0. Other quantizations may become available later.

Downloads

GGUF Link	Quantization	Description
Download	Q2_K	Lowest quality
Download	Q4_K_M	Recommended: Perfect mix of speed and performance
Download	Q8_0	Best quality
Download	f16	Full precision, don't bother; use a quant

Note from Flexan

I provide GGUFs and quantizations of publicly available models that do not have a GGUF equivalent available yet. This process is not yet automated and I download, convert, quantize, and upload them by hand, usually for models I deem interesting and wish to try out.

If there are some quants missing that you'd like me to add, you may request one in the community tab. If you want to request a public model to be converted, you can also request that in the community tab. If you have questions regarding the model, please refer to the original model repo.

Model Card for stage2_twitter

This model is a fine-tuned version of None. It has been trained using TRL.

Quick start

from transformers import pipeline

question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
generator = pipeline("text-generation", model="None", device="cuda")
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
print(output["generated_text"])

Training procedure

This model was trained with Distil.

Framework versions

TRL: 0.24.0
Transformers: 4.57.1
Pytorch: 2.9.0
Datasets: 4.3.0
Tokenizers: 0.22.2

Citations

Cite TRL as:

@misc{vonwerra2022trl,
    title        = {{TRL: Transformer Reinforcement Learning}},
    author       = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
    year         = 2020,
    journal      = {GitHub repository},
    publisher    = {GitHub},
    howpublished = {\url{https://github.com/huggingface/trl}}
}

Downloads last month: 24

GGUF

Model size

8B params

Architecture

qwen2

Hardware compatibility

2-bit

4-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Flexan/ilijalichkovski-stage2_twitter-GGUF

Base model

ilijalichkovski/stage2_twitter

Quantized

(1)

this model

Collection including Flexan/ilijalichkovski-stage2_twitter-GGUF

Community GGUFs

Collection

This collection contains quantized GGUF files for community models that did not have GGUF equivalents available yet. I do not own these models. • 46 items • Updated Mar 4