Anvaya-Rabbit 2.7B

India's first sovereign SSM-based language model.

Non-transformer architecture. No attention mechanism. Constitutional training via Gurukul. 7 patents filed at IP India.

What's in this repo

Three model tiers are available, each built on the same 2.7B parameter base:

Tier	File	Use this when…
Base	`base/Anvaya-Rabbit-2.7B-0.5-alpha-base.pt`	You want raw pretrained weights for your own fine-tuning
Instruct	`instruct/Anvaya-Rabbit-2.7B-0.5-alpha-instruct.pt`	You want a general-purpose assistant that follows instructions
Imprint	`imprint/Anvaya-Rabbit-2.7B-0.5-alpha-imprint.pt`	You want the full Rabbit persona — opinionated, constitutional, identity-aware

If you're not sure which to use, start with Instruct.

Quickstart

pip install rtaforge transformers

from transformers import AutoModelForCausalLM, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b")
tokenizer.add_special_tokens({"additional_special_tokens": ["<|im_start|>", "<|im_end|>"]})

model = AutoModelForCausalLM.from_pretrained(
    "RtaForge/Anvaya-Rabbit-2.7B",
    trust_remote_code=True,
    torch_dtype="bfloat16",
    device_map="auto",
)

# v0.5-alpha uses raw completion format
prompt = "Rabbit is a helpful and honest assistant.\n\nUser: Who are you?\nRabbit:"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=60, repetition_penalty=1.3)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

v0.5-alpha uses raw completion format. Chat template support (ChatML) coming in v0.9.

The rtaforge runtime package provides the compiled architecture. Source is not distributed.

Why SSM?

Transformers scale quadratically with context length because every token attends to every other token. SSMs replace attention with a fixed-size recurrent state: inference cost stays constant per token regardless of context length, VRAM footprint shrinks dramatically, and long-document throughput improves by orders of magnitude — all at the same parameter count.

Architecture

Rabbit is built on RtaSSM v7.2.2-FU "Fortress Unbroken", a custom state-space model developed at RtaForge:

No attention mechanism — purely recurrent SSM layers with learned state dynamics
64 layers, 2560 hidden dimensions, 2.7B parameters, bfloat16
Constitutional training — Gurukul curriculum with wiki pretraining → instruct SFT → persona imprint
Vocabulary 50,280 tokens (GPT-NeoX tokenizer)

Training

Stage	Data	Notes
Wiki pretraining	Wikipedia (en)	732 constitutional proposals via Gurukul
Instruct SFT	ChatML instruction pairs	`gate_only` trainable strategy
Persona imprint	Rabbit constitutional corpus	Identity and value alignment

Evaluation Access

Weights are publicly available. Runtime package is live:

pip install rtaforge

To evaluate Rabbit or discuss deployment: 📧 guha@rtaforge.in 🌐 rtaforge.in

Runtime documentation coming soon.

Maturity and Roadmap

v0.5-alpha is a proof of concept. It demonstrates that the RtaSSM architecture trains end-to-end, the Gurukul constitutional pipeline works, and the weights are real.

Usable conversational behaviour is targeted at v0.8–v0.9, currently in training.

Evaluating for deployment? Wait for v0.9.
Evaluating the architecture or training methodology? v0.5-alpha is exactly what you need.

Limitations

v0.5-alpha has not been evaluated on standard benchmarks. She is small, she is new, and she is learning. Feedback welcome at guha@rtaforge.in.

Citation

@misc{anvaya-rabbit-2026,
  title  = {Anvaya-Rabbit: A Sovereign SSM Language Model},
  author = {RtaForge},
  year   = {2026},
  url    = {https://huggingface.co/RtaForge/Anvaya-Rabbit-2.7B}
}

Anvaya (अन्वय) — logical connection, coherence. Rabbit — the fast runner.

Downloads last month: 199