legacy-datasets/wikipedia
Updated • 125k • 630
How to use SzegedAI/hubertusz-tiny-wiki with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForPreTraining
tokenizer = AutoTokenizer.from_pretrained("SzegedAI/hubertusz-tiny-wiki")
model = AutoModelForPreTraining.from_pretrained("SzegedAI/hubertusz-tiny-wiki")This model was trained from scratch on the Wikipedia subset of Hungarian Webcorpus 2.0 with MLM and SOP tasks.
First phase:
Second phase: