HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 967k • 2.8k
How to use deliciouscat/deberta-v3-base-encoder-decoder-v0.3 with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("deliciouscat/deberta-v3-base-encoder-decoder-v0.3")
model = AutoModelForSeq2SeqLM.from_pretrained("deliciouscat/deberta-v3-base-encoder-decoder-v0.3")deliciouscat/deberta-v3-base-encoder-decoder-v0.2-> 297511524(298M) params
HuggingFaceFW/fineweb
AiHub ko-en translation corpus (English part)
Some papers that I kept
optimizer: AdamW, lr=3e-5, betas=(0.875, 0.997)
batch size: 12
-> training on denoising objective (BART), 29523 step
from transformers import AutoTokenizer, EncoderDecoderModel
model = EncoderDecoderModel.from_pretrained("deliciouscat/deberta-v3-base-encoder-decoder-v0.3")
tokenizer = AutoTokenizer.from_pretrained("deliciouscat/deberta-v3-base-encoder-decoder-v0.3")
train more scientific data
fine-tune on keyword extraction task