Instructions to use LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints
- SGLang
How to use LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints with Docker Model Runner:
docker model run hf.co/LLM-OS-Models/KoHRM-Text-1.4B-raw-checkpoints
Ctrl+K
- epoch2-final-stage4b-korean-tool-finance-repeat-globalstep-470077
- stage1-gbs180
- stage1b-hrm-fastcap-repeat-step240000
- stage1b-hrm-fastcap-repeat-step250000
- stage1b-hrm-fastcap-repeat-step260000
- stage1b-hrm-fastcap-repeat-step270000
- stage1b-hrm-fastcap-repeat-step280000
- stage1b-hrm-fastcap-repeat-step290000
- stage1b-hrm-fastcap-repeat-step300000
- stage1b-hrm-fastcap-repeat-step310000
- stage1b-hrm-fastcap-repeat
- stage1c-hrm-fastcap-repeat2-step510000
- stage1c-hrm-fastcap-repeat2-step520000
- stage1c-hrm-fastcap-repeat2-step530000
- stage1c-hrm-fastcap-repeat2-step540000
- stage1c-hrm-fastcap-repeat2-step550000
- stage1c-hrm-fastcap-repeat2
- stage1d-hrm-fastcap-repeat3-step710000
- stage1d-hrm-fastcap-repeat3-step720000
- stage1d-hrm-fastcap-repeat3-step730000
- stage1d-hrm-fastcap-repeat3-step740000
- stage1d-hrm-fastcap-repeat3-step750000
- stage1d-hrm-fastcap-repeat3-step760000
- stage1d-hrm-fastcap-repeat3-step770000
- stage1d-hrm-fastcap-repeat3-step780000
- stage1d-hrm-fastcap-repeat3
- stage2-hrm-full-nocap-current
- stage2-hrm-full-nocap-final-epoch1
- stage2-hrm-full-nocap-step150000-160000
- stage2b-hrm-full-nocap-extra-epoch1-step320000
- stage2b-hrm-full-nocap-extra-epoch1-step330000
- stage2b-hrm-full-nocap-extra-epoch1-step340000
- stage2b-hrm-full-nocap-extra-epoch1-step350000
- stage2b-hrm-full-nocap-extra-epoch1-step360000
- stage2b-hrm-full-nocap-extra-epoch1-step370000
- stage2b-hrm-full-nocap-extra-epoch1-step380000
- stage2b-hrm-full-nocap-extra-epoch1-step390000
- stage2b-hrm-full-nocap-extra-epoch1
- stage2c-hrm-full-nocap-repeat2-step560000
- stage2c-hrm-full-nocap-repeat2-step570000
- stage2c-hrm-full-nocap-repeat2-step580000
- stage2c-hrm-full-nocap-repeat2-step590000
- stage2c-hrm-full-nocap-repeat2-step600000
- stage2c-hrm-full-nocap-repeat2-step610000
- stage2c-hrm-full-nocap-repeat2-step620000
- stage2c-hrm-full-nocap-repeat2-step630000
- stage2c-hrm-full-nocap-repeat2
- stage2d-hrm-full-nocap-repeat3-step790000
- stage2d-hrm-full-nocap-repeat3-step800000
- stage2d-hrm-full-nocap-repeat3-step810000