Instructions to use tensorblock/SpecAI-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use tensorblock/SpecAI-GGUF with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("tensorblock/SpecAI-GGUF", dtype="auto") - llama-cpp-python
How to use tensorblock/SpecAI-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tensorblock/SpecAI-GGUF", filename="SpecAI-Q2_K.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use tensorblock/SpecAI-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/SpecAI-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/SpecAI-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/SpecAI-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/SpecAI-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tensorblock/SpecAI-GGUF:Q2_K # Run inference directly in the terminal: ./llama-cli -hf tensorblock/SpecAI-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tensorblock/SpecAI-GGUF:Q2_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf tensorblock/SpecAI-GGUF:Q2_K
Use Docker
docker model run hf.co/tensorblock/SpecAI-GGUF:Q2_K
- LM Studio
- Jan
- Ollama
How to use tensorblock/SpecAI-GGUF with Ollama:
ollama run hf.co/tensorblock/SpecAI-GGUF:Q2_K
- Unsloth Studio new
How to use tensorblock/SpecAI-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/SpecAI-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/SpecAI-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tensorblock/SpecAI-GGUF to start chatting
- Docker Model Runner
How to use tensorblock/SpecAI-GGUF with Docker Model Runner:
docker model run hf.co/tensorblock/SpecAI-GGUF:Q2_K
- Lemonade
How to use tensorblock/SpecAI-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tensorblock/SpecAI-GGUF:Q2_K
Run and chat with the model
lemonade run user.SpecAI-GGUF-Q2_K
List all available models
lemonade list
Remove .gguf files (keep Q2_K.gguf)
Browse files- SpecAI-Q3_K_L.gguf +0 -3
- SpecAI-Q3_K_M.gguf +0 -3
- SpecAI-Q3_K_S.gguf +0 -3
- SpecAI-Q4_0.gguf +0 -3
- SpecAI-Q4_K_M.gguf +0 -3
- SpecAI-Q4_K_S.gguf +0 -3
- SpecAI-Q5_0.gguf +0 -3
- SpecAI-Q5_K_M.gguf +0 -3
- SpecAI-Q5_K_S.gguf +0 -3
- SpecAI-Q6_K.gguf +0 -3
- SpecAI-Q8_0.gguf +0 -3
SpecAI-Q3_K_L.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:0b9c967d026a485db7e4942b4bed85e7852e13311ffe9d9efd0b834be37c6e86
|
| 3 |
-
size 2087597856
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q3_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:c2a8b5029a5c56a445bac2984a1b3de718d45ad4644ae2b4b10ad64b96adb1f9
|
| 3 |
-
size 1955477280
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q3_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:6de71a947194d0ea0be696a796c9b453c658c517a51cc6afd760caadc6e8bfbb
|
| 3 |
-
size 1681798944
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q4_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:c00e8472dea6029ff8b1281fc92b4b54af7b4c37e40feb29b851fa39591d129a
|
| 3 |
-
size 2176177440
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q4_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:0ac474e9f19411f2df25c3bcbf062cebc3dd903fdc5a1a7c1183392288af260d
|
| 3 |
-
size 2393232672
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q4_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:9e4640ba67f36990daa6608d52a20771116f10be0a5d9932e552773f06da0958
|
| 3 |
-
size 2188760352
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q5_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:ee86b0e4856411570a9d37c0b8267fdff739c316b087109985cd385f69df232a
|
| 3 |
-
size 2641474848
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q5_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:b77e75d17fcb2af640e5d3a2126748da9dd1e7c3e1529ccdaddb02ed91f5c37c
|
| 3 |
-
size 2815276320
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q5_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:419594957692fa92b348e1b332cd0940d3aeec494053b6b69bb1927944f66b16
|
| 3 |
-
size 2641474848
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q6_K.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:022de3b3df248d8a2b5e61524de68c6956e3ab7abec1267110a5d3be04da2dbf
|
| 3 |
-
size 3135853344
|
|
|
|
|
|
|
|
|
|
|
|
SpecAI-Q8_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:739aafec53d81ced91f18cab1611afc62191755846bce3dbc386fdcb4a782a8d
|
| 3 |
-
size 4061222688
|
|
|
|
|
|
|
|
|
|
|
|