Instructions to use OddTheGreat/Meteor_4B_V.1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OddTheGreat/Meteor_4B_V.1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="OddTheGreat/Meteor_4B_V.1") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("OddTheGreat/Meteor_4B_V.1") model = AutoModelForImageTextToText.from_pretrained("OddTheGreat/Meteor_4B_V.1") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use OddTheGreat/Meteor_4B_V.1 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "OddTheGreat/Meteor_4B_V.1" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OddTheGreat/Meteor_4B_V.1", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/OddTheGreat/Meteor_4B_V.1
- SGLang
How to use OddTheGreat/Meteor_4B_V.1 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "OddTheGreat/Meteor_4B_V.1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OddTheGreat/Meteor_4B_V.1", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "OddTheGreat/Meteor_4B_V.1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OddTheGreat/Meteor_4B_V.1", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use OddTheGreat/Meteor_4B_V.1 with Docker Model Runner:
docker model run hf.co/OddTheGreat/Meteor_4B_V.1
merge
This is a merge of pretrained gemma3 language models
Some circumstances made me remember what real low-end computer is, so idea of this merge was born.
Goal of this merge is to create uncensored, somewhat smart all-round model, small enough to run on really ancient hardware.
This model takes main good features from its components, it is not censored (to a reasonable degree, extremely stuff wasn't tested), creative, able to multilang AND have not bad uncensored vision.
This model tested on core2duo E8400 with 8gb ddr2 ram, partially offloaded on 2gb gt630. It was more an experiment, but speed was bearable. On normal hardware model is FAST.
Of course, it is 4B after all, so don't expect 32 or 24b performance, but for it's size it is really good. For vision i used mmproj from SicariusSicariiStuff/X-Ray_Alpha
tested both q5_k_m and q8, both stable, but i reccomend to use q8 if possible.
- Downloads last month
- 7