Text Generation
MLX
Safetensors
PyTorch
English
llama4_text
facebook
meta
mobilellm
mlx - apple-mlx - runtime
conversational
Instructions to use robbiemu/MobileLLM-R1-950M-MLX with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use robbiemu/MobileLLM-R1-950M-MLX with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("robbiemu/MobileLLM-R1-950M-MLX") prompt = "Write a story about Einstein" messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
- Pi new
How to use robbiemu/MobileLLM-R1-950M-MLX with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "robbiemu/MobileLLM-R1-950M-MLX"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "robbiemu/MobileLLM-R1-950M-MLX" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use robbiemu/MobileLLM-R1-950M-MLX with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "robbiemu/MobileLLM-R1-950M-MLX"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default robbiemu/MobileLLM-R1-950M-MLX
Run Hermes
hermes
- MLX LM
How to use robbiemu/MobileLLM-R1-950M-MLX with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Interactive chat REPL mlx_lm.chat --model "robbiemu/MobileLLM-R1-950M-MLX"
Run an OpenAI-compatible server
# Install MLX LM uv tool install mlx-lm # Start the server mlx_lm.server --model "robbiemu/MobileLLM-R1-950M-MLX" # Calling the OpenAI-compatible server with curl curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "robbiemu/MobileLLM-R1-950M-MLX", "messages": [ {"role": "user", "content": "Hello"} ] }'
| {{- bos_token }} | |
| {%- if custom_tools is defined %} | |
| {%- set tools = custom_tools %} | |
| {%- endif %} | |
| {%- if not tools_in_user_message is defined %} | |
| {%- set tools_in_user_message = true %} | |
| {%- endif %} | |
| {%- if not date_string is defined %} | |
| {%- if strftime_now is defined %} | |
| {%- set date_string = strftime_now("%d %b %Y") %} | |
| {%- else %} | |
| {%- set date_string = "26 Jul 2024" %} | |
| {%- endif %} | |
| {%- endif %} | |
| {%- if not tools is defined %} | |
| {%- set tools = none %} | |
| {%- endif %} | |
| {#- This block extracts the system message, so we can slot it into the right place. #} | |
| {%- if messages[0]['role'] == 'system' %} | |
| {%- set system_message = messages[0]['content']|trim %} | |
| {%- set messages = messages[1:] %} | |
| {%- else %} | |
| {%- set system_message = "Please reason step by step, and put your final answer within \\boxed{}." %} | |
| {%- endif %} | |
| {#- System message #} | |
| {{- "<|start_header_id|>system<|end_header_id|>\n\n" }} | |
| {%- if tools is not none %} | |
| {{- "Environment: ipython\n" }} | |
| {%- endif %} | |
| {%- if tools is not none and not tools_in_user_message %} | |
| {{- "You have access to the following functions. To call a function, please respond with JSON for a function call." }} | |
| {{- 'Respond in the format {"name": function name, "parameters": dictionary of argument name and its value}.' }} | |
| {{- "Do not use variables.\n\n" }} | |
| {%- for t in tools %} | |
| {{- t | tojson(indent=4) }} | |
| {{- "\n\n" }} | |
| {%- endfor %} | |
| {%- endif %} | |
| {{- system_message }} | |
| {{- "<|eot_id|>" }} | |
| {#- Custom tools are passed in a user message with some extra guidance #} | |
| {%- if tools_in_user_message and not tools is none %} | |
| {#- Extract the first user message so we can plug it in here #} | |
| {%- if messages | length != 0 %} | |
| {%- set first_user_message = messages[0]['content']|trim %} | |
| {%- set messages = messages[1:] %} | |
| {%- else %} | |
| {{- raise_exception("Cannot put tools in the first user message when there's no first user message!") }} | |
| {%- endif %} | |
| {{- '<|start_header_id|>user<|end_header_id|>\n\n' -}} | |
| {{- "Given the following functions, please respond with a JSON for a function call " }} | |
| {{- "with its proper arguments that best answers the given prompt.\n\n" }} | |
| {{- 'Respond in the format {"name": function name, "parameters": dictionary of argument name and its value}.' }} | |
| {{- "Do not use variables.\n\n" }} | |
| {%- for t in tools %} | |
| {{- t | tojson(indent=4) }} | |
| {{- "\n\n" }} | |
| {%- endfor %} | |
| {{- first_user_message + "<|eot_id|>"}} | |
| {%- endif %} | |
| {%- for message in messages %} | |
| {%- if not (message.role == 'ipython' or message.role == 'tool' or 'tool_calls' in message) %} | |
| {{- '<|start_header_id|>' + message['role'] + '<|end_header_id|>\n\n'+ message['content'] | trim + '<|eot_id|>' }} | |
| {%- elif 'tool_calls' in message %} | |
| {%- if not message.tool_calls|length == 1 %} | |
| {{- raise_exception("This model only supports single tool-calls at once!") }} | |
| {%- endif %} | |
| {%- set tool_call = message.tool_calls[0].function %} | |
| {{- '<|start_header_id|>assistant<|end_header_id|>\n\n' -}} | |
| {{- '{"name": "' + tool_call.name + '", ' }} | |
| {{- '"parameters": ' }} | |
| {{- tool_call.arguments | tojson }} | |
| {{- "}" }} | |
| {{- "<|eot_id|>" }} | |
| {%- elif message.role == "tool" or message.role == "ipython" %} | |
| {{- "<|start_header_id|>ipython<|end_header_id|>\n\n" }} | |
| {%- if message.content is mapping or message.content is iterable %} | |
| {{- message.content | tojson }} | |
| {%- else %} | |
| {{- message.content }} | |
| {%- endif %} | |
| {{- "<|eot_id|>" }} | |
| {%- endif %} | |
| {%- endfor %} | |
| {%- if add_generation_prompt %} | |
| {{- '<|start_header_id|>assistant<|end_header_id|>\n\n' }} | |
| {%- endif %} | |