Instructions to use hebertgo/knowledgebase-intent-llm with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use hebertgo/knowledgebase-intent-llm with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir knowledgebase-intent-llm hebertgo/knowledgebase-intent-llm
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
| language: en | |
| license: mit | |
| base_model: mlx-community/Qwen2.5-1.5B-Instruct-4bit | |
| tags: | |
| - mlx | |
| - intent-detection | |
| - fine-tuned | |
| - knowledge-management | |
| - ios | |
| library_name: mlx | |
| # knowledgebase-intent-llm | |
| Fine-tuned model for intent detection in a knowledge management iOS app. | |
| ## Model Details | |
| - **Base Model**: mlx-community/Qwen2.5-1.5B-Instruct-4bit | |
| - **Model Type**: intent_detection | |
| - **Format**: complete_merged | |
| - **Framework**: MLX | |
| - **Training Examples**: 5000 | |
| - **Training Iterations**: 100 | |
| ## Usage | |
| ```python | |
| from mlx_lm import load, generate | |
| # Load the model | |
| model, tokenizer = load("hebertgo/knowledgebase-intent-llm") | |
| # Generate intent classification | |
| prompt = '''You are a helpful AI assistant for a knowledge-management app on an iPhone. Analyze the user's request and respond with JSON in this format: | |
| { | |
| "action": "Search|Create|Clarify|Conversation", | |
| "response": "User-friendly response message", | |
| "contentType": "videos|bookmarks|todos", | |
| "topic": "extracted topic or null" | |
| } | |
| User query: find videos about python''' | |
| response = generate(model, tokenizer, prompt=prompt, max_tokens=256) | |
| print(response) | |
| ``` | |
| ## iOS Integration | |
| This model is designed for use in iOS apps with MLX Swift: | |
| ```swift | |
| let config = ModelConfiguration( | |
| id: "hebertgo/knowledgebase-intent-llm", | |
| defaultPrompt: "" | |
| ) | |
| let model = try await LLMModelFactory.shared.loadContainer( | |
| configuration: config | |
| ) | |
| ``` | |
| ## Training Details | |
| - **Fine-tuning Method**: LoRA with model fusion | |
| - **Export Date**: 2025-06-24T17:12:38.111419 | |
| - **Fusion Completed**: True | |
| ## Expected Outputs | |
| The model generates JSON responses with these action types: | |
| - **Search**: Find existing content (videos, bookmarks, todos) | |
| - **Create**: Add new content | |
| - **Clarify**: Request more information | |
| - **Conversation**: General chat responses | |
| Content types supported: | |
| - videos | |
| - bookmarks | |
| - todos | |
| ## Performance | |
| Optimized for Apple Silicon devices with MLX framework for efficient on-device inference. | |