Note: AI Models may make mistakes. Please use with discretion.

List of Model HQ Supported Models (for Intel)

Note: AI Models may make mistakes. Please use with discretion.

List of Model HQ Supported Models (for Intel)

Note: AI Models may make mistakes. Please use with discretion.

List of Model HQ Supported Models (for Intel)

Intel Optimized AI Models

Complete reference of 225+ Intel-optimized models across all major AI families

Model Type	Models
Embedding Models26 models	industry-bert-contracts-ov industry-bert-insurance-ov industry-bert-asset-management-ov industry-bert-sec-ov industry-bert-loans-ov all-mini-lm-l6-v2-ov all-mpnet-base-v2-ov paraphrase-multilingual-MiniLM-L12-v2-ov gte-small-ov gte-base-ov gte-large-ov bge-small-en-v1.5-ov bge-base-en-v1.5-ov bge-large-en-v1.5-ov protectai-prompt-injection-ov malicious-url-detector-ov xlm-roberta-language-detector-ov valurank-bias-ov unitary-toxic-roberta-ov jina-reranker-v1-tiny-en-ov jina-reranker-v1-turbo-en-ov jina-reranker-tiny-onnx jina-reranker-turbo-onnx protectai-prompt-injection-onnx valurank-bias-onnx unitary-toxic-roberta-onnx
Qwen Models38 models	bling-qwen-1.5b-ov bling-qwen-500m-ov qwen2-0.5b-chat-ov qwen2-1.5b-instruct-ov qwen2-7b-instruct-ov qwen2-vl-2b-instruct-ov qwen2-vl-7b-instruct-ov qwen2.5-0.5b-instruct-ov qwen2.5-1.5b-instruct-ov qwen2.5-3b-instruct-ov qwen2.5-14b-instruct-ov qwen2.5-32b-instruct-ov qwen2.5-72b-instruct-ov qwen2.5-coder-7b-instruct-ov qwen3-8b-ov qwen3-1.7b-ov qwen3-4b-ov qwen3-14b-ov dragon-qwen-7b-ov slim-extract-qwen-0.5b-ov slim-extract-qwen-1.5b-ov bling-qwen-mini-tool bling-qwen-0.5b-gguf dragon-qwen-7b-gguf qwen2-7B-instruct-gguf qwen3-1.7b-gguf qwen3-4b-instruct-gguf qwen3-8b-gguf qwen3-14b-gguf qwen2-1.5b-instruct-gguf qwen2-0.5b-instruct-gguf slim-extract-qwen-1.5b-gguf slim-extract-qwen-nano-gguf qwen-2.5-7b-coder-gguf qwen-2.5-14b-instruct-gguf deepseek-qwen-14b-gguf deepseek-qwen-7b-gguf qwen2.5-32b-gguf
Llama-Based Models29 models	bling-tiny-llama-ov dolphin-2.9.4-llama3.1-8b-ov llama-11b-vision-instruct-ov llama-2-13b-chat-ov llama-2-chat-ov llama-3.1-instruct-ov llama-3.1-8b-instruct-npu-ov llama-3.2-1b-instruct-ov llama-3.2-1b-instruct-npu-ov llama-3.2-3b-instruct-ov llama-3.2-3b-instruct-npu-ov tiny-llama-chat-ov nvidia-llama3-chatqa-1.5-8b-ov dragon-llama2-ov bling-tiny-llama-npu-ov bling-tiny-llama-onnx llama-3.2-3b-onnx-qnn llama-2-chat-onnx llama-3.1-instruct-onnx llama-3.2-1b-instruct-onnx llama-3.2-3b-instruct-onnx dragon-llama-3.1-gguf dragon-llama-answer-tool llama-3.1-instruct-gguf llama-2-7b-chat-gguf llama-3-8b-instruct-gguf tiny-llama-chat-gguf llama-3.2-1b-instruct-gguf llama-3.2-3b-instruct-gguf
Phi Models33 models	phi-3-ov phi-3-npu-ov bling-phi-3-ov phi-4-ov phi-4-mini-ov phi-4-mini-npu-ov phi-4-npu-ov slim-xsum-phi-3-ov slim-boolean-phi-3-ov slim-sa-ner-phi-3-ov slim-summary-phi-3-ov slim-sql-phi-3-ov slim-extract-phi-3-ov bling-phi-3-onnx phi-3-onnx phi-3.5-onnx-qnn phi-3-vision-onnx slim-summary-phi-3-onnx slim-extract-phi-3-onnx slim-boolean-phi-3-onnx bling-phi-3-gguf bling-phi-3.5-gguf phi-3.5-gguf phi-4-gguf phi-4-mini-gguf phi-4-mini-reasoning-gguf phi-3-gguf slim-extract-phi-3-gguf slim-xsum-phi-3-gguf slim-boolean-phi-3-gguf slim-sa-ner-phi-3-gguf slim-q-gen-phi-3-tool slim-qa-gen-phi-3-tool
Mistral Models19 models	dolphin-2.9.3-mistral-7b-32k-ov mistral-7b-instruct-v0.2-ov mistral-7b-instruct-v0.3-ov mistral-7b-v0.3-npu-ov mistral-nemo-instruct-2407-ov mistral-small-instruct-2409-ov zephyr-mistral-7b-chat-ov teknium-open-hermes-2.5-mistral-ov dragon-mistral-ov dragon-mistral-0.3-ov dragon-mistral-0.3-onnx mistral-7b-instruct-v0.3-onnx dragon-mistral-0.3-gguf mistral-3.2-24b-gguf openhermes-2.5-mistral-7b-gguf zephyr-7b-beta-gguf starling-lm-7b-alpha-gguf dragon-mistral-answer-tool mistral-7b-instruct-v0.3-gguf
Yi Models7 models	yi-6b-1.5v-chat-ov yi-9b-chat-ov yi-9b-npu-ov dragon-yi-6b-ov dragon-yi-9b-ov dragon-yi-9b-gguf dragon-yi-answer-tool
DRAGON Models14 models	dragon-llama2-ov dragon-mistral-0.3-ov dragon-mistral-ov dragon-qwen-7b-ov dragon-yi-6b-ov dragon-yi-9b-ov dragon-mistral-0.3-onnx dragon-llama-3.1-gguf dragon-mistral-0.3-gguf dragon-yi-9b-gguf dragon-qwen-7b-gguf dragon-yi-answer-tool dragon-llama-answer-tool dragon-mistral-answer-tool
Slim Models75 models	slim-boolean-phi-3-ov slim-emotions-ov slim-emotions-npu-ov slim-extract-tiny-ov slim-extract-tiny-npu-ov slim-intent-ov slim-intent-npu-ov slim-ner-ov slim-ner-npu-ov slim-q-gen-tiny-ov slim-qa-gen-tiny-ov slim-ratings-ov slim-ratings-npu-ov slim-sentiment-ov slim-sentiment-npu-ov slim-sql-ov slim-sql-npu-ov slim-sql-qwen-base-ov slim-summary-phi-3-ov slim-summary-tiny-ov slim-summary-tiny-npu-ov slim-tags-ov slim-tags-npu-ov slim-topics-ov slim-topics-npu-ov slim-xsum-phi-3-ov slim-extract-qwen-0.5b-ov slim-extract-qwen-1.5b-ov slim-extract-phi-3-ov slim-sa-ner-phi-3-ov slim-sql-phi-3-ov slim-category-ov slim-sentiment-onnx slim-extract-tiny-onnx slim-summary-tiny-onnx slim-sql-onnx slim-emotions-onnx slim-topics-onnx slim-ner-onnx slim-intent-onnx slim-tags-onnx slim-ratings-onnx slim-summary-phi-3-onnx slim-extract-phi-3-onnx slim-boolean-phi-3-onnx slim-ner-tool slim-sentiment-tool slim-emotions-tool slim-ratings-tool slim-intent-tool slim-nli-tool slim-topics-tool slim-tags-tool slim-sql-tool bling-answer-tool slim-category-tool slim-xsum-tool slim-extract-tool slim-extract-phi-3-gguf slim-extract-qwen-1.5b-gguf slim-extract-qwen-nano-gguf slim-extract-tiny-tool slim-summary-tiny-tool slim-summary-phi-3-gguf slim-xsum-phi-3-gguf slim-boolean-tool slim-boolean-phi-3-gguf slim-sa-ner-phi-3-gguf slim-sa-ner-tool slim-tags-3b-tool slim-summary-tool slim-q-gen-phi-3-tool slim-q-gen-tiny-tool slim-qa-gen-tiny-tool slim-qa-gen-phi-3-tool
StableLM Models4 models	stablelm-2-zephyr-1_6b-ov stablelm-zephyr-3b-ov stablelm-2-12b-chat-ov bling-stablelm-3b-gguf
Gemma Models8 models	gemma-7b-it-ov codegemma-7b-it-ov gemma-2b-it-ov gemma-2b-it-onnx gemma-3-4b-gguf gemma-3-12b-gguf gemma-2-9b-instruct-gguf gemma-2-27b-instruct-gguf
Specialized Models6 models	intel-neural-chat-7b-v3-2-ov openchat-3.6-8b-20240522-ov tiny-dolphin-2.8-1.1b-ov dreamgen-wizardlm-2-7b-ov mathstral-7b-ov whisper-cpp-base-english
Multimodal Models2 models	speech-t5-tts-ov lcm-dreamshaper-ov

Intel Optimized AI Models

Complete reference of 225+ Intel-optimized models across all major AI families

Model Type	Models
Embedding Models26 models	industry-bert-contracts-ov industry-bert-insurance-ov industry-bert-asset-management-ov industry-bert-sec-ov industry-bert-loans-ov all-mini-lm-l6-v2-ov all-mpnet-base-v2-ov paraphrase-multilingual-MiniLM-L12-v2-ov gte-small-ov gte-base-ov gte-large-ov bge-small-en-v1.5-ov bge-base-en-v1.5-ov bge-large-en-v1.5-ov protectai-prompt-injection-ov malicious-url-detector-ov xlm-roberta-language-detector-ov valurank-bias-ov unitary-toxic-roberta-ov jina-reranker-v1-tiny-en-ov jina-reranker-v1-turbo-en-ov jina-reranker-tiny-onnx jina-reranker-turbo-onnx protectai-prompt-injection-onnx valurank-bias-onnx unitary-toxic-roberta-onnx
Qwen Models38 models	bling-qwen-1.5b-ov bling-qwen-500m-ov qwen2-0.5b-chat-ov qwen2-1.5b-instruct-ov qwen2-7b-instruct-ov qwen2-vl-2b-instruct-ov qwen2-vl-7b-instruct-ov qwen2.5-0.5b-instruct-ov qwen2.5-1.5b-instruct-ov qwen2.5-3b-instruct-ov qwen2.5-14b-instruct-ov qwen2.5-32b-instruct-ov qwen2.5-72b-instruct-ov qwen2.5-coder-7b-instruct-ov qwen3-8b-ov qwen3-1.7b-ov qwen3-4b-ov qwen3-14b-ov dragon-qwen-7b-ov slim-extract-qwen-0.5b-ov slim-extract-qwen-1.5b-ov bling-qwen-mini-tool bling-qwen-0.5b-gguf dragon-qwen-7b-gguf qwen2-7B-instruct-gguf qwen3-1.7b-gguf qwen3-4b-instruct-gguf qwen3-8b-gguf qwen3-14b-gguf qwen2-1.5b-instruct-gguf qwen2-0.5b-instruct-gguf slim-extract-qwen-1.5b-gguf slim-extract-qwen-nano-gguf qwen-2.5-7b-coder-gguf qwen-2.5-14b-instruct-gguf deepseek-qwen-14b-gguf deepseek-qwen-7b-gguf qwen2.5-32b-gguf
Llama-Based Models29 models	bling-tiny-llama-ov dolphin-2.9.4-llama3.1-8b-ov llama-11b-vision-instruct-ov llama-2-13b-chat-ov llama-2-chat-ov llama-3.1-instruct-ov llama-3.1-8b-instruct-npu-ov llama-3.2-1b-instruct-ov llama-3.2-1b-instruct-npu-ov llama-3.2-3b-instruct-ov llama-3.2-3b-instruct-npu-ov tiny-llama-chat-ov nvidia-llama3-chatqa-1.5-8b-ov dragon-llama2-ov bling-tiny-llama-npu-ov bling-tiny-llama-onnx llama-3.2-3b-onnx-qnn llama-2-chat-onnx llama-3.1-instruct-onnx llama-3.2-1b-instruct-onnx llama-3.2-3b-instruct-onnx dragon-llama-3.1-gguf dragon-llama-answer-tool llama-3.1-instruct-gguf llama-2-7b-chat-gguf llama-3-8b-instruct-gguf tiny-llama-chat-gguf llama-3.2-1b-instruct-gguf llama-3.2-3b-instruct-gguf
Phi Models33 models	phi-3-ov phi-3-npu-ov bling-phi-3-ov phi-4-ov phi-4-mini-ov phi-4-mini-npu-ov phi-4-npu-ov slim-xsum-phi-3-ov slim-boolean-phi-3-ov slim-sa-ner-phi-3-ov slim-summary-phi-3-ov slim-sql-phi-3-ov slim-extract-phi-3-ov bling-phi-3-onnx phi-3-onnx phi-3.5-onnx-qnn phi-3-vision-onnx slim-summary-phi-3-onnx slim-extract-phi-3-onnx slim-boolean-phi-3-onnx bling-phi-3-gguf bling-phi-3.5-gguf phi-3.5-gguf phi-4-gguf phi-4-mini-gguf phi-4-mini-reasoning-gguf phi-3-gguf slim-extract-phi-3-gguf slim-xsum-phi-3-gguf slim-boolean-phi-3-gguf slim-sa-ner-phi-3-gguf slim-q-gen-phi-3-tool slim-qa-gen-phi-3-tool
Mistral Models19 models	dolphin-2.9.3-mistral-7b-32k-ov mistral-7b-instruct-v0.2-ov mistral-7b-instruct-v0.3-ov mistral-7b-v0.3-npu-ov mistral-nemo-instruct-2407-ov mistral-small-instruct-2409-ov zephyr-mistral-7b-chat-ov teknium-open-hermes-2.5-mistral-ov dragon-mistral-ov dragon-mistral-0.3-ov dragon-mistral-0.3-onnx mistral-7b-instruct-v0.3-onnx dragon-mistral-0.3-gguf mistral-3.2-24b-gguf openhermes-2.5-mistral-7b-gguf zephyr-7b-beta-gguf starling-lm-7b-alpha-gguf dragon-mistral-answer-tool mistral-7b-instruct-v0.3-gguf
Yi Models7 models	yi-6b-1.5v-chat-ov yi-9b-chat-ov yi-9b-npu-ov dragon-yi-6b-ov dragon-yi-9b-ov dragon-yi-9b-gguf dragon-yi-answer-tool
DRAGON Models14 models	dragon-llama2-ov dragon-mistral-0.3-ov dragon-mistral-ov dragon-qwen-7b-ov dragon-yi-6b-ov dragon-yi-9b-ov dragon-mistral-0.3-onnx dragon-llama-3.1-gguf dragon-mistral-0.3-gguf dragon-yi-9b-gguf dragon-qwen-7b-gguf dragon-yi-answer-tool dragon-llama-answer-tool dragon-mistral-answer-tool
Slim Models75 models	slim-boolean-phi-3-ov slim-emotions-ov slim-emotions-npu-ov slim-extract-tiny-ov slim-extract-tiny-npu-ov slim-intent-ov slim-intent-npu-ov slim-ner-ov slim-ner-npu-ov slim-q-gen-tiny-ov slim-qa-gen-tiny-ov slim-ratings-ov slim-ratings-npu-ov slim-sentiment-ov slim-sentiment-npu-ov slim-sql-ov slim-sql-npu-ov slim-sql-qwen-base-ov slim-summary-phi-3-ov slim-summary-tiny-ov slim-summary-tiny-npu-ov slim-tags-ov slim-tags-npu-ov slim-topics-ov slim-topics-npu-ov slim-xsum-phi-3-ov slim-extract-qwen-0.5b-ov slim-extract-qwen-1.5b-ov slim-extract-phi-3-ov slim-sa-ner-phi-3-ov slim-sql-phi-3-ov slim-category-ov slim-sentiment-onnx slim-extract-tiny-onnx slim-summary-tiny-onnx slim-sql-onnx slim-emotions-onnx slim-topics-onnx slim-ner-onnx slim-intent-onnx slim-tags-onnx slim-ratings-onnx slim-summary-phi-3-onnx slim-extract-phi-3-onnx slim-boolean-phi-3-onnx slim-ner-tool slim-sentiment-tool slim-emotions-tool slim-ratings-tool slim-intent-tool slim-nli-tool slim-topics-tool slim-tags-tool slim-sql-tool bling-answer-tool slim-category-tool slim-xsum-tool slim-extract-tool slim-extract-phi-3-gguf slim-extract-qwen-1.5b-gguf slim-extract-qwen-nano-gguf slim-extract-tiny-tool slim-summary-tiny-tool slim-summary-phi-3-gguf slim-xsum-phi-3-gguf slim-boolean-tool slim-boolean-phi-3-gguf slim-sa-ner-phi-3-gguf slim-sa-ner-tool slim-tags-3b-tool slim-summary-tool slim-q-gen-phi-3-tool slim-q-gen-tiny-tool slim-qa-gen-tiny-tool slim-qa-gen-phi-3-tool
StableLM Models4 models	stablelm-2-zephyr-1_6b-ov stablelm-zephyr-3b-ov stablelm-2-12b-chat-ov bling-stablelm-3b-gguf
Gemma Models8 models	gemma-7b-it-ov codegemma-7b-it-ov gemma-2b-it-ov gemma-2b-it-onnx gemma-3-4b-gguf gemma-3-12b-gguf gemma-2-9b-instruct-gguf gemma-2-27b-instruct-gguf
Specialized Models6 models	intel-neural-chat-7b-v3-2-ov openchat-3.6-8b-20240522-ov tiny-dolphin-2.8-1.1b-ov dreamgen-wizardlm-2-7b-ov mathstral-7b-ov whisper-cpp-base-english
Multimodal Models2 models	speech-t5-tts-ov lcm-dreamshaper-ov

MODELS THAT PROVES US

Supported Model Families of Model HQ

Qwen 2.5 Instruct 14B

Qwen 2 Based Models

Llama 3 Based Models

Phi-3 Based Models

Google Gemma 2 Based Models

Mistral Small Model 22B

Mistral 7B Based Models

StableLM 3B Based Models

Yi 6B Based Models

Yi 9B Based Models

Dragon RAG Model

SLIM Function Calling Models

LLMWare Models

Here's the list of our models

LLMWare Models

Here's the list of our models

LLMWare Models

Here's the list of our models

DRAGON Models

Production-grade RAG-optimized 6-7B parameter models - "Delivering RAG on ...". Fine-tuned for question-answering, fact-based uses in RAG workflows. Uses the context provided to answer yes/no and multiple-choice questions. Trained to reduce hallucinations.

SLIM Models

Function-calling, structured output models for classifying and clustering tasks. Designed to be used in an AI Agent workflow either alone or stacked together. 10+ Structured Language Instruction Models (SLIMs) for almost every classifying task.

Custom Models

Expert custom model training services for your company and your domain. Full-service custom model fine-tuning from datasets to training (and beyond). Services include small specialized models (7B and under) and embedding models.

Industry BERT Models

Industry and specialized domain finetuned BERT embedding models. Specialized domains include: Insurance, SEC documents, Contracts & Asset Management.

SLIM GGUF

Quantized GGUF "tool" implementations of SLIM Models. Provide "gguf" and "tool" versions of many SLIM, DRAGON, and BLING models, optimized for CPU deployment. GGUF Generative Model class - support for Stable-LM-3B, CUDA build options, and better control over sampling strategies.

BLING Models

Small CPU-based RAG-optimized, instruct-following 1B-3B parameter models. Great for quickly prototyping POCs on a laptop. Fast inference time.

Explore Our Hugging Face Models

Try MODEL HQ by LLMWare.ai and start using AI models on your AI PCs today

If you need any assistance, feel free to reach out to us!

Try MODEL HQ by LLMWare.ai and start using AI models on your AI PCs today

If you need any assistance, feel free to reach out to us!

Try MODEL HQ by LLMWare.ai and start using AI models on your AI PCs today

If you need any assistance, feel free to reach out to us!

MODELS THAT PROVES US

Supported Model Families

Qwen 2.5 Instruct 14B

Qwen 2 Based Models

Llama 3 Based Models

Phi-3 Based Models

Google Gemma 2 Based Models

Mistral Small Model 22B

Mistral 7B Based Models

StableLM 3B Based Models

Yi 6B Based Models

Yi 9B Based Models

Dragon RAG Model

SLIM Function Calling Models