Note: AI Models may make mistakes. Please use with discretion.

List of Model HQ Supported Models (for Intel)

Note: AI Models may make mistakes. Please use with discretion.

List of Model HQ Supported Models (for Intel)

Note: AI Models may make mistakes. Please use with discretion.

List of Model HQ Supported Models (for Intel)

Intel Optimized AI Models

Complete reference of 225+ Intel-optimized models across all major AI families

Model TypeModels
Embedding Models26 models
industry-bert-contracts-ov
industry-bert-insurance-ov
industry-bert-asset-management-ov
industry-bert-sec-ov
industry-bert-loans-ov
all-mini-lm-l6-v2-ov
all-mpnet-base-v2-ov
paraphrase-multilingual-MiniLM-L12-v2-ov
gte-small-ov
gte-base-ov
gte-large-ov
bge-small-en-v1.5-ov
bge-base-en-v1.5-ov
bge-large-en-v1.5-ov
protectai-prompt-injection-ov
malicious-url-detector-ov
xlm-roberta-language-detector-ov
valurank-bias-ov
unitary-toxic-roberta-ov
jina-reranker-v1-tiny-en-ov
jina-reranker-v1-turbo-en-ov
jina-reranker-tiny-onnx
jina-reranker-turbo-onnx
protectai-prompt-injection-onnx
valurank-bias-onnx
unitary-toxic-roberta-onnx
Qwen Models38 models
bling-qwen-1.5b-ov
bling-qwen-500m-ov
qwen2-0.5b-chat-ov
qwen2-1.5b-instruct-ov
qwen2-7b-instruct-ov
qwen2-vl-2b-instruct-ov
qwen2-vl-7b-instruct-ov
qwen2.5-0.5b-instruct-ov
qwen2.5-1.5b-instruct-ov
qwen2.5-3b-instruct-ov
qwen2.5-14b-instruct-ov
qwen2.5-32b-instruct-ov
qwen2.5-72b-instruct-ov
qwen2.5-coder-7b-instruct-ov
qwen3-8b-ov
qwen3-1.7b-ov
qwen3-4b-ov
qwen3-14b-ov
dragon-qwen-7b-ov
slim-extract-qwen-0.5b-ov
slim-extract-qwen-1.5b-ov
bling-qwen-mini-tool
bling-qwen-0.5b-gguf
dragon-qwen-7b-gguf
qwen2-7B-instruct-gguf
qwen3-1.7b-gguf
qwen3-4b-instruct-gguf
qwen3-8b-gguf
qwen3-14b-gguf
qwen2-1.5b-instruct-gguf
qwen2-0.5b-instruct-gguf
slim-extract-qwen-1.5b-gguf
slim-extract-qwen-nano-gguf
qwen-2.5-7b-coder-gguf
qwen-2.5-14b-instruct-gguf
deepseek-qwen-14b-gguf
deepseek-qwen-7b-gguf
qwen2.5-32b-gguf
Llama-Based Models29 models
bling-tiny-llama-ov
dolphin-2.9.4-llama3.1-8b-ov
llama-11b-vision-instruct-ov
llama-2-13b-chat-ov
llama-2-chat-ov
llama-3.1-instruct-ov
llama-3.1-8b-instruct-npu-ov
llama-3.2-1b-instruct-ov
llama-3.2-1b-instruct-npu-ov
llama-3.2-3b-instruct-ov
llama-3.2-3b-instruct-npu-ov
tiny-llama-chat-ov
nvidia-llama3-chatqa-1.5-8b-ov
dragon-llama2-ov
bling-tiny-llama-npu-ov
bling-tiny-llama-onnx
llama-3.2-3b-onnx-qnn
llama-2-chat-onnx
llama-3.1-instruct-onnx
llama-3.2-1b-instruct-onnx
llama-3.2-3b-instruct-onnx
dragon-llama-3.1-gguf
dragon-llama-answer-tool
llama-3.1-instruct-gguf
llama-2-7b-chat-gguf
llama-3-8b-instruct-gguf
tiny-llama-chat-gguf
llama-3.2-1b-instruct-gguf
llama-3.2-3b-instruct-gguf
Phi Models33 models
phi-3-ov
phi-3-npu-ov
bling-phi-3-ov
phi-4-ov
phi-4-mini-ov
phi-4-mini-npu-ov
phi-4-npu-ov
slim-xsum-phi-3-ov
slim-boolean-phi-3-ov
slim-sa-ner-phi-3-ov
slim-summary-phi-3-ov
slim-sql-phi-3-ov
slim-extract-phi-3-ov
bling-phi-3-onnx
phi-3-onnx
phi-3.5-onnx-qnn
phi-3-vision-onnx
slim-summary-phi-3-onnx
slim-extract-phi-3-onnx
slim-boolean-phi-3-onnx
bling-phi-3-gguf
bling-phi-3.5-gguf
phi-3.5-gguf
phi-4-gguf
phi-4-mini-gguf
phi-4-mini-reasoning-gguf
phi-3-gguf
slim-extract-phi-3-gguf
slim-xsum-phi-3-gguf
slim-boolean-phi-3-gguf
slim-sa-ner-phi-3-gguf
slim-q-gen-phi-3-tool
slim-qa-gen-phi-3-tool
Mistral Models19 models
dolphin-2.9.3-mistral-7b-32k-ov
mistral-7b-instruct-v0.2-ov
mistral-7b-instruct-v0.3-ov
mistral-7b-v0.3-npu-ov
mistral-nemo-instruct-2407-ov
mistral-small-instruct-2409-ov
zephyr-mistral-7b-chat-ov
teknium-open-hermes-2.5-mistral-ov
dragon-mistral-ov
dragon-mistral-0.3-ov
dragon-mistral-0.3-onnx
mistral-7b-instruct-v0.3-onnx
dragon-mistral-0.3-gguf
mistral-3.2-24b-gguf
openhermes-2.5-mistral-7b-gguf
zephyr-7b-beta-gguf
starling-lm-7b-alpha-gguf
dragon-mistral-answer-tool
mistral-7b-instruct-v0.3-gguf
Yi Models7 models
yi-6b-1.5v-chat-ov
yi-9b-chat-ov
yi-9b-npu-ov
dragon-yi-6b-ov
dragon-yi-9b-ov
dragon-yi-9b-gguf
dragon-yi-answer-tool
DRAGON Models14 models
dragon-llama2-ov
dragon-mistral-0.3-ov
dragon-mistral-ov
dragon-qwen-7b-ov
dragon-yi-6b-ov
dragon-yi-9b-ov
dragon-mistral-0.3-onnx
dragon-llama-3.1-gguf
dragon-mistral-0.3-gguf
dragon-yi-9b-gguf
dragon-qwen-7b-gguf
dragon-yi-answer-tool
dragon-llama-answer-tool
dragon-mistral-answer-tool
Slim Models75 models
slim-boolean-phi-3-ov
slim-emotions-ov
slim-emotions-npu-ov
slim-extract-tiny-ov
slim-extract-tiny-npu-ov
slim-intent-ov
slim-intent-npu-ov
slim-ner-ov
slim-ner-npu-ov
slim-q-gen-tiny-ov
slim-qa-gen-tiny-ov
slim-ratings-ov
slim-ratings-npu-ov
slim-sentiment-ov
slim-sentiment-npu-ov
slim-sql-ov
slim-sql-npu-ov
slim-sql-qwen-base-ov
slim-summary-phi-3-ov
slim-summary-tiny-ov
slim-summary-tiny-npu-ov
slim-tags-ov
slim-tags-npu-ov
slim-topics-ov
slim-topics-npu-ov
slim-xsum-phi-3-ov
slim-extract-qwen-0.5b-ov
slim-extract-qwen-1.5b-ov
slim-extract-phi-3-ov
slim-sa-ner-phi-3-ov
slim-sql-phi-3-ov
slim-category-ov
slim-sentiment-onnx
slim-extract-tiny-onnx
slim-summary-tiny-onnx
slim-sql-onnx
slim-emotions-onnx
slim-topics-onnx
slim-ner-onnx
slim-intent-onnx
slim-tags-onnx
slim-ratings-onnx
slim-summary-phi-3-onnx
slim-extract-phi-3-onnx
slim-boolean-phi-3-onnx
slim-ner-tool
slim-sentiment-tool
slim-emotions-tool
slim-ratings-tool
slim-intent-tool
slim-nli-tool
slim-topics-tool
slim-tags-tool
slim-sql-tool
bling-answer-tool
slim-category-tool
slim-xsum-tool
slim-extract-tool
slim-extract-phi-3-gguf
slim-extract-qwen-1.5b-gguf
slim-extract-qwen-nano-gguf
slim-extract-tiny-tool
slim-summary-tiny-tool
slim-summary-phi-3-gguf
slim-xsum-phi-3-gguf
slim-boolean-tool
slim-boolean-phi-3-gguf
slim-sa-ner-phi-3-gguf
slim-sa-ner-tool
slim-tags-3b-tool
slim-summary-tool
slim-q-gen-phi-3-tool
slim-q-gen-tiny-tool
slim-qa-gen-tiny-tool
slim-qa-gen-phi-3-tool
StableLM Models4 models
stablelm-2-zephyr-1_6b-ov
stablelm-zephyr-3b-ov
stablelm-2-12b-chat-ov
bling-stablelm-3b-gguf
Gemma Models8 models
gemma-7b-it-ov
codegemma-7b-it-ov
gemma-2b-it-ov
gemma-2b-it-onnx
gemma-3-4b-gguf
gemma-3-12b-gguf
gemma-2-9b-instruct-gguf
gemma-2-27b-instruct-gguf
Specialized Models6 models
intel-neural-chat-7b-v3-2-ov
openchat-3.6-8b-20240522-ov
tiny-dolphin-2.8-1.1b-ov
dreamgen-wizardlm-2-7b-ov
mathstral-7b-ov
whisper-cpp-base-english
Multimodal Models2 models
speech-t5-tts-ov
lcm-dreamshaper-ov

Intel Optimized AI Models

Complete reference of 225+ Intel-optimized models across all major AI families

Model TypeModels
Embedding Models26 models
industry-bert-contracts-ov
industry-bert-insurance-ov
industry-bert-asset-management-ov
industry-bert-sec-ov
industry-bert-loans-ov
all-mini-lm-l6-v2-ov
all-mpnet-base-v2-ov
paraphrase-multilingual-MiniLM-L12-v2-ov
gte-small-ov
gte-base-ov
gte-large-ov
bge-small-en-v1.5-ov
bge-base-en-v1.5-ov
bge-large-en-v1.5-ov
protectai-prompt-injection-ov
malicious-url-detector-ov
xlm-roberta-language-detector-ov
valurank-bias-ov
unitary-toxic-roberta-ov
jina-reranker-v1-tiny-en-ov
jina-reranker-v1-turbo-en-ov
jina-reranker-tiny-onnx
jina-reranker-turbo-onnx
protectai-prompt-injection-onnx
valurank-bias-onnx
unitary-toxic-roberta-onnx
Qwen Models38 models
bling-qwen-1.5b-ov
bling-qwen-500m-ov
qwen2-0.5b-chat-ov
qwen2-1.5b-instruct-ov
qwen2-7b-instruct-ov
qwen2-vl-2b-instruct-ov
qwen2-vl-7b-instruct-ov
qwen2.5-0.5b-instruct-ov
qwen2.5-1.5b-instruct-ov
qwen2.5-3b-instruct-ov
qwen2.5-14b-instruct-ov
qwen2.5-32b-instruct-ov
qwen2.5-72b-instruct-ov
qwen2.5-coder-7b-instruct-ov
qwen3-8b-ov
qwen3-1.7b-ov
qwen3-4b-ov
qwen3-14b-ov
dragon-qwen-7b-ov
slim-extract-qwen-0.5b-ov
slim-extract-qwen-1.5b-ov
bling-qwen-mini-tool
bling-qwen-0.5b-gguf
dragon-qwen-7b-gguf
qwen2-7B-instruct-gguf
qwen3-1.7b-gguf
qwen3-4b-instruct-gguf
qwen3-8b-gguf
qwen3-14b-gguf
qwen2-1.5b-instruct-gguf
qwen2-0.5b-instruct-gguf
slim-extract-qwen-1.5b-gguf
slim-extract-qwen-nano-gguf
qwen-2.5-7b-coder-gguf
qwen-2.5-14b-instruct-gguf
deepseek-qwen-14b-gguf
deepseek-qwen-7b-gguf
qwen2.5-32b-gguf
Llama-Based Models29 models
bling-tiny-llama-ov
dolphin-2.9.4-llama3.1-8b-ov
llama-11b-vision-instruct-ov
llama-2-13b-chat-ov
llama-2-chat-ov
llama-3.1-instruct-ov
llama-3.1-8b-instruct-npu-ov
llama-3.2-1b-instruct-ov
llama-3.2-1b-instruct-npu-ov
llama-3.2-3b-instruct-ov
llama-3.2-3b-instruct-npu-ov
tiny-llama-chat-ov
nvidia-llama3-chatqa-1.5-8b-ov
dragon-llama2-ov
bling-tiny-llama-npu-ov
bling-tiny-llama-onnx
llama-3.2-3b-onnx-qnn
llama-2-chat-onnx
llama-3.1-instruct-onnx
llama-3.2-1b-instruct-onnx
llama-3.2-3b-instruct-onnx
dragon-llama-3.1-gguf
dragon-llama-answer-tool
llama-3.1-instruct-gguf
llama-2-7b-chat-gguf
llama-3-8b-instruct-gguf
tiny-llama-chat-gguf
llama-3.2-1b-instruct-gguf
llama-3.2-3b-instruct-gguf
Phi Models33 models
phi-3-ov
phi-3-npu-ov
bling-phi-3-ov
phi-4-ov
phi-4-mini-ov
phi-4-mini-npu-ov
phi-4-npu-ov
slim-xsum-phi-3-ov
slim-boolean-phi-3-ov
slim-sa-ner-phi-3-ov
slim-summary-phi-3-ov
slim-sql-phi-3-ov
slim-extract-phi-3-ov
bling-phi-3-onnx
phi-3-onnx
phi-3.5-onnx-qnn
phi-3-vision-onnx
slim-summary-phi-3-onnx
slim-extract-phi-3-onnx
slim-boolean-phi-3-onnx
bling-phi-3-gguf
bling-phi-3.5-gguf
phi-3.5-gguf
phi-4-gguf
phi-4-mini-gguf
phi-4-mini-reasoning-gguf
phi-3-gguf
slim-extract-phi-3-gguf
slim-xsum-phi-3-gguf
slim-boolean-phi-3-gguf
slim-sa-ner-phi-3-gguf
slim-q-gen-phi-3-tool
slim-qa-gen-phi-3-tool
Mistral Models19 models
dolphin-2.9.3-mistral-7b-32k-ov
mistral-7b-instruct-v0.2-ov
mistral-7b-instruct-v0.3-ov
mistral-7b-v0.3-npu-ov
mistral-nemo-instruct-2407-ov
mistral-small-instruct-2409-ov
zephyr-mistral-7b-chat-ov
teknium-open-hermes-2.5-mistral-ov
dragon-mistral-ov
dragon-mistral-0.3-ov
dragon-mistral-0.3-onnx
mistral-7b-instruct-v0.3-onnx
dragon-mistral-0.3-gguf
mistral-3.2-24b-gguf
openhermes-2.5-mistral-7b-gguf
zephyr-7b-beta-gguf
starling-lm-7b-alpha-gguf
dragon-mistral-answer-tool
mistral-7b-instruct-v0.3-gguf
Yi Models7 models
yi-6b-1.5v-chat-ov
yi-9b-chat-ov
yi-9b-npu-ov
dragon-yi-6b-ov
dragon-yi-9b-ov
dragon-yi-9b-gguf
dragon-yi-answer-tool
DRAGON Models14 models
dragon-llama2-ov
dragon-mistral-0.3-ov
dragon-mistral-ov
dragon-qwen-7b-ov
dragon-yi-6b-ov
dragon-yi-9b-ov
dragon-mistral-0.3-onnx
dragon-llama-3.1-gguf
dragon-mistral-0.3-gguf
dragon-yi-9b-gguf
dragon-qwen-7b-gguf
dragon-yi-answer-tool
dragon-llama-answer-tool
dragon-mistral-answer-tool
Slim Models75 models
slim-boolean-phi-3-ov
slim-emotions-ov
slim-emotions-npu-ov
slim-extract-tiny-ov
slim-extract-tiny-npu-ov
slim-intent-ov
slim-intent-npu-ov
slim-ner-ov
slim-ner-npu-ov
slim-q-gen-tiny-ov
slim-qa-gen-tiny-ov
slim-ratings-ov
slim-ratings-npu-ov
slim-sentiment-ov
slim-sentiment-npu-ov
slim-sql-ov
slim-sql-npu-ov
slim-sql-qwen-base-ov
slim-summary-phi-3-ov
slim-summary-tiny-ov
slim-summary-tiny-npu-ov
slim-tags-ov
slim-tags-npu-ov
slim-topics-ov
slim-topics-npu-ov
slim-xsum-phi-3-ov
slim-extract-qwen-0.5b-ov
slim-extract-qwen-1.5b-ov
slim-extract-phi-3-ov
slim-sa-ner-phi-3-ov
slim-sql-phi-3-ov
slim-category-ov
slim-sentiment-onnx
slim-extract-tiny-onnx
slim-summary-tiny-onnx
slim-sql-onnx
slim-emotions-onnx
slim-topics-onnx
slim-ner-onnx
slim-intent-onnx
slim-tags-onnx
slim-ratings-onnx
slim-summary-phi-3-onnx
slim-extract-phi-3-onnx
slim-boolean-phi-3-onnx
slim-ner-tool
slim-sentiment-tool
slim-emotions-tool
slim-ratings-tool
slim-intent-tool
slim-nli-tool
slim-topics-tool
slim-tags-tool
slim-sql-tool
bling-answer-tool
slim-category-tool
slim-xsum-tool
slim-extract-tool
slim-extract-phi-3-gguf
slim-extract-qwen-1.5b-gguf
slim-extract-qwen-nano-gguf
slim-extract-tiny-tool
slim-summary-tiny-tool
slim-summary-phi-3-gguf
slim-xsum-phi-3-gguf
slim-boolean-tool
slim-boolean-phi-3-gguf
slim-sa-ner-phi-3-gguf
slim-sa-ner-tool
slim-tags-3b-tool
slim-summary-tool
slim-q-gen-phi-3-tool
slim-q-gen-tiny-tool
slim-qa-gen-tiny-tool
slim-qa-gen-phi-3-tool
StableLM Models4 models
stablelm-2-zephyr-1_6b-ov
stablelm-zephyr-3b-ov
stablelm-2-12b-chat-ov
bling-stablelm-3b-gguf
Gemma Models8 models
gemma-7b-it-ov
codegemma-7b-it-ov
gemma-2b-it-ov
gemma-2b-it-onnx
gemma-3-4b-gguf
gemma-3-12b-gguf
gemma-2-9b-instruct-gguf
gemma-2-27b-instruct-gguf
Specialized Models6 models
intel-neural-chat-7b-v3-2-ov
openchat-3.6-8b-20240522-ov
tiny-dolphin-2.8-1.1b-ov
dreamgen-wizardlm-2-7b-ov
mathstral-7b-ov
whisper-cpp-base-english
Multimodal Models2 models
speech-t5-tts-ov
lcm-dreamshaper-ov

MODELS THAT PROVES US

Supported Model Families of Model HQ

Qwen 2.5 Instruct 14B

Qwen 2 Based Models

Llama 3 Based Models

Phi-3 Based Models

Google Gemma 2 Based Models

Mistral Small Model 22B

Mistral 7B Based Models

StableLM 3B Based Models

Yi 6B Based Models

Yi 9B Based Models

Dragon RAG Model

SLIM Function Calling Models

LLMWare Models

Here's the list of our models

LLMWare Models

Here's the list of our models

LLMWare Models

Here's the list of our models

DRAGON Models

Production-grade RAG-optimized 6-7B parameter models - "Delivering RAG on ...". Fine-tuned for question-answering, fact-based uses in RAG workflows. Uses the context provided to answer yes/no and multiple-choice questions. Trained to reduce hallucinations.

DRAGON Models

Production-grade RAG-optimized 6-7B parameter models - "Delivering RAG on ...". Fine-tuned for question-answering, fact-based uses in RAG workflows. Uses the context provided to answer yes/no and multiple-choice questions. Trained to reduce hallucinations.

DRAGON Models

Production-grade RAG-optimized 6-7B parameter models - "Delivering RAG on ...". Fine-tuned for question-answering, fact-based uses in RAG workflows. Uses the context provided to answer yes/no and multiple-choice questions. Trained to reduce hallucinations.

SLIM Models

Function-calling, structured output models for classifying and clustering tasks. Designed to be used in an AI Agent workflow either alone or stacked together. 10+ Structured Language Instruction Models (SLIMs) for almost every classifying task.

SLIM Models

Function-calling, structured output models for classifying and clustering tasks. Designed to be used in an AI Agent workflow either alone or stacked together. 10+ Structured Language Instruction Models (SLIMs) for almost every classifying task.

SLIM Models

Function-calling, structured output models for classifying and clustering tasks. Designed to be used in an AI Agent workflow either alone or stacked together. 10+ Structured Language Instruction Models (SLIMs) for almost every classifying task.

Custom Models

Expert custom model training services for your company and your domain. Full-service custom model fine-tuning from datasets to training (and beyond). Services include small specialized models (7B and under) and embedding models.

Custom Models

Expert custom model training services for your company and your domain. Full-service custom model fine-tuning from datasets to training (and beyond). Services include small specialized models (7B and under) and embedding models.

Custom Models

Expert custom model training services for your company and your domain. Full-service custom model fine-tuning from datasets to training (and beyond). Services include small specialized models (7B and under) and embedding models.

Industry BERT Models

Industry and specialized domain finetuned BERT embedding models. Specialized domains include: Insurance, SEC documents, Contracts & Asset Management.

Industry BERT Models

Industry and specialized domain finetuned BERT embedding models. Specialized domains include: Insurance, SEC documents, Contracts & Asset Management.

Industry BERT Models

Industry and specialized domain finetuned BERT embedding models. Specialized domains include: Insurance, SEC documents, Contracts & Asset Management.

SLIM GGUF

Quantized GGUF "tool" implementations of SLIM Models. Provide "gguf" and "tool" versions of many SLIM, DRAGON, and BLING models, optimized for CPU deployment. GGUF Generative Model class - support for Stable-LM-3B, CUDA build options, and better control over sampling strategies.

SLIM GGUF

Quantized GGUF "tool" implementations of SLIM Models. Provide "gguf" and "tool" versions of many SLIM, DRAGON, and BLING models, optimized for CPU deployment. GGUF Generative Model class - support for Stable-LM-3B, CUDA build options, and better control over sampling strategies.

SLIM GGUF

Quantized GGUF "tool" implementations of SLIM Models. Provide "gguf" and "tool" versions of many SLIM, DRAGON, and BLING models, optimized for CPU deployment. GGUF Generative Model class - support for Stable-LM-3B, CUDA build options, and better control over sampling strategies.

BLING Models

Small CPU-based RAG-optimized, instruct-following 1B-3B parameter models. Great for quickly prototyping POCs on a laptop. Fast inference time.

BLING Models

Small CPU-based RAG-optimized, instruct-following 1B-3B parameter models. Great for quickly prototyping POCs on a laptop. Fast inference time.

BLING Models

Small CPU-based RAG-optimized, instruct-following 1B-3B parameter models. Great for quickly prototyping POCs on a laptop. Fast inference time.

Abstract Design

Try MODEL HQ by LLMWare.ai and start using AI models on your AI PCs today

If you need any assistance, feel free to reach out to us!

Abstract Design

Try MODEL HQ by LLMWare.ai and start using AI models on your AI PCs today

If you need any assistance, feel free to reach out to us!

Abstract Design

Try MODEL HQ by LLMWare.ai and start using AI models on your AI PCs today

If you need any assistance, feel free to reach out to us!

MODELS THAT PROVES US

Supported Model Families

MODELS THAT PROVES US

Supported Model Families

Qwen 2.5 Instruct 14B

Qwen 2 Based Models

Llama 3 Based Models

Phi-3 Based Models

Google Gemma 2 Based Models

Mistral Small Model 22B

Mistral 7B Based Models

StableLM 3B Based Models

Yi 6B Based Models

Yi 9B Based Models

Dragon RAG Model

SLIM Function Calling Models