🧠 Supported Models

Smaller Can Be Better!

Bigger isn't always better! Smaller models (0.5B-7B) can be surprisingly powerful and offer significant advantages: - ⚡ Faster inference times - 💰 Lower hosting costs - 🚀 Quicker deployment - 🔄 Easier testing and iteration

How to read this

Please note that language support, performance, and other specifications may vary based on your specific use case, data, and fine-tuning process. This information is intended as general guidance—your results might differ significantly!

Model Overview

Qwen General Models

Qwen models excel at general text generation, understanding, and dialogue. They are particularly strong in Asian languages while still performing well in many Western languages.

Language Support:

Primary: Chinese, English
Strong: Japanese, Korean
Good: German, French, Spanish, Portuguese, Italian, Vietnamese, Thai
Basic: Arabic, Russian, and other less-represented European languages

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
Qwen2.5-0.5B	0.5B	Small ✅	Testing, prototypes	Minimal 🟢
Qwen2.5-1.5B	1.5B	Small ✅	Production-ready apps	Low 🟢
Qwen2.5-3B	3B	Small ✅	Complex applications	Moderate 🟡
Qwen2.5-7B	7B	Medium	High-performance needs	Significant 🟡
Qwen2.5-14B	14B	Large ⚠️	Specific high-accuracy needs	High 🔴
Qwen2.5-32B	32B	Very Large ⚠️	Only when validated as necessary	Very High 🔴
Qwen2.5-72B	72B	Very Large ⚠️	Specialized enterprise needs	Extreme 🔴

Qwen Code Models

Specialized for software development, these models excel at code generation, completion, and understanding. They support a wide range of programming languages and frameworks.

Language Support:

Primary: Python, JavaScript, Java, C++, TypeScript
Strong: Go, Rust, PHP, C#, Ruby
Good: Swift, Kotlin, SQL, Shell scripting
Basic: Scala, R, MATLAB, Assembly

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
Qwen2.5-Coder-0.5B	0.5B	Small ✅	Code completion, simple generation	Minimal 🟢
Qwen2.5-Coder-1.5B	1.5B	Small ✅	Most coding tasks	Low 🟢
Qwen2.5-Coder-3B	3B	Small ✅	Complex code generation	Moderate 🟡
Qwen2.5-Coder-7B	7B	Medium	Large coding projects	Significant 🟡
Qwen2.5-Coder-14B	14B	Large ⚠️	Advanced code generation	High 🔴
Qwen2.5-Coder-32B	32B	Very Large ⚠️	Enterprise code solutions	Very High 🔴

Qwen Math Models

Optimized for mathematical operations, these models excel at solving equations, proofs, and mathematical reasoning tasks.

Language Support:

Primary: Mathematical notation, LaTeX
Strong: English mathematical descriptions
Good: Chinese mathematical descriptions
Basic: Other language mathematical descriptions

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
Qwen2.5-Math-1.5B	1.5B	Small ✅	Most math operations	Low 🟢
Qwen2.5-Math-7B	7B	Medium	Complex calculations	Significant 🟡
Qwen2.5-Math-72B	72B	Very Large ⚠️	Research-grade math	Extreme 🔴

Qwen3 General Models

Qwen3 is the latest generation in the Qwen series, offering powerful advancements in multilingual text generation, reasoning, instruction-following, and overall performance across a wide range of applications.

Language Support:

Primary: Chinese, English
Strong: Japanese, Korean
Good: German, French, Spanish, Portuguese, Italian, Vietnamese, Thai
Basic: Arabic, Russian, and many others
Total Coverage: 100+ languages and dialects

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
Qwen3-0.6B	0.6B	Small ✅	Prototyping, evaluation	Minimal 🟢
Qwen3-1.7B	1.7B	Small ✅	Lightweight production apps	Low 🟢
Qwen3-4B	4B	Small ✅	Mid-scale applications	Moderate 🟡
Qwen3-8B	8.2B	Medium	Complex tasks, multi-turn	Significant 🟡
Qwen3-14B	14B	Large ⚠️	High-accuracy generation	High 🔴
Qwen3-32B	32B	Very Large ⚠️	Advanced enterprise solutions	Very High 🔴

About Qwen3

Qwen3 models deliver enhanced performance in instruction-following, multilingual tasks, reasoning, and code generation. They support over 100 languages and dialects and are well-suited for a wide variety of real-world use cases.

LLaMA 3 Models

Meta's LLaMA models are known for strong performance on text tasks—especially in English—while typically being optimized for European languages.

Language Support:

Primary: English
Strong: Spanish, German, French
Good: Italian, Portuguese, Dutch, and other major European languages
Basic: Asian languages and Arabic

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
Llama-3.2-1B	1B	Small ✅	Quick experiments	Minimal 🟢
Llama-3.2-3B	3B	Small ✅	Small applications	Low 🟢
Llama-3.1-8B	8B	Medium	Production apps	Significant 🟡
Llama-3.1-70B	70B	Very Large ⚠️	Enterprise needs	Extreme 🔴

Code LLaMA Models

Specialized version of LLaMA focused on code generation with strong multilingual code capabilities.

Language Support:

Primary: Python, JavaScript, Java, C++
Strong: PHP, C#, Ruby, Go
Good: Rust, Swift, TypeScript, Kotlin
Basic: Most other programming languages

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
CodeLlama-7b	7B	Medium	General coding	Significant 🟡
CodeLlama-13b	13B	Large ⚠️	Complex code projects	High 🔴
CodeLlama-34b	34B	Very Large ⚠️	Large-scale development	Very High 🔴
CodeLlama-70b	70B	Very Large ⚠️	Enterprise systems	Extreme 🔴

Phi Models

Microsoft's Phi models are designed for efficiency with a primary focus on English. They are especially strong in code generation (notably in Python and JavaScript), while their multilingual capabilities are more limited.

Language Support:

Primary: English
Strong (for code): Python, JavaScript
Limited: Other languages (only basic multilingual support, with non-English tasks generally underperforming)

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
Phi-3.5-mini-instruct	Mini	Small ✅	Quick deployment	Minimal 🟢
Phi-3-mini-4k-instruct	Mini	Small ✅	Testing	Minimal 🟢
Phi-3-mini-128k-instruct	Mini	Small ✅	General tasks	Low 🟢
Phi-3-small-8k-instruct	Small	Small ✅	Small applications	Low 🟢
Phi-3-medium-4k-instruct	Medium	Medium	Medium workloads	Moderate 🟡
Phi-3-medium-128k-instruct	Medium	Medium	Complex tasks	Moderate 🟡

DeepSeek R1 Models

DeepSeek models are specifically optimized for reasoning tasks and complex problem-solving. These models are distilled from larger models while maintaining impressive performance, especially in mathematics and coding tasks.

Language Support:

Primary: English, Chinese
Strong: Math notation, Programming languages
Good: European languages
Basic: Other languages

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
DeepSeek-R1-Distill-Qwen-1.5B	1.5B	Small ✅	Basic reasoning, Quick testing	Minimal 🟢
DeepSeek-R1-Distill-Qwen-7B	7B	Medium	Math problems, Code generation	Moderate 🟡
DeepSeek-R1-Distill-Llama-8B	8B	Medium	General reasoning tasks	Moderate 🟡
DeepSeek-R1-Distill-Qwen-14B	14B	Large ⚠️	Complex problem solving	High 🔴
DeepSeek-R1-Distill-Qwen-32B	32B	Very Large ⚠️	Advanced reasoning, Research	Very High 🔴
DeepSeek-R1-Distill-Llama-70B	70B	Very Large ⚠️	Enterprise applications	Extreme 🔴

Vision Models

Vision models combine text and image understanding capabilities for different specialized purposes.

ModelOne (manufactAI Labs)

Specialized model optimized for extracting structured information from documents and visual data.

Language Support:

Primary: 70+ languages with balanced representation
Core languages (14% each): English, Spanish, French, German, Italian, Russian
Additional Support: 64 other languages

Special Capabilities:

Structured data extraction from documents
Complex table and chart interpretation
Advanced multilingual OCR
Format-flexible outputs (CSV, JSON, YAML, XML)
Multi-page document processing

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
ModelOne-Vision	4.3B	Small ✅	Document extraction, Structured data	Moderate 🟡

ModelOne Dataset Coverage

Trained on diverse document types: - 49% Multipage Documents - 29% Real-world Images - 14% Single-page Documents - 8% Visual Representations (tables, charts)

Phi-3.5-Vision

A lightweight state-of-the-art multimodal model focused on general visual understanding and reasoning.

Language Support:

Primary: English
Strong: Common European languages
Good: Asian languages
Basic: Other languages

Model Name	Parameters	Size Category	Recommended Use Cases	Resource Impact
Phi-3.5-vision-instruct	4.2B	Small ✅	General visual tasks, Multi-frame analysis	Moderate 🟡

Best Practices:

Ensure images are clear and well-lit
Choose the appropriate model based on your specific use case:
- Phi-3.5-Vision for general visual understanding and reasoning
- ModelOne-Vision for structured document processing and data extraction with support for european languages

Making the Right Choice

Resource Impact Guide

🟢 Minimal/Low: Perfect for startups and individual developers
🟡 Moderate: Requires careful resource planning
🔴 High/Extreme: Significant infrastructure needed

When to Scale Up

Only consider larger models when you have:

✅ Tested smaller models thoroughly
✅ Identified specific performance gaps
✅ Measured and justified the resource trade-offs
✅ Budget for increased hosting costs

For most applications, start with models in the Small ✅ category. They offer excellent performance while keeping costs and complexity manageable.

🧠 Supported Models

Model Overview

Qwen General Models

Qwen Code Models

Qwen Math Models

Qwen3 General Models

LLaMA 3 Models

Code LLaMA Models

Phi Models

DeepSeek R1 Models

Vision Models

ModelOne (manufactAI Labs)

Phi-3.5-Vision

Making the Right Choice

Resource Impact Guide

When to Scale Up

Next Steps

Quick Start with Small Models

Performance Benchmarking

On this page