FactoryFactory

🧠 Supported Models

Smaller Can Be Better!

Bigger isn't always better! Smaller models (0.5B-7B) can be surprisingly powerful and offer significant advantages: - ⚑ Faster inference times - πŸ’° Lower hosting costs - πŸš€ Quicker deployment - πŸ”„ Easier testing and iteration

How to read this

Please note that language support, performance, and other specifications may vary based on your specific use case, data, and fine-tuning process. This information is intended as general guidanceβ€”your results might differ significantly!

Model Overview

Qwen General Models

Qwen models excel at general text generation, understanding, and dialogue. They are particularly strong in Asian languages while still performing well in many Western languages.

Language Support:

  • Primary: Chinese, English
  • Strong: Japanese, Korean
  • Good: German, French, Spanish, Portuguese, Italian, Vietnamese, Thai
  • Basic: Arabic, Russian, and other less-represented European languages
Model NameParametersSize CategoryRecommended Use CasesResource Impact
Qwen2.5-0.5B0.5BSmall βœ…Testing, prototypesMinimal 🟒
Qwen2.5-1.5B1.5BSmall βœ…Production-ready appsLow 🟒
Qwen2.5-3B3BSmall βœ…Complex applicationsModerate 🟑
Qwen2.5-7B7BMediumHigh-performance needsSignificant 🟑
Qwen2.5-14B14BLarge ⚠️Specific high-accuracy needsHigh πŸ”΄
Qwen2.5-32B32BVery Large ⚠️Only when validated as necessaryVery High πŸ”΄
Qwen2.5-72B72BVery Large ⚠️Specialized enterprise needsExtreme πŸ”΄

Qwen Code Models

Specialized for software development, these models excel at code generation, completion, and understanding. They support a wide range of programming languages and frameworks.

Language Support:

  • Primary: Python, JavaScript, Java, C++, TypeScript
  • Strong: Go, Rust, PHP, C#, Ruby
  • Good: Swift, Kotlin, SQL, Shell scripting
  • Basic: Scala, R, MATLAB, Assembly
Model NameParametersSize CategoryRecommended Use CasesResource Impact
Qwen2.5-Coder-0.5B0.5BSmall βœ…Code completion, simple generationMinimal 🟒
Qwen2.5-Coder-1.5B1.5BSmall βœ…Most coding tasksLow 🟒
Qwen2.5-Coder-3B3BSmall βœ…Complex code generationModerate 🟑
Qwen2.5-Coder-7B7BMediumLarge coding projectsSignificant 🟑
Qwen2.5-Coder-14B14BLarge ⚠️Advanced code generationHigh πŸ”΄
Qwen2.5-Coder-32B32BVery Large ⚠️Enterprise code solutionsVery High πŸ”΄

Qwen Math Models

Optimized for mathematical operations, these models excel at solving equations, proofs, and mathematical reasoning tasks.

Language Support:

  • Primary: Mathematical notation, LaTeX
  • Strong: English mathematical descriptions
  • Good: Chinese mathematical descriptions
  • Basic: Other language mathematical descriptions
Model NameParametersSize CategoryRecommended Use CasesResource Impact
Qwen2.5-Math-1.5B1.5BSmall βœ…Most math operationsLow 🟒
Qwen2.5-Math-7B7BMediumComplex calculationsSignificant 🟑
Qwen2.5-Math-72B72BVery Large ⚠️Research-grade mathExtreme πŸ”΄

LLaMA 3 Models

Meta’s LLaMA models are known for strong performance on text tasksβ€”especially in Englishβ€”while typically being optimized for European languages.

Language Support:

  • Primary: English
  • Strong: Spanish, German, French
  • Good: Italian, Portuguese, Dutch, and other major European languages
  • Basic: Asian languages and Arabic
Model NameParametersSize CategoryRecommended Use CasesResource Impact
Llama-3.2-1B1BSmall βœ…Quick experimentsMinimal 🟒
Llama-3.2-3B3BSmall βœ…Small applicationsLow 🟒
Llama-3.1-8B8BMediumProduction appsSignificant 🟑
Llama-3.1-70B70BVery Large ⚠️Enterprise needsExtreme πŸ”΄

Code LLaMA Models

Specialized version of LLaMA focused on code generation with strong multilingual code capabilities.

Language Support:

  • Primary: Python, JavaScript, Java, C++
  • Strong: PHP, C#, Ruby, Go
  • Good: Rust, Swift, TypeScript, Kotlin
  • Basic: Most other programming languages
Model NameParametersSize CategoryRecommended Use CasesResource Impact
CodeLlama-7b7BMediumGeneral codingSignificant 🟑
CodeLlama-13b13BLarge ⚠️Complex code projectsHigh πŸ”΄
CodeLlama-34b34BVery Large ⚠️Large-scale developmentVery High πŸ”΄
CodeLlama-70b70BVery Large ⚠️Enterprise systemsExtreme πŸ”΄

Phi Models

Microsoft’s Phi models are designed for efficiency with a primary focus on English. They are especially strong in code generation (notably in Python and JavaScript), while their multilingual capabilities are more limited.

Language Support:

  • Primary: English
  • Strong (for code): Python, JavaScript
  • Limited: Other languages (only basic multilingual support, with non-English tasks generally underperforming)
Model NameParametersSize CategoryRecommended Use CasesResource Impact
Phi-3.5-mini-instructMiniSmall βœ…Quick deploymentMinimal 🟒
Phi-3-mini-4k-instructMiniSmall βœ…TestingMinimal 🟒
Phi-3-mini-128k-instructMiniSmall βœ…General tasksLow 🟒
Phi-3-small-8k-instructSmallSmall βœ…Small applicationsLow 🟒
Phi-3-medium-4k-instructMediumMediumMedium workloadsModerate 🟑
Phi-3-medium-128k-instructMediumMediumComplex tasksModerate 🟑

DeepSeek R1 Models

DeepSeek models are specifically optimized for reasoning tasks and complex problem-solving. These models are distilled from larger models while maintaining impressive performance, especially in mathematics and coding tasks.

Language Support:

  • Primary: English, Chinese
  • Strong: Math notation, Programming languages
  • Good: European languages
  • Basic: Other languages
Model NameParametersSize CategoryRecommended Use CasesResource Impact
DeepSeek-R1-Distill-Qwen-1.5B1.5BSmall βœ…Basic reasoning, Quick testingMinimal 🟒
DeepSeek-R1-Distill-Qwen-7B7BMediumMath problems, Code generationModerate 🟑
DeepSeek-R1-Distill-Llama-8B8BMediumGeneral reasoning tasksModerate 🟑
DeepSeek-R1-Distill-Qwen-14B14BLarge ⚠️Complex problem solvingHigh πŸ”΄
DeepSeek-R1-Distill-Qwen-32B32BVery Large ⚠️Advanced reasoning, ResearchVery High πŸ”΄
DeepSeek-R1-Distill-Llama-70B70BVery Large ⚠️Enterprise applicationsExtreme πŸ”΄

Vision Models

Vision models combine text and image understanding capabilities for different specialized purposes.

ModelOne (manufactAI Labs)

Specialized model optimized for extracting structured information from documents and visual data.

Language Support:

  • Primary: 70+ languages with balanced representation
  • Core languages (14% each): English, Spanish, French, German, Italian, Russian
  • Additional Support: 64 other languages

Special Capabilities:

  • Structured data extraction from documents
  • Complex table and chart interpretation
  • Advanced multilingual OCR
  • Format-flexible outputs (CSV, JSON, YAML, XML)
  • Multi-page document processing
Model NameParametersSize CategoryRecommended Use CasesResource Impact
ModelOne-Vision4.3BSmall βœ…Document extraction, Structured dataModerate 🟑

ModelOne Dataset Coverage

Trained on diverse document types: - 49% Multipage Documents - 29% Real-world Images - 14% Single-page Documents - 8% Visual Representations (tables, charts)

Phi-3.5-Vision

A lightweight state-of-the-art multimodal model focused on general visual understanding and reasoning.

Language Support:

  • Primary: English
  • Strong: Common European languages
  • Good: Asian languages
  • Basic: Other languages
Model NameParametersSize CategoryRecommended Use CasesResource Impact
Phi-3.5-vision-instruct4.2BSmall βœ…General visual tasks, Multi-frame analysisModerate 🟑

Best Practices:

  • Ensure images are clear and well-lit
  • Choose the appropriate model based on your specific use case:
    • Phi-3.5-Vision for general visual understanding and reasoning
    • ModelOne-Vision for structured document processing and data extraction with support for european languages

Making the Right Choice

Resource Impact Guide

  • 🟒 Minimal/Low: Perfect for startups and individual developers
  • 🟑 Moderate: Requires careful resource planning
  • πŸ”΄ High/Extreme: Significant infrastructure needed

When to Scale Up

Only consider larger models when you have:

  • βœ… Tested smaller models thoroughly
  • βœ… Identified specific performance gaps
  • βœ… Measured and justified the resource trade-offs
  • βœ… Budget for increased hosting costs

Next Steps

Need Help?

For most applications, start with models in the Small βœ… category. They offer excellent performance while keeping costs and complexity manageable.

On this page