Available Models
Kuzco provides a variety of on-device models for text generation, vision, and image creation. All models run locally on the device.
Text Generation Models
These models are optimized for chat, completion, and general text generation tasks.
Qwen3 4B
.qwen3_4bExcellent balance of performance and size. Recommended for most use cases.
Qwen3 8B
.qwen3_8bHigher quality responses with more nuanced understanding.
LLaMA 3 3B
.llama3_3bFast and efficient. Good for simpler tasks and quick responses.
Phi-4 Mini
.phi4_miniMicrosoft's compact model with strong reasoning capabilities.
Gemma 3 4B
.gemma3_4bGoogle's efficient model optimized for mobile devices.
DeepSeek R1 1.5B
.deepseekR1_1_5bUltra-lightweight model for basic tasks with minimal memory footprint.
// Using text modelslet session = try await KuzcoSession(model: .qwen3_4b)let response = try await session.oneShot("Explain quantum computing simply.")Vision Models
Vision models can analyze images and answer questions about visual content.
Qwen3 VL
.qwen3VLMultimodal model for image understanding and visual Q&A.
SmolVLM
.smolVLMCompact vision-language model for efficient image analysis.
// Using vision modelslet session = try await KuzcoSession(model: .qwen3VL)let response = try await session.analyzeImage( image, prompt: "What objects are in this image?")Image Generation Models
Generate images from text prompts using diffusion models.
Stable Diffusion 2.1
.stableDiffusion21Generate images from text prompts with customizable dimensions.
// Using image generationlet generator = try await KuzcoImageGenerator(model: .stableDiffusion21)let image = try await generator.generate( prompt: "A serene mountain landscape at sunset", width: 512, height: 512)Choosing the Right Model
| Use Case | Recommended Model | Why |
|---|---|---|
| General chat | .qwen3_4b | Best balance of quality and speed |
| Complex reasoning | .qwen3_8b | Larger context, better understanding |
| Quick responses | .llama3_3b | Fastest generation speed |
| Low memory devices | .deepseekR1_1_5b | Smallest memory footprint |
| Code generation | .phi4_mini | Strong at coding tasks |
| Image analysis | .qwen3VL | Best vision capabilities |
| Image generation | .stableDiffusion21 | Only image gen option |
Model Download
Models are downloaded on first use or can be pre-downloaded for better user experience:
// Check if model is availablelet isAvailable = await KuzcoModelManager.shared.isModelAvailable(.qwen3_4b)// Download with progress trackingfor try await progress in KuzcoModelManager.shared.downloadModel(.qwen3_4b) { print("Download: \(Int(progress.progress * 100))%")}See Model Management for detailed download and storage management.