On-Device AI for Your iOS App
Ditch per-token API fees. Give your users unlimited AI access with one flat monthly cost. Ship in 3 lines of code.
import Kuzco
let session = try await KuzcoSession(model: .qwen3_4b)
for try await partial in session.streamResponse(to: "Hello!") {
print(partial.text, terminator: "")
}Everything you need for on-device AI
Kuzco SDK provides a complete toolkit for running AI models locally on iOS devices. No cloud required.
Text Generation
Stream responses in real-time. Build chatbots on-device.
Vision AI
Analyze images locally with vision models.
Image Generation
Create images with Stable Diffusion.
Privacy First
All processing on-device. Data never leaves.
Model Manager
Download and manage models easily.
SwiftUI Ready
Drop-in components for chat UIs and more.
import SwiftUIimport Kuzcovar body: some View {KuzcoChat()}
Built for performance
Optimized models, maximum context, and real-time generation across all Apple devices.
AI Models
Text, vision, and image generation — constantly growing.
Model Sizes
Optimized models from 1.1GB to 5GB.
32K Context
8x more context than Apple Intelligence.
All Apple Devices
iPhone, iPad, Mac, and Vision Pro.
Works Offline
No internet required. Run AI anywhere.
Faster Generation
Outperforms Apple Intelligence on-device.
Build AI apps in minutes
Simple, intuitive APIs for text, image, and vision AI. All running locally on-device.
import Kuzco// Create a session with your preferred modellet session = try await KuzcoSession(model: .qwen3_4b)// Stream responses in real-timefor try await partial in session.streamResponse(to: "Explain Swift") { print(partial.text, terminator: "") if partial.isComplete { print("Tokens: \(partial.usage?.totalTokens ?? 0)") }}// Or get a complete responselet response = try await session.respond(to: "Hello!")Get early access
Be among the first to build on-device AI apps with Kuzco SDK. No credit card required. We'll notify you when it's ready.