Skip to main content
Qualcomm® Dragonwing™ products bring generative AI (GenAI) to edge devices without relying solely on cloud infrastructure, lowering latency, enhancing privacy, and reducing costs for diverse IoT applications. The following image illustrates the GenAI architecture, including supported use cases and applications, GenAI frameworks, and the lower-level backends and libraries. GenAI architecture diagram Dragonwing IQ products integrate a CPU, GPU, and Hexagon NPU for heterogeneous computing, optimized for large language models (LLMs), vision, and multimodal AI tasks. Leading GenAI models — including Llama, Whisper, Stable Diffusion, and LLaVA — are supported for IoT scenarios such as robotics, retail, and industrial automation. For responsive LLM performance, Dragonwing IQ supports multi-billion parameter models with techniques such as prefill acceleration and self-speculative decoding. To enable bring-your-own-model (BYOM) workflows and streamline deployment, use Qualcomm AI Hub, Qualcomm AI Runtime SDK execution providers, and Qualcomm Generative AI Inference Extensions (Genie).

Why on-device GenAI matters

Generative AI on Dragonwing delivers context-aware, adaptive, and privacy-preserving solutions at the edge, enabling automation, personalization, and operational efficiency for use cases such as autonomous drones, predictive maintenance, multimodal agents, and on-device retrieval-augmented generation (RAG).

Get started

Select the option that matches your GenAI workflow.

Prepare a GenAI model

Prepare a GenAI model for execution on Qualcomm Dragonwing IoT devices.

Run a GenAI model

Run a prepared GenAI model on a Qualcomm Dragonwing IoT device.

Use GenAI models with Genie

Prepare, manage, and execute GenAI models with Qualcomm Generative AI Inference Extensions (Genie).

Develop a GenAI application

Build your own GenAI application based on a Qualcomm sample application.