qwen

@qwen

On GitHub since April 2014

View on GitHub

Published Tools

Total Stars

Weekly Downloads

GitHub Followers

Public Repos

Published Tools

1 Skill85 Agentsacross 3 categories

Qwen: Qwen3.5-9B

qwen

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design with early fusion of multimodal tokens, allowing the model to process and reason across text and images within the same context.

...more

AgentLLM Model

1 dir

Qwen: Qwen3.5-35B-A3B

qwen

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall performance is comparable to that of the Qwen3.5-27B.

...more

AgentLLM Model

1 dir

Qwen: Qwen3.5-27B

qwen

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of the Qwen3.5-122B-A10B.

...more

AgentLLM Model

1 dir

Qwen: Qwen3.5-122B-A10B

qwen

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of overall performance, this model is second only to Qwen3.5-397B-A17B. Its text capabilities significantly outperform those of Qwen3-235B-2507, and its visual capabilities surpass those of Qwen3-VL-235B.

...more

AgentLLM Model

1 dir

Qwen: Qwen3.5-Flash

qwen

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

...more

AgentLLM Model

1 dir

Qwen: Qwen3.5 Plus 2026-02-15

qwen

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency. In a variety of task evaluations, the 3.5 series consistently demonstrates performance on par with state-of-the-art leading models. Compared to the 3 series, these models show a leap forward in both pure-text and multimodal capabilities.

...more

AgentLLM Model

1 dir

Qwen: Qwen3.5 397B A17B

qwen

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers state-of-the-art performance comparable to leading-edge models across a wide range of tasks, including language understanding, logical reasoning, code generation, agent-based tasks, image understanding, video understanding, and graphical user interface (GUI) interactions.

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Max Thinking

qwen

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it delivers major gains in factual accuracy, complex reasoning, instruction following, alignment with human preferences, and agentic behavior.

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Coder Next

qwen

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per token, delivering performance comparable to models with 10 to 20x higher active compute, which makes it well suited for cost-sensitive, always-on agent deployment. The model is trained with a strong agentic focus and performs reliably on long-horizon coding tasks, complex tool usage, and recovery fro

...more

AgentLLM Model

1 dir

Qwen: Qwen3 VL 32B Instruct

qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text comprehension, enabling fine-grained spatial reasoning, document and scene analysis, and long-horizon video understanding.Robust OCR in 32 languages, and enhanced multimodal fusion through Interleaved-MRoPE and DeepStack architectures. Optimized for agentic

...more

AgentLLM Model

1 dir

Qwen: Qwen3 VL 8B Thinking

qwen

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and long-context processing (native 256K, expandable to 1M tokens) for tasks such as scientific visual analysis, causal inference, and mathematical reasoning over image or video inputs. Compared to the Instruct edition, the Thinking version introduces d

...more

AgentLLM Model

1 dir

Qwen: Qwen3 VL 8B Instruct

qwen

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon temporal reasoning, DeepStack for fine-grained visual-text alignment, and text-timestamp alignment for precise event localization. The model supports a native 256K-token context window, extensible to 1M tokens, and handles both static and dynamic medi

...more

AgentLLM Model

1 dir

Qwen: Qwen3 VL 30B A3B Thinking

qwen

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels in perception of real-world/synthetic categories, 2D/3D spatial grounding, and long-form visual comprehension, achieving competitive multimodal benchmark results. For agentic use, it handles multi-image multi-turn instructions, video timeline alignments, GUI automation, and visual c

...more

AgentLLM Model

1 dir

Qwen: Qwen3 VL 30B A3B Instruct

qwen

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception of real-world/synthetic categories, 2D/3D spatial grounding, and long-form visual comprehension, achieving competitive multimodal benchmark results. For agentic use, it handles multi-image multi-turn instructions, video timeline alignments, GUI automation, and

...more

AgentLLM Model

1 dir

Qwen: Qwen3 VL 235B A22B Thinking

qwen

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math. The series emphasizes robust perception (recognition of diverse real-world and synthetic categories), spatial understanding (2D/3D grounding), and long-form visual comprehension, with competitive results on public multimodal benchmarks for both perception and reasoning. Beyond analysis,

...more

AgentLLM Model

1 dir

Qwen: Qwen3 VL 235B A22B Instruct

qwen

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table extraction, multilingual OCR). The series emphasizes robust perception (recognition of diverse real-world and synthetic categories), spatial understanding (2D/3D grounding), and long-form visual comprehension, with competitive results on public multimodal ben

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Max

qwen

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It delivers higher accuracy in math, coding, logic, and science tasks, follows complex instructions in Chinese and English more reliably, reduces hallucinations, and produces higher-quality responses for open-ended Q&A, writing, and conversation. The model supports over 100 language

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Coder Plus

qwen

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and environment interaction, combining coding proficiency with versatile general-purpose abilities.

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Coder Flash

qwen

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling and environment interaction, combining coding proficiency with versatile general-purpose abilities.

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Next 80B A3B Thinking

qwen

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic planning, and reports strong results across knowledge, reasoning, coding, alignment, and multilingual evaluations. Compared with prior Qwen3 variants, it emphasizes stability under long chains of thought and efficient scaling during inference, and it is tuned t

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Next 80B A3B Instruct (free)

qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual use, while remaining robust on alignment and formatting. Compared with prior Qwen3 instruct variants, it focuses on higher throughput and stability on ultra-long inputs and multi-turn dialogues, making it well-suited for RAG, tool use, and agentic workflows

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Next 80B A3B Instruct

qwen

...more

AgentLLM Model

1 dir

Qwen: Qwen Plus 0728 (thinking)

qwen

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

...more

AgentLLM Model

1 dir

Qwen: Qwen Plus 0728

qwen

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

...more

AgentLLM Model

1 dir

Qwen: Qwen3 30B A3B Thinking 2507

qwen

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for “thinking mode,” where internal reasoning traces are separated from final answers. Compared to earlier Qwen3-30B releases, this version improves performance across logical reasoning, mathematics, science, coding, and multilingual benchmarks. It also demonstrates stronger instruction following, tool use, and a

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Coder 30B A3B Instruct

qwen

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the Qwen3 architecture, it supports a native context length of 256K tokens (extendable to 1M with Yarn) and performs strongly in tasks involving function calls, browser use, and structured code completion. This model is optimized for instruction-following without “think

...more

AgentLLM Model

1 dir

Qwen: Qwen3 30B A3B Instruct 2507

qwen

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It operates in non-thinking mode and is designed for high-quality instruction following, multilingual understanding, and agentic tool use. Post-trained on instruction data, it demonstrates competitive performance across reasoning (AIME, ZebraLogic), coding (MultiPL-E, LiveCodeBench), and alignment (IFEval, WritingBench) benchmarks. It outperforms its non-instru

...more

AgentLLM Model

1 dir

Qwen: Qwen3 235B A22B Thinking 2507

qwen

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144 tokens of context. This "thinking-only" variant enhances structured logical reasoning, mathematics, science, and long-form generation, showing strong benchmark performance across AIME, SuperGPQA, LiveCodeBench, and MMLU-Redux. It enforces a special reasoning mode

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Coder 480B A35B (free)

qwen

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over repositories. The model features 480 billion total parameters, with 35 billion active per forward pass (8 out of 160 experts). Pricing for the Alibaba endpoints varies by context length. Once a request is greater than 128k input tokens, the higher pricing is used.

...more

AgentLLM Model

1 dir

Qwen: Qwen3 Coder 480B A35B

qwen

...more

AgentLLM Model

1 dir

Qwen: Qwen3 235B A22B Instruct 2507

qwen

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in know

...more

AgentLLM Model

1 dir

Qwen: Qwen3 4B (free)

qwen

Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.

...more

AgentLLM Model

1 dir

Qwen: Qwen3 30B A3B

qwen

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique ability to switch seamlessly between a thinking mode for complex reasoning and a non-thinking mode for efficient dialogue ensures versatile, high-quality performance. Significantly outperforming prior models like QwQ and Qwen2.5, Qwen3 delivers superior mathematics, coding, commonsen

...more

AgentLLM Model

1 dir

Qwen: Qwen3 8B

qwen

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to

...more

AgentLLM Model

1 dir

Qwen: Qwen3 14B

qwen

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, programming, and logical inference, and a "non-thinking" mode for general-purpose conversation. The model is fine-tuned for instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts a

...more

AgentLLM Model

1 dir

Qwen: Qwen3 32B

qwen

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles

...more

AgentLLM Model

1 dir

Qwen: Qwen3 235B A22B

qwen

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token c

...more

AgentLLM Model

1 dir

Qwen: Qwen2.5 Coder 7B Instruct

qwen

Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE, SwiGLU, RMSNorm, and GQA attention with support for up to 128K tokens using YaRN-based extrapolation. It is trained on a large corpus of source code, synthetic data, and text-code grounding, providing robust performance across programming languages and agentic co

...more

AgentLLM Model

1 dir

Qwen: Qwen2.5 VL 32B Instruct

qwen

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual interpretation within images, and precise event localization in extended videos. Qwen2.5-VL-32B demonstrates state-of-the-art performance across multimodal benchmarks such as MMMU, MathVista, and VideoMME, while maintaining strong re

...more

AgentLLM Model

1 dir

Qwen: QwQ 32B

qwen

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.

...more

AgentLLM Model

1 dir

Qwen: Qwen VL Plus

qwen

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.

...more

AgentLLM Model

1 dir

Qwen: Qwen VL Max

qwen

Qwen VL Max is a visual understanding model with 7500 tokens context length. It excels in delivering optimal performance for a broader spectrum of complex tasks.

...more

AgentLLM Model

1 dir

Qwen: Qwen-Turbo

qwen

Qwen-Turbo, based on Qwen2.5, is a 1M context model that provides fast speed and low cost, suitable for simple tasks.

AgentLLM Model

1 dir

Qwen: Qwen2.5 VL 72B Instruct

qwen

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.

...more

AgentLLM Model

1 dir

Qwen: Qwen-Plus

qwen

Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced performance, speed, and cost combination.

...more

AgentLLM Model

1 dir

Qwen: Qwen-Max

qwen

Qwen-Max, based on Qwen2.5, provides the best inference performance among [Qwen models](/qwen), especially for complex multi-step tasks. It's a large-scale MoE model that has been pretrained on over 20 trillion tokens and further post-trained with curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) methodologies. The parameter count is unknown.

...more

AgentLLM Model

1 dir

Qwen2.5 Coder 32B Instruct

qwen

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning** and **code fixing**. - A more comprehensive foundation for real-world applications such as **Code Agents**. Not only enhancing coding capabilities but also maintaining its strengths in mathematics and general competencies. To read more about its eval

...more

AgentLLM Model

1 dir

Qwen: Qwen2.5 7B Instruct

qwen

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains. - Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of sys

...more

AgentLLM Model

1 dir

Qwen2.5 72B Instruct

qwen

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains. - Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of sy

...more

AgentLLM Model

1 dir

Qwen: Qwen2.5-VL 7B Instruct

qwen

Qwen2.5 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements: - SoTA understanding of images of various resolution & ratio: Qwen2.5-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc. - Understanding videos of 20min+: Qwen2.5-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc. - Agent that can operate your mobiles, robo

...more

AgentLLM Model

1 dir

QVQ-72B-Preview

Qwen

QVQ-72B-Preview - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-0.6B

Qwen

Qwen3-0.6B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-1.7B

Qwen

Qwen3-1.7B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-14B

Qwen

Qwen3-14B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-235B-A22B

Qwen

Qwen3-235B-A22B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-235B-A22B-Instruct-2507

Qwen

Qwen3-235B-A22B-Instruct-2507 - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-235B-A22B-Thinking-2507

Qwen

Qwen3-235B-A22B-Thinking-2507 - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-30B-A3B

Qwen

Qwen3-30B-A3B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-30B-A3B-Thinking-2507

Qwen

Qwen3-30B-A3B-Thinking-2507 - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-32B

Qwen

Qwen3-32B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-4B

Qwen

Qwen3-4B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-8B

Qwen

Qwen3-8B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-Coder-30B-A3B-Instruct

Qwen

Qwen3-Coder-30B-A3B-Instruct - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-Coder-480B-A35B-Instruct

Qwen

Qwen3-Coder-480B-A35B-Instruct - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-Next-80B-A3B-Instruct

Qwen

Qwen3-Next-80B-A3B-Instruct - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-Next-80B-A3B-Thinking

Qwen

Qwen3-Next-80B-A3B-Thinking - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-VL-235B-A22B-Instruct

Qwen

Qwen3-VL-235B-A22B-Instruct - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-VL-8B-Instruct

Qwen

Qwen3-VL-8B-Instruct - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3-VL-8B-Thinking

Qwen

Qwen3-VL-8B-Thinking - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3.5-122B-A10B

Qwen

Qwen3.5-122B-A10B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3.5-27B

Qwen

Qwen3.5-27B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3.5-35B-A3B

Qwen

Qwen3.5-35B-A3B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen3.5-397B-A17B

Qwen

Qwen3.5-397B-A17B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

QwQ-32B

Qwen

QwQ-32B - AI model available on ModelScope inference platform

AgentAI Model

1 dir

QwQ-32B-Preview

Qwen

QwQ-32B-Preview - AI model available on ModelScope inference platform

AgentAI Model

1 dir

Qwen: Qwen3.6 Plus Preview (free)

qwen

Qwen 3.6 Plus Preview is the next-generation evolution of the Qwen Plus series, featuring an advanced hybrid architecture that improves efficiency and scalability. It delivers stronger reasoning and more reliable agentic behavior compared to the 3.5 series. In benchmarks, it performs at or above leading state-of-the-art models. Designed as a flagship preview, it excels in agentic coding, front-end development, and complex problem-solving. Note: The model collects prompt and completion data that

...more

AgentLLM Model

1 dir

Qwen: Qwen3.6 Plus (free)

qwen

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers major gains in agentic coding, front-end development, and overall reasoning, with a significantly improved “vibe coding” experience. The model excels at complex tasks such as 3D scenes, games, and repository-level problem solving, achieving a 78.8 score on SWE-bench Verifi

...more

AgentLLM Model

1 dir

Qwen: Qwen3.6 Plus

qwen

...more

AgentLLM Model

1 dir

Qwen: Qwen3.5 Plus 2026-04-20

qwen

Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...

...more

AgentLLM Model

1 dir

Qwen: Qwen3.6 Flash

qwen

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...

...more

AgentLLM Model

1 dir

Qwen: Qwen3.6 35B A3B

qwen

Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architecture combining Gated...

...more

AgentLLM Model

1 dir

Qwen: Qwen3.6 Max Preview

qwen

Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic coding, tool use, and...

...more

AgentLLM Model

1 dir

Qwen: Qwen3.6 27B

qwen

Qwen3.6 27B is a dense 27-billion-parameter language model from the Qwen Team at Alibaba, released in April 2026. It features hybrid multimodal capabilities — accepting text, image, and video inputs...

...more

AgentLLM Model

1 dir

qwen-asr-dgxspark

Alibaba Qwen Team, AllenChou (DGX Spark port)

Qwen3-ASR python package, patched to run on NVIDIA DGX Spark (Grace-Blackwell) with torch>=2.11, transformers>=5, vllm>=0.20.

...more

Skillai-ml

1 dir

Qwen: Qwen3.7 Max

qwen

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,...

...more

AgentLLM Model

1 dir

Qwen: Qwen3.7 Plus

qwen

Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and image input with text output, building on the series' text capabilities with a comprehensive upgrade to its...

...more

AgentLLM Model

1 dir