Qwen: Qwen3.5-Flash
Qwen3
Multimodal
Paid
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.
Parameters
-
Context Window
1,000,000
tokens
Input Price
$0.065
per 1M tokens
Output Price
$0.26
per 1M tokens
Capabilities
Model capabilities and supported modalities
Performance
Reasoning
-
Math
-
Coding
-
Knowledge
-
Modalities
Input Modalities
text,image,video
Output Modalities
text
LLM Price Calculator
Calculate the cost of using this model
$0.000097
$0.000780
Input Cost:$0.000097
Output Cost:$0.000780
Total Cost:$0.000878
Estimated usage: 4,500 tokens
Monthly Cost Estimator
Based on different usage levels
Light Usage
$0.0032
~10 requests
Moderate Usage
$0.0325
~100 requests
Heavy Usage
$0.3250
~1000 requests
Enterprise
$3.2500
~10,000 requests
Note: Estimates based on current token count settings per request.
Last Updated: 2026/04/01
