AllenAI: Molmo2 8B (free)
Other
Multimodal
Free
Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.
Parameters
8B
Context Window
36,864
tokens
Input Price
$0
per 1M tokens
Output Price
$0
per 1M tokens
Capabilities
Model capabilities and supported modalities
Performance
Reasoning
Good reasoning with solid logical foundations
Math
-
Coding
-
Knowledge
-
Modalities
Input Modalities
text,image,video
Output Modalities
text
LLM Price Calculator
Calculate the cost of using this model
$0.000000
$0.000000
Input Cost:$0.000000
Output Cost:$0.000000
Total Cost:$0.000000
Estimated usage: 4,500 tokens
Monthly Cost Estimator
Based on different usage levels
Light Usage
$0.0000
~10 requests
Moderate Usage
$0.0000
~100 requests
Heavy Usage
$0.0000
~1000 requests
Enterprise
$0.0000
~10,000 requests
Note: Estimates based on current token count settings per request.
Last Updated: 2026/02/04
