LogoTop AI Hubs

Arcee AI: Spotlight

Other
Multimodal
Paid

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text grounding tasks. It offers a 32 k‑token context window, enabling rich multimodal conversations that combine lengthy documents with one or more images. Training emphasized fast inference on consumer GPUs while retaining strong captioning, visual‐question‑answering, and diagram‑analysis accuracy. As a result, Spotlight slots neatly into agent workflows where screenshots, charts or UI mock‑ups need to be interpreted on the fly. Early benchmarks show it matching or out‑scoring larger VLMs such as LLaVA‑1.6 13 B on popular VQA and POPE alignment tests.

Parameters

-

Context Window

131,072

tokens

Input Price

$0.18

per 1M tokens

Output Price

$0.18

per 1M tokens

Capabilities

Model capabilities and supported modalities

Performance

Reasoning

-

Math

-

Coding

-

Knowledge

-

Modalities

Input Modalities

image

Output Modalities

text

LLM Price Calculator

Calculate the cost of using this model

$0.000270
$0.000540
Input Cost:$0.000270
Output Cost:$0.000540
Total Cost:$0.000810
Estimated usage: 4,500 tokens

Monthly Cost Estimator

Based on different usage levels

Light Usage
$0.0036
~10 requests
Moderate Usage
$0.0360
~100 requests
Heavy Usage
$0.3600
~1000 requests
Enterprise
$3.6000
~10,000 requests
Note: Estimates based on current token count settings per request.
Last Updated: 2025/05/06