AI Hub
← All models

Gemini 3.5 Flash vs UI-TARS 7B

Google vs ByteDance — benchmarks, pricing, and capabilities side by side.

  • UI-TARS 7B is cheaper ($0.10 vs $1.50 per 1M input)
  • Gemini 3.5 Flash has a larger context window (1M)
Gemini 3.5 FlashUI-TARS 7B
Intelligence index92.2
DeveloperGoogleByteDance
TypeMultimodalMultimodal
AccessAPI onlyOpen weights
Context window1,048,576 tokens128,000 tokens
Input price$1.50 / 1M$0.10 / 1M
Output price$9.00 / 1M$0.20 / 1M
Speed221 tok/s
ReleasedMay 19, 2026July 22, 2025
Parameters
Input modalitiesText, Image, Audio, VideoImage, Text
Output modalitiesTextText