UI-TARS 7B
All models
BytedanceBytedance

UI-TARS 7B

128K context$0.100/M input$0.200/M output

UI-TARS 7B is an AI model from Bytedance built for agent workflows, with support for image, text input and text output. UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

What is UI-TARS 7B ?

UI-TARS 7B is an AI model from Bytedance that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare UI-TARS 7B against other models for agent workflows and production deployments.

Model ID

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

Modalities
Input
imagetext
Output
text
Supported Parameters
frequency_penaltylogit_biasmax_tokenspresence_penaltyrepetition_penaltyseedstoptemperaturetop_ktop_p

Related content