Rank #7 on AIDB
14 models in the AIDB database. Average AIDB score 84, top score 87, momentum index 140.3.
No-code bot platform for building, publishing and monetizing AI agents.
We present Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning.
We present Infinity, a Bitwise Visual AutoRegressive Modeling capable of generating high-resolution, photorealistic images following language instruction.
ByteDance's flagship LLM, widely deployed in China.
We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the scale of more than 10,000 GPUs.
PixelDance V1.4 is a video generation model developed by the ByteDance Research team, using the DiT structure.
We introduce SeedEdit, a diffusion model that is able to revise a given image with any text prompts.
We present LLaVA-OneVision, a family of open large multimodal models (LMMs) developed by consolidating our insights into data, models, and visual representations in the LLaVA-NeXT blog series.
We present GR-2, a state-of-the-art generalist robot agent for versatile and generalizable robot manipulation.
Seed1.6 is the latest general-purpose model series unveiled by the ByteDance Seed team.
ByteDance Seed video generation model.
ByteDance's image generation, video, audio model tracked by Epoch, focused on video generation.
ByteDance's foundation model for fast multimodal content creation across short-form video pipelines.
A professional-grade, self-developed LLM supporting up to 128k tokens, enabling fine-tuning across the entire series.