llms / directory
MODEL WEIGHTS▌
572listings · open vs closed weights · readme & download links
North Mini Code
openCohere
Cohere's first agentic coding model designed for developers. It combines efficiency with powerful coding capabilities, making it ideal for modern software engineering tasks.
Claude Fable 5
closedAnthropic
Claude Fable 5 is a next-generation intelligence model designed for ambitious work. It excels at long-running tasks and can investigate codebases before acting.
Miso 1
openMisa Labs
Miso 1 is the most emotive voice model in the world, capable of human-like emotional responses and fast reaction times. It is open source and comes with a full API for developers to build upon.
Gemma 4
openGoogle DeepMind
Gemma 4 is an advanced model built from Gemini 3 research, designed to maximize intelligence-per-parameter. It supports multimodal reasoning and is optimized for various applications, including mobile and IoT devices.
Qwen3.7-Plus: Multimodal Agent Intelligence
closedQwenTeam
Qwen3.7-Plus is a multimodal agent model that integrates vision and language capabilities into a single foundation. It excels in coding, tool use, and productivity workflows, offering a versatile solution for software engineering and automation tasks.
Grok Build 0.1
closedxAI
Grok Build 0.1 is an intelligent coding model that powers the Grok Build CLI. It excels at agentic coding and is available via the xAI API in public beta.
LFM2.5-8B-A1B
openLiquid AI
LFM2.5-8B-A1B is a device-optimized model designed for real-life applications on various devices including phones, laptops, and robots. It features an expanded context length and a hybrid MoE architecture, making it fast and reliable for diverse use cases.
Antigravity
closedAntigravity is an agent-first AI development platform by Google designed for autonomous coding agents. It allows users to manage complex workflows, from coding to testing and debugging, all performed by multiple agents in parallel.
ESM Cambrian
openBiohub
ESM Cambrian (ESMC) is a next-generation evolutionary scale model that predicts protein structure and facilitates the design of new proteins. It leverages billions of protein sequences to internalize the fundamental properties of protein biology, enabling high-accuracy predictions and innovative protein designs.
Claude Opus 4.8
closedAnthropic
Claude Opus 4.8 introduces enhancements in coding, long-running agentic work, and complex knowledge tasks. It offers a fast mode for quicker outputs and a new effort dial for response customization.
LocateAnything
openThe Hong Kong Polytechnic University, Princeton University, Nanjing University, University of Illinois Urbana-Champaign
Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding. LocateAnything performs diverse localization tasks under a unified vision-language model, including document understanding, GUI grounding, dense object detection, and OCR localization.
LongCat Video Avatar 1.5
openvictor
LongCat Video Avatar 1.5 is a model designed for creating animated video avatars. It leverages advanced techniques to generate lifelike representations in video format.
Kronos
openshiyu-coder
Kronos is the first open-source foundation model for financial candlesticks, trained on data from over 45 global exchanges. It is designed to handle the unique characteristics of financial data, providing a specialized solution for forecasting in financial markets.
Aleph 2.0
closedRunway
Aleph 2.0 is an upgraded video editing model that allows users to modify video content efficiently. It enables users to edit a single frame and apply those changes across the entire video while preserving unaltered elements.
Qwen 3.7-Max
closedQwenTeam
Qwen 3.7-Max is a proprietary model designed for the agent era, excelling in coding, office automation, and long-horizon reasoning tasks. It offers versatile capabilities for writing and debugging code, automating workflows, and executing complex tasks autonomously.
Composer 2.5
closedCursor
Composer 2.5 is a substantial improvement over its predecessor, offering enhanced intelligence and behavior for long-running tasks. It excels in following complex instructions and provides a more pleasant collaboration experience.
Cohere Command A+
openCohere Labs
Cohere Command A+ is an open-source LLM optimized for agentic, multilingual, and reasoning-heavy tasks. It supports vision inputs and is designed for efficient deployment on minimal hardware.
Marlin 2B
openNemoStation
Marlin 2B is a video VLM designed to extract structured information from videos, providing precise scene and event captions with timestamps. It excels in dense captioning and temporal grounding tasks.
Lance
openByteDance
Lance is a 3B native unified multimodal model that supports image and video understanding, generation, and editing within a single framework. It is efficient at 3B scale, delivering strong performance across various benchmarks.
Gemini 3.5 Flash
closedGoogle DeepMind
Gemini 3.5 Flash is designed for executing complex, agentic workflows with exceptional speed and intelligence. It excels in coding and long-horizon tasks, providing real-world utility for developers and enterprises.
Starchild-1: The First Real-Time Multimodal World Model
closedOdyssey
Starchild-1 is the world's first multimodal world model that generates synchronized audio and video in real-time while responding to user input. It represents a significant advancement in generative intelligence by learning directly from the world through large-scale video.
Perception 1.0
closedCeptory
Perception 1.0 is the core model layer behind Ceptory's enterprise video intelligence, enabling natural language search, multimodal analysis, and operational monitoring. It provides structured outputs ready for API integration and supports retrieval from large video libraries.
MiniMax M2.5
closedMiniMax
MiniMax M2.5 is a state-of-the-art model designed for real-world productivity, excelling in coding, agentic tool use, and office work. It offers significant improvements in task completion speed and cost-effectiveness, making it ideal for complex applications.
Dramabox
openResemble AI
Dramabox is an expressive text-to-speech model with voice cloning capabilities. It allows users to control speaker identity, emotion, and delivery through prompts, making it ideal for creating dynamic audio content.