llms / directory

MODEL WEIGHTS

572listings · open vs closed weights · readme & download links

North Mini Code

open

Cohere

Cohere's first agentic coding model designed for developers. It combines efficiency with powerful coding capabilities, making it ideal for modern software engineering tasks.

code· 30B· 64,000 ctx
0 · 0 commentsweights link →

Claude Fable 5

closed

Anthropic

Claude Fable 5 is a next-generation intelligence model designed for ambitious work. It excels at long-running tasks and can investigate codebases before acting.

code
0 · 0 comments

Miso 1

open

Misa Labs

Miso 1 is the most emotive voice model in the world, capable of human-like emotional responses and fast reaction times. It is open source and comes with a full API for developers to build upon.

voice
0 · 0 comments

Gemma 4

open

Google DeepMind

Gemma 4 is an advanced model built from Gemini 3 research, designed to maximize intelligence-per-parameter. It supports multimodal reasoning and is optimized for various applications, including mobile and IoT devices.

language· 12B, 26B, 31B
0 · 0 commentsweights link →

Qwen3.7-Plus: Multimodal Agent Intelligence

closed

QwenTeam

Qwen3.7-Plus is a multimodal agent model that integrates vision and language capabilities into a single foundation. It excels in coding, tool use, and productivity workflows, offering a versatile solution for software engineering and automation tasks.

multimodal
0 · 0 comments

Grok Build 0.1

closed

xAI

Grok Build 0.1 is an intelligent coding model that powers the Grok Build CLI. It excels at agentic coding and is available via the xAI API in public beta.

code
0 · 0 comments

LFM2.5-8B-A1B

open

Liquid AI

LFM2.5-8B-A1B is a device-optimized model designed for real-life applications on various devices including phones, laptops, and robots. It features an expanded context length and a hybrid MoE architecture, making it fast and reliable for diverse use cases.

· 8B· 128,000 ctx
0 · 0 comments

Antigravity

closed

Google

Antigravity is an agent-first AI development platform by Google designed for autonomous coding agents. It allows users to manage complex workflows, from coding to testing and debugging, all performed by multiple agents in parallel.

code
1 · 0 comments

ESM Cambrian

open

Biohub

ESM Cambrian (ESMC) is a next-generation evolutionary scale model that predicts protein structure and facilitates the design of new proteins. It leverages billions of protein sequences to internalize the fundamental properties of protein biology, enabling high-accuracy predictions and innovative protein designs.

protein-language
1 · 0 comments

Claude Opus 4.8

closed

Anthropic

Claude Opus 4.8 introduces enhancements in coding, long-running agentic work, and complex knowledge tasks. It offers a fast mode for quicker outputs and a new effort dial for response customization.

language
1 · 0 comments

LocateAnything

open

The Hong Kong Polytechnic University, Princeton University, Nanjing University, University of Illinois Urbana-Champaign

Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding. LocateAnything performs diverse localization tasks under a unified vision-language model, including document understanding, GUI grounding, dense object detection, and OCR localization.

vision-language
0 · 0 comments

LongCat Video Avatar 1.5

open

victor

LongCat Video Avatar 1.5 is a model designed for creating animated video avatars. It leverages advanced techniques to generate lifelike representations in video format.

generative-media
0 · 0 comments

Kronos

open

shiyu-coder

Kronos is the first open-source foundation model for financial candlesticks, trained on data from over 45 global exchanges. It is designed to handle the unique characteristics of financial data, providing a specialized solution for forecasting in financial markets.

language· 499.2M· 512 ctx
0 · 0 commentsweights link →

Aleph 2.0

closed

Runway

Aleph 2.0 is an upgraded video editing model that allows users to modify video content efficiently. It enables users to edit a single frame and apply those changes across the entire video while preserving unaltered elements.

generative-media
0 · 0 comments

Qwen 3.7-Max

closed

QwenTeam

Qwen 3.7-Max is a proprietary model designed for the agent era, excelling in coding, office automation, and long-horizon reasoning tasks. It offers versatile capabilities for writing and debugging code, automating workflows, and executing complex tasks autonomously.

code
0 · 0 comments

Composer 2.5

closed

Cursor

Composer 2.5 is a substantial improvement over its predecessor, offering enhanced intelligence and behavior for long-running tasks. It excels in following complex instructions and provides a more pleasant collaboration experience.

language
0 · 0 comments

Cohere Command A+

open

Cohere Labs

Cohere Command A+ is an open-source LLM optimized for agentic, multilingual, and reasoning-heavy tasks. It supports vision inputs and is designed for efficient deployment on minimal hardware.

language· 25B· 128,000 ctx
0 · 0 commentsweights link →

Marlin 2B

open

NemoStation

Marlin 2B is a video VLM designed to extract structured information from videos, providing precise scene and event captions with timestamps. It excels in dense captioning and temporal grounding tasks.

video-language· 2B
0 · 0 comments

Lance

open

ByteDance

Lance is a 3B native unified multimodal model that supports image and video understanding, generation, and editing within a single framework. It is efficient at 3B scale, delivering strong performance across various benchmarks.

multimodal· 3B
0 · 0 commentsweights link →

Gemini 3.5 Flash

closed

Google DeepMind

Gemini 3.5 Flash is designed for executing complex, agentic workflows with exceptional speed and intelligence. It excels in coding and long-horizon tasks, providing real-world utility for developers and enterprises.

language
0 · 0 comments

Starchild-1: The First Real-Time Multimodal World Model

closed

Odyssey

Starchild-1 is the world's first multimodal world model that generates synchronized audio and video in real-time while responding to user input. It represents a significant advancement in generative intelligence by learning directly from the world through large-scale video.

world model
0 · 0 comments

Perception 1.0

closed

Ceptory

Perception 1.0 is the core model layer behind Ceptory's enterprise video intelligence, enabling natural language search, multimodal analysis, and operational monitoring. It provides structured outputs ready for API integration and supports retrieval from large video libraries.

video-intelligence
0 · 0 commentsweights link →

MiniMax M2.5

closed

MiniMax

MiniMax M2.5 is a state-of-the-art model designed for real-world productivity, excelling in coding, agentic tool use, and office work. It offers significant improvements in task completion speed and cost-effectiveness, making it ideal for complex applications.

code
0 · 0 comments

Dramabox

open

Resemble AI

Dramabox is an expressive text-to-speech model with voice cloning capabilities. It allows users to control speaker identity, emotion, and delivery through prompts, making it ideal for creating dynamic audio content.

text-to-speech· 3.3B
0 · 0 commentsweights link →