TrendHub Logo
TrendHub

Models Intelligence

Hugging Face and model radar

This workspace now blends Hugging Face launches with frontier benchmark context, open-weight watchlists and official model specs. GitHub repository scanning stays on the main Trend Radar page.

Performance Atlas

Frontier rankings, open-weight leaders and official model specs

This layer combines the live OpenAI model catalog, benchmark watchlists and vendor documentation so the model board is not limited to Hugging Face launch activity alone.

Synced May 11, 2026 3-source stack

Frontier Models

Closed-model performance watch

Provider APIs + benchmark watchlists + vendor docs

OpenAI

GPT-5.5

Frontier

OpenAI lane is now sourced from the live OpenAI API catalog. The site promotes the newest available GPT family model first and keeps the previous static GPT-5.4 entry only as historical fallback context.

Catalog source

Live API

Internal rank

ELO 1990

Context

1M

Model family

GPT

Anthropic

Claude Opus 4.7

Frontier

Anthropic lane now tracks the official Anthropic fallback seed. Claude Opus 4.7 is treated as the current flagship fallback and is replaced automatically when the configured Anthropic Models API returns a newer Opus-family model.

Catalog source

Fallback

Internal rank

ELO 1907

Context

1M

Max output

128k

Google

Gemini 3.1 Pro Preview

Frontier

Best current Artificial Analysis intelligence score and a top-three Arena text position, making it a strong general frontier benchmark.

AA intelligence

57

Arena overall

#3

Coding

#3

Creative

#2

Longer query

#3

Method stack

How this layer stays useful

Rule 1

OpenAI model availability is pulled from the live Models API when NUXT_OPENAI_API_KEY is present.

Rule 2

Anthropic model availability is pulled from the live Models API when NUXT_ANTHROPIC_API_KEY is present; otherwise the official Claude Opus 4.7 fallback seed is used.

Rule 3

Arena and Artificial Analysis remain benchmark references for relative quality, while provider APIs decide whether a model is currently available.

Rule 4

Official vendor docs remain the source of truth for API availability, context windows and pricing.

Open Models

Open-weight leaders worth tracking

AA open-weight board + HF open LLM board

Z.ai

GLM-5 (Reasoning)

Current open-weight benchmark leader on Artificial Analysis, with a 200k context window and MIT licensing called out in the comparison pages.

AA open rank#1
AA intelligence50
Context200k
LicenseMIT

Moonshot

Kimi K2.5 (Reasoning)

Still sits in the top open-weight tier for reasoning-heavy work and remains one of the clearest self-hostable alternatives to frontier closed models.

AA open rank#2
AA intelligence47
ProfileReasoning
TrackOpen weights

Qwen

Qwen3.5 397B A17B

A top-three open-weight contender on Artificial Analysis that keeps showing up as a practical agentic and coding checkpoint to watch.

AA open rank#3
AA intelligence45
ProfileAgentic
TrackOpen weights

Official Specs

Source-of-truth docs

OpenAI

GPT-5.5 / GPT-5.5 Pro / GPT-5.4

Docs

Live OpenAI catalog lane for the newest GPT family available to the configured API key.

  • The OpenAI Models API returned GPT-5.5 / GPT-5.5 Pro / GPT-5.4 for the current key.
  • Official model docs remain the source of truth for context windows, pricing and intended use.
  • Keep benchmark placement separate from API availability: a model can appear in the catalog before third-party leaderboards fully settle.

Anthropic

Claude Opus 4.7 / Claude Sonnet 4.6

Docs

Official Claude lineup centered on Opus 4.7 for maximum agentic coding, Sonnet 4.6 for balanced production use and Haiku 4.5 for low-latency throughput.

  • The Anthropic API key was unavailable at build/runtime, so TrendHub is using the Claude Opus 4.7 fallback seed.
  • Anthropic lists Claude Opus 4.7 as the most capable generally available Claude model and exposes it as claude-opus-4-7.
  • Published specs list a 1M context window and 128k max output for Claude Opus 4.7, with the same $5 / $25 per MTok headline pricing as Opus 4.6.

Google

Gemini 2.5 Pro / 2.5 Flash / 2.5 Flash-Lite

Docs

Official Gemini lineup optimized around deep reasoning, price-performance and ultra-fast throughput.

  • Google positions Gemini 2.5 Pro as the most advanced option for complex tasks.
  • Gemini 2.5 Flash is framed as the best price-performance choice for low-latency reasoning.
  • Gemini 2.5 Flash-Lite is the fastest and most budget-friendly multimodal option in the family.

Latest Launch

anindita-13/healthmate-llm

General • Created May 11

9 items in the latest lane

Hot Momentum

SulphurAI/Sulphur-2-base

text-to-video • Trend 408

9 items in the hot lane

Update Watch

csukuangfj2/sherpa-onnx-libs

General • Updated May 11

9 items in the updated lane

HF Frontier Feed

Fresh releases, crowd momentum and active model updates

Latest, hottest and recently updated Hugging Face models

Leaderboard Snapshot

Model Vault

1. GPT-5.5

ELO 1990

OpenAItext

Context

1M

Speed

430

2. GPT-5.5 Pro

ELO 1984

OpenAItext

Context

1M

Speed

430

3. GPT-5.4

ELO 1978

OpenAItext

Context

400K

Speed

430

4. Claude Opus 4.7

ELO 1907

Anthropictext

Context

1M

Speed

350

5. Claude Sonnet 4.6

ELO 1885

Anthropictext

Context

1M

Speed

520

6. GPT-5.3-Codex

ELO 1810

OpenAIcode

Context

128K

Speed

550

7. Gemini 3.1 Flash-Lite

ELO 1785

Googletext

Context

1M

Speed

1.2K

8. Gemini 3 Pro

ELO 1720

Googletext

Context

2M

Speed

480

Action playbook

Model evaluation steps

Validate SulphurAI/Sulphur-2-base against one benchmark workflow before adopting it.

Review the card and tags for anindita-13/healthmate-llm to see whether it is a genuine launch or a thin checkpoint wrapper.

Re-check csukuangfj2/sherpa-onnx-libs when the lastModified lane moves, especially if it already fits your stack.

Use the hot, latest and updated lanes together before moving a model into production testing.

Task Mix

The most common task categories showing up across latest, hot and updated lanes.

General

HF lane density

17

text-generation

HF lane density

2

text-to-video

HF lane density

1

image-text-to-image

HF lane density

1

any-to-any

HF lane density

1

text-to-image

HF lane density

1