LLM Engines

  1. Home
  2. AI
  3. Agent, RAG, MCP & ML
  4. LLM Engines

A reference list of LLM and embedding models by provider: generative models, reasoners, embeddings, and rerankers. Each row lists modalities, parameters, API or repo id, use cases, deployment, pricing, benchmark metrics where public numbers exist (vendor papers, MTEB, or common eval suites), and coarse Scale.

Scale is capacity class: tiny, medium, large; top vendor lines also use large / frontier; teacher–student distillations use tiny / distilled, medium / distilled, or large / distilled matching backbone size (details in prose where it matters). Economical embedding tiers use tiny / medium; strong API defaults use large; flagship lines (e.g. OpenAI text-embedding-3-large, Cohere embed-v4, Voyage voyage-3-large / voyage-code-3, Gemini Embedding 2) use large / frontier. Rerankers and niche tools often stay N/A.