Curiosity · AI Model

Reka Vision

Reka Vision is Reka AI's enterprise multimodal product — a platform (rather than a single model) that wraps Reka's Core and Flash family to provide image, video, and document understanding at scale. It offers visual search, captioning, chapter generation, and Q&A over media libraries, aimed at media, e-commerce, and surveillance customers.

Model specs

Vendor
Reka AI
Family
Reka
Released
2024-09
Context window
128,000 tokens
Modalities
text, vision, video
Input price
n/a
Output price
n/a
Pricing as of
2026-04-20

Strengths

  • Built on Reka's frontier multimodal stack
  • Supports long-form video understanding
  • Enterprise deployment options (VPC, on-prem)

Limitations

  • Pricing is sales-led — no public per-token rate
  • Smaller ecosystem than hyperscaler visual AI products
  • Requires careful pipeline integration for best accuracy

Use cases

  • Enterprise video search and chapter generation
  • E-commerce visual product search
  • Content moderation pipelines with context awareness
  • Media intelligence and surveillance review

Benchmarks

BenchmarkScoreAs of
Perception Test~60%2026-04
MMMU~56%2026-04
VideoMME~55%2026-04

Frequently asked questions

What is Reka Vision?

Reka Vision is Reka AI's enterprise product for multimodal understanding of images and video. It combines Reka Core/Flash models with retrieval, search, and captioning workflows.

How is Reka Vision different from Reka Core?

Reka Core is a model; Reka Vision is a product that uses Reka's models (Core and Flash) plus retrieval and indexing infrastructure to operate on large video and image libraries.

Sources

  1. Reka Vision homepage — accessed 2026-04-20
  2. Reka AI research page — accessed 2026-04-20