Skip to main content
Dundie·Feature Parity Map← all products

grok capabilities

14 mapped capabilities, each graded and dated. This is the diagnosis — the migration guide is the cure.


Capabilities

Chat and model family (Grok 4.x)

provisionalverified 2 days ago

Grok is xAI's AI assistant available on grok.com, iOS, Android, and embedded in X (Twitter). It runs the Grok 4.x model family — currently Grok 4.3 as the cost-efficient flagship and Grok 4 Heavy as the top-tier variant — offering multimodal text, image, and video inputs with up to a 1-million-token context window.

low confidence · 0.65docssource ↗source ↗

Custom instructions and Skills

provisionalverified 2 days ago

Grok supports global custom instructions (up to 12,000 characters) that apply to all new conversations, letting users set persistent tone, format, or domain preferences. Skills (launched May 18, 2026) extend this to reusable, named workflow definitions — teach a workflow once by describing it or uploading reference files, and Grok activates it automatically whenever relevant.

low confidence · 0.55docssource ↗source ↗

DeepSearch / DeeperSearch (agentic web research)

provisionalverified 2 days ago

DeepSearch enables Grok to browse the web and X in real time, verify sources, synthesize conflicting information, and produce a cited research report in response to a query. DeeperSearch is a heavier variant that runs multiple verification agents for peer-review-style synthesis.

low confidence · 0.50docssource ↗source ↗

Developer API

provisionalverified 2 days ago

xAI provides a commercial API exposing Grok models for text/reasoning, image generation, video generation, and real-time voice — all under a single platform with OpenAI-SDK compatibility, native xAI SDK (Python), and Vercel AI SDK support.

low confidence · 0.65docssource ↗source ↗

File and image upload (document analysis and vision)

provisionalverified 2 days ago

Users can attach documents (PDF, TXT, CSV, JSON, HTML, Excel, code files) and images (JPEG, PNG) directly in chat. Grok uses a server-side attachment_search tool to semantically retrieve relevant passages across multiple uploaded files, and applies vision understanding to images.

low confidence · 0.60docssource ↗source ↗

Image generation (Aurora / Grok Imagine)

provisionalverified 2 days ago

Grok generates images from text prompts using the Aurora model — an autoregressive mixture-of-experts transformer that builds images patch by patch, excelling at text rendering, lifelike portraits, and consistent stylistic output up to 1024x1024 pixels. The Grok Imagine interface also supports image editing and generating up to 10 variations per prompt.

low confidence · 0.60docssource ↗source ↗

Persistent memory

provisionalverified 2 days ago

Grok remembers facts, preferences, and context from previous conversations and applies them automatically in future sessions. Users can view every stored memory item individually, delete specific items, clear all memory, or disable memory entirely from Data Controls settings.

low confidence · 0.55docssource ↗source ↗

Plan, pricing, and usage gating

provisionalverified 2 days ago

Grok is available in a free tier with strict rate limits and across five paid subscription tiers — X Premium, SuperGrok Lite, SuperGrok, X Premium+, and SuperGrok Heavy — each unlocking progressively higher usage limits, model access, and advanced features.

low confidence · 0.60docssource ↗source ↗

Reasoning / Think mode

provisionalverified 2 days ago

Think mode activates Grok's extended chain-of-thought reasoning, letting it break down problems step by step and show its reasoning process before producing a final answer. A higher-compute variant called Big Brain mode applies multi-agent orchestration for the most demanding queries.

low confidence · 0.50docssource ↗source ↗

Tasks (scheduled agentic actions)

provisionalverified 2 days ago

Grok Tasks lets users schedule automated prompts to run at a future time, with results delivered via push notification or email. Supported frequencies include one-time, daily, specific days of the week, monthly, and annual schedules.

low confidence · 0.50docssource ↗source ↗

Video generation (Grok Imagine)

provisionalverified 2 days ago

Grok Imagine generates short videos from text prompts or by animating still images, producing clips with natively synthesized audio in a single pass. Clips run up to 10 seconds at 720p resolution; an Extend from Frame feature chains clips together for sequences up to 15 seconds per extension.

low confidence · 0.60docssource ↗source ↗

Voice mode (real-time speech conversation)

provisionalverified 2 days ago

Grok supports real-time spoken conversation through voice mode in the Grok app, enabling bidirectional voice interaction where users can speak and receive spoken responses. A camera-pointing feature lets users direct Grok to analyze what they see during a voice session.

low confidence · 0.65docssource ↗source ↗

Workspaces and Projects

provisionalverified 2 days ago

Workspaces are isolated environments within a Grok account, each carrying its own custom instructions, uploaded files, and separate conversation history. Projects (accessible at grok.com/project) extend this with task management, cloud file integration (e.g., Google Drive), and X-platform collaboration features.

low confidence · 0.55docssource ↗source ↗

X (Twitter) platform integration and real-time data access

provisionalverified 2 days ago

Grok has exclusive native access to the X post firehose, enabling real-time analysis of tweets, trending topics, influencer conversations, and breaking news as they happen — without the crawl-delay gap that affects other AI assistants using web search.

low confidence · 0.55docssource ↗source ↗

Editorial guidance, not a warranty. AI tools change weekly; every entry carries the date it was last verified. Verify before relying on a specific capability.