grok capabilities
14 mapped capabilities, each graded and dated. This is the diagnosis — the migration guide is the cure.
Capabilities
Chat and model family (Grok 4.x)
provisionalverified 2 days agoGrok is xAI's AI assistant available on grok.com, iOS, Android, and embedded in X (Twitter). It runs the Grok 4.x model family — currently Grok 4.3 as the cost-efficient flagship and Grok 4 Heavy as the top-tier variant — offering multimodal text, image, and video inputs with up to a 1-million-token context window.
Custom instructions and Skills
provisionalverified 2 days agoGrok supports global custom instructions (up to 12,000 characters) that apply to all new conversations, letting users set persistent tone, format, or domain preferences. Skills (launched May 18, 2026) extend this to reusable, named workflow definitions — teach a workflow once by describing it or uploading reference files, and Grok activates it automatically whenever relevant.
DeepSearch / DeeperSearch (agentic web research)
provisionalverified 2 days agoDeepSearch enables Grok to browse the web and X in real time, verify sources, synthesize conflicting information, and produce a cited research report in response to a query. DeeperSearch is a heavier variant that runs multiple verification agents for peer-review-style synthesis.
Developer API
provisionalverified 2 days agoxAI provides a commercial API exposing Grok models for text/reasoning, image generation, video generation, and real-time voice — all under a single platform with OpenAI-SDK compatibility, native xAI SDK (Python), and Vercel AI SDK support.
File and image upload (document analysis and vision)
provisionalverified 2 days agoUsers can attach documents (PDF, TXT, CSV, JSON, HTML, Excel, code files) and images (JPEG, PNG) directly in chat. Grok uses a server-side attachment_search tool to semantically retrieve relevant passages across multiple uploaded files, and applies vision understanding to images.
Image generation (Aurora / Grok Imagine)
provisionalverified 2 days agoGrok generates images from text prompts using the Aurora model — an autoregressive mixture-of-experts transformer that builds images patch by patch, excelling at text rendering, lifelike portraits, and consistent stylistic output up to 1024x1024 pixels. The Grok Imagine interface also supports image editing and generating up to 10 variations per prompt.
Persistent memory
provisionalverified 2 days agoGrok remembers facts, preferences, and context from previous conversations and applies them automatically in future sessions. Users can view every stored memory item individually, delete specific items, clear all memory, or disable memory entirely from Data Controls settings.
Plan, pricing, and usage gating
provisionalverified 2 days agoGrok is available in a free tier with strict rate limits and across five paid subscription tiers — X Premium, SuperGrok Lite, SuperGrok, X Premium+, and SuperGrok Heavy — each unlocking progressively higher usage limits, model access, and advanced features.
Reasoning / Think mode
provisionalverified 2 days agoThink mode activates Grok's extended chain-of-thought reasoning, letting it break down problems step by step and show its reasoning process before producing a final answer. A higher-compute variant called Big Brain mode applies multi-agent orchestration for the most demanding queries.
Tasks (scheduled agentic actions)
provisionalverified 2 days agoGrok Tasks lets users schedule automated prompts to run at a future time, with results delivered via push notification or email. Supported frequencies include one-time, daily, specific days of the week, monthly, and annual schedules.
Video generation (Grok Imagine)
provisionalverified 2 days agoGrok Imagine generates short videos from text prompts or by animating still images, producing clips with natively synthesized audio in a single pass. Clips run up to 10 seconds at 720p resolution; an Extend from Frame feature chains clips together for sequences up to 15 seconds per extension.
Voice mode (real-time speech conversation)
provisionalverified 2 days agoGrok supports real-time spoken conversation through voice mode in the Grok app, enabling bidirectional voice interaction where users can speak and receive spoken responses. A camera-pointing feature lets users direct Grok to analyze what they see during a voice session.
Workspaces and Projects
provisionalverified 2 days agoWorkspaces are isolated environments within a Grok account, each carrying its own custom instructions, uploaded files, and separate conversation history. Projects (accessible at grok.com/project) extend this with task management, cloud file integration (e.g., Google Drive), and X-platform collaboration features.
X (Twitter) platform integration and real-time data access
provisionalverified 2 days agoGrok has exclusive native access to the X post firehose, enabling real-time analysis of tweets, trending topics, influencer conversations, and breaking news as they happen — without the crawl-delay gap that affects other AI assistants using web search.