By using the Cloudglue SDK and/or the MCP server, you agree to the Cloudglue Terms of Service and acknowledge our Privacy Policy.
Quick Start
The Cloudglue MCP server lets you connect AI assistants like Claude Desktop, Cursor and Windsurf to Cloudglue’s video understanding capabilities through the Model Context Protocol. Using the Cloudglue MCP server, you can bring advanced video analysis, transcription, and extraction features directly into your favorite AI chat applications. This allows your AI assistants to understand, analyze, and extract insights from video content without leaving your workflow - whether you’re looking to transcribe meetings, extract structured data from video clips, or have conversations about video collections. The video below walks through how to setup Cloudglue MCP server in a Claude Desktop clientPrerequisites
First, get a Cloudglue API Key from cloudglue.dev, this will be used to authenticate the MCP server with your Cloudglue account.Node.js
The Filesystem Server and many other MCP servers require Node.js to run. Verify your Node.js installation by opening a terminal or command prompt and running:Setup
1. Configure MCP Client
Configure your MCP client (Claude Desktop or Cursor) to use the Cloudglue MCP server:- Claude Desktop
- Cursor
There are two ways to configure the Cloudglue MCP server in Claude Desktop:
Replace
Method 1: Using Claude Desktop Extension
Method 1: Using Claude Desktop Extension
- Download the latest Cloudglue Claude Desktop Extension from the releases page
- Double click to open with Claude Desktop (you need to have Claude Desktop running prior to this)
- Click ‘Install’
- When prompted, enter your API key
- Enable the extension
Method 2: Using manual configuration
Method 2: Using manual configuration
- Open the Claude menu on your computer and select “Settings…”
- Click on “Developer” in the left-hand bar of the Settings pane
- Click on “Edit Config”
claude_desktop_config.json file (located at ~/Library/Application Support/Claude/claude_desktop_config.json on macOS or %APPDATA%\Claude\claude_desktop_config.json on Windows).Add the following configuration to the file:<CLOUDGLUE-YOUR-API-KEY> with your Cloudglue API key.After updating the configuration, restart Claude Desktop completely.2. Verify Installation
Once configured:- In Claude Desktop: You should see a slider icon in the bottom left corner of the input box. Click it to see the available Cloudglue tools.
- In Cursor: The Agent will automatically detect and use relevant Cloudglue tools when needed.
Remote MCP Server
Cloudglue also provides a remote MCP server that you can connect to directly without installing the local package.Transport Options
The remote server supports two transport types:Server-Sent Events (SSE)
- URL:
https://mcp.cloudglue.dev/sse - Authentication:
- Header Name:
Authorization - Bearer Token: Your Cloudglue API Key
- Header Name:
Streamable HTTP
- URL:
https://mcp.cloudglue.dev/mcp - Authentication:
- Header Name:
Authorization - Bearer Token: Your Cloudglue API Key
- Header Name:
Cursor Configuration
For Cursor, you can use the remote server with this configuration:Local Proxy Workaround
Since all MCP clients do not fully support remote MCP servers yet, you can use this proxy configuration:Note: OAuth-based authentication is coming soon to simplify the setup process and eliminate the need to manually manage API keys in configurations.
Supported URL Formats
The Cloudglue MCP Server supports multiple URL formats for video analysis:Cloudglue Platform (Default)
- Format:
cloudglue://files/file-id - Usage: Use file ID from
list_videostool - Best for: Videos already uploaded to Cloudglue
YouTube URLs
- Formats:
https://www.youtube.com/watch?v=...https://youtu.be/...
- Supported tools:
describe_video,extract_video_entities,segment_video_chapters - Note: Not supported for
segment_video_camera_shots
Public HTTP Video URLs
- Format: Direct links to MP4 files (e.g.,
https://example.com/video.mp4) - Best for: Publicly accessible video files
Data Connector URLs
Requires setup in Cloudglue account - see Data Connectors Documentation-
Dropbox:
- Shareable links:
https://www.dropbox.com/scl/fo/SOME_ID/SOME_OTHER_ID?rlkey=SOME_KEY&st=SOME_TIME&dl=0 - Direct path:
dropbox://<path>/<to>/<file>
- Shareable links:
-
Google Drive:
- Format:
gdrive://file/<file_id>
- Format:
-
Zoom:
- Meeting UUID:
zoom://uuid/QFwZYEreTl2e6MBFSslXjQ%3D%3D - Meeting ID:
zoom://id/81586198865
- Meeting UUID:
Available Tools
Once configured, your AI assistant can access the following Cloudglue tools:Discovery & Navigation
-
list_collections: Discover available video collections and their basic metadata. Use this first to understand what video collections exist before using other collection-specific tools. Shows collection IDs needed for other tools, video counts, and collection types. Pagination guidance: For comprehensive exploration, paginate through all collections (check has_more and increment offset by limit) to ensure you don’t miss any collections. Use smaller limits (5-10) for quick overviews, larger limits (25-50) for thorough exploration. -
list_videos: Browse and search video metadata with powerful filtering options. Use this to explore available videos, find specific content by date, or see what’s in a collection. Returns essential video info like duration, filename, and IDs needed for other tools. Pagination guidance: For exhaustive exploration, paginate through all videos (check has_more and increment offset by limit) to ensure complete coverage. Use date filtering to focus on specific time periods, then paginate within those results.
Individual Video Analysis
-
describe_video: Get comprehensive transcripts and descriptions from individual videos with intelligent cost optimization. Automatically checks for existing transcripts before creating new ones. Use this for individual video analysis - for analyzing multiple videos in a collection, use retrieve_collection_descriptions instead. Supports all URL formats: Cloudglue URLs, YouTube URLs, public HTTP video URLs, and data connector URLs (Dropbox, Google Drive, Zoom). -
extract_video_entities: Extract structured data and entities from videos using custom prompts with intelligent cost optimization. Automatically checks for existing extractions before creating new ones. Use this for individual video analysis - for analyzing multiple videos in a collection, use retrieve_collection_entities instead. Supports all URL formats: Cloudglue URLs, YouTube URLs, public HTTP video URLs, and data connector URLs (Dropbox, Google Drive, Zoom). The quality of results depends heavily on your prompt specificity. -
get_video_metadata: Get comprehensive technical metadata about a Cloudglue video file including duration, resolution, file size, processing status, and computed statistics. Use this when you need video specifications, file details, or processing information rather than content analysis. Different from content-focused tools like describe_video. -
segment_video_camera_shots: Segment videos into camera shots with intelligent cost optimization. Automatically checks for existing shot segmentation jobs before creating new ones. Returns timestamps and metadata for each camera shot detected. Supports Cloudglue URLs, public HTTP video URLs, and data connector URLs (Dropbox, Google Drive, Zoom). Note: YouTube URLs are not supported for camera shot segmentation. -
segment_video_chapters: Segment videos into chapters with intelligent cost optimization. Automatically checks for existing chapter segmentation jobs before creating new ones. Returns timestamps and descriptions for each chapter detected. Supports all URL formats: Cloudglue URLs, YouTube URLs, public HTTP video URLs, and data connector URLs (Dropbox, Google Drive, Zoom).
Collection Analysis
-
retrieve_summaries: Bulk retrieve video summaries and titles from a collection to quickly understand its content and themes. Works with both rich-transcripts and media-descriptions collections. Perfect for getting a high-level overview of what’s in a collection, identifying common topics, or determining if a collection contains relevant content for a specific query. Use this as your first step when analyzing a collection - it’s more efficient than retrieving full descriptions and helps you determine if you need more detailed information. Only proceed to retrieve_descriptions if you need the full multimodal context for specific videos identified through the summaries. For targeted content discovery, consider using search_video_summaries or search_video_moments instead of browsing through all summaries. Pagination guidance: For comprehensive collection analysis, paginate through all summaries (check has_more and increment offset by limit) to ensure complete coverage. Use larger limits (25-50) for efficient bulk analysis, smaller limits (5-10) for targeted exploration. -
retrieve_descriptions: Bulk retrieve rich multimodal descriptions (text, audio, and visual) from a collection with advanced filtering. Works with both rich-transcripts and media-descriptions collections. Use this only after using retrieve_summaries to identify specific videos that need detailed analysis. This tool is more resource-intensive and limited to 10 descriptions per request, so it’s best used for targeted analysis of specific videos rather than broad collection overview. For single videos, use describe_video instead. Use date filtering to focus on specific time periods. For targeted content discovery, consider using search_video_moments or search_video_summaries instead of parsing through dense full descriptions. Pagination guidance: Due to the 10-item limit, pagination is essential for comprehensive analysis. Always check has_more and paginate through all descriptions when user intent requires exhaustive coverage. Use date filtering first to narrow scope, then paginate within filtered results. -
retrieve_entities: Batch retrieve structured entity data from multiple videos in a collection. Entities can be user-defined based on what’s important for your collection (people, objects, concepts, custom categories). Perfect for data mining, building datasets, or analyzing previously extracted entities at scale. Pagination guidance: For comprehensive entity analysis, paginate through all entities (check has_more and increment offset by limit) to ensure complete data coverage. Use date filtering to focus on specific time periods, then paginate within those results. Essential for building complete datasets or performing exhaustive entity analysis. -
search_video_moments: AI-powered semantic search to find specific video segments within a collection. Uses Cloudglue’s search API to locate relevant moments across speech, on-screen text, and visual descriptions. Returns structured search results with timestamps and metadata. Perfect for finding needle-in-haystack spoken and visual content, specific discussions, or thematic analysis. -
search_video_summaries: AI-powered semantic search to find relevant videos within a collection. Uses Cloudglue’s search API to locate videos based on their content, summaries, and metadata. Works with rich-transcripts and media-descriptions collections. Returns structured search results with video information and relevance scores. Perfect for discovering videos by topic, theme, or content similarity.
Pagination Strategy
When to Paginate Exhaustively
- User asks for “all” or “complete” analysis of collections/videos
- User wants comprehensive coverage or exhaustive exploration
- User requests data mining, dataset building, or complete entity extraction
- User asks for “everything” in a collection or time period
How to Paginate
- Start with initial request (offset=0, appropriate limit)
- Check response
pagination.has_morefield - If
has_more: true, increment offset by limit and repeat - Continue until
has_more: falseor you have sufficient data - Use date filtering first to narrow scope, then paginate within results
Limit Guidelines
- Quick overviews: 5-10 items
- Comprehensive analysis: 25-50 items
- Exhaustive exploration: Use maximum limits (50 for summaries, 10 for descriptions/entities)
When to Use Which Tool
- Start exploring: Use
list_collectionsandlist_videosto explore available content - For single videos: Use
describe_video,extract_video_entities,segment_video_camera_shots, orsegment_video_chapters - For collection overview: Always start with
retrieve_summariesto efficiently understand what’s in a collection - For detailed analysis: Only use
retrieve_descriptionsfor specific videos that need full multimodal context, identified through summaries - For structured data: Use
retrieve_entitiesfor bulk entity extraction - For specific content: Use
search_video_momentsfor targeted segment search,search_video_summariesfor video-level search - For technical specs: Use
get_video_metadata
Troubleshooting
If the MCP server isn’t working:- Ensure you have Node.js installed (
node --versionin terminal) - Verify your API key is correct
- Restart the client application completely
- Check logs for any error messages
Resources
- Official Cloudglue MCP Server NPM Package
- Cloudglue API Docs
- Model Context Protocol
- Claude Desktop MCP Guide
- Cursor MCP Documentation