Integrate Cloudglue’s powerful video understanding capabilities directly into Claude Desktop, Cursor, Windsurf, and other AI assistants
claude_desktop_config.json
file (located at ~/Library/Application Support/Claude/claude_desktop_config.json
on macOS or %APPDATA%\Claude\claude_desktop_config.json
on Windows).Add the following configuration to the file:<CLOUDGLUE-YOUR-API-KEY>
with your Cloudglue API key from step 1.After updating the configuration, restart Claude Desktop completely.list_collections
: Discover available video collections and their basic metadata. Use this first to understand what video collections exist before using other collection-specific tools. Shows collection IDs needed for other tools, video counts, and collection types.
list_videos
: Browse and search video metadata with powerful filtering options. Use this to explore available videos, find specific content by date, or see what’s in a collection. Returns essential video info like duration, filename, and IDs needed for other tools.
describe_video
: Get comprehensive transcripts and descriptions from individual videos (YouTube or Cloudglue upload) with intelligent cost optimization. Automatically checks for existing transcripts before creating new ones. Use this for individual video analysis - for analyzing multiple videos in a collection, use retrieve_collection_transcripts instead. Supports both YouTube and Cloudglue videos with different analysis levels.
extract_video_entities
: Extract structured data and entities from videos using custom prompts with intelligent cost optimization. Automatically checks for existing extractions before creating new ones. Use this for individual video analysis - for analyzing multiple videos in a collection, use retrieve_collection_entities instead. The quality of results depends heavily on your prompt specificity.
get_video_metadata
: Get comprehensive technical metadata about a Cloudglue video file including duration, resolution, file size, processing status, and computed statistics. Use this when you need video specifications, file details, or processing information rather than content analysis. Different from content-focused tools like describe_video.
retrieve_transcript_summaries
: Bulk retrieve video summaries and titles from a collection to quickly understand its content and themes. Perfect for getting a high-level overview of what’s in a collection, identifying common topics, or determining if a collection contains relevant content for a specific query. Use this as your first step when analyzing a collection - it’s more efficient than retrieving full transcripts and helps you determine if you need more detailed information. Only proceed to retrieve_collection_transcripts if you need the full multimodal context for specific videos identified through the summaries. Returns up to 50 summaries per request with pagination support.
retrieve_collection_transcripts
: Bulk retrieve rich multimodal transcripts (text, audio, and visual) from a collection with advanced filtering. Use this only after using retrieve_transcript_summaries to identify specific videos that need detailed analysis. This tool is more resource-intensive and limited to 10 transcripts per request, so it’s best used for targeted analysis of specific videos rather than broad collection overview. For single videos, use describe_video instead. Use date filtering to focus on specific time periods.
retrieve_collection_entities
: Batch retrieve structured entity data from multiple videos in a collection. Entities can be user-defined based on what’s important for your collection (people, objects, concepts, custom categories). Perfect for data mining, building datasets, or analyzing previously extracted entities at scale. Supports pagination and date-based filtering to manage large result sets. For individual video entities, use extract_video_entities instead.
find_video_collection_moments
: AI-powered semantic search to find specific moments, topics, or content within a video collection. Returns relevant segments with context, timestamps, and citations. Perfect for finding needle-in-haystack content, specific discussions, or thematic analysis across multiple videos. Much more targeted than bulk retrieval tools.
list_collections
and list_videos
to explore available contentdescribe_video
or extract_video_entities
retrieve_transcript_summaries
to efficiently understand what’s in a collectionretrieve_collection_transcripts
for specific videos that need full multimodal context, identified through summariesretrieve_collection_entities
for bulk entity extractionfind_video_collection_moments
for targeted semantic searchget_video_metadata
node --version
in terminal)