Introduction

What is Cloudglue?

Cloudglue is an API service that transforms videos into structured, LLM-ready data. Using advanced multimodal AI, we extract meaningful information from video content including speech, visual scenes, and on-screen text - making video content programmable and searchable for your applications.

Whether you’re building video knowledge bases, creating AI chatbots that understand video content, or extracting structured data at scale, Cloudglue provides the tools to turn any video into actionable data.

Get Started in 3 Steps

Get your API key

Install an SDK

Choose from our Python or JavaScript SDKs, or use the MCP Server for Claude/Cursor integration.

Upload and extract

Upload your first video and extract structured data or rich transcripts in minutes.

Quick Example

Here’s how easy it is to extract structured data from any video:

from cloudglue import CloudGlue

client = CloudGlue()

# Upload and extract

uploaded = client.files.upload(
'path/to/local/video.mp4',
wait_until_finish=True
)

extraction = client.extract.run(
url=uploaded.uri,
prompt="Extract all speakers and main topics discussed",
schema={"speakers": ["string"], "topics": ["string"]}
)

print(extraction.data)

# {"speakers": ["John Smith", "Sarah Johnson"], "topics": ["AI", "Marketing"]}

Core Features

Extract

Convert video content into structured JSON data using custom schemas to target specific information, making it easy to program against and organize for your exact application needs.

Transcribe

Get comprehensive multimodal transcriptions including speech, visual scene descriptions, and on-screen text. Perfect for capturing every detail from videos across all modalities.

Chat Completion

Create AI conversations that can access and reason about video content, allowing users to ask questions about specific videos or compare content across multiple sources.

Advanced Capabilities

Collections & Organization

Entity Collections - Process multiple videos with consistent schemas
Rich Transcript Collections - Organize videos for searchable transcriptions
Collection Chat - Have conversations across entire video libraries

Integrations & Tools

MCP Server - Direct integration with Claude Desktop and Cursor
Playground - Test and experiment with video processing
Schema Builder - Visual tool for creating extraction schemas
Webhooks - Real-time processing notifications

Multimodal Understanding

Speech Transcription - Accurate speech-to-text with speaker identification
Visual Scene Analysis - Detailed descriptions of what’s happening visually
Scene Text Recognition - Extract text visible on screen (captions, presentations, etc.)
YouTube Integration - Process videos directly from YouTube URLs

Popular Use Cases

Video Q&A Chatbots

Build intelligent chatbots that can answer questions about video content, perfect for training materials, meetings, and educational content.

Structured Data Extraction

Extract specific information like product details, people, locations, or any custom data schema from video content at scale.

Video Knowledge Bases

Create searchable knowledge bases from video libraries, making hours of content instantly accessible and queryable.

What Makes Cloudglue Different

Multimodal AI: We don’t just transcribe speech - we understand visual content, on-screen text, and context
Custom Schemas: Define exactly what data you want extracted with flexible schema definitions
Collection Processing: Organize and process multiple videos with consistent approaches
Developer-First: Clean APIs, comprehensive SDKs, and tools built for developers
Real-time Integration: MCP server support for direct AI assistant integration

Next Steps

Choose your path based on what you want to accomplish:

New to Cloudglue?

Start with our setup guide to get your API key and SDK installed.

See Examples

Explore detailed use cases with step-by-step implementations.

API Reference

Dive into the full API documentation and endpoint details.

Try the Playground

Test video processing directly in your browser without any code.

Resources

API Documentation: Full API Reference
SDKs: JavaScript • Python
Tools: Playground • Schema Builder
Community: Discord • Support
Need an SDK or integration? Let us know!

Getting Started

Core Concepts

Deep Dives

Use Cases

What is Cloudglue?

Get Started in 3 Steps

Quick Example

Core Features

Extract

Transcribe

Chat Completion

Advanced Capabilities

Collections & Organization

Integrations & Tools

Multimodal Understanding

Popular Use Cases

Video Q&A Chatbots

Structured Data Extraction

Video Knowledge Bases

What Makes Cloudglue Different

Next Steps

New to Cloudglue?

See Examples

API Reference

Try the Playground

Resources

Getting Started

Core Concepts

Deep Dives

Use Cases

​What is Cloudglue?

​Get Started in 3 Steps

​Quick Example

​Core Features

Extract

Transcribe

Chat Completion

​Advanced Capabilities

​Collections & Organization

​Integrations & Tools

​Multimodal Understanding

​Popular Use Cases

Video Q&A Chatbots

Structured Data Extraction

Video Knowledge Bases

​What Makes Cloudglue Different

​Next Steps

New to Cloudglue?

See Examples

API Reference

Try the Playground

​Resources

What is Cloudglue?

Get Started in 3 Steps

Quick Example

Core Features

Advanced Capabilities

Collections & Organization

Integrations & Tools

Multimodal Understanding

Popular Use Cases

What Makes Cloudglue Different

Next Steps

Resources