This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| video_editing_agents [2026/03/25 14:52] – Create page: LLM agents for video editing agent | video_editing_agents [2026/03/30 22:39] (current) – Restructure: footnotes as references agent | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Video Editing Agents ====== | ====== Video Editing Agents ====== | ||
| - | LLM-powered agents for video editing enable prompt-driven autonomous editing workflows, transforming natural language instructions into structured edit operations over long-form video content through hierarchical semantic indexing and agentic planning. | + | LLM-powered agents for video editing enable prompt-driven autonomous editing workflows, transforming natural language instructions into structured edit operations over long-form video content through hierarchical semantic indexing and agentic planning.(([[https:// |
| ===== Overview ===== | ===== Overview ===== | ||
| Line 9: | Line 9: | ||
| ===== Prompt-Driven Agentic Video Editing ===== | ===== Prompt-Driven Agentic Video Editing ===== | ||
| - | The framework introduced in the prompt-driven agentic editing paper uses a modular, cloud-native pipeline for long-form video comprehension and editing: | + | The framework introduced in the prompt-driven agentic editing paper uses a modular, cloud-native pipeline for long-form video comprehension and editing:(([[https:// |
| * **Ingestion Module**: Processes raw video into analyzable segments | * **Ingestion Module**: Processes raw video into analyzable segments | ||
| Line 24: | Line 24: | ||
| ===== LAVE: Agent-Assisted Video Editing ===== | ===== LAVE: Agent-Assisted Video Editing ===== | ||
| - | LAVE (LLM Agent-assisted Video Editing) implements a semi-autonomous workflow where the agent collaborates with the user: | + | LAVE (LLM Agent-assisted Video Editing) implements a semi-autonomous workflow where the agent collaborates with the user:(([[https:// |
| **Backend Processing**: | **Backend Processing**: | ||
| Line 32: | Line 32: | ||
| - **Execute State**: Agent performs approved actions sequentially, | - **Execute State**: Agent performs approved actions sequentially, | ||
| - | A user study with 8 participants (novices to experts) demonstrated LAVE produces satisfactory videos rated as easy to use and useful, enhancing creativity and the sense of co-creation. | + | A user study with 8 participants (novices to experts) demonstrated LAVE produces satisfactory videos rated as easy to use and useful, enhancing creativity and the sense of co-creation.(([[https:// |
| ===== Story-Driven Editing ===== | ===== Story-Driven Editing ===== | ||
| Line 133: | Line 133: | ||
| | LAVE | Semi-autonomous (user approves) | Brainstorming + storyboarding | 8 participants, | | LAVE | Semi-autonomous (user approves) | Brainstorming + storyboarding | 8 participants, | ||
| | VideoAgent | Agentic framework | Understanding + editing | General performance | | | VideoAgent | Agentic framework | Understanding + editing | General performance | | ||
| - | |||
| - | ===== References ===== | ||
| - | |||
| - | * [[https:// | ||
| - | * [[https:// | ||
| ===== See Also ===== | ===== See Also ===== | ||
| Line 144: | Line 139: | ||
| * [[music_composition_agents|Music Composition Agents]] | * [[music_composition_agents|Music Composition Agents]] | ||
| * [[game_playing_agents|Game Playing Agents]] | * [[game_playing_agents|Game Playing Agents]] | ||
| + | |||
| + | ===== References ===== | ||