Skip to main content

Media Skills

Media Skills give the OpenClaw Agent multimedia processing capabilities — from generating presentations and managing music to summarizing YouTube videos. These Skills expand the Agent's range from pure text to multimodal.


#25 — Felo Slides

PropertyDetails
Rank#25 / 50
CategoryMedia
Total Score55 / 80
Maturity🟡 Beta
Installationclawhub install felo-slides
Target UsersUsers who need to quickly create presentations

Feature Overview

Generate professional presentations (PPT/PDF) from natural language descriptions:

  • Natural language to slides: Describe the topic and outline to generate
  • Template system: Multiple professional design templates
  • Chart generation: Automatically generate charts and infographics
  • Multilingual support: Traditional Chinese, English, Japanese, and more
  • Export formats: PPTX, PDF, Google Slides
  • Content population: Combines with search Skills to auto-fill latest data

Why It Matters

Creating presentations is one of the most time-consuming tasks for knowledge workers. Felo Slides lets the Agent produce a well-structured, professionally designed presentation in minutes — you just review and fine-tune.

Score Breakdown

RELCOMTRCVALMNTRLBSECLRNTotal
7778767655

Installation & Setup

clawhub install felo-slides

openclaw skill configure felo-slides \
--api-key your_felo_slides_key

# Usage examples
openclaw run "Create a 10-slide presentation about AI Agent trends in enterprise using Felo Slides"

openclaw run felo-slides \
--template business-modern \
--language en \
--topic "Q1 Operations Report"

Dependencies & Security

  • Dependencies: Felo Slides API Key
  • Permissions Required: Network access (API calls), local file write (export)
  • Security: SEC 7/10 — presentation content is sent to Felo servers for processing
  • Alternatives: Google Slides API direct manipulation; Markdown to PPTX conversion tools

#27 — Spotify

PropertyDetails
Rank#27 / 50
CategoryMedia
Total Score55 / 80
Maturity🟡 Beta
Installationclawhub install community/spotify-claw
Target UsersSpotify Premium users

Feature Overview

Control Spotify playback and manage music:

  • Play/pause/skip tracks
  • Search songs, albums, and artists
  • Manage playlists
  • Context-based music recommendations (e.g., "Play music suitable for coding")
  • View recent listening history

Why It Matters

Music is part of many people's workflow. Let the Agent automatically adjust background music based on your calendar and tasks — Lo-fi during focused work, relaxing music before meetings, high-energy playlists during exercise.

Score Breakdown

RELCOMTRCVALMNTRLBSECLRNTotal
6777778655

Installation & Setup

clawhub install community/spotify-claw

# OAuth authorization
openclaw auth spotify

# Usage examples
openclaw run "Play some Lo-fi music suitable for working"
openclaw run "Add the current song to my 'Favorites' playlist"

Dependencies & Security

  • Dependencies: Spotify Premium account, Spotify OAuth
  • Permissions Required: user-modify-playback-state, playlist-modify-private
  • Security: SEC 8/10 — Spotify permissions are limited in scope, no sensitive data involved
  • Alternatives: Apple Music integration is in community development

#32 — YouTube Digest

PropertyDetails
Rank#32 / 50
CategoryMedia
Total Score53 / 80
Maturity🟡 Beta
Installationclawhub install community/youtube-digest
Target UsersHeavy YouTube users, learners

Feature Overview

Extract knowledge from YouTube videos:

  • Subtitle extraction: Retrieve auto/manual subtitles from videos
  • Content summaries: Generate key-point summaries
  • Timestamped notes: Mark important time points
  • Channel tracking: Follow new videos from specific channels
  • Playlist analysis: Consolidate knowledge from entire playlists

Why It Matters

YouTube is one of the world's largest knowledge repositories, but watching a 1-hour tutorial takes significant time. YouTube Digest lets the Agent generate a video summary in seconds so you can quickly decide whether it's worth watching in full.

Score Breakdown

RELCOMTRCVALMNTRLBSECLRNTotal
7667678653

Installation & Setup

clawhub install community/youtube-digest

# Optional: YouTube API Key improves efficiency
openclaw skill configure youtube-digest \
--youtube-api-key your_api_key # optional

# Usage examples
openclaw run "Summarize this video: https://youtube.com/watch?v=xxxxx"
openclaw run "Follow the Fireship channel and give me weekly new video summaries"

Dependencies & Security

  • Dependencies: YouTube API Key (optional — works without it but slower)
  • Permissions Required: YouTube Data API (read-only)
  • Security: SEC 8/10 — read-only access, does not modify any data
  • Alternatives: Use the Web Browsing Skill to browse YouTube and summarize manually

#35 — Image Generation

PropertyDetails
Rank#35 / 50
CategoryMedia
Total Score52 / 80
Maturity🟡 Beta
Installationclawhub install community/image-gen
Target UsersUsers needing quick image generation

Feature Overview

Create images through multiple AI image generation services:

  • Text to image: Generate images from natural language descriptions
  • Multi-model support: DALL-E 3, Stable Diffusion, Midjourney API
  • Image editing: Modify existing images
  • Batch generation: Generate multiple variants at once
  • Style control: Specify art style, aspect ratio, and resolution

Why It Matters

When you need social media post images, presentation illustrations, or concept visualizations, the Image Generation Skill lets you produce them without leaving OpenClaw. Works especially well paired with Felo Slides.

Score Breakdown

RELCOMTRCVALMNTRLBSECLRNTotal
7667667752

Installation & Setup

clawhub install community/image-gen

# Configure image generation service (choose one)
openclaw skill configure image-gen \
--provider openai \
--api-key sk-xxxxxxxxxxxx

# Usage example
openclaw run "Generate a cyberpunk-style illustration of Taipei 101"

Dependencies & Security

  • Dependencies: Image generation API Key (OpenAI, Stability AI, etc.)
  • Permissions Required: API calls + local file write
  • Security: SEC 7/10 — image descriptions are sent to third-party services
  • Alternatives: Use DALL-E or Midjourney official interfaces directly

#39 — TweetClaw

PropertyDetails
Rank#39 / 50
CategoryMedia / Social
Total Score50 / 80
Maturity🟠 Alpha
Installationclawhub install community/tweetclaw
Target UsersSocial media managers, X/Twitter users

Feature Overview

X (formerly Twitter) content management:

  • Read latest tweets from specific accounts
  • Search tweets by topic
  • Draft tweets (manual confirmation required before publishing)
  • Track trending topics
  • Analyze tweet engagement metrics

Why It Matters

For users managing social media, TweetClaw lets the Agent track industry trends, draft tweets, and analyze engagement. Pair with the Summarize Skill for daily Twitter digests.

Score Breakdown

RELCOMTRCVALMNTRLBSECLRNTotal
6666667750

Installation & Setup

clawhub install community/tweetclaw

# Set X API (requires Basic or Pro plan)
openclaw skill configure tweetclaw \
--api-key your_x_api_key \
--api-secret your_x_api_secret \
--access-token your_access_token \
--access-secret your_access_secret
X API Costs

The X API is now paid. The Basic plan costs $100/month with limited features. TweetClaw's read-only features can be substituted with the free Web Browsing Skill.

Dependencies & Security

  • Dependencies: X API Key (paid)
  • Permissions Required: Tweet read/write
  • Security: SEC 7/10 — recommend restricting to read-only mode, publish tweets manually
  • Alternatives: Web Browsing Skill with X search (free but limited)

#40 — Voice / Vapi

PropertyDetails
Rank#40 / 50
CategoryMedia
Total Score50 / 80
Maturity🟠 Alpha
Installationclawhub install community/voice-vapi
Target UsersUsers who want voice interaction

Feature Overview

Add voice capabilities to the OpenClaw Agent:

  • Voice input: Speech-to-Text (STT)
  • Voice output: Text-to-Speech (TTS)
  • Voice conversation: Real-time voice interaction mode
  • Multilingual: Supports Chinese, English, Japanese, Korean, and more
  • Custom voices: Choose from different TTS voices

Why It Matters

Voice is the most natural human-computer interaction method. The Voice/Vapi Skill lets you control the Agent by speaking, perfect for scenarios where typing is inconvenient — driving, doing housework, etc.

Score Breakdown

RELCOMTRCVALMNTRLBSECLRNTotal
6657667750

Installation & Setup

clawhub install community/voice-vapi

# Configure voice service (Vapi)
openclaw skill configure voice-vapi \
--provider vapi \
--api-key your_vapi_key \
--voice-id alloy \
--language en-US

# Start voice mode
openclaw voice

Dependencies & Security

  • Dependencies: Vapi API Key or other TTS/STT service
  • Permissions Required: Microphone access, network access
  • Security: SEC 7/10 — voice data is sent to third-party services for processing
  • Alternatives: Future OpenClaw versions may include built-in voice capabilities (see Module 11)

Content Creator

clawhub install felo-slides
clawhub install community/image-gen
clawhub install community/tweetclaw
clawhub install community/spotify-claw

Learner

clawhub install community/youtube-digest
clawhub install community/summarize
clawhub install community/obsidian-claw

Multimodal Agent

clawhub install community/voice-vapi
clawhub install community/image-gen
clawhub install community/youtube-digest
# See, hear, speak, draw — a fully multimodal Agent