🌊

songsee

Name: songsee
Author: OpenClaw

Generate spectrograms and rich feature-panel visualizations from any audio file using the songsee CLI.

作者 OpenClawv1.0.0

Design & MediaOpen SourceAutomationCLIDeveloper Tool

正在連線至 VM...

npx clawhub@latest install songsee

8目前安裝數

v1.0.0版本

系統需求

songsee

songsee turns audio files into detailed visual representations — from classic spectrograms to multi-panel feature grids. Whether you need a quick frequency snapshot or a comprehensive breakdown of mel filterbanks, chroma, HPSS, loudness, tempograms, MFCCs, and spectral flux, songsee renders it all as a single exportable image.It works directly from the command line, supports piping audio via stdin, and lets you slice any time window out of a track — making it equally useful for quick inspection and batch processing workflows.

運作原理

songsee wraps the songsee CLI binary to give your AI agent audio visualization capabilities. Here's the typical flow:Point songsee at an audio file (MP3, WAV, or other formats via ffmpeg): songsee track.mp3Choose one or more visualization panels with --viz — e.g., spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux. Multiple panels are rendered as a grid.Customize the output with flags for color palette (--style), image dimensions (--width / --height), FFT settings (--window / --hop), frequency range (--min-freq / --max-freq), and time slicing (--start / --duration).Export as jpg or png to a file or stdout.WAV and MP3 are decoded natively; other formats are handled automatically if ffmpeg is available on the system.

核心功能

Multiple Visualization Types — Supports spectrogram, mel filterbank, chroma, HPSS, self-similarity matrix, loudness, tempogram, MFCC, and spectral flux panels.Multi-Panel Grid — Combine any mix of visualizations into a single image with one command.Time Slicing — Extract and visualize any segment of audio using --start and --duration flags.Color Palettes — Choose from five styles: classic, magma, inferno, viridis, and gray.Stdin Support — Pipe audio directly into songsee for seamless integration with shell pipelines.Flexible Output — Export as JPEG or PNG with configurable width and height.FFT Control — Tune window size, hop length, and frequency range for precise analysis.Broad Format Support — Native MP3/WAV decoding; other formats handled via ffmpeg if installed.

系統需求

songsee CLI — The core binary that performs audio analysis and rendering. Installed automatically by this skill via Homebrew (steipete/tap/songsee). Required.ffmpeg — Enables decoding of audio formats beyond MP3 and WAV (e.g., FLAC, OGG, AAC). Optional — the skill works without it for MP3 and WAV files.

使用情境

Music Production Review — Visualize frequency content, dynamics, and rhythm features of a mix to identify mastering issues or compare versions.Audio Research & Analysis — Generate multi-panel feature grids (chroma, MFCC, tempogram) for academic or ML dataset inspection.Podcast & Speech QA — Quickly check a recorded segment for noise, clipping, or silence by slicing and visualizing specific time windows.Batch Pipeline Integration — Pipe audio from other tools into songsee via stdin to automate visualization in CI or data processing workflows.

安裝方式

Run in your terminal

npx clawhub@latest install songsee

Click the Install button at the top of this page for one-click setup

正在連線至 VM...

npx clawhub@latest install songsee

8目前安裝數

v1.0.0版本

系統需求

songsee

常見問題

評價

0 則評價

登入後撰寫評價

尚無評價。來分享你的使用體驗吧！