🌊

songsee

Generate spectrograms and rich feature-panel visualizations from any audio file using the songsee CLI.

por OpenClawv1.0.0
Design & MediaOpen SourceAutomationCLIDeveloper Tool
Conectando a la VM...
Conectando a la VM...
npx clawhub@latest install songsee
8Instalaciones actuales
v1.0.0Versión

Requisitos

songsee

songsee turns audio files into detailed visual representations — from classic spectrograms to multi-panel feature grids. Whether you need a quick frequency snapshot or a comprehensive breakdown of mel filterbanks, chroma, HPSS, loudness, tempograms, MFCCs, and spectral flux, songsee renders it all as a single exportable image.It works directly from the command line, supports piping audio via stdin, and lets you slice any time window out of a track — making it equally useful for quick inspection and batch processing workflows.

Cómo funciona

songsee wraps the songsee CLI binary to give your AI agent audio visualization capabilities. Here's the typical flow:Point songsee at an audio file (MP3, WAV, or other formats via ffmpeg): songsee track.mp3Choose one or more visualization panels with --viz — e.g., spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux. Multiple panels are rendered as a grid.Customize the output with flags for color palette (--style), image dimensions (--width / --height), FFT settings (--window / --hop), frequency range (--min-freq / --max-freq), and time slicing (--start / --duration).Export as jpg or png to a file or stdout.WAV and MP3 are decoded natively; other formats are handled automatically if ffmpeg is available on the system.

Características principales

Multiple Visualization Types — Supports spectrogram, mel filterbank, chroma, HPSS, self-similarity matrix, loudness, tempogram, MFCC, and spectral flux panels.Multi-Panel Grid — Combine any mix of visualizations into a single image with one command.Time Slicing — Extract and visualize any segment of audio using --start and --duration flags.Color Palettes — Choose from five styles: classic, magma, inferno, viridis, and gray.Stdin Support — Pipe audio directly into songsee for seamless integration with shell pipelines.Flexible Output — Export as JPEG or PNG with configurable width and height.FFT Control — Tune window size, hop length, and frequency range for precise analysis.Broad Format Support — Native MP3/WAV decoding; other formats handled via ffmpeg if installed.

Requisitos

songsee CLI — The core binary that performs audio analysis and rendering. Installed automatically by this skill via Homebrew (steipete/tap/songsee). Required.ffmpeg — Enables decoding of audio formats beyond MP3 and WAV (e.g., FLAC, OGG, AAC). Optional — the skill works without it for MP3 and WAV files.

Casos de uso

Music Production Review — Visualize frequency content, dynamics, and rhythm features of a mix to identify mastering issues or compare versions.Audio Research & Analysis — Generate multi-panel feature grids (chroma, MFCC, tempogram) for academic or ML dataset inspection.Podcast & Speech QA — Quickly check a recorded segment for noise, clipping, or silence by slicing and visualizing specific time windows.Batch Pipeline Integration — Pipe audio from other tools into songsee via stdin to automate visualization in CI or data processing workflows.

Cómo instalar

1
Run in your terminal
npx clawhub@latest install songsee
or
2
Click the Install button at the top of this page for one-click setup

Preguntas frecuentes

Reseñas

0 reseñas

Inicia sesión para escribir una reseña

Aún no hay reseñas. ¡Sé el primero en compartir tu experiencia!