🌊

songsee

Generate spectrograms and rich feature-panel visualizations from any audio file using the songsee CLI.

von OpenClawv1.0.0
Design & MediaOpen SourceAutomationCLIDeveloper Tool
Verbindung zu VM wird hergestellt...
Verbindung zu VM wird hergestellt...
npx clawhub@latest install songsee
8Aktuelle Installationen
v1.0.0Version

Voraussetzungen

songsee

songsee turns audio files into detailed visual representations — from classic spectrograms to multi-panel feature grids. Whether you need a quick frequency snapshot or a comprehensive breakdown of mel filterbanks, chroma, HPSS, loudness, tempograms, MFCCs, and spectral flux, songsee renders it all as a single exportable image.It works directly from the command line, supports piping audio via stdin, and lets you slice any time window out of a track — making it equally useful for quick inspection and batch processing workflows.

Funktionsweise

songsee wraps the songsee CLI binary to give your AI agent audio visualization capabilities. Here's the typical flow:Point songsee at an audio file (MP3, WAV, or other formats via ffmpeg): songsee track.mp3Choose one or more visualization panels with --viz — e.g., spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux. Multiple panels are rendered as a grid.Customize the output with flags for color palette (--style), image dimensions (--width / --height), FFT settings (--window / --hop), frequency range (--min-freq / --max-freq), and time slicing (--start / --duration).Export as jpg or png to a file or stdout.WAV and MP3 are decoded natively; other formats are handled automatically if ffmpeg is available on the system.

Hauptfunktionen

Multiple Visualization Types — Supports spectrogram, mel filterbank, chroma, HPSS, self-similarity matrix, loudness, tempogram, MFCC, and spectral flux panels.Multi-Panel Grid — Combine any mix of visualizations into a single image with one command.Time Slicing — Extract and visualize any segment of audio using --start and --duration flags.Color Palettes — Choose from five styles: classic, magma, inferno, viridis, and gray.Stdin Support — Pipe audio directly into songsee for seamless integration with shell pipelines.Flexible Output — Export as JPEG or PNG with configurable width and height.FFT Control — Tune window size, hop length, and frequency range for precise analysis.Broad Format Support — Native MP3/WAV decoding; other formats handled via ffmpeg if installed.

Voraussetzungen

songsee CLI — The core binary that performs audio analysis and rendering. Installed automatically by this skill via Homebrew (steipete/tap/songsee). Required.ffmpeg — Enables decoding of audio formats beyond MP3 and WAV (e.g., FLAC, OGG, AAC). Optional — the skill works without it for MP3 and WAV files.

Anwendungsfälle

Music Production Review — Visualize frequency content, dynamics, and rhythm features of a mix to identify mastering issues or compare versions.Audio Research & Analysis — Generate multi-panel feature grids (chroma, MFCC, tempogram) for academic or ML dataset inspection.Podcast & Speech QA — Quickly check a recorded segment for noise, clipping, or silence by slicing and visualizing specific time windows.Batch Pipeline Integration — Pipe audio from other tools into songsee via stdin to automate visualization in CI or data processing workflows.

Installation

1
Run in your terminal
npx clawhub@latest install songsee
or
2
Click the Install button at the top of this page for one-click setup

FAQ

Bewertungen

0 Bewertungen

Anmelden, um eine Bewertung zu schreiben

Noch keine Bewertungen. Sei der Erste, der seine Erfahrungen teilt!