ResonanceID-cli

A Rust-based audio fingerprinting CLI inspired by Shazam-style matching.

This project is being built for a Design and Analysis of Algorithms course, with focus on:

fingerprint pipeline design
matching quality vs false positives
practical CLI workflows
measurable runtime behavior

Features

Store songs into a local SQLite fingerprint DB
Recognize unknown clips against stored references
🔊 Live system audio recognition - identify songs playing on your computer (YouTube, Spotify, etc.)
Show ranked candidates (top matches)
Manage DB from CLI (list-songs, remove-song, db-stats)
Config layering (/etc, user config, local config)
CLI overrides for all key tuning params
Optional clipping for reference indexing (--clip-start, --clip-duration, --auto-clip)

Tech Stack

Rust
SQLite (rusqlite)
FFT (rustfft)
WAV I/O (hound)
TOML config (serde, toml)
Audio capture (libpulse-binding for Linux/PulseAudio)

Pipeline (High-Level)

Store / Remember

Read WAV samples
(Optional) clip audio range
STFT spectrogram
Peak extraction (constellation points)
Fingerprint generation (hash, anchor_time_ms)
Insert song metadata + fingerprints into SQLite

Recognize

Read WAV samples
STFT spectrogram
Peak extraction
Fingerprint generation
Hash lookup in DB + offset voting
Rank songs by strongest offset consistency

Installation / Run

cargo build
cargo run -- --help

Note: pass app args after -- when using cargo run.

Diagnose issues using

cargo test

CLI Commands

Store a reference track

cargo run -- store <wav_path> "<Title>" "<Artist>" [options]

Alias:

cargo run -- remember <wav_path> "<Title>" "<Artist>" [options]

Recognize a clip

cargo run -- recognize <wav_path> [options]

🔊 Live system audio recognition (NEW!)

cargo run -- listen [duration] [options]

Capture and identify audio playing on your computer in real-time.

Examples:

# List available audio devices (shows monitors and microphones)
cargo run -- list-devices

# Capture system audio for 10 seconds (default)
cargo run -- listen --monitor

# Capture system audio for 5 seconds
cargo run -- listen 5 --monitor

# Use specific device by index
cargo run -- listen --device 0

Key features:

🎧 Works with headphones (captures before audio output)
🔇 Works at any volume level (even muted)
📻 Recognizes music from YouTube, Spotify, web browsers, etc.
🎯 Uses PulseAudio monitor sources (Linux)

Show ranked candidates

cargo run -- list-top-matches <wav_path> [options]

Database management

cargo run -- list-songs [--db <db_path>]
cargo run -- remove-song <song_id> [--db <db_path>]
cargo run -- db-stats [--db <db_path>]

Common Options

--db <db_path>
--config <path>
--no-config

Fingerprint options:

--window-size <n>
--hop-size <n>
--anchor-window <n>
--threshold-db <f32>

Recognition options:

--min-match-score <n>
--dynamic-gate-scale <f32>
--small-query-threshold <n>
--max-results <n>

Clip options (store/remember):

--clip-start <seconds>
--clip-duration <seconds>
--auto-clip (center clip; default 20s if duration not specified)

Config

Search order (when --config is not given):

/etc/resonanceid-cli/config.toml
~/.config/resonanceid-cli/config.toml
./resonanceid-cli.toml

Precedence:

CLI flags > config file > defaults

Example config:

[fingerprint]
window_size = 1024
hop_size = 512
anchor_window = 50
threshold_db = -20.0

[recognition]
min_match_score = 2
dynamic_gate_scale = 30.0
small_query_threshold = 1000
max_results = 5

You can copy from resonanceid-cli.toml.example.

Quick Demo

File-based recognition

# 1) Convert audio to WAV (mono, 44.1k)
ffmpeg -y -i input.mp3 -ac 1 -ar 44100 input.wav

# 2) Store reference
cargo run -- store input.wav "My Song" "My Artist"

# 3) Recognize clip
cargo run -- recognize clip.wav

System audio recognition (Shazam-style)

# 1) Store some reference songs
cargo run -- store song1.wav "Song 1" "Artist 1"
cargo run -- store song2.wav "Song 2" "Artist 2"

# 2) Play music on your computer (YouTube, Spotify, etc.)

# 3) Identify what's playing
cargo run -- listen --monitor

Notes

File-based commands (store, recognize) expect WAV input files
Use ffmpeg for mp3/flac conversion before running file-based commands
For stable matching quality, reference clips around 20–45 seconds are recommended
System audio capture (listen) works directly - no file conversion needed
The listen command is Linux-only (requires PulseAudio/PipeWire)
For other platforms, use file-based recognition with recognize

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
songs		songs
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
README.md		README.md
resonanceid-cli.toml.example		resonanceid-cli.toml.example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ResonanceID-cli

Features

Tech Stack

Pipeline (High-Level)

Store / Remember

Recognize

Installation / Run

CLI Commands

Store a reference track

Recognize a clip

🔊 Live system audio recognition (NEW!)

Show ranked candidates

Database management

Common Options

Config

Quick Demo

File-based recognition

System audio recognition (Shazam-style)

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

rugbedbugg/ResonanceID-cli

Folders and files

Latest commit

History

Repository files navigation

ResonanceID-cli

Features

Tech Stack

Pipeline (High-Level)

Store / Remember

Recognize

Installation / Run

CLI Commands

Store a reference track

Recognize a clip

🔊 Live system audio recognition (NEW!)

Show ranked candidates

Database management

Common Options

Config

Quick Demo

File-based recognition

System audio recognition (Shazam-style)

Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages