AI AGENT SKILLS

Galdr

一个面向 Automation 场景的 Agent 技能。原始说明：galdr analyzes music from YouTube or local audio files into structured listener-state traces for AI agents: pulse, pattern, momentum, breath, texture, harmon...

下载技能包打开来源页 Automation

SKILL.md

name: galdr
description: Use galdr's default ARC workflow to turn YouTube URLs or local audio files into grounded, time-ordered listening-experience prompts backed by listener-state traces: pattern, attention, pulse, heard pressure, texture, harmony, melody, overtones, and silence/re-entry structure. Use when asked to analyze a song, explain what makes a track work structurally, generate a listening experience, compare tracks, or extract video frames from a music video.
version: 0.4.0
author: Sellemain
license: MIT
platforms: [linux, macos]

galdr

Deterministic ears for AI agents. Audio in, listener-state traces out. Acoustic signal becomes structured evidence an LLM can encounter.

galdr is a music perception CLI for AI agents. Its default experience is ARC: analyze a track into time-ordered listener-state traces, then assemble those traces into a grounded listening-experience prompt. The metrics are evidence. The ARC prompt is the main user-facing output.

Install

Preferred trusted sources:

PyPI: <https://pypi.org/project/galdr/>
Source: <https://github.com/sellemain/galdr>

pip install galdr

# or from source:
git clone https://github.com/sellemain/galdr.git
cd galdr
pip install -e .

This skill teaches an agent how to use galdr; it does not install the galdr command itself. Check: galdr --version. If missing, install the CLI before proceeding. If provenance matters, verify the PyPI metadata or install from the source repository above before running it.

Core Workflows

Default: ARC listening experience

Use this path unless the user explicitly asks for raw metrics, comparison, debugging, or agent-internal traces. ARC turns galdr's evidence into a prose prompt for a grounded, time-ordered listening experience.

The shape is:

Fetch or listen to the track.
Analyze it into listener-state traces.
Assemble the ARC prompt with --template arc --mode full.
Review the prompt, then write or send it to the requested model.

YouTube URL → ARC prompt (most common)

# Step 1: fetch audio + context (slug auto-derived from title)
galdr fetch "https://youtu.be/..." --analyze

# galdr prints the slug at the end:
#   Slug : artist-song-title
#   Next : galdr assemble artist-song-title --template arc --mode full

# Step 2: assemble the prompt locally
galdr assemble artist-song-title --template arc --mode full > prompt.txt

Override auto-derived metadata if needed:

galdr fetch "https://youtu.be/..." --artist "Oliver Anthony" --title "Rich Men North of Richmond" --analyze

If YouTube download behavior is flaky:

galdr doctor
galdr update-deps

galdr doctor reports the active Python executable, yt-dlp command/version, ffmpeg/ffprobe, JavaScript runtimes, and impersonation support. galdr update-deps upgrades yt-dlp[default,curl-cffi] in the same Python environment galdr is using.

Local file → ARC prompt

The analysis command is galdr listen, not galdr analyze.

galdr listen track.wav --name my-track
galdr assemble my-track --template arc --mode full > prompt.txt

Raw second-by-second analysis (advanced)

Galdr is strongest when read as a time-ordered listener-state trace. The stream is the primary evidence. Whole-track interpretation comes after walking the track through time.

Start with:

analysis/<slug>/<slug>_stream.json
analysis/<slug>/<slug>_perception.json
docs/PERCEPTION-MODEL.md

Useful extras:

*_harmony_stream.json
*_melody_stream.json
*_overtone_stream.json
*_report.json
galdr assemble <slug> --mode blind

Reading order:

Read PERCEPTION-MODEL.md first.
Treat *_stream.json as the main evidence surface.
Walk the track in order.
Mark transitions: silence, re-entry, pattern breaks, attention shifts, pressure-state changes, harmonic movement.
Translate pressure fields into listening language: comes forward, holds, releases, empties. Do not quote LUFS values in experience prose.
Only then compress upward into a larger interpretation.

Do not:

jump straight to a whole-song mood summary
treat summary metrics as more important than the stream
ignore silence/re-entry structure
overclaim emotional certainty from structure alone
quote loudness/LUFS readings as if they were the experience

Minimal recipe:

galdr listen track.wav --name my-track
jq '.[0:12]' analysis/my-track/my-track_stream.json
jq '.summary' analysis/my-track/my-track_perception.json
galdr assemble my-track --mode blind > prompt.txt

Send the ARC prompt to another model

Only do this if the operator explicitly wants model-written prose. Review the assembled ARC prompt before piping it to claude, llm, or any other external model endpoint.

galdr assemble my-track --template arc --mode full | claude
galdr assemble my-track --template arc --mode full | llm

Optional Python agent pattern

import subprocess, re

fetch = subprocess.run(
    ["galdr", "fetch", url, "--analyze"],
    capture_output=True, text=True, check=True
)
slug = re.search(r"Slug\s*:\s*(\S+)", fetch.stdout).group(1)

prompt = subprocess.run(
    ["galdr", "assemble", slug, "--template", "arc", "--mode", "full"],
    capture_output=True, text=True, check=True
).stdout

# Review prompt before sending it to any external model endpoint.

Mode and template flags

| Mode | What's included |
|------|----------------|
| full (default) | metrics + lyrics + background + frames |
| lyrics | metrics + lyrics |
| context | metrics + background |
| blind | metrics only (structural, no cultural context) |

--template arc prepends the default listening-experience rules: tone, format, interpretation bounds, and the instruction to walk the track through time. Omit it only when you want a raw data block.

Interpreting galdr Output

ARC is the default output path. The metrics exist to keep that prose grounded: use them as evidence for what changes, returns, releases, locks, or breaks over time.

See references/metrics.md for full metric reference.

Quick read:

pattern near 1.0 → listener is locked; near 0 → constant disruption
texture negative → harmonic dominant (warm, tonal); positive → percussive dominant
pressure_balance building/releasing/sustaining → heard-pressure shape across the track
Clustered pattern_breaks at the end → planned release; distributed → varied structure
silence depth below -60dB with re-lock above 0.93 attention → structured withdrawal/return

Writing ARC Experience Prose (without piping)

When writing experience prose yourself from galdr evidence, prefer galdr assemble <slug> --template arc --mode full. If you are writing from raw assembled output without the template:

First-person listener perspective, present tense
Timestamps only at structural pivots (silences, pattern breaks, major energy shifts)
Translate metrics — describe what they mean, don't quote numbers
LUFS/pressure values are evidence, not prose; write “pressure comes forward / holds / releases / empties”
Body anchors (chest, jaw, sternum) sparingly — two or three for the whole piece
End at the final sound event; no aftermath, no reflection
~800 words, no section headers

Other Commands

galdr compare track-a track-b          # side-by-side structural comparison
galdr frames slug                      # extract + describe video frames at structural moments
galdr fetch "url" --no-download        # context only (Wikipedia + lyrics), no audio
galdr fetch "url" --censor             # sanitize explicit lyrics before saving
galdr doctor                           # inspect yt-dlp/media runtime health
galdr update-deps                      # upgrade yt-dlp reliability extras
galdr catalog                          # list all indexed tracks
galdr catalog --track NAME             # summary card for one track

适用场景

分类

Automation Automation 低风险技能筛选

风险等级

风险标签

network access file access

文件

2

Automation

Self-Improving Agent

一个面向 Automation 场景的 Agent 技能。原始说明：Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Clau...

Automation 低风险

Self-Improving + Proactive Agent

一个面向 Automation 场景的 Agent 技能。原始说明：Self-reflection + Self-criticism + Self-learning + Self-organizing memory. Agent evaluates its own work, catches mistakes, and improves permanently. Use when...

Automation 未知

Proactive Agent

一个面向 Automation 场景的 Agent 技能。原始说明：Transform AI agents from task-followers into proactive partners that anticipate needs and continuously improve. Now with WAL Protocol, Working Buffer, Autonomous Crons, and battle-tested patterns. Part of the Hal Stack 🦞

Automation 未知

ontology

一个面向 Automation 场景的 Agent 技能。原始说明：Typed knowledge graph for structured agent memory and composable skills. Use when creating/querying entities (Person, Project, Task, Event, Document), linkin...

Automation 低风险

Skill Creator

一个面向 Automation 场景的 Agent 技能。原始说明：Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.

Automation 未知

Desktop Control

一个面向 Automation 场景的 Agent 技能。原始说明：Advanced desktop automation with mouse, keyboard, and screen control

SKILL.md

galdr

Install

Core Workflows

Default: ARC listening experience

YouTube URL → ARC prompt (most common)

Local file → ARC prompt

Raw second-by-second analysis (advanced)

Send the ARC prompt to another model

Optional Python agent pattern

Mode and template flags

Interpreting galdr Output

Writing ARC Experience Prose (without piping)

Other Commands

相关技能

Self-Improving Agent

Self-Improving + Proactive Agent

Proactive Agent

ontology

Skill Creator

Desktop Control