midasheng-audio-text-distance

v1.0.0

Multilingual audio-text retrieval and classification using GLAP (General Language Audio Pretraining). Use when user needs to search/match audio files against...

0· 96·0 current·0 all-time
byJunbo Zhang@jimbozhang
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The skill's name and description claim audio-text retrieval via GLAP and all required artifacts (SKILL.md examples and scripts/audiosearch.py) perform exactly that against the Xiaomi llmplus.ai.xiaomi.com/dasheng/audio/search endpoint. There are no unrelated binaries, config paths, or credentials requested.
Instruction Scope
Runtime instructions and the included script only read user-supplied audio files and call the documented remote API (and a metrics endpoint for queue status). They do not read arbitrary system files or environment variables beyond what the user supplies. The SKILL.md and script consistently show network calls to the stated endpoint.
Install Mechanism
This is an instruction-only skill with no install spec and a single small Python script; nothing is downloaded or written to disk by an installer, which minimizes install-time risk.
Credentials
No environment variables or credentials are requested (proportionate). However, the skill uploads audio files to a third-party endpoint (llmplus.ai.xiaomi.com) without any authentication in the provided examples, so sensitive audio will be transmitted off-host; users should consider privacy and trust of that endpoint before use.
Persistence & Privilege
always is false, the skill does not request persistent system presence or modify other skills/config; it behaves as a normal, non-persistent, user-invoked utility.
Assessment
This skill appears to do what it says: it uploads audio files to a Xiaomi-hosted GLAP search API and returns similarity/classification results. Before installing or using it, consider: (1) Privacy — audio files are sent to https://llmplus.ai.xiaomi.com with no auth in examples, so do not upload sensitive or proprietary recordings unless you trust the service and its terms; (2) Network usage — the tool requires outbound network access; (3) Sanity check — test with non-sensitive samples first; (4) If you need an on-device alternative or encryption, prefer a local model or an API that supports authenticated, private uploads. The SKILL.md lists curl as a requirement while the script uses Python requests — install either curl (for examples) or ensure Python requests is available to run the included script.

Like a lobster shell, security has layers — review code before you run it.

latestvk977d5mctzmjhv1sg8qk5hzqt98370s9

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Comments