{"skill":{"slug":"vlm-grounding","displayName":"vlm-grounding","summary":"Use GLM-4.7V's multimodal grounding capability to detect and locate objects/text in images. Activate when user asks to find, locate, detect, or ground specif...","tags":{"latest":"1.0.0"},"stats":{"comments":0,"downloads":185,"installsAllTime":0,"installsCurrent":0,"stars":0,"versions":1},"createdAt":1773909716294,"updatedAt":1777272617828},"latestVersion":{"version":"1.0.0","createdAt":1773909716294,"changelog":"- Initial release of multimodal grounding skill using GLM-4.7V.\n- Supports detecting and locating objects, text, and UI elements in images with bounding box outputs.\n- Provides end-to-end workflow: model API call, bounding box parsing, and visualization on images.\n- Includes robust parsing for various bracket styles and auto-renormalization of coordinates.\n- Triggers automatically on grounding-related user requests in both English and Chinese.","license":"MIT-0"},"metadata":null,"owner":{"handle":"qijimrc","userId":"s17dxhxp6n7b66j7evz56mb7zn83w4q2","displayName":"Ji Qi","image":"https://avatars.githubusercontent.com/u/28889175?v=4"},"moderation":{"isSuspicious":true,"isMalwareBlocked":false,"verdict":"suspicious","reasonCodes":["suspicious.llm_suspicious"],"summary":"Detected: suspicious.llm_suspicious","engineVersion":"v2.4.0","updatedAt":1777272617828}}