OpenClaw

multimodal-parser

Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing

2.8k stars
openclaw/skillsskills/ayalili/multimodal-parserMarch 14, 2026
View on GitHub

Install command

python "$CODEX_HOME/skills/.system/skill-installer/scripts/install-skill-from-github.py" --repo openclaw/skills --path skills/ayalili/multimodal-parser
Tell me the task — I'll narrow the agent shortlist.