multimodal-parser

Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing

2.8k stars

openclaw/skillsskills/ayalili/multimodal-parserMarch 14, 2026

View on GitHub

Install command

python "$CODEX_HOME/skills/.system/skill-installer/scripts/install-skill-from-github.py" --repo openclaw/skills --path skills/ayalili/multimodal-parser

Tell me the task — I'll narrow the agent shortlist.