I'm still working towards adding multi-modal support to my LLM tool. In the meantime, here are notes on running prompts against images and PDFs from the command-line using the Google Gemini family of models.

continue reading on til.simonwillison.net

⚠️ This post links to an external website. ⚠️