site banner

Tinker Tuesday for June 24, 2025

This thread is for anyone working on personal projects to share their progress, and hold themselves somewhat accountable to a group of peers.

Post your project, your progress from last week, and what you hope to accomplish this week.

If you want to be pinged with a reminder asking about your project, let me know, and I'll harass you each week until you cancel the service

1
Jump in the discussion.

No email address required.

Haha, yep, tables and rich extraction is pretty bad out of the box.

In this case though, I can confidently say I'm an expert on PDF extraction for llm use.

I can confidently say I'm an expert on PDF extraction for llm use.

ANy tips and tricks you picked up regarding this not available out there on the web? I basically just throw the most powerful vision model at it and YOLO it.

Why not just use one of the many existing commercial solutions? That's what we did last I dealt with OCR'ing PDFs, just used Azure's API and then processed the data. Would be surprised if a raw vision model is cheaper or higher quality.