This thread is for anyone working on personal projects to share their progress, and hold themselves somewhat accountable to a group of peers.
Post your project, your progress from last week, and what you hope to accomplish this week.
If you want to be pinged with a reminder asking about your project, let me know, and I'll harass you each week until you cancel the service

Jump in the discussion.
No email address required.
Notes -
Haha, yep, tables and rich extraction is pretty bad out of the box.
In this case though, I can confidently say I'm an expert on PDF extraction for llm use.
ANy tips and tricks you picked up regarding this not available out there on the web? I basically just throw the most powerful vision model at it and YOLO it.
Why not just use one of the many existing commercial solutions? That's what we did last I dealt with OCR'ing PDFs, just used Azure's API and then processed the data. Would be surprised if a raw vision model is cheaper or higher quality.
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link