Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Gedxx
on Sept 14, 2020
|
parent
|
context
|
favorite
| on:
What's so hard about PDF text extraction?
If you are interested in to extract pdf Tables, I recommend you Tabula, here an example
https://www.ikkaro.net/convert-pdf-to-excel-csv/
codegladiator
on Sept 14, 2020
|
next
[–]
also tetpdf (paid) works really well (used it for extracting transactions from account statement pdfs) (demo works for 2-3 pages). I actually used a combination of tabula and tetpdf.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: