2 points by nnurmanov 21 hours ago | 1 comment
Is anyone building such a benchmark?
nnurmanov 20 hours ago
Docling is OK with tables, but fails with cyrillic text;
marker-pdf is OK with tables, but it also fails with cyrillic text;
What other pdf parser libraries exist? I am looking for preferably on-premise solutions, but if I won't find a reliable and accurate solution, I might consider cloud based solutions as well.