Product: PDFNet Windows
I am trying to identify tables and lists elements in the PDF in order to identify and tag them appropriately to make a logical structural tree for the PDF document. I saw that TextExtractor could be used to extract the text, but not sure how to use it to extract tables. Are there any ways of doing it? Can you provide an example of how to do it?
(P.s) I am trying to make an HTML a PDF/UA compliant PDF using PdfTron. So far I could identify texts and images, tag them accordingly, and add them to the logical tree using ElementReader. I am stuck in the tables and lists content. I am coding using C#.