Currently we are working on a Win32 application written in C++ (we are evaluating the 6.6.1 sdk).
We have a document with Mongolia content(attachment is sample).
When using Adobe PDF Reader, the text copied unicode is: 1830 1822 182A 1830 1822 182D 180C
When using PDFViewSimpleTest, the text copied unicode is: 1830 1822 182A 1830 1822 182D
When using ElementReaderAdvTest, “for (CharIterator itr = element.GetCharIterator(); itr.HasNext(); itr.Next())” unicode is: 1830 1822 182A 1830 1822 182D
PDFNet SDK missing 1 characters “180C”.
How can we correct this?
This is important for us.Attachment is the different display for 180C.
I want to know how Adobe got MONGOLIAN FREE VARIATION SELECTOR TWO ? By font info? Or other mapping?
在 2017年1月5日星期四 UTC+8上午3:19:57,Ryan写道:
Adobe was the only PDF reader that returned U+180C. Every other reader I tried did not return this.
Note that U+180C is a MONGOLIAN FREE VARIATION SELECTOR TWO
Is getting this unicode selector important for you?