Hello, I am an amateur programmer and recently I built a .NET project. In my project, the pdf file will be downloaded from the Internet. Is there a way to convert pdf files to text strings or characters in android?
This article on XsPDF.com is what you looking for, check out: It's possible to convert pdf to text
Answers
Of course, this is possible and converting PDF to text string is not so difficult. It seems that your requirement is to extract text from PDF document. In this way, you can define what text content you want to get.
If the downloaded PDF contains only images, then you can try OCR toolkit. In general, if your PDF is a scanned document, it is usually possible to use OCR toolkit directly. Scan drivers often insert text as invisible text so that users can directly search contents of PDF document.
You can find an open source directly on the Internet. However, the output of many open source solutions is usually not ideal.
Why not try XsPDF SDK? I have used it in my project and can successfully extract text from pdf and output as string.