开发者

PDF: hyperlink/position [closed]

开发者 https://www.devze.com 2023-01-09 00:14 出处:网络
Closed. This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only 开发者_如何学Cby
Closed. This question needs to be more focused. It is not currently accepting answers.

Want to improve this question? Update the question so it focuses on one problem only 开发者_如何学Cby editing this post.

Closed 5 years ago.

Improve this question

Is there a "console way" to find out the position and target of hyperlinks within PDF-documents? The popular tools for converting pdf-to-* end up with text or very broken html documents. That's why I wonder if there is a way to know where (exact position) within the PDF we can find a link and where it is pointing to.

Looking forward to helpful reply. Thanks and kind regards


You didn't specify the language/platform, so here is a .NET solution:

Docotic.Pdf library (disclaimer: I work for Bit Miracle) can be used to retrieve all hyperlinks in a document. You may retrieve bounding box, text and other properties of a link. Please take a look at "Extract text from link target" sample.


I've just published a Python solution on Github. http://github.com/j-norwood-young/PDFLinkFinder

0

精彩评论

暂无评论...
验证码 换一张
取 消