开发者

Indexing PDF file by SOLR

开发者 https://www.devze.com 2023-04-07 20:02 出处:网络
I\'m using Solrj to index PDF files whith SOLR, but some files can\'t开发者_StackOverflow社区 index and make an exception

I'm using Solrj to index PDF files whith SOLR, but some files can't开发者_StackOverflow社区 index and make an exception

GRAVE: Error: Could not parse predefined CMAP file for 'Adobe-Identity-UCS'
java.lang.NoSuchMethodError: org.apache.fontbox.cmap.CMap.lookup(II)Ljava/lang/String;

can you tell me what's the problem? Thanks


Seems some mismatch with the apache fontbox jars, which mentions the method not found.
Can you confirm the jars for tika and all its dependencies are in sync and are the ones with the build.
you can also check standalone if the parsing of documents work fine using the Apache Tika project jars.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号