开发者

Reducing the memory size of Index for Lucene

开发者 https://www.devze.com 2023-03-08 19:37 出处:网络
I use Lucene for searching the HTML documents. The issue I have is on increased size of index files, I have a开发者_Go百科bt 300-400MB size of HTML files but the index is running upto .98Gb. The reaso

I use Lucene for searching the HTML documents. The issue I have is on increased size of index files, I have a开发者_Go百科bt 300-400MB size of HTML files but the index is running upto .98Gb. The reason I see because of specification we have. Like we index the same contents for four different fields, which I guess is the problem ( we use same contents, one case sensitive and other otherwise, one casesensitive with special characters and other otherwise).

Is there a way to reduce the size of index? Keeping the same requirements? Is there a different way we index the same and search differently to support all?


I assume your problem is that you are storing these fields instead of just indexing them. So the solution is: don't store them.

0

精彩评论

暂无评论...
验证码 换一张
取 消