-->
These old forums are deprecated now and set to read-only. We are waiting for you on our new forums!
More modern, Discourse-based and with GitHub/Google/Twitter authentication built-in.

All times are UTC - 5 hours [ DST ]



Forum locked This topic is locked, you cannot edit posts or make further replies.  [ 2 posts ] 
Author Message
 Post subject: Lucene Index Files Size?
PostPosted: Mon Jan 11, 2010 7:11 pm 
Newbie

Joined: Tue Dec 15, 2009 3:49 pm
Posts: 6
I am currently preparing to implement Hibernate Search in my application. I would like to determine how much space is required on a machine to store Lucene index files. Is there a way that I can roughly estimate this based on number of rows?

I am working with ~100,000 rows and ~60 columns (text, number and date fields).


Top
 Profile  
 
 Post subject: Re: Lucene Index Files Size?
PostPosted: Tue Jan 12, 2010 10:07 am 
Hibernate Team
Hibernate Team

Joined: Thu Apr 05, 2007 5:52 am
Posts: 1689
Location: Sweden
I am not sure if a simple formula exist. The index size will depend on many factors, for example whether or not you keep the indexed data (store=Store.YES) and how the data is indexed, meaning which analyzer you are using. Imagine for example an synonym analyzer which will add additional tokens into the token stream.

Given that the Lucene index format is available on the Lucene website, I guess with some effort one could derive some estimates given that you make some assumptions about your data, but it is for sure not easy.

Probably best to just build a test index.

--Hardy


Top
 Profile  
 
Display posts from previous:  Sort by  
Forum locked This topic is locked, you cannot edit posts or make further replies.  [ 2 posts ] 

All times are UTC - 5 hours [ DST ]


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
© Copyright 2014, Red Hat Inc. All rights reserved. JBoss and Hibernate are registered trademarks and servicemarks of Red Hat, Inc.