-->
These old forums are deprecated now and set to read-only. We are waiting for you on our new forums!
More modern, Discourse-based and with GitHub/Google/Twitter authentication built-in.

All times are UTC - 5 hours [ DST ]



Forum locked This topic is locked, you cannot edit posts or make further replies.  [ 3 posts ] 
Author Message
 Post subject: Need a Forward Index
PostPosted: Thu Dec 15, 2011 7:55 pm 
Newbie

Joined: Thu Dec 15, 2011 7:52 pm
Posts: 2
I found some people asking if anyone had made one in Lucene. Didn't seem so. Basically, the Lucene indexes are inverted, meaning they go from terms to documents. I need to find the list of terms found in a document. The reason is I have a lot of documents that are attachments and I don't want to be parsing them again and again, and I need to get the terms out of them for classification purposes.

Thanks.


Top
 Profile  
 
 Post subject: Re: Need a Forward Index
PostPosted: Fri Dec 16, 2011 5:49 am 
Hibernate Team
Hibernate Team

Joined: Thu Apr 05, 2007 5:52 am
Posts: 1689
Location: Sweden
Hi,

what is your actual question?
Btw, if you are storing the term vector in your documents (enabled via an option), you can retrieve it later via IndexReader#getTermFreqVectors(int docNumber)

--Hardy


Top
 Profile  
 
 Post subject: Re: Need a Forward Index
PostPosted: Fri Dec 16, 2011 2:46 pm 
Newbie

Joined: Thu Dec 15, 2011 7:52 pm
Posts: 2
Yeah I want to be able to get the terms and frequencies found in a document.

I am kind of confused about what you didn't get in my original question. I have a bunch of attachments associated with an entity. I need to index the text in the attachments. Then I need to know what terms were found later. I don't want to have to go fetch all the attachments and parse them again. Kinda simple.

Looks like your solution will do that for me. Thanks.


Top
 Profile  
 
Display posts from previous:  Sort by  
Forum locked This topic is locked, you cannot edit posts or make further replies.  [ 3 posts ] 

All times are UTC - 5 hours [ DST ]


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
© Copyright 2014, Red Hat Inc. All rights reserved. JBoss and Hibernate are registered trademarks and servicemarks of Red Hat, Inc.