Hibernate Books

All times are UTC - 5 hours [ DST ]



Post new topic Reply to topic  [ 5 posts ] 
Author Message
 Post subject: indexing
PostPosted: Mon Feb 27, 2012 8:15 am 
Regular
Regular

Joined: Thu Jun 16, 2011 12:03 pm
Posts: 94
Hi all,

I have to index text from a pdf, but my problem is that i don't want to save this content in the bd, only in the indexes. It is possible to save data only in the index and not in the bd?

My question is if in case of reindexing everything, i lose all the information saved in the index before...

thanks in advance,

Hibernator,


Top
 Profile  
 
 Post subject: Re: indexing
PostPosted: Mon Feb 27, 2012 8:40 am 
Regular
Regular

Joined: Thu Jun 16, 2011 12:03 pm
Posts: 94
Hi all,

I have found
Quote:
13.2.1 Utilizing PDFBox to index PDF documents
in Hibernate search in action book.

My new question is that if i want to have the pdf text related with another entity in the index like:

idbook -> text book (tokenized)

should i generate the index with two fields:

-idbook
-tokenized text content

It is this right?

thanks in advance,


Top
 Profile  
 
 Post subject: Re: indexing
PostPosted: Mon Feb 27, 2012 10:02 am 
Regular
Regular

Joined: Thu Jun 16, 2011 12:03 pm
Posts: 94
I have found this:

https://community.jboss.org/wiki/Hibern ... Extraction

i'll try this one i think...

thankss


Top
 Profile  
 
 Post subject: Re: indexing
PostPosted: Tue Feb 28, 2012 5:10 am 
Regular
Regular

Joined: Thu Jun 16, 2011 12:03 pm
Posts: 94
I have been searching any examples but nothing is the solution for my problem.

In the book Hibernate Search in Action the examples save the content in the database and as i told i don't want this.

I am really thinking about creating my own index with tow fields idBook and texttokenized.

thanks in advance,


Top
 Profile  
 
 Post subject: Re: indexing
PostPosted: Tue Feb 28, 2012 1:08 pm 
Hibernate Team
Hibernate Team

Joined: Fri Oct 05, 2007 4:47 pm
Posts: 2296
Location: Third rock from the Sun
Hi,
the article at https://community.jboss.org/wiki/Hibern ... Extraction is focusing on the lazy field, even if you avoid the complexity of lazy loading the PDF it's still a good example: you can store the file path in the database, and in the FieldBridge on the file you actually load the external file from the filesystem and extract the text you need. Just return the text..

_________________
Sanne
http://in.relation.to/


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 5 posts ] 

All times are UTC - 5 hours [ DST ]


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
© Copyright 2014, Red Hat Inc. All rights reserved. JBoss and Hibernate are registered trademarks and servicemarks of Red Hat, Inc.