-->
These old forums are deprecated now and set to read-only. We are waiting for you on our new forums!
More modern, Discourse-based and with GitHub/Google/Twitter authentication built-in.

All times are UTC - 5 hours [ DST ]



Forum locked This topic is locked, you cannot edit posts or make further replies.  [ 3 posts ] 
Author Message
 Post subject: StemFilter for prefixes
PostPosted: Thu Aug 20, 2009 7:20 am 
Newbie

Joined: Tue Jul 14, 2009 6:13 am
Posts: 12
Hi,
I am using the GermanStemFilter. It's stemming very good but only the suffixes.

An example: the word is "gelaufen". The GermanStemFilter stems it to "gelauf". And I would like to have such words to be stemmed to "lauf" so that the prefix "ge" and the suffix "en" are dismissed.

Are there any StemFilter which are stemming prefixes from words?


Top
 Profile  
 
 Post subject: Re: StemFilter for prefixes
PostPosted: Thu Aug 20, 2009 12:54 pm 
Hibernate Team
Hibernate Team

Joined: Thu Apr 05, 2007 5:52 am
Posts: 1689
Location: Sweden
Hi,

I don't have any experience with the German stemmer above using the Snowball stemmer which is included in Lucene. You find more information about how it works here - http://snowball.tartarus.org/algorithms ... emmer.html . Especially nice is the list of words with their stemmed version.

That said, I am not so sure if you generally can just remove the 'ge'. What's for example about 'gewinnen'?

Try searching the Lucene/Solr forums for stemmer related questions. You might have more luck there. If you find something feel free to post it here.

--Hardy


Top
 Profile  
 
 Post subject: Re: StemFilter for prefixes
PostPosted: Fri Aug 21, 2009 7:14 am 
Newbie

Joined: Tue Jul 14, 2009 6:13 am
Posts: 12
Yes, I know this site. But you mentioned the very good "contra example" for stemming prefixes. If I'll find something in any other forum I'll post it here.


Top
 Profile  
 
Display posts from previous:  Sort by  
Forum locked This topic is locked, you cannot edit posts or make further replies.  [ 3 posts ] 

All times are UTC - 5 hours [ DST ]


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
© Copyright 2014, Red Hat Inc. All rights reserved. JBoss and Hibernate are registered trademarks and servicemarks of Red Hat, Inc.