b***@mnogosearch.org
2013-11-27 13:53:48 UTC
Author: Felix Heller
Email: ***@aimcom.de
Message:
Hello,
I've installed and configured MnoGoSearch as a powerful full text search engine for
CMS websites a few days ago. But right now I am a little bit confused about the
configuration of document sections.
I would like to index the headlines (<h1>, <h2>, <h3>) in special fields so that I
can weight them more in comparison to the body text.
There is one example given in indexer.conf:
Section h1 26 128 "<h1>(.*)</h1>" $1
This works fine because normally there is only one <h1> on a webpage. But when I try
to index all <h2> headlines using the regular expression "<h2>(.*)</h2>" $1, the
whole content between the first <h2> and the last <h2> gets indexed. What I would
like to get is only the text between the <h2>...</h2> tags.
Could somebody please tell me if there is a solution for that problem?
Thanks a lot for your help
Felix
Reply: <http://www.mnogosearch.org/board/message.php?id=21590>
Email: ***@aimcom.de
Message:
Hello,
I've installed and configured MnoGoSearch as a powerful full text search engine for
CMS websites a few days ago. But right now I am a little bit confused about the
configuration of document sections.
I would like to index the headlines (<h1>, <h2>, <h3>) in special fields so that I
can weight them more in comparison to the body text.
There is one example given in indexer.conf:
Section h1 26 128 "<h1>(.*)</h1>" $1
This works fine because normally there is only one <h1> on a webpage. But when I try
to index all <h2> headlines using the regular expression "<h2>(.*)</h2>" $1, the
whole content between the first <h2> and the last <h2> gets indexed. What I would
like to get is only the text between the <h2>...</h2> tags.
Could somebody please tell me if there is a solution for that problem?
Thanks a lot for your help
Felix
Reply: <http://www.mnogosearch.org/board/message.php?id=21590>