Discussion:
`./indexer -Eblob` on utf8 words indexes them as question marks (e.g. ?????)
Yasser Zamani
2014-04-29 07:47:05 UTC
Permalink
Hi there,

I have crawled my site using utf8 as char-set. I can see the words
correctly saved into `bdicti` table with utf8 encoding.

But when I run `./indexer -Eblob`, they do not correctly indexed and
saved into `bdict` table and I just see question marks (e.g. ?????)
instead of the word. So, searching works only for ascii characters
rather than any utf8 one!

Please help while I do not have any idea how to resolve this issue.

Thanks in advance!
Yasser Zamani
2014-04-30 13:16:16 UTC
Permalink
Sorry for my fault :(

CREATE DATABASE mnogosearch
DEFAULT CHARACTER SET utf8
DEFAULT COLLATE utf8_general_ci;

i.e. creating the database with utf8, RESOLVED the issue :)
Post by Yasser Zamani
Hi there,
I have crawled my site using utf8 as char-set. I can see the words
correctly saved into `bdicti` table with utf8 encoding.
But when I run `./indexer -Eblob`, they do not correctly indexed and
saved into `bdict` table and I just see question marks (e.g. ?????)
instead of the word. So, searching works only for ascii characters
rather than any utf8 one!
Please help while I do not have any idea how to resolve this issue.
Thanks in advance!
_______________________________________________
General mailing list
http://lists.mnogosearch.org/listinfo/general
d***@hodei.net
2014-05-05 10:32:33 UTC
Permalink
hi

i have accented characters in my web search.

to solve this problem, i have modify the database with this queries :

ALTER TABLE bdict CONVERT TO CHARACTER SET utf8 COLLATE utf8_general_ci;
ALTER TABLE bdicti CONVERT TO CHARACTER SET utf8 COLLATE utf8_general_ci;

and i have init variables *$localcharset* et *$browsercharset* with
utf-8 in my indexer.conf

But i have always the problem !

have you an idea ?

Thanks
__________________________________________________________________
my config :

* Debian 3.2.51-1 x86_64 GNU/Linux
* mnogosearch 3.3.15
* indexer.conf :
......
DBAddr mysql://root:***@localhost/mnogosearch/?dbmode=blob
......


---
Ce courrier électronique ne contient aucun virus ou logiciel malveillant parce que la protection avast! Antivirus est active.
http://www.avast.com
Alexander Barkov
2014-05-29 08:14:49 UTC
Permalink
Post by d***@hodei.net
hi
i have accented characters in my web search.
ALTER TABLE bdict CONVERT TO CHARACTER SET utf8 COLLATE utf8_general_ci;
ALTER TABLE bdicti CONVERT TO CHARACTER SET utf8 COLLATE utf8_general_ci;
and i have init variables *$localcharset* et *$browsercharset* with
utf-8 in my indexer.conf
But i have always the problem !
have you an idea ?
Thanks
__________________________________________________________________
* Debian 3.2.51-1 x86_64 GNU/Linux
* mnogosearch 3.3.15
......
Try adding the SetNames=utf8 part, like this:


DBAddr
mysql://root:***@localhost/mnogosearch/?SetNames=utf8&dbmode=blob

......
Post by d***@hodei.net
------------------------------------------------------------------------
<http://www.avast.com/>
Ce courrier électronique ne contient aucun virus ou logiciel malveillant
parce que la protection Antivirus avast! <http://www.avast.com/> est
active.
_______________________________________________
General mailing list
http://lists.mnogosearch.org/listinfo/general
d***@hodei.net
2014-06-26 14:49:21 UTC
Permalink
Hi

I have a problem when i indexing my database :
___________________________________________________________________________________________
***@botujo:/home/jean# /usr/local/mnogosearch/sbin/indexer -Eblob
indexer[4787]: Indexing
indexer[4787]: Loading URL list
{sql.c:1513} Query: SELECT rec_id, site_id, pop_rank, last_mod_time FROM url

indexer[4787]: MySQL driver: #144: Table './mnogosearch/url' is marked
as crashed and last (automatic?) repair failed


Here is my database information in phpmyadmin :

name lines size
------------------------------------------------------
bdict 864 575 1,1 Go
bdicti utilisé
bdict_tmp 2,0 Ko
categories 1,0 Ko
crossdict 1,0 Ko
dict 1,0 Ko
links 1,0 Ko
qcache 1,0 Ko
qinfo 2,0 Ko
qtrack 1,0 Ko
server 889 156,7 Ko
srvinfo 1,0 Ko
url utilisé
urlinfo 11 009 854 27,4 Go


Have you an idea ?

Thanks

__________________________________________________________________
my config :

* Debian 3.2.51-1 x86_64 GNU/Linux
* mnogosearch 3.3.15
* indexer.conf :
......
DBAddr mysql://root:***@localhost/mnogosearch/?dbmode=blob


---
Ce courrier électronique ne contient aucun virus ou logiciel malveillant parce que la protection avast! Antivirus est active.
http://www.avast.com
Alexander Barkov
2014-07-02 07:02:45 UTC
Permalink
Hi,
Post by d***@hodei.net
Hi
___________________________________________________________________________________________
indexer[4787]: Indexing
indexer[4787]: Loading URL list
{sql.c:1513} Query: SELECT rec_id, site_id, pop_rank, last_mod_time FROM url
indexer[4787]: MySQL driver: #144: Table './mnogosearch/url' is marked
as crashed and last (automatic?) repair failed
name lines size
------------------------------------------------------
bdict 864 575 1,1 Go
bdicti utilisé
bdict_tmp 2,0 Ko
categories 1,0 Ko
crossdict 1,0 Ko
dict 1,0 Ko
links 1,0 Ko
qcache 1,0 Ko
qinfo 2,0 Ko
qtrack 1,0 Ko
server 889 156,7 Ko
srvinfo 1,0 Ko
url utilisé
urlinfo 11 009 854 27,4 Go
Have you an idea ?
Does manual "REPAIR TABLE url" help?
Post by d***@hodei.net
Thanks
__________________________________________________________________
* Debian 3.2.51-1 x86_64 GNU/Linux
* mnogosearch 3.3.15
......
---
Ce courrier électronique ne contient aucun virus ou logiciel malveillant
parce que la protection avast! Antivirus est active.
http://www.avast.com
_______________________________________________
General mailing list
http://lists.mnogosearch.org/listinfo/general
Loading...