FUDforum
Fast Uncompromising Discussions. FUDforum will get your users talking.

Home » FUDforum » How To » Rebuild search index part by part
Show: Today's Messages :: Polls :: Message Navigator
Switch to threaded view of this topic Create a new topic Submit Reply
Rebuild search index part by part [message #30461] Mon, 27 February 2006 10:51 Go to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

Hello!

I had a big database crash, now forum is working, but ... i do not have search index (fud26_search, fud26_index)...
Now I should rebuild search indexes, but I cannot.
I cannot do it because about 20 seconds after start to rebuild all is stopped (Throbber in my Firefox stops to animate). Probably because of any anti DoS or something on server.
In theory, I can do it in my home computer, but the database created at home is incompatible with mySQL on server, I cannot upload some tables, some has broken national characters.

What i'm looking for?
The rebuild script takes about 400% of power of my computer, probably same on server, and probably it is because the process is killed by the server. Any chances for the rebuild index script which will work:
- much, much longer
- can make the work in parts (no one more than 4 minutes)
- not use so much CPU power at once

This way I (and every other with similar problemm) can rebuild search index part by part on temporary disabled forum.

Is/will/may be it possible???

Any other ideas what can I do to rebuild index?


Re: Rebuild search index part by part [message #30464 is a reply to message #30461] Mon, 27 February 2006 14:15 Go to previous messageGo to next message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
You can run the search rebuild via command line, by following instructions at the top of the indexdb.php script.

FUDforum Core Developer
Re: Rebuild search index part by part [message #30471 is a reply to message #30464] Mon, 27 February 2006 14:34 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

Ilia wrote on Mon, 27 February 2006 15:15

You can run the search rebuild via command line, by following instructions at the top of the indexdb.php script.


Yes i know that, but i don't have shell access on server where forum is installed. So only modification of script, other script, but sill started "via apache" is possibe here.

BTW ... I'm looking into indexdb.php (FUDforum 2.7.4) ... as i understand, after clicking yes (sending form) the first what should i see is:
line 45: echo '<br>Disabling the forum for the duration of maintenance run<br>';


and before this line there is nothing what can take much time. Am I wrong? Because... on the server where i have the problemm the page is starting to load, but it is nor refreshing. It is still same page with same form, and Firefox Throbber stops after few seconds.

On my home computer where reindexing is working, it looks like something is working a while (few minutes), and than in few seconds i'm getting new page with botg disabling forum, blah, blah, blah, reenabling forum. Is it normal? It is like it should work? Maby there is a little bug in reindexing script.


I'm looking inside the file, but in fact i don't understand how it is working. What can I comment to make it working faster, what script can be extracted from it to make reIndex forum piece after piece. What I mean to make e.g.

manualReindexStart.php - blocking forum, dropping tables
manualReindex1.php - creating index for first forum
manualReindex2.php - creating index for 2nd forum
...
manualReindexn.php - creating index for last forum
manualReindexEnd.php - Enabling the forum.

Maby it is possible to make it like this, so i can start via apache every piece one after one. The manualReindex1-n.php is one file with some parameter changes to parse next file.

I'm not sure if you understand my idea, but it is only solution for my problemm I can imagine for now.


Re: Rebuild search index part by part [message #30479 is a reply to message #30471] Mon, 27 February 2006 19:22 Go to previous messageGo to next message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
The only way you can make it work in your situation is by making it index few messages at a time and then reload the page. Eventually it will index all of the forum's messages.

FUDforum Core Developer
Re: Rebuild search index part by part [message #30484 is a reply to message #30479] Mon, 27 February 2006 21:27 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

Ilia wrote on Mon, 27 February 2006 20:22

The only way you can make it work in your situation is by making it index few messages at a time and then reload the page. Eventually it will index all of the forum's messages.

OK but ... how?
By running new/special script which will stop every few messages?

Or just by using Admin Panel in way:
1. Admin panel
2. Rebuild search index
3. Yes (rebuild)
4. Throbber stopped
5. Return to forum
6. Goto 1.

???
If it is the second one how can I be sure that it is working?


Re: Rebuild search index part by part [message #30501 is a reply to message #30484] Tue, 28 February 2006 14:29 Go to previous messageGo to next message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
The idea is that you use LIMIT clause to fetch only a few messages each time, lets say 100 for example sakes. When you processed those 100 messages you would redirect the browsers to the same page, this time starting from message 101 and processing the next 100 messages. This process will be then repeated until there are no more messages.

FUDforum Core Developer
Re: Rebuild search index part by part [message #30514 is a reply to message #30461] Tue, 28 February 2006 20:46 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

Do you mean to set

@set_time_limit(2400);

in adm/indexdb.php

To low value, e.g. 100 seconds and than start to rebuild index via admin panel?

Or maby:

you mean to change the line:

74: q('INSERT INTO '.$tbl.'search_cache (srch_query, query_type, expiry, msg_id, n_match) VALUES(\'\', -'.$r[0].', 0,0,0)');
75: index_text($subj, read_msg_body($r[3], $r[2], $r[4]), $r[0]);


to:


74: q('INSERT INTO '.$tbl.'search_cache (srch_query, query_type, expiry, msg_id, n_match) VALUES(\'\', -'.$r[0].', 0,0,0) LIMIT 100');


I'm not sure where it (LIMIT 100) should be placed ?

maby:

66: $c = q('SELECT id, subject, length, foff, file_id FROM '.$tbl.'msg WHERE '.($sid ? ' id>'.$sid.' AND ' : '').' apr=1 ORDER BY subject');


like:

66: $c = q('SELECT id, subject, length, foff, file_id FROM '.$tbl.'msg WHERE '.($sid ? ' id>'.$sid.' AND ' : '').' apr=1 ORDER BY subject' LIMIT 100);


please tell me if you are talking just about change like this in the file and than just rebuild index. I don't understand this script, and I'm affraid to make something wrong. I takes me 3 days (and 2 nights) to make a forum usable, don't want to kill it by wrong modification of indexdb.php, but also i want to get back search funcionality.


[Updated on: Wed, 01 March 2006 16:38]

Report message to a moderator

Re: Rebuild search index part by part [message #30524 is a reply to message #30514] Wed, 01 March 2006 18:08 Go to previous messageGo to next message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
Line 66 is where you want to make your changes.

FUDforum Core Developer
Re: Rebuild search index part by part [message #30531 is a reply to message #30461] Wed, 01 March 2006 18:17 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

Can you be a little more precisous? Or maby FUDforum script is for programists only?

Probably I understend now how you want to do this, but have no idea how to do it Sad


Re: Rebuild search index part by part [message #30548 is a reply to message #30531] Thu, 02 March 2006 14:14 Go to previous messageGo to next message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
Well, the query needs to be modified to start working from an offset and fetch X records (lets say 100 each time). At the end it would redirect to itself via header("Location: ..."); and via a GET parameter specify a new offset to start from.

FUDforum Core Developer
Re: Rebuild search index part by part [message #30613 is a reply to message #30461] Sun, 05 March 2006 03:33 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

OK, i thing now i understand... i'll try to write it.
But can you please tell me. The posts are ordered by subject, is it important? Because to make a piece of work, and back to it l8r it is better to order messages by id.
And if it is important how can I back to work?


Re: Rebuild search index part by part [message #30614 is a reply to message #30613] Sun, 05 March 2006 16:27 Go to previous messageGo to next message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
It does not matter what ORDER BY statement you use. Normally the forum orders messages by id for the purpose of re-indexing, but you can easily change the ordering to "by-subject", there is no harm in doing that.

FUDforum Core Developer
Re: Rebuild search index part by part [message #30630 is a reply to message #30614] Sun, 05 March 2006 22:12 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

Ilia wrote on Sun, 05 March 2006 17:27

It does not matter what ORDER BY statement you use. Normally the forum orders messages by id for the purpose of re-indexing, but you can easily change the ordering to "by-subject", there is no harm in doing that.


Wink Normally it is ordered by subject, and I will order it by Id Wink ... i mean the query $c=.... in adm/indexdb.php


Re: Rebuild search index part by part [message #30647 is a reply to message #30461] Mon, 06 March 2006 23:50 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

Ok ... maby i did ... it seems to be wotking on test base, but before i start it on my real forum, please can you please look on and tell me what do you think?

I'm attaching indexdb.php which is changed to have possibility to start indexdb-poor.php

and indexdb-poor.php (in next post) which is the indexing db script made in a way you talked about (if not i didn't understand you Sad )
  • Attachment: indexdb.php
    (Size: 3.41KB, Downloaded 623 times)


[Updated on: Mon, 06 March 2006 23:57]

Report message to a moderator

Re: Rebuild search index part by part [message #30648 is a reply to message #30461] Mon, 06 March 2006 23:50 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

indexdb-poor.php

Re: Rebuild search index part by part [message #30657 is a reply to message #30648] Tue, 07 March 2006 14:35 Go to previous messageGo to next message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
The code is fine, but you need to remove db_lock() call since you are not doing an unlock anymore.

FUDforum Core Developer
Re: Rebuild search index part by part [message #30659 is a reply to message #30461] Tue, 07 March 2006 16:30 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

And any ideas, why db_unlock() is not working? If i uncomment it I get:

(/home/wariat/www/forum/ZorannorumNB/include/theme/default/db.inc:49
/home/wariat/www/forum/indexdb-poor.php:62
) internal: internal
Query: DB_UNLOCK: no previous lock established
_GET: array ( 'S' => 'ca17...', 'SQ' => 'b0c...', 'Indexed' => '50', )
Server Version: 4.1.14-log
[Referring Page] http://www.wariat.org.pl/Zorannorum/adm/indexdb.php?S=ca17...&SQ=b0c....


it says that no lock established, but i did it in first Pass, so what is more correct?

- to lock it and than unlock on every pass of script?
- don't lock anywhere?

If it will work correct on my slow server, any chanses to this feature (or somenthing like this) to official release?


Re: Rebuild search index part by part [message #30674 is a reply to message #30659] Wed, 08 March 2006 13:59 Go to previous messageGo to next message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
it is not working because the lock it was supposed to be unlocking was not created. The error message is pretty clear in that regard.

FUDforum Core Developer
Re: Rebuild search index part by part [message #30795 is a reply to message #30461] Mon, 13 March 2006 15:07 Go to previous messageGo to next message
wariat is currently offline  wariat   Poland
Messages: 22
Registered: April 2005
Location: Poznań/Poland
Karma: 0
Junior Member

But you didn't tell me what is better in your opinion:

Quote:


it says that no lock established, but i did it in first Pass, so what is more correct?

- to lock it and than unlock on every pass of script?
- don't lock anywhere?



and:

Quote:


If it will work correct on my slow server, any chanses to this feature (or something like this) to official release?




Re: Rebuild search index part by part [message #30799 is a reply to message #30795] Tue, 14 March 2006 14:26 Go to previous message
Ilia is currently offline  Ilia   Canada
Messages: 13241
Registered: January 2002
Karma: 0
Senior Member
Administrator
Core Developer
The lock cannot stay active between passes, you need to create it and release it on every pass.

It should work on your server, if it works I'll onsider making something like this part of the main release.


FUDforum Core Developer
  Switch to threaded view of this topic Create a new topic Submit Reply
Previous Topic: TREE VIEW not staying as default....it is always switching back to flat.
Next Topic: different themes for different forums.
Goto Forum:
  

-=] Back to Top [=-
[ Syndicate this forum (XML) ] [ RSS ]

Current Time: Fri Nov 01 11:26:17 GMT 2024

Total time taken to generate the page: 0.02723 seconds