Hi,
I have trouble with my site, when I try to index it for use with a search engine Fluid Dynamics Search Engine v2.0.0.0072.
The problem is that the PHPSESSID variable is added to the url's, and it makes the search engine loop, or at least, makes it very very slow. It shows it has to index about 76000 pages, which is way too much.
I looked up some doc on the net, and I tried setting these settings in php files:
ini_set('session.use_trans_sid', 0);
ini_set('session.use_cookies', 1);
and these ones in .htaccess files:
php_flag session.use_trans_sid off
php_flag session.use_cookies on
but without any result. I also asked my host service to set the php settings right; they've done that.
I use XOOPS and xcgal on my site (
http://www.flyingsnowman.com).
I pasted a copy herebelow with the myguestbook directory. You can see that the robot always takes the same directory , filename, and parameters, because the PHPSESSID is always different.
Is there any way to strip this PHPSESSID off by default?
thanks,
Eric
Admin Page / Rebuilding realm 'www.flyingsnowman.com':
Indexing website '^http\:\/\/www\.flyingsnowman\.com\/' for realm 'www.flyingsnowman.com':
Running in automatic mode on batch 28. Status: 270 documents indexed; 2 failed; 76068 waiting to be indexed.
Please wait patiently while pages are indexed. If this process times out, click here to restart it.
Crawling remote sites and updating the database may take a long time. Please be patient...
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=a3e9fa01f24607a7ebb7368d31bf17c6'... took 0 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=b381c388722a9681068227843b37197b'... took 0 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=c950ac0dfe7c0e507975bc2262326999'... took 1 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=d05759710608182b9a0c562cff5f6e1f'... took 0 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=d121157861251baed9ab8e2bb9d6be57'... took 0 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=d12e9fd0e4f99fe678b16d8f7353e61e'... took 0 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=d396f45e6c64cd04f430773b56d4b59e'... took 0 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=e9781d618c364cb12a4c6fee360063f7'... took 0 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=e9a04c20d7eab797e97340f2a4209e65'... took 0 seconds.
-> Requesting 'http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=f8cac8c5be397401394fe4ca68e253c8'... took 1 seconds.
Finished crawling. Now parsing files and updating index...
1. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=a3e9fa01f24607a7ebb7368d31bf17c6 - 36KB - 23 Nov 2004 [ Updated existing record. ]
2. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=b381c388722a9681068227843b37197b - 36KB - 23 Nov 2004 [ Updated existing record. ]
3. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=c950ac0dfe7c0e507975bc2262326999 - 36KB - 23 Nov 2004 [ Updated existing record. ]
4. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=d05759710608182b9a0c562cff5f6e1f - 36KB - 23 Nov 2004 [ Updated existing record. ]
5. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=d121157861251baed9ab8e2bb9d6be57 - 36KB - 23 Nov 2004 [ Updated existing record. ]
6. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=d12e9fd0e4f99fe678b16d8f7353e61e - 36KB - 23 Nov 2004 [ Updated existing record. ]
7. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=d396f45e6c64cd04f430773b56d4b59e - 36KB - 23 Nov 2004 [ Updated existing record. ]
8. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=e9781d618c364cb12a4c6fee360063f7 - 36KB - 23 Nov 2004 [ Updated existing record. ]
9. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=e9a04c20d7eab797e97340f2a4209e65 - 36KB - 23 Nov 2004 [ Updated existing record. ]
10. *** Flying Snowman Productions - music compositions, recordings & productions - music software for the pc - website development *** [ Edit | Crawl | Delete ]
URL:
http://www.flyingsnowman.com/modules/myguestbook/index.php?start=5&PHPSESSID=f8cac8c5be397401394fe4ca68e253c8 - 36KB - 23 Nov 2004 [ Updated existing record. ]
Crawler Finished
The crawler has indexed 10 web pages and is finished with this batch. The number of web pages per batch is controlled by the Crawler: Max Pages Per Batch setting.
There are now 534 web pages in the 'www.flyingsnowman.com' realm - 0 records created; 10 updated; 0 removed.
Continue:
This page should automatically refresh in 15 seconds in order to continue. If it does not, click here.