294061
Rincewind
Re: Hacked the REF Hack :o)
  • 2002/2/28 20:57

  • Rincewind

  • Just popping in

  • Posts: 16

  • Since: 2002/1/14


Wow there Half-dead. Sounds like your talking about cloaking. From Search Engine World:

Quote => Using some system to hide code or content from a user, and deliver custom content to a search engine spider. The word Cloak comes from Star Trek where the Klingons were capable of "cloaking" their ships invisible. There are three main types of cloaking: IP based, User Agent based, and the combination of those two. IP based cloaking custom delivers a page based on the users IP address (this can be used to deliver custom language based sites or target groups of users from particular ISP's such as AOL or @home users). User Agent cloaking sends a custom page based upon the users Agent (most often use to take advantage of a particular agents strengths or features). Finally, the combination of Agent and IP cloaking is use to target specific users <= end quote

There are two schools of thought when it comes to cloaking. The search engines say "don't do it. It's spamming and we don't like it." and the web masters say "The search sites use cloacking themselves (ever tried to get to altavista US and ended up in UK or where ever your from) so it must be OK. And also if you don't catch us, what you don't see don't hurt."

Your probably thinking, if the sites cloacked how could the SE's see you. Well the occasional human does actualy confirm a proportion of listings. They compare your real web page to the one in the spiders cache and if they don't match However if the only dif is the meta tags then a human will never notice, right, wrong. One of the first checks is a simple bit count. If the cached page has a different bit count form the real page then further investigation is taken.

Basicly what I'm saying is cloaking sections of your site can be very veryuseful but also fraught with danger. For more info check out these two articles Here and here.



294062
Anonymous
Re: Hacked the REF Hack :o)
  • 2002/2/28 18:02

  • Anonymous

  • Posts: 0

  • Since:


Was just thinkin of this.... might be smart to check against user agents and if its (netscape, opera, or internet explorer)... well we skip the keyword extraction to keep the cpu usage down

Google and the others that are targeted by this report other user agents as far as i know, right?



294063
JM2
Re: Hacked the REF Hack :o)
  • 2002/2/28 17:37

  • JM2

  • Just popping in

  • Posts: 2

  • Since: 2002/1/2 2


Quote:

header("Expires: Mon, 26 Jul 1997 05:00:00 GMT");


It's browser cache file Expires time.
browser never cache.
xoops site shows update page always.



294064
lykoszine
Re: Hacked the REF Hack :o)
  • 2002/2/28 17:18

  • lykoszine

  • Module Developer

  • Posts: 244

  • Since: 2002/1/2 2


Call me dumb, but why:

header("Expires: Mon, 26 Jul 1997 05:00:00 GMT");

line 43, header.php



294065
lykoszine
Re: Hacked the REF Hack :o)
  • 2002/2/28 16:45

  • lykoszine

  • Module Developer

  • Posts: 244

  • Since: 2002/1/2 2


Hey Man!

I keep downloading these and you release a new one before I get round to intalling it!!!



Way to go



294066
Anonymous
Re: Hacked the REF Hack :o)
  • 2002/2/28 14:04

  • Anonymous

  • Posts: 0

  • Since:


Seems like they takin a bit time with the news :o)

REF Hack v4.1

Already has a wordlist(currently bout 70words big)... and it also does links, downloads, faq, & forum, in addition to the original articles

---
updated link

<small>[ Edited by Half-Dead on 2002/3/1 23:59:00 ]</small>



294067
Rincewind
Re: Hacked the REF Hack :o)
  • 2002/2/28 12:11

  • Rincewind

  • Just popping in

  • Posts: 16

  • Since: 2002/1/14


This is a very usefull hack. However there is a few more feature I would find usefull.

You script cuts out short words like "the" "and" "to" but many words in english are longer than three letters but should also be cut out such as "from", "because", "also", and others. Would it be posible to create a data file of words to exclude. Or crossreferance the keywords with a dictionary file and only include nouns and verbs. Thus excluding all proverbs. I understand this is a bit more than just a hack and could be a leanthy mod. Plus it would have to be upadted with each fresh language supported. However it could be significanly benificial.

Sencondly: Would it be posible for someone out there to create a module that parsed the text on your pages a came back with a word count on each keyword used. This would be usefull to determine how well the search engines may rank you in thier listings. You would be able to alter the keyword density of your documents to optomise your ranking without the overkill of spamming keywords on pages, or the underkill of not enough keywords. I have found problems in underkill before. I once had a web design company site which ranked better under aromatherapy than under web design. This was because one of the clients was an alternative medicine group and so lots of my pages were discussing the clients site and not mine. Indeed at one point my web design site ranked higher for aroatherapy that the aromatherapy site itself. A mod like that discribed above would help predict how search engines will rank your pages before you submit them so you can correct things before it's to late. It's hard enough getting listed in search engines without worrying about listing incorectly.



294068
Anonymous
Re: Hacked the REF Hack :o)
  • 2002/2/28 11:06

  • Anonymous

  • Posts: 0

  • Since:


Damn! I gotta stop playing with this hack :o)

Just added the possibility to make tags from any section of the site: forum, downloads, links, faq..

It'll be on the main XOOPS news page soon, so stay tuned



294069
Anexia
Re: Hacked the REF Hack :o)
  • 2002/2/27 21:41

  • Anexia

  • Just popping in

  • Posts: 3

  • Since: 2002/2/22


Euh ! Linamix vu que tu parle français/anglais tu pourrais nous traduire ça sur le forum de XOOPS France avec le lien de la nouvelle version de Ref_hack !!!

Sorry ! My english is very bad.
I read english.... but, not speak or write...

<small>[ Edited by Anexia on 2002/2/28 7:36:54 ]</small>



294070
Anonymous
Re: Hacked the REF Hack :o)
  • 2002/2/27 21:31

  • Anonymous

  • Posts: 0

  • Since:


Just updated the hack again, and added the possibility to define unwanted words in the config file...very usefull to remove common crap

Repacked REF Hack

---
updated link

<small>[ Edited by Half-Dead on 2002/3/1 23:59:56 ]</small>







Login

Who's Online

158 user(s) are online (89 user(s) are browsing Support Forums)


Members: 0


Guests: 158


more...

Donat-O-Meter

Stats
Goal: $100.00
Due Date: May 31
Gross Amount: $0.00
Net Balance: $0.00
Left to go: $100.00
Make donations with PayPal!

Latest GitHub Commits