1
tedsmith
What file formats can standard search tools search through?
  • 2005/4/9 17:55

  • tedsmith

  • Home away from home

  • Posts: 1151

  • Since: 2004/6/2 1


I quick query.

A colleague and I had a bit of a 'discussion' (argument) the other day about search facilities and their ability to index web content.

My argument was that to index and search through a traditionally none-web-based document such, as a Word or Excel document, you needed a specialist search tool, such as e-swish or some of the other alternatives to that. I stated that for that reason, Google only return values from either html or pdf formats because it isn't designed to search through standard files.

His argument was that any search facility can index and search through word or Excel documents and the only reason that Google and the like does not return values from those file types is because webmasters do not traditionally publish such files for access over the Internet. In other words, it's simply coincidence that that I've never seen a search hit from a file type other than html or pdf.

Who's correct?

The reason I ask is that we have a large repository of Word documents that ideally we'd like the XOOPS search bar to be able to index and return search hits from. I said it can't do it and that XOOPS can only return values from data stored in the database and you can't make XOOPS go out and index an external Word file. He said it should be able to do it.

Who's correct?

Thanks

2
Dave_L
Re: What file formats can standard search tools search through?
  • 2005/4/9 19:06

  • Dave_L

  • XOOPS is my life!

  • Posts: 2277

  • Since: 2003/11/7


When a URL is requested, by a human or a search bot, the response isn't really a file such as .html or .pdf, but a document with a MIME content type and body. As long as the server (or the script running on the server) returns a content type and body that the search engine is happy with, I don't think the file extension matters.

3
tedsmith
Re: What file formats can standard search tools search through?
  • 2005/4/11 12:29

  • tedsmith

  • Home away from home

  • Posts: 1151

  • Since: 2004/6/2 1


Thanks Dave_L.

So could the XOOPS search be used to search through a Word document etc stored on our network using a direct UNC path?

4
ackbarr
Re: What file formats can standard search tools search through?

No. The XOOPS search by itself only works by searching via SQL queries. However, I think that the DMS module uses the Swish-E file indexer, which can index the contents of Word, Excel, PDF, and others.

Login

Who's Online

431 user(s) are online (314 user(s) are browsing Support Forums)


Members: 0


Guests: 431


more...

Donat-O-Meter

Stats
Goal: $100.00
Due Date: Nov 30
Gross Amount: $0.00
Net Balance: $0.00
Left to go: $100.00
Make donations with PayPal!

Latest GitHub Commits