You are here: silicon.com > Networks > WebWatch

WebWatch

Dell gets Googled

Googlebot unearths confidential info for all to see...

Tags: googlebot, dell, google

By Elinor Mills

Published: 2 February 2006 08:40 GMT

Dell apparently learnt the hard way this week that companies have to be careful to ensure information they store on the internet and want to keep hidden is not automatically added to a search engine index for everyone on the web to see.

Specifications for future Dell laptops were accessible via Google's search site before the content was pulled from a Dell file transfer protocol site and from Google's cache.

Google, like the other major search engines, has an automated search engine that sends software robots called "spiders" out to crawl the web and find sites to add to the index of websites it maintains. Because the spiders follow links running from one website to others, they pick up sites on their own without webmasters having to manually submit them to search engines.

Webmasters can also provide the URL, or numerical web address, for pages they want crawled, and they can submit detailed site maps to Google, according to Google's "information for webmasters" pages.

Webmasters who want to keep some or all of their site private from the Googlebot can put a standard document called "robot.txt" at the root of the server that instructs the crawler not to download content. If the removal request is urgent, the webmaster can submit a request via Google's automatic URL removal system but must provide an email address and password first.

Content that has been removed can still be viewed through Google's cache, which is a "snapshot" and archive of each page crawled. Webmasters can prevent pages from being cached by inserting specific code on them.

Elinor Mills writes for CNET News.com

  1. Zones
  2. Management
  3. Networks
  4. Software
  5. IT Services
  6. Hardware
  1. Verticals
  2. Public Sector
  3. Financial Services
  4. Retail & Leisure
Read and write about internet access at the airports of the world at atlarge.com. Be the first to rate an airport, win champagne...

Steve Ranger Editor's Blog: The naked truth about DSL Is it time to rethink broadband pricing?

Natasha Lomas ¿Dónde está el iPhone 3G? Comment: It's clear who calls the shots in this relationship...


  • Jobs
SEO optimisation specialist

A leading creative agency requires a search engine optimisation specialist to manage a number of there end client sites. Candidates should have ...

Graduate and Intern Opportunities with Google

Our work at Google also requires ideas from many non-technical fields, and we currently have New Graduate and Intern positions available in ...

Senior Software Engineer (JAVA/J2EE)

Ability to work with large, multiple data sets -Proficient in Object Oriented design and development -Ability to formally communicate architectural ...

CIO50 2008
The silicon.com CIO50 2008 profiles the most influential and innovative tech chiefs in the UK across all industries and organisation size, from the biggest FTSE100 companies to high growth dot-com start ups and the public sector. The list was voted on by the UK CIO community and a panel of experts. Find out more in our latest special report.





Quick Sitemap Links: