Blocking Indexing

Qudoc allows you to hide your part or all of your website and supports the robots.txt files and the the noindex and nofollow directives.

Robots.txt

We support the following directives in a robots.txt file:

Directive Description Example
User-agent We obey all rules for the agents * and qubot. Our agent runs as qubot on IP 58.108.186.149. Rules for qubot will override rules for *. User-agent: qubot
Crawl-delay Qubot accepts a crawl delay in seconds, with millisecond resolution. By default it has a delay of 100ms. crawl-delay: 1.5
Disallow Paths matching the expression will not be crawled. The most basic form matches any path starting with the expression. Qubot also accepts the wild cards * and $.
* will match any sequence of zero (0) or more characters.
$ will force match the whole path.
Where a Disallow and an Allow directive both match a path, the longest will be used.
Disallow: /cgi-bin/

Disallow: *.jpg$
Allow Paths matching the expression will be crawled. This is used to override Disallow commands. Qubot also accepts the wild cards * and $.
* will match any sequence of zero (0) or more characters.
$ will force match the whole path.
Where a Disallow and an Allow directive both match a path, the longest will be used.
Allow: /documents/

Allow: *.doc$
Sitemap There can be any number of sitemap or siteindex entries. These must specify the full URL to the sitemap or siteindex entry. User-agent does not effect sitemaps. Sitemap: http://www.example.com/sitemap.xml

Noindex and Nofollow

Qubot supports the following HTML metadata directives:

Directive Description Example
noindex Qubot will not index pages with this metadata value. <meta name="qubot" content="noindex" />
nofollow Qubot will not follow links on a page with this metadata value. This can also be used for single links in anchor tags. <meta name="robots" content="nofollow" />

<a href="http://www.example.com/" rel="nofollow">Example link</a>