Just type (or Cut&Paste) the URL for the page you want to validate into the text field on the form and press the "Check now" button.
You can link directly to the Validator home page, or you can call the Validator
CGI program. The home page is http://archiveready.com/ at the moment
(and for the foreseeable future) and the CGI program can be reached at
http://archiveready.com/check?url=http://yousite.com.
ArchiveReady is checking several website attributes such as:
Web archiving is the process of collecting portions of the World Wide Web and ensuring the collection is preserved in an archive, such as an archive site, for future researchers, historians, and the public. http://en.wikipedia.org/wiki/Web_archiving
To enable the archive of a site by the Portuguese Web Archive, it is fundamental that the site presents a crawler-friendly homepage. Portuguese Web Archive Crawler
Heritrix (ExtractorJS) has trouble finding the links that are not hardcoded strings in javascript. Heritrix Known Issues
If fancy features such as JavaScript, cookies, session IDs, frames, DHTML, or Flash keep you from seeing all of your site in a text browser, then search engine spiders may have trouble crawling your site. How Googlebot sees your webpages
The content of your robots.txt file tells search engine crawlers how they should visit your site. Google Webmaster Guidelines
ArchiveReady is built using Python and various different libraries such as requests, Beautiful Soup. Nginx and uwsgi are also used. Everything is written in Vim.
Papers, projects, initiatives, relevant to web preservation.