Posted by admin on

Web Page Automation

I had a need to return to this; here are my notes, I start with some products. Also the related pages widget has been tuned.

  1. http://docs.seleniumhq.org/projects/ide/
  2. https://www.seleniumhq.org/projects/webdriver/
  3. http://wwwsearch.sourceforge.net/mechanize/
  4. http://maxq.tigris.org/
  5. http://twill.idyll.org/

I found these articles in the Summer of 2020

  1. https://thenextweb.com/syndication/2020/07/22/how-to-use-python-and-selenium-to-scrape-websites/, Web scraping has been used to extract data from websites almost from the time the World Wide Web was born. In the early days, scraping was mainly done on static pages – those with known elements, tags, and data.
  2. https://towardsdatascience.com/web-scraping-a-less-brief-overview-of-scrapy-and-selenium-part-ii-3ad290ce7ba1 , the first rule of web crawling is you do not harm the website. The second rule of web crawling is you do NOT harm the website.

Also on this wiki,

Browser Scripting


Comment ( 1 )

  1. Dave
    I have added some content about scrapy and more recent insights into web scraping.

This site uses Akismet to reduce spam. Learn how your comment data is processed.

By continuing to use the site, you agree to the use of cookies. more information

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.

Close