Web Scraping (also termed Screen Scraping, Web Data Extraction, Web Harvesting, etc.) is a technique for extracting large amounts of data from websites and save the the extracted data to a local file or to a database. In this course, you will learn how to perform web scraping using Python 3 and the Beautiful Soup, a free open-source library written in Python for parsing HTML. We will use lxml, which is an extensive library for parsing XML and HTML documents very quickly; it can even handle messed up tags. We will also be using the Requests module instead of the already built-in urllib2 module due to improvements in speed and readability. Finally, we will use Selenium alongside Beautiful Soup to crawl AJAX & JavaScript driven pages. The course cover the following topics: accessing web pages programmatically; scraping web pages to extract the required data using Beautiful Soup to parse web pages; interacting with web pages to do different things with them programmatically; and using Selenium for web scraping and when we need it.
Learn moreHas discount |
|
||
---|---|---|---|
Expiry period | Lifetime | ||
Made in | English | ||
Last updated at | Tue Dec 2024 | ||
Level |
|
||
Total lectures | 7 | ||
Total quizzes | 0 | ||
Total duration | 01:53:00.000000 Hours | ||
Total enrolment | 0 | ||
Number of reviews | 0 | ||
Avg rating |
|
||
Short description | Web Scraping (also termed Screen Scraping, Web Data Extraction, Web Harvesting, etc.) is a technique for extracting large amounts of data from websites and save the the extracted data to a local file or to a database. In this course, you will learn how to perform web scraping using Python 3 and the Beautiful Soup, a free open-source library written in Python for parsing HTML. We will use lxml, which is an extensive library for parsing XML and HTML documents very quickly; it can even handle messed up tags. We will also be using the Requests module instead of the already built-in urllib2 module due to improvements in speed and readability. Finally, we will use Selenium alongside Beautiful Soup to crawl AJAX & JavaScript driven pages. The course cover the following topics: accessing web pages programmatically; scraping web pages to extract the required data using Beautiful Soup to parse web pages; interacting with web pages to do different things with them programmatically; and using Selenium for web scraping and when we need it. | ||
Outcomes |
|
||
Requirements |
|