Libraries & Tools
Beautiful Soup Methods
HTML & Attributes
Challenges & Ethics
Code & Logic
100

Which library is used to send HTTP GET requests in Python?

The requests library.

100

Which method returns the first matching element it finds?

find().

100

In an <a> tag, which attribute is used to store the link URL?

The href attribute.

100

What is the primary purpose of web scraping?

To automatically extract data from websites.

100

What is the shortcut method used to access the very first paragraph tag in a document?

soup.p.

200

Which library is used in Python to parse HTML content?

Beautiful Soup.

200

Which method is used to return all matching elements?

find_all().

200

Which attribute of an <img> tag contains the actual URL of the image?

The src attribute.

200

What should you always review on a website before you begin scraping it?

Terms of service (permissions)

200

Write the specific line of code required to import the Beautiful Soup library.

from bs4 import BeautifulSoup.

300

Which Python library is used to write data into CSV files?

The csv library.

300

What method is used to extract the text content from a specific tag?

The .text method.

300

What is it called when one HTML element is placed inside another element?

Nesting.

300

What is a puzzle used to verify that a website user is a human and not a bot?

A CAPTCHA.

300

True or False: The find_all() method is used to return only the first matching element.

False (It returns all matching elements; find() returns the first).

400

Which specific method is used to write a single row to a CSV file?

The writerow() method.

400

What is the process of breaking down HTML into parts that are easier for the program to understand?

Parsing.

400

True or False: It is always allowed to scrape any website without checking for permissions.

False.

400

Which method is used if you want to write multiple rows of data to a CSV file all at once?

The writerows() method.

500

Which challenge refers to websites that update their content automatically without needing a page refresh?

Dynamic content.

500

True or False: HTML tags are used to identify and locate data during the scraping process.

True.