This project is designed to extract large datasets of book information from Goodreads based on user ratings. It includes a Python script, main.py, which scrapes book data from Goodreads pages and saves it to an Excel file.
Provide Input: Enter the exact link to the book list on the Goodreads website. Specify the minimum rating threshold for the books you want to extract. Input the total number of web pages in your book list.
Data Extraction: The script will start extracting book data based on the specified criteria. It will create an Excel file named books.xlsx in the project directory. The extracted book information will be saved in this file. Excel File:
You can find the extracted book data in the books.xlsx file in the project folder.