Google-Maps-Scrapper
This Python script utilizes the Playwright library to perform web scraping and data extraction from Google Maps. It is particularly designed for obtaining information about businesses, including their name, address, website, phone number, reviews, and more.
Table of Contents
Prerequisite
- This code requires a python version below 3.10
- Any version of python beyond 3.9 may cause issues and may not work properly
Key Features
-
Data Scraping: The script scrapes data from Google Maps listings, extracting valuable information about businesses, such as their name, address, website, and contact details.
-
Review Analysis: It extracts review counts and average ratings, providing insights into businesses' online reputation.
-
Business Type Detection: The script identifies whether a business offers in-store shopping, in-store pickup, or delivery services.
-
Operating Hours: It extracts information about the business's operating hours.
-
Introduction Extraction: The script also scrapes introductory information about the businesses when available.
-
Data Cleansing: It cleanses and organizes the scraped data, removing redundant or unnecessary columns.
-
CSV Export: The cleaned data is exported to a CSV file for further analysis or integration with other tools.
Installation
-
Clone this repository:
git clone https://github.com/zohaibbashir/google-maps-scraping.git
-
Navigate to the project directory:
cd google-maps-scraping
-
Install the required Python packages:
pip install -r requirements.txt
How to Use:
To use this script, follow these steps:
-
Run the script with Python:
python main.py -s "search term" -t total
Write the name of the place/business in "search term" and a number in place of "total" to get the number of listings. If listings are less than the number provided it is because there are fewer listings than the number provided such as
python main.py -s "Turkish Restaurants in Toronto Canada" -t 20
-
The script will launch a browser, perform the search, and start scraping information. It will display the progress and save the results to a CSV file called result.csv.