Scrapfly scrapers
Scalable Python web scraping scripts for +40 popular domains
This repository contains educational example scrapers for popular web scraping targets using the [ScrapFly](https://scrapfly.io) web scraping API and Python. Most Scrapers use a simple web scraping stack: - Python version 3.10+ - [Scrapfly's Python SDK](https://github.com/scrapfly/python-scrapfly) for sending HTTP request, bypass blocking and parsing the HTML using the built-in [parsel](https://pypi.org/project/parsel/) selector. - [asyncio](https://pypi.org/project/asyncio/) for writing concurr... The project is written primarily in Python, distributed under the Other license, first published in 2023. It has gained significant community traction with 1,004 stars and 193 forks on GitHub. Key topics include: antibot, automation, captcha-bypass, crawler, crawling.
ScrapFly Scrapers 🕷️
This repository contains educational example scrapers for popular web scraping targets using the ScrapFly web scraping API and Python.
Most Scrapers use a simple web scraping stack:
- Python version 3.10+
- Scrapfly's Python SDK for sending HTTP request, bypass blocking and parsing the HTML using the built-in parsel selector.
- asyncio for writing concurrent code using the async/await syntax.
- JMESPath and nested-lookup for JSON parsing when needed.
- loguru for logging.
To learn more about web scraping see our full tutorials on how to scrape these targets (and many others) see the scrapeguide directory.
List of Scrapers
Below is the list of available web scrapers for the supported domains along with their scrape guide, sample datasets, and status. 👇
<table> <tr> <td><strong>Domain</strong></td> <td><strong>Guide</strong></td> <td><strong>Sample Datasets</strong></td> <td><strong>Status</strong></td> </tr> <tr> <td><a href="/aliexpress-scraper/">Aliexpress.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-aliexpress/">How to Scrape Aliexpress.com (2025 Update)</a></td> <td> <ul> <li><a href="./aliexpress-scraper/results/product.json">Product pages</a></li> <li><a href="./aliexpress-scraper/results/search.json">Search pages</a></li> <li><a href="./aliexpress-scraper/results/reviews.json">Product reviews</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Aliexpress_scraper-success-brightgreen" alt="aliexpress-scraper-status"></td> </tr> <tr> <td><a href="/amazon-scraper/">Amazon.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-amazon/">How to Scrape Amazon.com Product Data and Reviews</a></td> <td> <ul> <li><a href="./amazon-scraper/results/product.json">Product pages</a></li> <li><a href="./amazon-scraper/results/search.json">Search pages</a></li> <li><a href="./amazon-scraper/results/reviews.json">Product reviews</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Amazon_scraper-success-brightgreen" alt="amazon-scraper-status"></td> </tr> <tr> <td><a href="/bestbuy-scraper/">BestBuy.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-bestbuy-product-offer-and-review-data/">How to Scrape BestBuy Product, Offer and Review Data</a></td> <td> <ul> <li><a href="./bestbuy-scraper/results/promos.json">Sitemap pages</a></li> <li><a href="./bestbuy-scraper/results/products.json">Product pages</a></li> <li><a href="./bestbuy-scraper/results/reviews.json">Review pages</a></li> <li><a href="./bestbuy-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/BestBuy_scraper-success-brightgreen" alt="bestbuy-scraper-status"></td> </tr> <tr> <td><a href="/bing-scraper/">Bing.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-bing-search-using-python/">How to Scrape Bing Search with Python</a></td> <td> <ul> <li><a href="./bing-scraper/results/serps.json">SERP data</a></li> <li><a href="./bing-scraper/results/keywords.json">Keyword data</a></li> <li><a href="./bing-scraper/results/rich_snippets.json">Rich snippet data</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Bing_scraper-success-brightgreen" alt="bing-scraper-status"></td> </tr> <tr> <td><a href="/booking-scraper/">Booking.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-bookingcom/">How to Scrape Booking.com (2025 Update)</a></td> <td> <ul> <li><a href="./bookingcom-scraper/results/hotel.json">Hotel pages</a></li> <li><a href="./bookingcom-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Booking_scraper-success-brightgreen" alt="booking-scraper-status"></td> </tr> <tr> <td><a href="/crunchbase-scraper/">Crunchbase.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-crunchbase/">How to Scrape Crunchbase in 2025</a></td> <td> <ul> <li><a href="./crunchbase-scraper/results/company.json">Company pages</a></li> <li><a href="./crunchbase-scraper/results/person.json">Investor pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Crunchbase_scraper-success-brightgreen" alt="crunchbase-scraper-status"></td> </tr> <tr> <td><a href="/domaincom-scraper/">Domain.com.au</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-domain-com-au-real-estate-property-data/">How to Scrape Domain.com.au Real Estate Property Data</a></td> <td> <ul> <li><a href="./domaincom-scraper/results/properties.json">Property pages</a></li> <li><a href="./domaincom-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Domain.com.au_scraper-success-brightgreen" alt="domaincom-scraper-status"></td> </tr> <tr> <td><a href="/ebay-scraper/">Ebay.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-ebay/">How to Scrape Ebay Using Python (2025 Update)</a></td> <td> <ul> <li><a href="./ebay-scraper/results/product.json">Product pages</a></li> <li><a href="./ebay-scraper/results/product-with-variants.json">Product pages with variant</a></li> <li><a href="./ebay-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Ebay_scraper-success-brightgreen" alt="ebay-scraper-status"></td> </tr> <tr> <td><a href="/etsy-scraper/">Etsy.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-etsy-com-product-review-data/">How to Scrape Etsy.com Product, Shop and Search Data</a></td> <td> <ul> <li><a href="./etsy-scraper/results/products.json">Product pages</a></li> <li><a href="./etsy-scraper/results/shops.json">Shop pages</a></li> <li><a href="./etsy-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Etsy_scraper-success-brightgreen" alt="etsy-scraper-status"></td> </tr> <tr> <td><a href="/fashionphile-scraper/">Fashionphile.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-fashionphile/">How to Scrape Fashionphile for Second Hand Fashion Data</a></td> <td> <ul> <li><a href="./fashionphile-scraper/results/products.json">Product pages</a></li> <li><a href="./fashionphile-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Fashionphile_scraper-success-brightgreen" alt="fashionphile-scraper-status"></td> </tr> <tr> <td><a href="/glassdoor-scraper/">Glassdoor.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-glassdoor/">How to Scrape Glassdoor (2025 update)</a></td> <td> <ul> <li><a href="./glassdoor-scraper/results/jobs.json">Job pages</a></li> <li><a href="./glassdoor-scraper/results/reviews.json">Review pages</a></li> <li><a href="./glassdoor-scraper/results/salaries.json">Salary pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Glassdoor_scraper-success-brightgreen" alt="glassdoor-scraper-status"></td> </tr> <tr> <td><a href="/goat-scraper/">Goat.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-goat-com-fashion-apparel/">How to Scrape Goat.com for Fashion Apparel Data in Python</a></td> <td> <ul> <li><a href="./goat-scraper/results/products.json">Product pages</a></li> <li><a href="./goat-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Goat_scraper-success-brightgreen" alt="goat-scraper-status"></td> </tr> <tr> <td><a href="/google-scraper/">Google.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-google/">How to Scrape Google Search Results</a> - <a href="https://scrapfly.io/blog/how-to-scrape-google-maps/">How to Scrape Google Maps</a></td> <td> <ul> <li><a href="./google-scraper/results/serp.json">SERP data</a></li> <li><a href="./google-scraper/results/keywords.json">Keyword data</a></li> <li><a href="./google-scraper/results/google_map_places_urls.json">Google Maps place URLs</a></li> <li><a href="./google-scraper/results/google_map_places.json">Google Maps place data</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Google_scraper-success-brightgreen" alt="goat-scraper-status"></td> </tr> <tr> <td><a href="/homegate-scraper/">Homegate.ch</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-homegate-ch-real-estate-property-data/">How to Scrape Homegate.ch Real Estate Property Data</a></td> <td> <ul> <li><a href="./homegate-scraper/results/properties.json">Property pages</a></li> <li><a href="./homegate-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Homegate_scraper-success-brightgreen" alt="homegate-scraper-status"></td> </tr> <tr> <td><a href="/idealista-scraper/">Idealista.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-idealista/">How to Scrape Idealista.com in Python - Real Estate Property Data</a></td> <td> <ul> <li><a href="./idealista-scraper/results/properties.json">Property pages</a></li> <li><a href="./idealista-scraper/results/search_data.json">Search pages</a></li> <li><a href="./idealista-scraper/results/search_URLs.json">Provinces pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Idealista_scraper-success-brightgreen" alt="idealista-scraper-status"></td> </tr> <tr> <td><a href="/immobilienscout24-scraper/">Immobilienscout24.de</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-immobillienscout24-real-estate-property-data/">How to Scrape Immobilienscout24.de Real Estate Data</a></td> <td> <ul> <li><a href="./immobilienscout24-scraper/results/properties.json">Property pages</a></li> <li><a href="./immobilienscout24-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Immobilienscout24_scraper-success-brightgreen" alt="immobilienscout24-scraper-status"></td> </tr> <tr> <td><a href="/immoscout24-scraper/">Immoscout24.ch</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-immoscout24-ch-real-estate-property-data/">How to Scrape Immoscout24.ch Real Estate Property Data</a></td> <td> <ul> <li><a href="./immoscout24-scraper/results/properties.json">Property pages</a></li> <li><a href="./immoscout24-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Immoscout24_scraper-success-brightgreen" alt="immoscout24-scraper-status"></td> </tr> <tr> <td><a href="/immowelt-scraper/">Immowelt.de</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-immowelt-de-real-estate-properties/">How to Scrape Immowelt.de Real Estate Data</a></td> <td> <ul> <li><a href="./immowelt-scraper/results/properties.json">Property pages</a></li> <li><a href="./immowelt-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Immowelt_scraper-success-brightgreen" alt="immowelt-scraper-status"></td> </tr> <tr> <td><a href="/indeed-scraper/">Indeed.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-indeedcom/">How to Scrape Indeed.com (2025 Update)</a></td> <td> <ul> <li><a href="./indeed-scraper/results/jobs.json">Job pages</a></li> <li><a href="./indeed-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Indeed_scraper-success-brightgreen" alt="indeed-scraper-status"></td> </tr> <tr> <td><a href="/instagram-scraper/">Instagram.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-instagram/">How to Scrape Instagram in 2025</a></td> <td> <ul> <li><a href="./instagram-scraper/results/user.json">User data</a></li> <li><a href="./instagram-scraper/results/all-user-posts.json">All user posts</a></li> <li><a href="./instagram-scraper/results/multi-image-post.json">Multi image post</a></li> <li><a href="./instagram-scraper/results/video-post.json">Video Post</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Instagram_scraper-success-brightgreen" alt="instagram-scraper-status"></td> </tr> <tr> <td><a href="/leboncoin-scraper/">Leboncoin.fr</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-leboncoin-marketplace-real-estate/">How to Web Scrape Leboncoin.fr using Python</a></td> <td> <ul> <li><a href="./leboncoin-scraper/results/ads.json">Ad pages</a></li> <li><a href="./leboncoin-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Leboncoin_scraper-success-brightgreen" alt="leboncoin-scraper-status"></td> </tr> <tr> <td><a href="/nordstorm-scraper/">Nordstorm.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-nordstrom/">How to Scrape Nordstrom Fashion Product Data</a></td> <td> <ul> <li><a href="./nordstorm-scraper/results/products.json">Product pages</a></li> <li><a href="./nordstorm-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Nordstorm_scraper-success-brightgreen" alt="nordstorm-scraper-status"></td> </tr> <tr> <td><a href="/realestatecom-scraper/">Realestate.com.au</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-realestate-com-au-property-listing-data/">How to Scrape Realestate.com.au Property Listing Data</a></td> <td> <ul> <li><a href="./realestatecom-scraper/results/properties.json">Property pages</a></li> <li><a href="./realestatecom-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Realestate_scraper-success-brightgreen" alt="realestate-scraper-status"></td> </tr> <tr> <td><a href="/realtorcom-scraper/">Realtor.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-realtorcom/">How to Scrape Realtor.com - Real Estate Property Data</a></td> <td> <ul> <li><a href="./realtorcom-scraper/results/property.json">Property pages</a></li> <li><a href="./realtorcom-scraper/results/search.json">Search pages</a></li> <li><a href="./realtorcom-scraper/results/feed.json">Feed pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Realtor_scraper-success-brightgreen" alt="realtor-scraper-status"></td> </tr> <tr> <td><a href="/reddit-scraper/">Reddit.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-reddit-social-data/">How to Scrape Reddit Posts, Subreddits and Profiles </a></td> <td> <ul> <li><a href="./reddit-scraper/results/post.json">Post pages</a></li> <li><a href="./reddit-scraper/results/subreddit.json">Subreddit pages</a></li> <li><a href="./reddit-scraper/results/user_comments.json">User comment pages</a></li> <li><a href="./reddit-scraper/results/user_posts.json">User post pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Reddit_scraper-success-brightgreen" alt="Reddit-scraper-status"></td> </tr> <tr> <td><a href="/redfin-scraper/">Redfin.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-redfin/">How to Scrape Redfin Real Estate Property Data in Python </a></td> <td> <ul> <li><a href="./redfin-scraper/results/properties_for_sale.json">Property pages for sale</a></li> <li><a href="./redfin-scraper/results/properties_for_rent.json">Property pages for rent</a></li> <li><a href="./redfin-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Redfin_scraper-success-brightgreen" alt="redfin-scraper-status"></td> </tr> <tr> <td><a href="./rightmove-scraper/">Rightmove.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-rightmove/">How to Scrape RightMove Real Estate Property Data with Python</a></td> <td> <ul> <li><a href="./rightmove-scraper/results/properties.json">Property pages</a></li> <li><a href="./rightmove-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Rightmove_scraper-success-brightgreen" alt="Rightmove-scraper-status"></td> </tr> <tr> <td><a href="./seloger-scraper/">Seloger.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-seloger-com-listing-real-estate-ads/">How to Scrape Seloger.com - Real Estate Listing Data</a></td> <td> <ul> <li><a href="./seloger-scraper/results/property.json">Property pages</a></li> <li><a href="./seloger-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Seloger_scraper-success-brightgreen" alt="Seloger-scraper-status"></td> </tr> <tr> <td><a href="./similarweb-scraper/">Similarweb.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-similarweb/">How to Scrape SimilarWeb Website Traffic Analytics</a></td> <td> <ul> <li><a href="./similarweb-scraper/results/websites.json">Website pages</a></li> <li><a href="./similarweb-scraper/results/websites_compare.json">Website compare pages</a></li> <li><a href="./similarweb-scraper/results/trends.json">Trend pages</a></li> <li><a href="./similarweb-scraper/results/sitemap_urls.json">Sitemaps</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Similarweb_scraper-success-brightgreen" alt="Similarweb-scraper-status"></td> </tr> <tr> <td><a href="./stockx-scraper/">Stockx.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-stockx/">How to Scrape StockX e-commerce Data with Python</a></td> <td> <ul> <li><a href="./stockx-scraper/results/product.json">Product pages</a></li> <li><a href="./stockx-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Stockx_scraper-success-brightgreen" alt="Stockx-scraper-status"></td> </tr> <tr> <td><a href="./threads-scraper/">Threads.net</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-threads/">How to scrape Threads by Meta using Python (2025 Update)</a></td> <td> <ul> <li><a href="./threads-scraper/results/profile.json">Profile pages</a></li> <li><a href="./threads-scraper/results/thread.json">Thread pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Threads_scraper-success-brightgreen" alt="Threads-scraper-status"></td> </tr> <tr> <td><a href="./tiktok-scraper/">TikTok.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-tiktok-python-json/">How To Scrape TikTok in 2025</a></td> <td> <ul> <li><a href="./tiktok-scraper/results/comments.json">Comment data</a></li> <li><a href="./tiktok-scraper/results/posts.json">Post data</a></li> <li><a href="./tiktok-scraper/results/profiles.json">Profile data</a></li> <li><a href="./tiktok-scraper/results/channel.json">Channel data</a></li> <li><a href="./tiktok-scraper/results/search.json">Search data</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/TikTok_scraper-success-brightgreen" alt="TikTok-scraper-status"></td> </tr> <tr> <td><a href="./tripadvisor-scraper/">Tripadvisor.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-tripadvisor/">How to Scrape TripAdvisor.com (2025 Updated)</a></td> <td> <ul> <li><a href="./tripadvisor-scraper/results/hotels.json">Hotel pages</a></li> <li><a href="./tripadvisor-scraper/results/search.json">Search pages</a></li> <li><a href="./tripadvisor-scraper/results/location.json">Location pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Tripadvisor_scraper-success-brightgreen" alt="Tripadvisor-scraper-status"></td> </tr> <tr> <td><a href="./trustpilot-scraper/">Trustpilot.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-trustpilot-com-reviews/">How to Scrape Trustpilot.com Reviews and Company Data</a></td> <td> <ul> <li><a href="./trustpilot-scraper/results/companies.json">Company pages</a></li> <li><a href="./trustpilot-scraper/results/reviews.json">Reviews pages</a></li> <li><a href="./trustpilot-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Trustpilot_scraper-success-brightgreen" alt="Trustpilot-scraper-status"></td> </tr> <tr> <td><a href="./twitter-scraper/">Twitter(X).com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-twitter/">How to Scrape X.com (Twitter) using Python (2025 Update)</a></td> <td> <ul> <li><a href="./twitter-scraper/results/profile.json">Profile pages</a></li> <li><a href="./twitter-scraper/results/tweet.json">Tweet pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Twitter_scraper-success-brightgreen" alt="Twitter-scraper-status"></td> </tr> <tr> <td><a href="./vestiairecollective-scraper/">VestiaireCollective.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-vestiairecollective/">How to Scrape Vestiaire Collective for Fashion Product Data</a></td> <td> <ul> <li><a href="./vestiairecollective-scraper/results/products.json">Product pages</a></li> <li><a href="./vestiairecollective-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Vestiaire_Collective_scraper-success-brightgreen" alt="Vestiaire-Collective-scraper-status"></td> </tr> <tr> <td><a href="./g2-scraper/">G2.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-g2-company-data-and-reviews/">How to Scrape G2 Company Data and Reviews</a></td> <td> <ul> <li><a href="./g2-scraper/results/reviews.json">Review pages</a></li> <li><a href="./g2-scraper/results/search.json">Search pages</a></li> <li><a href="./g2-scraper/results/alternatives.json">Alternatives pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/G2_scraper-success-brightgreen" alt="G2-scraper-status"></td> </tr> <tr> <td><a href="/walmart-scraper/">Walmart.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-walmartcom/">How to Scrape Walmart.com Product Data (2025 Update)</a></td> <td> <ul> <li><a href="./walmart-scraper/results/products.json">Product pages</a></li> <li><a href="./walmart-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Walmart_scraper-success-brightgreen" alt="Walmart-scraper-status"></td> </tr> <tr> <td><a href="/wellfound-scraper/">Wellfound.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-wellfound-aka-angellist/">How to Scrape Wellfound Company Data and Job Listings</a></td> <td> <ul> <li><a href="./wellfound-scraper/results/companies.json">Company pages</a></li> <li><a href="./wellfound-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Wellfound_scraper-success-brightgreen" alt="Wellfound-scraper-status"></td> </tr> <tr> <td><a href="/linkedin-scraper/">Linkedin.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-linkedin-person-profile-company-job-data/">How to Scrape LinkedIn in 2025</a></td> <td> <ul> <li><a href="./linkedin-scraper/results/profile.json">Profile pages</a></li> <li><a href="./linkedin-scraper/results/company.json">Company pages</a></li> <li><a href="./linkedin-scraper/results/job_search.json">Job search pages</a></li> <li><a href="./linkedin-scraper/results/jobs.json">Job pages</a></li> <li><a href="./linkedin-scraper/results/articles.json">Article pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/LinkedIn_scraper-success-brightgreen" alt="LinkedIn-scraper-status"></td> </tr> <tr> <td><a href="/yellowpages-scraper/">Yellowpages.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-yellowpages/">How to Scrape YellowPages.com in 2025</a></td> <td> <ul> <li><a href="./yellowpages-scraper/results/business_pages.json">Business pages</a></li> <li><a href="./yellowpages-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Yellowpages_scraper-success-brightgreen" alt="Yellowpages-scraper-status"></td> </tr> <tr> <td><a href="/yelp-scraper/">Yelp.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-yelpcom/">How to Web Scrape Yelp.com (2025 update)</a></td> <td> <ul> <li><a href="./yelp-scraper/results/business_pages.json">Business pages</a></li> <li><a href="./yelp-scraper/results/reviews.json">Review pages</a></li> <li><a href="./yelp-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Yelp_scraper-success-brightgreen" alt="Yelp-scraper-status"></td> </tr> <tr> <td><a href="/youtube-scraper/">YouTube.com</a></td></a></td> <td><a href="https://scrapfly.io/blog/posts/how-to-scrape-youtube-in-2025">How to Scrape YouTube in 2025</a></td> <td> <ul> <li><a href="./youtube-scraper/results/channel_videos.json">Channel videos</a></li> <li><a href="./youtube-scraper/results/channels.json">Channel metadata</a></li> <li><a href="./youtube-scraper/results/channel_videos.json">Channel videos</a></li> <li><a href="./youtube-scraper/results/videos.json">Video metadata</a></li> <li><a href="./youtube-scraper/results/comments.json">Video comments</a></li> <li><a href="./youtube-scraper/results/shorts.json">Shorts' metadata</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/YouTube_scraper-success-brightgreen" alt="YouTube-scraper-status"></td> </tr> <tr> <td><a href="/zillow-scraper/">Zillow.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-zillow/">How to Scrape Zillow Real Estate Property Data in Python</a></td> <td> <ul> <li><a href="./zillow-scraper/results/property.json">Property pages</a></li> <li><a href="./zillow-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Zillow_scraper-success-brightgreen" alt="Zillow-scraper-status"></td> </tr> <tr> <td><a href="/zoominfo-scraper/">Zoominfo.com</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-zoominfo/">How to Scrape Zoominfo Company Data (2025 Update)</a></td> <td> <ul> <li><a href="./zoominfo-scraper/results/companies.json">Company pages</a></li> <li><a href="./zoominfo-scraper/results/directory.json">Directory pages</a></li> <li><a href="./zoominfo-scraper/results/faqs.json">FAQs data</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Zoominfo_scraper-success-brightgreen" alt="Zoominfo-scraper-status"></td> </tr> <tr> <td><a href="/zoopla-scraper/">Zoopla.co.uk</a></td> <td><a href="https://scrapfly.io/blog/how-to-scrape-zoopla/">How to Scrape Zoopla Real Estate Property Data in Python</a></td> <td> <ul> <li><a href="./zoopla-scraper/results/properties.json">Property pages</a></li> <li><a href="./zoopla-scraper/results/search.json">Search pages</a></li> </ul> </td> <td><img src="https://img.shields.io/badge/Zoopla_scraper-success-brightgreen" alt="Zoopla-scraper-status"></td> </tr> </table>Fair Use and Legal Disclaimer
This repository contains educational reference material to illustrate how accessible web scraping can be and the provided programs are not intented to be used in web scraping production.
That being said, Scrapfly team is constantly updating and improving all of this code for optimal experience.
Scrapfly does not offer legal advice and as always, consult a lawyer when creating programs that interact with other people's websites though here's a good general intro of what NOT to do:
- Do not store PII (personally identifiable information) of EU citizens who are protected by GDPR.
- Do not scrape and repurpose entire public datasets which can be protected by database protection laws in some countries.
- Do not scrape at rates that could damage the website and scrape only publicly available data.
Contributors
Showing top 8 contributors by commit count.
