In this video, we'll see how you can scrape complex webpages, including pages that use hydration( ie. load their data in the browser using JavaScript), in Python using the Playwright framework.
This lets you scrape websites that cannot be scraped using simpler tools such as the requests library.
Using Playwright, you can also take screenshots, scrape websites that require authentication, and much more. Playwright is a modern alternative to Selenium. It is comparable to Puppeteer, with the difference that Playwright is not JavaScript-only, it provides official Python bindings.
👍 Please like if you found this video helpful, and subscribe to stay updated with my latest tutorials. 🔔
The code is available here: [ Ссылка ]
❤️ You can support this channel by buying me a ☕: [ Ссылка ]
🔖 Chapters:
00:00 Intro
02:17 Scraping Using Requests
05:16 Scraping Using Playwright
09:13 Converting Table to DataFrame
10:46 Screenshot
12:12 Other Features
🔗 Video links:
Playwright: [ Ссылка ]
🐍 More Vincent Codes Finance:
- ✍🏻 Blog: [ Ссылка ]
- 🐦 X: [ Ссылка ]
- 🧵 Threads: [ Ссылка ]
- 😺 GitHub: [ Ссылка ]
- 📘 Facebook: [ Ссылка ]
- 👨💼 LinkedIn: [ Ссылка ]
- 🎓 Academic website: [ Ссылка ]
#scraping #python #playwright #programming #code #nlp #opensource #pandas #puppeteer #selenium #bigdata #research #researchtips #vscode #professor #datascience #dataanalytics #dataanalysis #webscraping
Ещё видео!