Web Scraping & Topic Modeling using R
2024-03-20
Open edu.nl/qh34g in a new tab
When you are ready put a green sticky note on your laptop and follow the instructions on the presenter’s screen
What are the main farming-related topics discussed in social media?
Social media data
from Facebook or X
In the digital media era, social media platforms and websites serve as valuable sources for collecting user-generated content, which in turn represents the voices and opinions of citizens. Twitter (X)’s real-time nature and vast array of tweets cover diverse topics, providing insights into current events, public reactions, and emerging trends. Similarly, Facebook’s extensive user base and features like status updates and comments offer rich data on user experiences and interactions.
Web scraping using the Chrome extension
Web Data Research Assistant
Web scraping is the process of extracting data from websites. It involves automated techniques to collect information from web pages or social media website, typically in formats such as HTML, XML, or JSON. Web scraping allows you to retrieve specific data elements, such as text, images, or tables, from web pages and store them for analysis or other purposes.
Open script_socialmedia.qmd
and follow the instructions there
What are the main farming-related topics discussed in Dutch news since 2022?
51 newspaper articles
in Dutch
Quantitative analysis:
Topic modeling
Why?
Good to know:
Topic modeling using Latent Dirichlet Allocation (LDA) is a method used to reveal latent topics in unstructured text data.
In an LDA model:
Open script.qmd
and follow the instructions there
Reach out to us with questions:
Follow Rbanism:
Citation:
Zhang, S., & Forgaci, C. (2024). What do Facebook or X users say about farming in the Netherlands? - Word cloud analysis using R TU Delft.
Forgaci, C. (2024). What do the Dutch news say about farming communities? - A Topic modeling approach using R TU Delft.
Web Scraping & Topic Modeling with R