IS 930 Web Scraping for Researchers

Content

In today’s data-driven world, the ability to obtain web data is a game-changer for empirical research. Top-tier research papers in marketing, management, accounting, finance, and information systems increasingly rely on data obtained from scraping websites or querying APIs. Examples of such data include customer reviews from Amazon, search trends on Google, Instagram posts, job postings, investment disclosures, and much more. The skill to obtain novel, rich, and unique web data has become critical for business researchers.

This hands-on course equips doctoral students and postdocs in business with the essential skills to collect, extract, and process web data for research purposes. You will follow a structured, step-by-step approach to:

Identify requirements for effective data extraction
Automate data collection from websites or APIs
Parse and clean data
Navigate ethical and legal considerations

Through practical exercises in small groups, you will develop a fully functional web scraper using the Python programming language, which you can easily adapt for your own research projects later on.

You do not need to be familiar with Python to attend the course. If you have not worked with Python yet, we provide setup instructions before the course starts.

Learning Outcomes

By the end of the course, students ..:

..can evaluate the costs and benefits of automating data extraction from websites and APIs,
..can implement web scraping techniques in Python to collect, parse, and clean data from websites and APIs,
..can address common technical challenges in web scraping, including dynamic content and scalability,
..can explain ethical and methodological considerations related to web scraping.

Necessary prerequisites:

–

Recommended prerequisites:

–

Modulkatalog

IS 930 Modulbeschreibung
( PDF , 216 KB )

Kontakt

Bild: Elene Rakviashvili

Luisa Buck, M.Sc. (sie/ihr)

E-Mail: luisa.buckuni-mannheim.de
Tel: +49 621 181-2153

Adresse:
Universität Mannheim
L5, 1–6 – Raum 722
68131 Mannheim

Sprechstunde:
Nach Vereinbarung

IS 930 Web Scraping for Researchers

Modulkatalog

Kontakt

Luisa Buck, M.Sc. (sie/ihr)

Lehrstuhl für Information Systems II

FORUM