Download full text
(466.1Kb)
Citation Suggestion
Please use the following Persistent Identifier (PID) to cite this document:
https://nbn-resolving.org/urn:nbn:de:0168-ssoar-98479-3
Exports for your reference manager
Improving the Quality of Individual-Level Web Tracking: Challenges of Existing Approaches and Introduction of a New Content and Long-Tail Sensitive Academic Solution
[journal article]
Abstract This article evaluates the quality of data collection in individual-level desktop web tracking used in the social sciences and shows that the existing approaches face sampling issues, validity issues due to the lack of content-level data and their disregard for the variety of devices and long-tail c... view more
This article evaluates the quality of data collection in individual-level desktop web tracking used in the social sciences and shows that the existing approaches face sampling issues, validity issues due to the lack of content-level data and their disregard for the variety of devices and long-tail consumption patterns as well as transparency and privacy issues. To overcome some of these problems, the article introduces a new academic web tracking solution, WebTrack, an open-source tracking tool maintained by a major European research institution, GESIS. The design logic, the interfaces, and the backend requirements for WebTrack are discussed, followed by a detailed examination of the strengths and weaknesses of the tool. Finally, using data from 1,185 participants, the article empirically illustrates how an improvement in data collection through WebTrack leads to innovative shifts in the use of tracking data. As WebTrack allows for collecting the content people are exposed to beyond the classical news platforms, it can greatly improve the detection of politics-related information consumption in tracking data through automated content analysis compared to traditional approaches that rely on the source-level analysis.... view less
Keywords
data; data quality; information; information collection; content analysis; validity; transparency; data protection
Classification
Basic Research in the Social Sciences
Free Keywords
online tracking; automated content analysis; WebTrack; content; long-tail consumption
Document language
English
Publication Year
2024
Page/Pages
p. 1-21
Journal
Social Science Computer Review (2024) Online First
DOI
https://doi.org/10.1177/08944393241287793
ISSN
0894-4393
Status
Preprint; peer reviewed
Licence
Deposit Licence - No Redistribution, No Modifications