Bai, Dan and Gu, Yan (2026) Harnessing Big Data, Hindered by Bias: Evaluating TikTok Research API for Fair and Optimal Social Sciences. Social Science Computer Review. DOI https://doi.org/10.1177/08944393251413277
Bai, Dan and Gu, Yan (2026) Harnessing Big Data, Hindered by Bias: Evaluating TikTok Research API for Fair and Optimal Social Sciences. Social Science Computer Review. DOI https://doi.org/10.1177/08944393251413277
Bai, Dan and Gu, Yan (2026) Harnessing Big Data, Hindered by Bias: Evaluating TikTok Research API for Fair and Optimal Social Sciences. Social Science Computer Review. DOI https://doi.org/10.1177/08944393251413277
Abstract
Digital platforms now serve as crucial archives for analysing societal trends, yet their research APIs pose methodological challenges. This study critically evaluates TikTok Research API through comparative analysis of 6,373 videos from 14 creators in the United States and United Kingdom (2020–2022), contrasting API-derived outputs with manual collection and third-party analytics. The API demonstrated scalability, retrieving more videos than alternative methods and providing 22 variables, including eight unavailable elsewhere. However, limitations were substantial: transcriptions covered about 10% of the content, with more transcripts returned from American male creators. Engagement metrics exhibited inconsistent accuracy across data sources, with the API showing systematically lower view counts but higher comment and share counts compared to manual collection. The number of videos varied depending on sample composition, indicating that small changes in inclusion criteria could shift outcomes disproportionately. These results highlight systematic inconsistencies, showing why multi-method approaches remain necessary despite automation. While TikTok Research API offers valuable scale and ethical compliance, its demographic biases and metadata inconsistencies compromise validity. The study advocates integrated auditing protocols and targeted API refinements to improve representativeness and accuracy in platform-based research.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | digital platform; TikTok; API; big data; data audit |
| Divisions: | Faculty of Science and Health Faculty of Science and Health > Psychology, Department of |
| SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
| Depositing User: | Unnamed user with email elements@essex.ac.uk |
| Date Deposited: | 25 Mar 2026 14:48 |
| Last Modified: | 25 Mar 2026 14:48 |
| URI: | http://repository.essex.ac.uk/id/eprint/42550 |
Available files
Filename: bai-gu-2026-harnessing-big-data-hindered-by-bias-evaluating-tiktok-research-api-for-fair-and-optimal-social-sciences.pdf
Licence: Creative Commons: Attribution 4.0