Skip to content

News

Data sources for news content, media coverage, and online search trends. For social media platforms, see Social Media.

  • Nexis Uni Global — Full text of articles from Dutch and international newspapers and news magazines. Also provides company information on Dutch and international businesses, and Market Insight. Accessible to UU students and staff via the Utrecht University Library.
  • Structure of News CSV Global — Structured dataset mapping the topics, sources, and formats found in news journalism across countries and outlets. Designed for computational analysis of news content and media diversity. Useful for studying media framing, information dissemination, and coverage patterns in financial markets research.
  • Google Trends API Global — Relative search interest over time for any query, by country, region, and category. Widely used as a real-time proxy for investor attention, household financial concerns, and information demand. Data is normalised (0–100 within each query). Accessible via the website or programmatically with the pytrends Python library.
  • Wikimedia REST API API Python Global — Free API providing daily and monthly page view counts for any Wikipedia article across all language editions, from 2015 to present. Useful as a proxy for public interest and attention — covering companies, industries, events, and topics. No authentication required. Accessible via the Pageviews endpoint or with the pageviewsapi Python package.
pip install pageviewsapi
  • GDELT (Global Database of Events, Language, and Tone) API CSV Global — Massive open dataset monitoring world news from 100+ languages, updated every 15 minutes. Covers events (who did what to whom, when, and where), themes, organizations, sentiment/tone, and source URLs. Raw data files downloadable from the GDELT Data page or queryable via Google BigQuery (free tier). Also accessible via the GDELT DOC 2.0 API. Widely used in event studies, media sentiment analysis, and geopolitical risk research.

See also: Social Media for platform data (Twitter/X, Reddit, Meta) | Sentiment & Culture for text-based sentiment indices | Company Filings (SEC EDGAR) for EDGAR full-text search.