Wikipedia Views
A look at some of the largest firms and their daily wikipedia page views and trends.
Data is updated quarterly as data arrives after market close US-EST time.
Tutorials
are the best documentation — Wikipedia Views Tutorial
Input Datasets
Wikipedia Scrapers
Models Used
Fuzzy Matching
Model Outputs
Views and Trends
Description
This dataset provides daily Wikipedia page view data and trends for major companies, offering insights into public interest and market sentiment.
It includes metrics on view counts, relative views, and derived alpha/beta proxies to help investors gauge short-term and long-term trends in public attention towards specific stocks.
Data Access
Latest Data
All Data
This data is around 1GB if you download the entire dataset.
Filtered Dataset
Data Dictionary
Sure, let's update the markdown table with a more precise description of each feature, incorporating the detailed understanding of how alpha and beta proxies are calculated:
Feature Name | Description | Type | Example |
---|---|---|---|
views | The total number of page views for a specific ticker on a given date. | Float | 0.992128 |
relative_views | The relative number of page views for a ticker, normalized against views on other dates or other tickers. | Float | 0.992128 |
alpha_short | Short-term alpha proxy representing the change in the Exponential Moving Average (EMA) of page views over a short period. It indicates the short-term momentum or trend. | Float | 0.443640 |
beta_short | Short-term beta proxy measuring the deviation of actual page views from their short-term EMA. It represents the short-term volatility or variability in interest. | Float | 0.456864 |
alpha_long | Long-term alpha proxy similar to alpha_short but calculated over a longer time frame. It reflects the long-term trend or momentum in page views. | Float | 0.482683 |
beta_long | Long-term beta proxy indicating the deviation of actual page views from their long-term EMA. It measures the long-term volatility or variation in attention. | Float | 0.480479 |
long_short_alpha | The difference between long-term and short-term alpha proxies, highlighting the change in trend strength over varying time frames. | Float | 0.627361 |
long_short_beta | The difference between long-term and short-term beta proxies, showing the change in volatility or variability over different time periods. | Float | 0.683407 |
search_pressure | A composite metric calculated from a combination of alpha and beta proxies over different time frames, designed to provide a holistic view of the changing interest in a ticker. | Float | 0.508743 |
This table offers a succinct yet comprehensive overview of each feature, tailored to facilitate a clear understanding of the data's dimensions and their relevance in financial analysis, especially in the context of assessing public interest and market sentiment toward different financial entities.
Use Cases
This dataset is designed to provide investors with a detailed analysis of market interest and sentiment towards various financial entities, as reflected in Wikipedia page views. By analyzing page view trends and volatility, investors can gain insights into public interest and market sentiment, which are crucial factors in investment decision-making.
This dataset can be leveraged by investors for various purposes:
Market Sentiment Analysis: By analyzing trends and volatility in page views, investors can gauge public interest and sentiment towards specific tickers.
Investment Decision Support: Insights from the dataset can support buy, hold, or sell decisions based on the perceived interest and sentiment dynamics.
Risk Assessment: Variability in page views, as indicated by beta proxies, can aid in assessing the market's perception of risk associated with certain tickers.
Trend Identification: Alpha proxies provide a means to identify emerging trends in investor interest, which can be precursors to market movements.
Comparative Analysis: Comparing alpha and beta metrics across different tickers can help identify outperformers or underperformers in terms of market interest.
Last updated