# Box Office Stats

{% hint style="success" %}
Dataset contains 100+ tickers, available from 1997-07-01 onwards.
{% endhint %}

`Tutorials` are the best documentation — [<mark style="color:blue;">`Box Office Movie Analysis Tutorial`</mark>](https://colab.research.google.com/github/sovai-research/sovai-public/blob/main/notebooks/datasets/Movies%20Box%20Office.ipynb)

## Description

This dataset provides detailed box office performance data for movies, including daily revenue, theater counts, and distributor information.

It links movies to their producer companies via ticker symbols, enabling analysis of box office success across different production studios and distributors over time.

## Data Access

#### Retrieving Data

```python
import sovai as sov 
df_movies = sov.data("movies/boxoffice")
```

<figure><img src="/files/dsdfVlIHeEP3sMdjuuwv" alt=""><figcaption></figcaption></figure>

### Data Dictionary

| Column Name          | Data Type | Description                                                                | Example                                             |
| -------------------- | --------- | -------------------------------------------------------------------------- | --------------------------------------------------- |
| ticker               | string    | Ticker symbol of the movie producer company                                | "ZEEL"                                              |
| date                 | date      | Date of the movie's box office performance                                 | 2022-03-18                                          |
| title                | string    | Title of the movie                                                         | "The Kashmir Files"                                 |
| distributor          | string    | Distributor of the movie                                                   | "Zee Studios"                                       |
| gross                | integer   | Gross box office revenue for the movie on the specified date               | 413000                                              |
| percent\_yd          | float     | Percentage change in gross revenue compared to the previous day            | 0.0                                                 |
| percent\_lw          | float     | Percentage change in gross revenue compared to the previous week           | 0.2                                                 |
| theaters             | integer   | Number of theaters screening the movie on the specified date               | 230                                                 |
| per\_theater         | float     | Average gross revenue per theater on the specified date                    | 1796.0                                              |
| total\_gross         | integer   | Cumulative gross box office revenue for the movie up to the specified date | 413000                                              |
| days\_in\_release    | integer   | Number of days the movie has been in release as of the specified date      | 1                                                   |
| parent\_company      | string    | Parent company of the movie producer                                       | "Zee Entertainment Enterprises Limited"             |
| distributor\_address | string    | Address of the movie distributor                                           | "Laxmi Industrial Estate, Off New Link Road, An..." |
| distributor\_website | string    | Website of the movie distributor                                           | "<https://www.zee.com/>"                            |
| release\_date        | date      | Initial release date of the movie                                          | 2022-03-17                                          |

### Dataset Structure

The dataset is organized as a table with 228,484 rows and 15 columns. Each row represents a specific movie's box office performance on a particular date.

### Data Types

The dataset contains the following data types:

* String: ticker, title, distributor, parent\_company, distributor\_address, distributor\_website
* Date: date, release\_date
* Integer: gross, theaters, total\_gross, days\_in\_release
* Float: percent\_yd, percent\_lw, per\_theater

### Missing Values

If a movie does not have any data for a particular column on a specific date, the corresponding cell may contain missing values.

### Example Rows

Here are a few example rows from the dataset:

| ticker | date       | title             | distributor   | gross  | percent\_yd | percent\_lw | theaters | per\_theater | total\_gross | days\_in\_release | parent\_company                       | distributor\_address                              | distributor\_website          | release\_date |
| ------ | ---------- | ----------------- | ------------- | ------ | ----------- | ----------- | -------- | ------------ | ------------ | ----------------- | ------------------------------------- | ------------------------------------------------- | ----------------------------- | ------------- |
| 600579 | 2011-02-11 | Raymond Did It    | Plastic Age … | 2999   | 0.0         | 0.0         | 1.0      | 2999.0       | 2999         | 1                 | KraussMaffei Group                    | 7295 Tellier St, Montreal, Quebec H1N 3S9, CA     | <https://plastic-age.com/en/> | 2011-02-10    |
| 600579 | 2011-02-12 | Raymond Did It    | Plastic Age … | 193    | -0.94       | 0.0         | 1.0      | 193.0        | 3192         | 2                 | KraussMaffei Group                    | 7295 Tellier St, Montreal, Quebec H1N 3S9, CA     | <https://plastic-age.com/en/> | 2011-02-10    |
| ZEEL   | 2022-03-18 | The Kashmir Files | Zee Studios   | 413000 | 0.0         | 0.0         | 230.0    | 1796.0       | 413000       | 1                 | Zee Entertainment Enterprises Limited | Laxmi Industrial Estate, Off New Link Road, An... | <https://www.zee.com/>        | 2022-03-17    |

This data dictionary provides an overview of the movie producer and movie dataset, including the column descriptions, data types, examples, and sample rows.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.sov.ai/realtime-datasets/sectorial-datasets/box-office-stats.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
