circle-nodesClustering Panels

Clustering specifically designed for multivariate panel clustering of financial and time-series data

Tutorials are the best documentation — Clustering Panels Tutorialarrow-up-right

Introduction

Can be used to cluster any panel dataset. It is particularly useful for financial analysts, data scientists, and researchers working with time-series data across multiple entities (e.g., stocks, companies) and variables.

Initialization

The CustomDataFrame can be initialized using the sov.data() function:

import sovai as sov

sov.token_auth(token="your_token_here")
df = sov.data("accounting/weekly")

Basic Clustering

Perform clustering on all features:

df_cluster = df.cluster()

Feature-Specific Clustering

Cluster based on specific features:

Summary Clustering

Get a quick summary of the last 6-months data:

Visualization Methods

Line Plot

Visualize cluster centroids and distances:

Scatter Plot

Create a scatter plot of clustered data:

Animation Plot

Generate an animated plot of cluster evolution:

Advanced Analysis

Distance Calculation

Calculate distances between ticker-cluster combinations:

Examples

Basic Clustering and Visualization

Feature-Specific Clustering and Distance Analysis

Last updated