Bankruptcy Predictions
Chapter 7 and Chapter 11 bankruptcy predictions made easy for over 5,000 US publicly traded stocks.
Monthly corporate bankruptcy predictions arrive the 2nd of every month.
Tutorials
are the best documentation — Corporate Bankruptcy Tutorial
Input Datasets
SEC Bankruptcies, Delistings, Market Data, Financial Statements
Models Used
CNN, LightGBM, RocketModel, AutoEncoder
Model Outputs
Calibrated Probabilities, Shapley Values
Description
The model predicts the likelihood of bankruptcies in the next 6-months for US publicly listed companies using advanced machine learning models.
With an accuracy of around 89% and ROC-AUC of 85%, these models represent a large improvement over traditional methods of bankruptcy prediction for equity selection.
Advanced modeling techniques used in this dataset:
The Boosting Model: Utilizes LightGBM technology, integrating both fundamental and market data for accurate predictions.
The Convolutional Model: Employs a Convolutional Neural Network (CNN) for efficient pattern recognition in market trends.
The Rocket Model: Specializes in time series data, using random convolutional kernels for effective classification and forecasting.
The Encoder Model: Combines LightGBM with CNN autoencoders, enhancing feature engineering for more precise predictions.
The Fundamental Model: Focuses solely on fundamental data via LightGBM, without extra architectural layers, for straightforward financial analysis.
Data Access
Monthly Probabilities
Specific Tickers
Specific Dates
Latest Data
All Data
Daily Probabilities
The daily probabilities are experimental, and have a very short history of just a couple of months.
Feature Importance (Shapleys)
Feature Importance (Shapley Values) calculates the contribution of each input variable (features) such as Debt, Assets, and Revenue to predict bankruptcy risk.
Reports
Sorting and Filtering
Filter the outputs based on the top by Sector, Marketcap, and Revenue and bankruptcy risk. You can also change ranking
to change
to investigate the month on month change.
Plots
Bankruptcy Comparison
Timed Feature Importance
Total Feature Importance
Bankruptcy and Returns
PCA Statistical Similarity
Correlation Similarity
Trend Similarity
Model Performance
Confusion Matrix
Threshold Plots
Lift Curve
Global Explainability
Computations
Leverage advanced computational tools for deeper analysis:
Distance Matrix:
Assess the similarity between entities based on selected attributes.
Percentile Calculation:
Calculate the relative standing of values within a dataset.
Feature Mapping:
Map accounting features to standardized metrics.
PCA Calculation:
Perform principal component analysis for dimensionality reduction.
For more advanced applications, see the tutotrial.
Data Dictionary
ticker
Stock ticker symbol.
TEXT
"TSLA"
date
Record date.
DATE
2023-09-30
probability_light
LightGBM Boosting Model prediction.
FLOAT
1.46636
probability_convolution
CNN Model prediction for bankruptcies
FLOAT
0.135975
probability_rocket
Rocket Model prediction for time series classification
FLOAT
0.02514
probability_encoder
LightGBM and CNN autoencoders Model prediction.
FLOAT
0.587817
probability_fundamental
Prediction using accounting data only.
FLOAT
1.26148
probability
Average probability across models.
FLOAT
0.553823
sans_market
Fundamental prediction adjusted for market predictions.
FLOAT
-0.20488
volatility
Variability of model predictions.
FLOAT
0.62934
multiplier
Coefficient for model prediction calibration.
FLOAT
1.951868
version
Model/data record version.
INT
20240201
When sans_market
is positive, it means that the fundamentals show a larger predicted bankruptcy than what the market predicts (stock might go down in medium term) , when sans_market
is negative, the market might have overreacted, and predict a larger probability of bankruptcy than what the fundamentals suggest (stock might go up in medium term).
Use Cases
Bankruptcy Prediction Analysis: Offer insights into predicted corporate bankruptcies and identify key factors, clarifying main drivers across different cycles.
Variable Impact Breakdown: Analyze how each individual variable affects bankruptcy predictions, providing in-depth feature contribution insights.
Temporal Feature Distribution Analysis: Reveal how variables contribute to predictions over time, emphasizing key features in forecasting models.
Correlation Discovery: Identify stocks with similar bankruptcy probability trends, revealing correlated market behaviors.
Probability Shift Overview: Showcase changes in bankruptcy probabilities among correlated stocks, providing a comprehensive market perspective.
Sentiment Inversion Analysis: Convert bankruptcy predictions into positive sentiment indicators to gauge potential impacts on stock returns.
Behavioral Similarity Mapping: Locate stocks with similar behaviors to a selected reference, based on bankruptcy trends and PCA feature analysis.
Last updated