Bankruptcy Predictions
Chapter 7 and Chapter 11 bankruptcy predictions made easy for over 5,000 US publicly traded stocks.
Monthly corporate bankruptcy predictions arrive the 2nd of every month.
Tutorials
are the best documentation — Corporate Bankruptcy Tutorial
Input Datasets
SEC Bankruptcies, Delistings, Market Data, Financial Statements
Models Used
CNN, LightGBM, RocketModel, AutoEncoder
Model Outputs
Calibrated Probabilities, Shapley Values
Description
The model predicts the likelihood of bankruptcies in the next 6-months for US publicly listed companies using advanced machine learning models.
With an accuracy of around 89% and ROC-AUC of 85%, these models represent a large improvement over traditional methods of bankruptcy prediction for equity selection.
Advanced modeling techniques used in this dataset:
The Boosting Model: Utilizes LightGBM technology, integrating both fundamental and market data for accurate predictions.
The Convolutional Model: Employs a Convolutional Neural Network (CNN) for efficient pattern recognition in market trends.
The Rocket Model: Specializes in time series data, using random convolutional kernels for effective classification and forecasting.
The Encoder Model: Combines LightGBM with CNN autoencoders, enhancing feature engineering for more precise predictions.
The Fundamental Model: Focuses solely on fundamental data via LightGBM, without extra architectural layers, for straightforward financial analysis.
Data Access
Monthly Probabilities
Specific Tickers
Specific Dates
Latest Data
All Data
Daily Probabilities
The daily probabilities are experimental, and have a very short history of just a couple of months.
Feature Importance (Shapleys)
Feature Importance (Shapley Values) calculates the contribution of each input variable (features) such as Debt, Assets, and Revenue to predict bankruptcy risk.
Reports
Sorting and Filtering
Filter the outputs based on the top by Sector, Marketcap, and Revenue and bankruptcy risk. You can also change ranking
to change
to investigate the month on month change.
Plots
Bankruptcy Comparison
Timed Feature Importance
Total Feature Importance
Bankruptcy and Returns
PCA Statistical Similarity
Correlation Similarity
Trend Similarity
Model Performance
Confusion Matrix
Threshold Plots
Lift Curve
Global Explainability
Computations
Leverage advanced computational tools for deeper analysis:
Distance Matrix:
Assess the similarity between entities based on selected attributes.
Percentile Calculation:
Calculate the relative standing of values within a dataset.
Feature Mapping:
Map accounting features to standardized metrics.
PCA Calculation:
Perform principal component analysis for dimensionality reduction.
For more advanced applications, see the tutotrial.
Data Dictionary
Name | Description | Type | Example |
---|---|---|---|
| Stock ticker symbol. | TEXT | "TSLA" |
| Record date. | DATE | 2023-09-30 |
| LightGBM Boosting Model prediction. | FLOAT | 1.46636 |
| CNN Model prediction for bankruptcies | FLOAT | 0.135975 |
| Rocket Model prediction for time series classification | FLOAT | 0.02514 |
| LightGBM and CNN autoencoders Model prediction. | FLOAT | 0.587817 |
| Prediction using accounting data only. | FLOAT | 1.26148 |
| Average probability across models. | FLOAT | 0.553823 |
| Fundamental prediction adjusted for market predictions. | FLOAT | -0.20488 |
| Variability of model predictions. | FLOAT | 0.62934 |
| Coefficient for model prediction calibration. | FLOAT | 1.951868 |
| Model/data record version. | INT | 20240201 |
When sans_market
is positive, it means that the fundamentals show a larger predicted bankruptcy than what the market predicts (stock might go down in medium term) , when sans_market
is negative, the market might have overreacted, and predict a larger probability of bankruptcy than what the fundamentals suggest (stock might go up in medium term).
Use Cases
Bankruptcy Prediction Analysis: Offer insights into predicted corporate bankruptcies and identify key factors, clarifying main drivers across different cycles.
Variable Impact Breakdown: Analyze how each individual variable affects bankruptcy predictions, providing in-depth feature contribution insights.
Temporal Feature Distribution Analysis: Reveal how variables contribute to predictions over time, emphasizing key features in forecasting models.
Correlation Discovery: Identify stocks with similar bankruptcy probability trends, revealing correlated market behaviors.
Probability Shift Overview: Showcase changes in bankruptcy probabilities among correlated stocks, providing a comprehensive market perspective.
Sentiment Inversion Analysis: Convert bankruptcy predictions into positive sentiment indicators to gauge potential impacts on stock returns.
Behavioral Similarity Mapping: Locate stocks with similar behaviors to a selected reference, based on bankruptcy trends and PCA feature analysis.
Last updated