Price Breakout
A dataset with daily updated predictions of price breaking upwards for US Equities.
Daily predictions arrive between 11 pm - 4 am before market open in the US for 13,000+ stocks.
Tutorials
are the best documentation — Price Breakout Prediction Tutorial
Input Datasets
Historical Stock Prices, Trading Volumes, Technical Indicators, Order Book.
Models Used
Classification Algorithms, Regression Models, Conformal Predictors
Model Outputs
Price Movement Predictions, Probability Scores, Confidence Intervals
Description
This datasets identifies potential price breakout stocks over the next 30-60 days for US Equities. This dataset provides daily predictions of upward price breakouts for over 13,000 US equities.
The accuracy is around 65% and ROC-AUC of 68%, it is one of the most accurate breakout models on the market. It is retrained on a weekly basis.
Several machine learning models are trained using the prepared dataset:
Calibrated Classifier: A classification model trained on the engineered features to predict the binary target.
Proprietory Regressor: A proprietory regression model is used to predict the probability of a price increase.
Conformal Regressor: Used to provide calibrated confidence intervals around the predictions, offering an additional measure of uncertainty.
Data Access
Retrieving Data
Latest Data
Full history
Specific Ticker
Plots
Line Predictions
Breakout Predictions
Visualize breakout predictions using the SDK's plotting capabilities:
Prediction Accuracy
Assess the accuracy of breakout predictions:
Data Dictionary
Column | Description | Type | Example |
---|---|---|---|
| Stock ticker symbol. | object | "AAPL" |
| Date when the data was recorded. | datetime64[ns] | 2023-09-30 |
| Target variable for predictions. | float64 | 0.05 |
| Future returns of the stock. | float32 | 0.10 |
| Predicted probability from the model. | float64 | 1.25 |
| Lower bound of the prediction interval. | float64 | 1.20 |
| Upper bound of the prediction interval. | float64 | 1.30 |
| Standard deviation of the predictions. | float64 | 0.02 |
| Lower bound of the conformal prediction interval. | float64 | 1.18 |
| Upper bound of the conformal prediction interval. | float64 | 1.32 |
| Slope derived from the rolling regression of predictions over a window. | float64 | 0.003 |
Use Case
Understood. I'll focus on the use cases that would be most relevant to professional investors. Here's the refined list:
• Portfolio optimization:
Identify potential new additions to diversified stock portfolios
Rebalance existing holdings based on breakout predictions
• Risk management:
Use confidence intervals and standard deviations to assess potential downside risk
Implement more precise hedging strategies based on predicted price movements
• Sector and market analysis:
Identify trends across industry sectors or the broader market
Compare breakout potentials across different stock categories (e.g., large-cap vs. small-cap)
• Market timing:
Use aggregate predictions across multiple stocks to gauge overall market sentiment
Time entry and exit points for broader market positions
Last updated