BITCOIN ANALYSIS, VISUALIZATION, FORECASTING, AND PREDICTION WITH PYTHON GUI
Author | : Vivian Siahaan |
Publisher | : BALIGE PUBLISHING |
Total Pages | : 278 |
Release | : 2022-06-04 |
Genre | : Business & Economics |
ISBN | : |
Bitcoin is a digital currency created in January 2009. It follows the ideas set out in a whitepaper by the mysterious and pseudonymous Satoshi Nakamoto.1 The identity of the person or persons who created the technology is still a mystery. Bitcoin offers the promise of lower transaction fees than traditional online payment mechanisms and, unlike government-issued currencies, it is operated by a decentralized authority. This dataset provides the history of daily prices of Bitcoin. The data starts from 17-Sep-2014 and is updated till 09-July-2021. It contains 2747 rows and 7 columns. The columns in the dataset are Date, Open, High, Low, Close, Adj Close, and Volume. In this project, you will involve technical indicators such as daily returns, Moving Average Convergence-Divergence (MACD), Relative Strength Index (RSI), Simple Moving Average (SMA), lower and upper bands, and standard deviation. To perform forecasting based on regression on Adj Close price of Bitcoin, you will use: Linear Regression, Random Forest regression, Decision Tree regression, Support Vector Machine regression, Naïve Bayes regression, K-Nearest Neighbor regression, Adaboost regression, Gradient Boosting regression, Extreme Gradient Boosting regression, Light Gradient Boosting regression, Catboost regression, MLP regression, Lasso regression, and Ridge regression. The machine learning models used predict Bitcoin daily returns as target variable are K-Nearest Neighbor classifier, Random Forest classifier, Naive Bayes classifier, Logistic Regression classifier, Decision Tree classifier, Support Vector Machine classifier, LGBM classifier, Gradient Boosting classifier, XGB classifier, MLP classifier, and Extra Trees classifier. Finally, you will develop GUI to plot boundary decision, distribution of features, feature importance, predicted values versus true values, confusion matrix, learning curve, performance of the model, and scalability of the model.