Risk Assessment System
In the project, we aimed to tackle the issue of early suicide prediction using data science and machine learning techniques. The project made use of the SuicideWatch dataset from Kaggle, which provided a large volume of relevant data for this problem. To achieve the highest possible accuracy in our predictions, we adopted a hybrid approach that combined deep learning models with a lexicon-based technique.
The deep learning models were trained on the dataset, and the lexicon-based approach was used to supplement the predictions made by the models. The combination of these two techniques allowed us to achieve a higher accuracy compared to if we had used only traditional machine learning algorithms.
Additionally, we designed a user interface that could be used to provide support to individuals who are potentially at risk of suicide. This interface allowed for easy and effective communication between the users and a support network, and it was integrated into the overall solution.
Interface
Parametrics of models
Results
- Evaluation Metrics: Performance of Bi-RNN and LSTM models can be assessed using metrics like accuracy, F1 score, precision, recall, and training time. Training Time: Bi-RNN model demonstrates faster training compared to LSTM, as indicated by the training time graph.
- Accuracy and F1 Score: Bi-RNN slightly outperforms LSTM in terms of accuracy and F1 score, with a preference towards precision for Bi-RNN and recall for LSTM. Overall, Bi-RNN performs slightly better on this dataset, but model choice should align with the specific task and data.
In conclusion, our project demonstrated the potential of data science and machine learning techniques to address important social issues like early suicide prediction. The combination of deep learning models and lexicon-based approach allowed us to achieve improved accuracy and the user interface provided a means to support those in need.