This document provides a detailed review of the test results from the latest model evaluation. The results are presented through tables and visualized with graphs to aid in the interpretation of the model's performance.
The table below summarizes the key performance metrics across different datasets:
The confusion matrix for the Sentiment Analysis dataset. It illustrates the distribution of true positives, true negatives, false positives, and false negatives.
The metric values per epochs graph to analyze best epoch value
The model demonstrates strong performance across all datasets, with particularly high accuracy and F1-scores in the Sentiment Analysis dataset. The visualizations indicate consistent improvement over time, suggesting effective tuning and optimization.