The Study Habit Derivatives project aims to analyze student survey data to understand their study habits and predict potential professions based on their responses. By using NLP techniques, the project identifies patterns in study habits and suggests suitable careers.
Upload your survey data CSV file and run the provided Jupyter notebook. The notebook processes the data and predicts potential professions based on the study habits of the students.
- pandas: For data manipulation and analysis.
- transformers: For natural language processing tasks using pre-trained models.
- google.colab: For file upload functionality in Google Colab.
- StringIO: For handling in-memory file operations.
The code performs the following key functions:
- Import Libraries: Imports necessary libraries for data manipulation, NLP, file uploads, and handling in-memory file operations.
- Upload and Load Data: Handles file upload, decoding, and loading the CSV data into a pandas DataFrame.
- Initialize the NLP Model: Initializes a masked language model using the
transformers
library and demonstrates its functionality. - Data Preparation: Renames the columns for easier handling and selects the relevant columns for analysis.
- Generate Sentences for Prediction: Creates sentences incorporating study habits, ready for masked language model prediction.
- Predict Professions: Defines a function to apply the masked language model and predict potential professions, storing the results in a new column.
This analysis opens up several possibilities for applications:
- Career Guidance: Predicting suitable professions based on study habits can help in personalized career guidance.
- Educational Insights: Understanding study habits can lead to improved educational strategies and resource allocation.
- Personal Development: Students can gain insights into their strengths and potential career paths, aiding in personal development.