An easy to use Snowflake-based text clustering or LLM, tool/framework
-
Updated
Jul 28, 2024 - Python
An easy to use Snowflake-based text clustering or LLM, tool/framework
Data Driven Sentiment Insight into Twitter(X) Trends | Kafka | Spark | Spark MLlib | Docker
Created a SparkML RandomForest model to predict total employee compensation. Queried data with SparkSQL, ran PySpark scripts to run EDA, pre-process data, and train model achieving with 0.98 R2 score.
User, Event, and Predictive Metric Dashboard on 2GB/month of log files from Brackets IDE
Work in-progress NBA Game Predictor using Spark
Solving Kaggle Titanic with Pyspark libraries
Scala Library for extracting useful information from trained Spark Model (DecisionTreeClassificationModel)
Recommendation engine in Java. Based on an ALS algorithm (Apache Spark). Train a new model after N seconds.
EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.
Intra-course Homeworks and final homework for Big Data Engineering course. Include KPMG Hackaton 'University Trends' documentation
使用SpringBoot & ElasticSearch 及ELK组件 & Spark ML Lib构建一个仿大众点评的千人千面推荐系统(不是
SparkMLib ALS(Writed by Scala&Java) used in commodity recommendation system
Introduction to Apache Spark.
Big Data Project - SSML - Spark Streaming for Machine Learning
Utilized SparkML and Scikit-Learn train several machine learning models for distinguishing fraudulent and legitimate transactions. The machine learning models are then utilized to make predictions on Kafka-generated real-time data streams. Built an interface for displaying these predictions in real-time using the Streamlit framework.
Introductory Big Data concepts using Spark framework and different libraries
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
Add a description, image, and links to the sparkmllib topic page so that developers can more easily learn about it.
To associate your repository with the sparkmllib topic, visit your repo's landing page and select "manage topics."