Research Overview
Learn about the research topic, objectives, and motivation behind improving the Apriori algorithm.
This repository contains the research, code, documentation, and presentation materials for COS-781: Data Mining, focusing on improved algorithms for frequent itemset mining. The project includes complete implementations of:
The implementation includes comprehensive benchmarking, visualization, and comparison capabilities to analyze algorithm performance on real-world datasets.
Research Overview
Learn about the research topic, objectives, and motivation behind improving the Apriori algorithm.
Algorithm Documentation
Detailed documentation of the Apriori algorithm, FP-Growth algorithm, and the proposed improvements.
Exploratory Data Analysis
Comprehensive data exploration including dataset inspection, visualizations, and insights for frequent itemset mining.
Data Preprocessing
Data preprocessing pipeline covering missing data handling, feature engineering, and filtering strategies.
Implementation
Code documentation and implementation details for the improved algorithm.
Presentation
Access slides and presentation materials for the final presentation.
This monorepo contains: