Book Reco

Online Bookstore with a Hybrid Recommendation Systems

An online bookstore featuring a hybrid recommendation system. The platform leverages content-based and collaborative filtering algorithms to provide personalized book recommendations based on user preferences, purchase history, or ratings.

Laravel

Vue.js

Inertia.js

Tailwind CSS

FastAPI

PHP

Python

MySQL

Redis

Docker

Midtrans

Machine Learning

Content-Based Filtering

Collaborative Filtering

SVD Matrix Factorization

Visit

Book Reco

Deep Dives

Covering technical aspects, such as architecture, design decisions, and implementation details.

Case Studies

Information Overload

The overwhelming abundance of book data (retrieved from the Gramedia API scraping) makes it difficult for users to find relevant books.

The Need for Personalization

Modern e-commerce requires a engine capable of understanding the unique preferences of each individual.

Technical Challenges

Building a system that remains lightweight on the web front-end while intelligently handling heavy matrix computation on the back-end.

The system is intended for research and demonstration of recommendation techniques and includes tools for scraping, preprocessing, training, and serving recommendations.

System Architecture

Main Application (Laravel 12)

Handles transactions, book data management, payment gateway (Midtrans) integration, and Admin panel.

Model Service (FastAPI)

A dedicated Python engine for Data Science processes (Training & Tuning).

Containerization

The entire ecosystem is wrapped in Docker containers to ensure scalability and ease of deployment.

Recommendation System Methods

Using a Hybrid Recommendation System approach.

Content-Based Filtering

Utilizes book metadata (title, author) processed with TF-IDF Vectorization and Cosine Similarity to identify product similarities.

Collaborative Filtering (Model-Based)

Implementing Singular Value Decomposition (SVD) to extract latent factors from user-item interactions (ratings and transaction quantities).(Diedit)Pulihkan teks asli

Data Science Flow (Pipeline)

Starting with the data acquisition process (scraping), followed by data preprocessing (data cleaning), the model training and tuning phase (Grid Search), and finally the serving phase via a microservice API integrated with the e-commerce website.

1. Data Acquisition

Scraping data from Gramedia's public API.

2. Preprocessing

Cleaning book metadata data in Laravel.

3. Model Training

Sending the dataset to the microservice via the API.

4. Evaluation

Calculating RMSE and MAE metrics to ensure model accuracy before deploying to production.

5. Serving

Recommendation results are displayed on the frontend using Inertia.js (Vue 3).

The system can be configured to automatically train models at specific intervals whenever book, transaction, and rating data changes, ensuring users receive recommendations based on the most recent interaction patterns.

Dataset/Materials and Visualization

1. Data Source

Dataset collected through legal web scraping techniques on Gramedia's public API, resulting in thousands of book metadata items including title, author, cover image, ISBN, price, and category.

2. Materials

User interaction dataset (dummy/transaction simulation) including user ID, book ID, rating score, and purchase amount.

3. Visualization

Error Analysis: Visualization of the decrease in RMSE and MAE during the hyperparameter tuning process (Grid Search).

Top-N Recommendations: Visual representation of the most frequently recommended books compared to best-sellers.

Analysis Results & Insights

1. Model Evaluation & Baseline

Based on test results on simulated (dummy) data, the SVD model produced a relatively high RMSE value. This was identified as a result of the high level of sparsity (rare data) and random rating distribution in the dummy data, which does not fully reflect complex (logical/patterned) human behavior patterns.

2. Pipeline Stability

The key insight is not the absolute RMSE figure, but rather the stability of the pipeline. The system has proven capable of automating the end-to-end process: from transaction aggregation in Laravel, sending data to FastAPI, to consistent prediction calculations despite high data variance.

3. Cold-Start Effectiveness

The popularity-fallback strategy ensures a high retention rate for new users even when preference data is not yet available.

4. Strategy Comparison

Analysis shows that under conditions of limited/random data, the Content-Based Filtering approach provides more qualitatively stable results (title relevance), while Collaborative Filtering acts as a user-relationship discovery engine that becomes more precise as the density of real transaction data increases.

Potentials

1. Application in Local/Independent Bookstores

Helping small bookstores digitize their inventory and have a recommendation system as sophisticated as large marketplaces (Amazon/Gramedia) with efficient infrastructure costs (due to the use of microservices).

2. Campus Digital Library

Increasing student interest in reading by recommending books based on the borrowing history of similar students in the same major.

3. Personalized Marketing Tool

This system can be integrated with Email Marketing or Push Notifications to offer books that customers "truly desire," thereby increasing the Conversion Rate (CVR) for business partners.

4. System Scalability

The FastAPI-based architecture allows this system to be developed into a SaaS (Software as a Service) platform that can be used by multiple book e-commerce partners simultaneously.