← Registry

Case File

Log:route-optimizer·

Personality-Based Place Recommendation & Routing System

Survey data linked personality dimensions to place labels; classifiers were compared on the same feature matrix.

Domain
Recommendation Systems
Focus
Model comparison, data analysis
Stack
Python, Scikit-learn, Pandas
Status
Completed
Year
2025

Case cover

Feature importance (RIASEC and demographic predictors)

Feature importance (RIASEC and demographic predictors)

Problem

Preferences are mostly implicit; inference had to come from surveys and labels rather than direct scores. Generalization of competing classifiers was unknown upfront.

Approach

Fixed the feature matrix and used a reproducible train/hold-out split. Metrics: accuracy, F1, confusion matrix; weighted F1 under class imbalance.

Technical Structure

Data preparation

Pulled personality and place-preference fields from surveys; dropped incomplete or inconsistent rows.

Feature logic

Big-Five-like dimensions numerically encoded; venue types one-hot or ordinal encoded.

Model comparison

Logistic Regression, Random Forest, Gradient Boosting on the same split; small grid search only.

Evaluation metrics

Accuracy, precision, recall, macro/micro F1; default decision threshold 0.5.

Outcome

Selected the top F1 model; feature set and encoding scheme documented for reruns.

Model comparison

MetricScenarioDecision TreesRandom ForestBoosting Methods
Decision TreesMulti Output ClassifierMulti Output RegressorClassifier ChainXGBoostLightGBM
Mikro F-1Standart Model0.62700.67690.67340.64100.65700.6772
Demography0.65990.72470.72290.69380.70860.6807
Hyperparameter-0.73110.7288-0.71490.7287
Makro F-1Standart Model0.58040.58230.62270.57720.60170.6285
Demography0.62050.64920.65640.63500.65450.6297
Hyperparameter-0.65290.6495-0.65430.6792
Hamming LossStandart Model0.28120.22110.23260.24020.24540.2297
Demography0.25380.18950.19680.20460.20920.2237
Hyperparameter-0.18610.1887-0.19940.1956

Related records