Jan Westerdiep, Rahiel Kasim and Jaro Camphuijsen
How we won/lost the data mining competition
from rankpy.models import LambdaMART
.
.
.
model = LambdaMART(
metric='nDCG@38', max_leaf_nodes=7, shrinkage=0.1,
estopping=10, n_jobs=-1, min_samples_leaf=50,
random_state=42
)
We convert the timestamp, ,
to four additional features: month, week, day and hour:
, , , .
[1] (5th place) X. Liu, B. Xu, Y. Zhang, Q. Yan, L. Pang, Q. Li, H. Sun, and B. Wang. Combination of Diverse Ranking Models for Personalized Expedia Hotel Searches.
For all searches, group all ratings per hotel and take the average/median/std.
Do this for the $$ \verb|prop_starrating|, \verb|prop_review_score|, \verb|prop_location_score1|, \text{and } \verb|prop_location_score2| $$
Final score: