Skip to content

Solution of team baseline_business at MTS Machine Learning Cup 2023 competition.

License

Notifications You must be signed in to change notification settings

vorobeevich/mts-ml-cup

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

mts-ml-cup

Solution of team baseline_business at MTS Machine Learning Cup 2023 competition.

image

Public & Private scores: 1.64 (~ top 25% of leaderboard).

image

Links

Brief report:

The notebook with a baseline from the authors of the competition was taken as a start point (https://www.kaggle.com/code/dmitriy89/mts-ml-cup-2023-baseline/notebook). To begin with, in addition to als embeddings regarding user_id and url_host for the request_cnt sum, the same embeddings were used for price feature. Also, in addition to the operation of the sum, aggregations were added through the mean, minimum and maximum. The embedding size was increased from 50 to 64, the number of iterations for the matrix decomposition algorithm from 30 to 32. Embeddings for different features were contacted. This helped increase the score from 1.48 to 1.54.

Further, the following feature was used: the sum of request_cnt for all url_hosts from the top_k most popular. 256, 512, 1024 were used as the top_k parameter, and the quality steadily increased.

Also, all categorical features (date, region_name, city_name, cpe_manufacturer_name, cpe_model_name, cpe_type_cd, cpe_model_os_type, part_of_day) have been replaced with numbers using label encoding. Then their aggregations were used using the sum, average, minimum, maximum, median, mode, std.

Final number of features: 64 * 10 (als) + 1024 (top_urls) + 8 * 7 (categorical features) = 1720.

As a result, we managed to get a score of 1.64. The selection of hyperparameters for CatBoost with the help of Optuna did not give an increase in quality.

About

Solution of team baseline_business at MTS Machine Learning Cup 2023 competition.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published