Have you ever wondered when the best time of year to book a hotel room is? Or the optimal length of stay to get the best daily rate? What if you wanted to predict whether or not a hotel was likely to receive a disproportionately high number of special requests? This hotel booking dataset can help you explore those questions!
This data set contains booking information for a city hotel and a resort hotel and includes information such as when the booking was made, length of stay, the number of adults, children, and/or babies, and the number of available parking spaces, among other things. All personally identifying information has been removed from the data.
- Hotel: Type of hotel (Resort Hotel or City Hotel)
- Is_canceled: Whether the booking was canceled (0 for Not canceled, 1 for Canceled)
- Lead_time: Number of days between the date of booking and arrival date
- Arrival_date_year: Year of arrival from 2015 to 2017
- Arrival_date_month: Month of arrival from Jan to Dec
- Arrival_date_week_number: Week number of year from 1 to 53
- Arrival_date_day_of_month: Day of arrival date
- Stays_in_weekend_nights: Number of Saturday/Sunday nights stayed or booked to stay at the hotel
- Stays_in_week_nights: Number of Monday to Friday nights stayed or booked to stay at the hotel
- And more...
The dataset has 119,390 rows and 32 columns.
- Removed columns "company" and "agent" due to considerable missing values.
- Filled null values in the "children" column with the rounded median value.
- Removed rows containing missing country values.
- Converted the "children" column from float to integer.
- Analysis of bookings in different types of hotels
- Annual number and percentage of arriving visitors
- Booking distribution for each year separated by hotel type (not canceled)
- Length of stay in hotels
- Dominating meal bookings
- Months with the highest and lowest number of bookings
- Correlation matrix for important numerical features
- Relation between Average Daily Rate (ADR) and number of people
- Top booking countries
- Time-based count of bookings
- Car parking space requirements
- Target sectors for advertisement
- Months with cheaper booking rates
- City hotels have a higher number of bookings.
- Bookings peaked in 2016 but declined in 2017.
- About 1/3 of bookings were canceled.
- The countries "PRI", "GBR", and "FRA" had the highest number of bookings.
- ADR tends to increase with the increase in the number of people.
- November to January has cheaper booking rates. The majority of distribution channels and market segments involve travel agencies, suggesting a focus on targeting them.
The EDA provides valuable insights for understanding booking trends and informing decision-making processes in the hospitality industry. Further analysis and modeling can be conducted based on these findings to optimize hotel management strategies.