Welcome to BnBFacts! I started this blog in 2020 as a side-project to explore the short-term rental industry using my 10+ years of experience in the field of data science and analytics.
Since then, I’ve done several in-depth analyses using Python and R including:
- Uncovering factors which drive the occupancy rate using a gradient boost classifier model and interpreting the results using Shapley values
- Determining how hosts price their listings using a lasso regression
- How guests define value using logistic regression
- How to write a listing description using natural language processing (NLP) and network visualizations
- Most common reasons for bad reviews using NLP, sentiment analysis, and topic modelling.
- Creating a segmentation for Airbnb hosts using k-means clustering
- Understanding how guests score the overall rating as a function of other categories ratings using relative importance.
My goal for this blog is to develop data-driven insights for Airbnb hosts. All the code used in this project can be found on my GitHub repo.
If you’d like to suggest a topic or contact me for any reason, drop me a line at [email protected]