As an aviation enthusiast, I always look forward to flying whenever I plan to travel. Since the Covid-19 pandemic ended, aviation market began to recover slowly from rock bottom ticket prices in peak of pandemic. With the war in Ukraine and rise in ATF prices, ticket prices have been through the roof reaching new high. As a result of this, I decided to do an exploratory data analysis for ticket prices to better understand the factors affecting them in India. I aim to answer questions like No. of flights available across India, Ticket availability across different class, Price range across different class, etc. Check out the project here
Data is gathered from the Kaggle and considered as secondary data. The dataset includes details on the ticket booking alternatives available through the website "Easemytrip" for flights between India's top 6 metro areas. The cleaned dataset contains 11 characteristics and 300261 datapoints. Data was gathered in two stages: for business class tickets and for economy class tickets. The site provided a total of 300261 unique flight booking possibilities. 50 days of data were gathered, from February 11 to March 31 of 2022. Dataset Link: Kaggle
The various features of the cleaned dataset are explained below:
- Airline: The airline column contains the name of the airline firm. There are six different airlines, making it a category trait.
- Flight: The flight code of the aircraft is stored in flight.
- Source City: City where the flight departs from is a classification feature with 6 distinctive cities.
- Departure Time: This is a categorical feature that was deduced from time periods being divided into bins. It has six different time labels and stores information about the departure time.
- Stops: A category feature that holds the number of stops between the source and destination cities and has 3 different values.
- Arrival Time: This derived categorical feature was developed by binning time intervals. It maintains information regarding the arrival time and has six different time labels.
- Destination City: The location of the aircraft's landing. It is a classification feature with 6 distinctive cities.
- Class: A permanent feature that shows the total number of hours needed to travel between cities.
- Duration: A permanent feature that shows the total number of hours needed to travel between cities. 10)Days Left: The trip date is subtracted from the booking date to arrive at this derived feature.
- Price: Information about the ticket price is stored in the target variable.
Click on Power BI Dashboard to view the dashboard
Using Power BI Dashboard one can quickly get to know ticket prices between different cities flying with different airlines in different classes along with flight duration
- 'Air Asia' offers the cheapest flight tickets while flying Economy class while 'Air India' is cheapest while flying Business class.
- Booking tickets 3-7 weeks before travel will be cheaper than buying them within 3 weeks of travel as prices rise rapidly in 2-20 days period. Tickets can be cheap when bought just 1 day before however they might not be as cheap as when bought more than 3 weeks before
- Ticket price grow linearly with duration of flight peaking when duration of flight reaches 20 hours. However due to some outliers they again fall for for flights with duration of more than 20 hours. Relation can be approximated by 2nd degree curve
- Flight departing late at night and arriving early morning or late at night are cheapest.
- Flight prices increase with increase in number of stops.
- Delhi offers the cheapest flights while Hyderabad is most expensive city to fly to