AIRLINE FARE PRICE PREDICTION

IRJET Journal
IRJET JournalFast Track Publications

https://www.irjet.net/archives/V9/i5/IRJET-V9I5571.pdf

International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2448
AIRLINE FARE PRICE PREDICTION
Kartik Rathi1, Anubhav Kumar2, Manish Yadav3
1,2,3 Students, Computer science and Engineering department, MIET Meerut,Uttar Pradesh,India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract The price of airline ticket changes frequently
nowadays and there's plenty of difference. Price change
keeps happening within few hours for the identical flight.
The shoppers want to induce themostcost-effectivepossible
price while the airline companies wanttheutmostprofitand
revenue possible. To unravel this problem, researchers
introduce different models to avoid wasting consumers
money- minimum price predicting model and models that
tell us an optimal time to shop for a ticket while airlines use
techniques like demand prediction and price discrimination
to maximize their revenue
1. INTRODUCTION
This project aims to develop an application which can
predict the flight prices for various flights using different
machine learning techniques. The user will get the expected
values and with its reference the user can plan to book their
tickets suitably. At this time, carrier ticket costs can shift
powerfully and fundamentally for an identical flight, in any
event, for accessible seats inside the identical cabin. Clients
are attempting to urge the foremost minimal cost while
Airlines companies try to stay their general income as high
as could reasonably be expected and boost their benefit.
Airlines utilize different computational methods to extend
their income, as an example, demand forecast and value
segregation. The proposed system can help save
immeasurable rupees of shoppers by proving them the
knowledge to book tickets at thecorrecttime.Parameterson
which fares are calculated-
Now we can perform Exploratory Data Analysis onthegiven
information. we'll discover correlation between the
highlights. At that time a Machine Learning model are made
to utilizing those highlights.
1.1 Proposed Methodology
For this project, we've implemented the machine learning
life cycle to make a basic web application which is able to
predict the flight fare by applying machine learning
algorithms on historical flight data using some python
libraries like Pandas, NumPy, Matplotlib, seaborn, and
sklearn. Below image shows the number of steps that we
followed from the life cycle.
Data selection is that the initial step where historical data of
flight is assembles for the model to predict prices. Our
dataset consists of quite 10,000+ records of information
associated with flights and costs. A number of thefeaturesof
the dataset are source, destination, departure date, point,
and number of stops, point in time, prices. Within the
exploratory data analysis step, we cleaned the dataset by
removing the duplicate values and null values. If the null
values aren't removed, the accuracy of the model are
affected. Next step is data pre-processing where we
observed that almost all of the information was present in
string format. Data from each feature is extracted i.e., day
and month is extracted from date of journey in integer
format, hours and minutes is extracted from time of
departure. Features like sourceanddestinationneededtobe
converted into values as they were of categorical type. For
this One hot-encoding and label encoding techniques are
wont to convert categorical data into the integer data.
Feature selection step is involved in selecting important
features that are more correlated to the value. So, some
features to be selected and passed to the group of models.
Random forest is an ensemblelearningmethodthatbasically
uses group of decision trees as group of models. Random
amount of knowledge is passed to decision trees and every
decision tree predicts values in line with the datasetgiven to
that. From the predictions made by the choice trees the
features like extra information and route which are
unnecessary features which can affect the accuracy of the
model and so, they have to be removed before getting our
model ready for prediction.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2449
After selecting the features which are more correlated to
cost the following step involves applying machine algorithm
and creating a model. As our dataset incorporates labeled
data, we've got to use supervised machine learning
algorithms also in supervised we are going to be using
regression algorithms as our dataset contains continuous
values within the features. Regression models are wont to
describe relationship between dependent and independent
variables. The machinelearningalgorithmsthatwearegoing
to be using in our project are:
1.2 Linear Regression
In simple linear regression there's only 1 independent and
one dependent feature but as our dataset consists of the
many independent features on which the worthmayrely on,
we are going to be using multiple linear regression which
estimates relationship between two or more independent
variables and one dependent variable. The multiple linear
regression models are represented by:
C
Where, y = the predicted value of the dependent variable
= the independent variables
m= independent variables coefficients
C= y-intercept x
1.3 Decision Tree
There are basically of two type’s Decision tree i.e.,
classification and regression tree where classification is
employed for categorical values and regression is employed
for continuous values. Decision tree chooses independent
variable from dataset as decision nodes for decision
making.It divides the entire dataset in several sub-section
and when test data is passed to the model the output is
determined by checking the section to which the info point
belongs to. And to whichever section the info point belongs
to the choice tree will give output because the average value
of all the information points within the sub-section.
1.4 Random Forest
Random Forest is an ensemble learning technique where
training model uses multiple learning algorithms then
combine individual results to urge a final predicted result.
Under ensemble learning random forest falls into bagging
category where random number of featuresandrecordswill
average value of the expected values if considered because
the output of the random forest model.
2. PERFORMANCE METRICS
Performance metrics are statistical models which is able to
be accustomed compare the accuracy of the machine
learning models trained by different algorithms. The
sklearn.metrics module are accustomed implement the
functions to live the errors from each model using the
regression metrics. Following metricsareaccustomedcheck
the error measure of every model.
2.1 MAE (Mean Absolute Error)
Mean Absolute Error is basically the sum of average of the
absolute difference between the expected and actual values.
MAE = 1/n [∑(y-ý)]
y = actual output values,
ý = predicted output values
n = Total number of data points
Lesser the value of MAE better the performance of your
model.
2.2 MSE (Mean Square Error)
Mean Square Error squares the difference of actual and
predicted output values before summing all rather than
using absolute value.
MSE = 1/n [∑ ( ]
y=actual output values
y’=predicted output values
n = Total number of data points
Lower the value of MSE better the performanceofthemodel.
2.2 RMSE (Root Mean Square Error)
RMSE is measured by taking the square root of the average
of the squared difference between the prediction and also
the actual value.
RMSE = √1/n [∑ ( ]
y=actual output values
y’=predicted output values
n = Total number of data points
RMSE is greater than MAE and lesser the value of RMSE
between different models the better the performanceofthat
model.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2450
2.3 (Coefficient of determination)
It helps you to grasp how well the independent variable
adjusted with the variance in your model.
= 1 − ∑ ( / ∑ (
The worth of R-square lies between 0 to 1. The closer its
value to at least one, the higher your model is when
comparing with other model values. There are different
cross-validation techniques like GridsearchCV and
RandomizedsearchCV which can be used for improving the
accuracy of the model.Parametersofthemodelslikenumber
of trees in random forest or max depth of decision tree will
be changed using this method which can help us in further
enhancement of the accuracy.
The last three steps of the life cycle model are involved
within the deployment of the trained machine learning
model. Therefore, after getting the model with the most
effective accuracy we store that model in a very file using
pickle module. The back-end of the applying are going to be
created using Flask Framework where API end-points such
and acquire and POST are going to be created to perform
operations associated with fetching and displaying data on
the front-end of the appliance. The front-end oftheapplying
are going to be created using the bootstrap framework
where user will have the functionality of enteringtheirflight
data. This data is sent to the back-end service where the
model will predict the output consistent with the provided
data. the anticipated value is distributedtothefront-endand
displayed.
3. RESULTS
We had used many algorithms for prepared our ML model
i.e., Linear Regression, Decision tree, Random Forest. From
all of these, Random Forest gives us the most accurate
predictions. Models' performance using a few metrics are
given below:
A) LINEAR REGRESSION:
R2 SCORE : 0.6220180540708748
MAE: 1866.1170014687825
MSE: 6974960.920111607
RMSE: 2641.0151306101234
B) DECISION TREE:
R2 SCORE : 0.74098856555086889
MAE : 1270.242692247699
MSE : 4779579.164036292
RMSE : 2186.224865844383
C) RANDOM FOREST:
R2 SCORE : 0.8598264560438941
MAE : 1055.5042333744152
MSE : 2586644.6841106885
RMSE : 1608.3049101804945
By observing all of the performancemetricswecanconclude
that Random Forest will give us accurate and better results.
So, we use Random Forest model for the deployment.
4. CONCLUSION
This project can result in saving money of inexperienced
people by providing them the information related to trends
of the flight prices and also give them a predicted value of
the price which they use to decide whether to book ticket
now or later. On working with different models, itwasfound
out that Random Forest algorithmgivesthehighestaccuracy
in predicting the output.
REFERENCES
[1] K. Tziridis T. Kalampokas G.Papakostas and K.
Diamantaras "Airfare price prediction using machine
learning techniques" in European Signal Processing
Conference (EUSIPCO), DOI: 10.23919/EUSIPCO
.2017.8081365L. Li Y. Chen and Z. Li” Yawning detection for
monitoring driver fatigue based on two cameras” Proc. 12th
Int. IEEE Conf. Intell. Transp. Syst. pp. 1-6 Oct. 2009.
[2] William Groves and Maria Gini "An agent for optimizing
airline ticket purchasing" in proceedings of the 2013
international conference on autonomous agents and multi-
agent systems.
[3] Supriya Rajankar, Neha sakhrakar and Omprakash
rajankar “Flight fare prediction using machine learning
algorithms” International journal of Engineering Research
and Technology (IJERT) June 2019.
[4] Tianyi wang, samira Pouyanfar,haimanTianandYudong
Tao "A Framework for airline price prediction: A machine
learning approach"
[5] T. Janssen "A linear quantile mixed regression model for
prediction of airline ticket prices"
[6] medium.com/analytics-vidhya/mae-mse-rmse-
coefficient-ofdetermination-adjusted-r-squared-which-
metric-is-bettercd0326a5697e article on performance
metrics
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2451
[7] www.keboola.com/blog/random-forest-regression
article on random forest
[8] https://towardsdatascience.com/machine-learning-
basics-decisiontree-regression-1d73ea003fda article on
decision tree regression
BIOGRAPHIES
KARTIK RATHI
MIET,MEERUT(STUDENT)
AKTU
ANUBHAV KUMAR
MIET,MEERUT(STUDENT)
AKTU
MANISH YADAV
MIET,MEERUT(STUDENT)
AKTU

Recomendados

14.05.12 Analysis and Prediction of Flight Prices using Historical Pricing Da... von
14.05.12 Analysis and Prediction of Flight Prices using Historical Pricing Da...14.05.12 Analysis and Prediction of Flight Prices using Historical Pricing Da...
14.05.12 Analysis and Prediction of Flight Prices using Historical Pricing Da...Swiss Big Data User Group
9.3K views22 Folien
Air Ticket Price Prediction.pdf von
Air Ticket Price Prediction.pdfAir Ticket Price Prediction.pdf
Air Ticket Price Prediction.pdfAdityaAryan45
478 views15 Folien
Prediction of House Sales Price von
Prediction of House Sales PricePrediction of House Sales Price
Prediction of House Sales PriceAnirvan Ghosh
6.7K views24 Folien
Feature selection von
Feature selectionFeature selection
Feature selectionDong Guo
14.1K views29 Folien
House Sale Price Prediction von
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Predictionsriram30691
9.1K views27 Folien
Machine Learning Clustering von
Machine Learning ClusteringMachine Learning Clustering
Machine Learning ClusteringRupak Roy
129 views16 Folien

Más contenido relacionado

Was ist angesagt?

HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER von
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIERHANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIERvineet raj
3K views10 Folien
Random forest von
Random forestRandom forest
Random forestMusa Hawamdah
53.3K views23 Folien
House price prediction von
House price predictionHouse price prediction
House price predictionKaranseth30
2.2K views16 Folien
Linear regression in machine learning von
Linear regression in machine learningLinear regression in machine learning
Linear regression in machine learningShajun Nisha
103 views25 Folien
Performance Metrics for Machine Learning Algorithms von
Performance Metrics for Machine Learning AlgorithmsPerformance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsKush Kulshrestha
1.7K views35 Folien
Classification using back propagation algorithm von
Classification using back propagation algorithmClassification using back propagation algorithm
Classification using back propagation algorithmKIRAN R
1.9K views24 Folien

Was ist angesagt?(20)

HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER von vineet raj
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIERHANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER
HANDWRITTEN DIGIT RECOGNITION USING k-NN CLASSIFIER
vineet raj3K views
House price prediction von Karanseth30
House price predictionHouse price prediction
House price prediction
Karanseth302.2K views
Linear regression in machine learning von Shajun Nisha
Linear regression in machine learningLinear regression in machine learning
Linear regression in machine learning
Shajun Nisha103 views
Performance Metrics for Machine Learning Algorithms von Kush Kulshrestha
Performance Metrics for Machine Learning AlgorithmsPerformance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning Algorithms
Kush Kulshrestha1.7K views
Classification using back propagation algorithm von KIRAN R
Classification using back propagation algorithmClassification using back propagation algorithm
Classification using back propagation algorithm
KIRAN R1.9K views
Predicting house price von Divya Tiwari
Predicting house pricePredicting house price
Predicting house price
Divya Tiwari11K views
Predicting house prices_Regression von Sruti Jain
Predicting house prices_RegressionPredicting house prices_Regression
Predicting house prices_Regression
Sruti Jain2.2K views
Analysis of crop yield prediction using data mining techniques von eSAT Journals
Analysis of crop yield prediction using data mining techniquesAnalysis of crop yield prediction using data mining techniques
Analysis of crop yield prediction using data mining techniques
eSAT Journals2.1K views
Handwritten Digit Recognition(Convolutional Neural Network) PPT von RishabhTyagi48
Handwritten Digit Recognition(Convolutional Neural Network) PPTHandwritten Digit Recognition(Convolutional Neural Network) PPT
Handwritten Digit Recognition(Convolutional Neural Network) PPT
RishabhTyagi4818.7K views
Image classification with Deep Neural Networks von Yogendra Tamang
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural Networks
Yogendra Tamang7.5K views
Feature selection concepts and methods von Reza Ramezani
Feature selection concepts and methodsFeature selection concepts and methods
Feature selection concepts and methods
Reza Ramezani9K views
k medoid clustering.pptx von Roshan86572
k medoid clustering.pptxk medoid clustering.pptx
k medoid clustering.pptx
Roshan86572473 views
Breast cancer diagnosis machine learning ppt von AnkitGupta1476
Breast cancer diagnosis machine learning pptBreast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning ppt
AnkitGupta147618.8K views
House price ppt 18 bcs6588_md. tauhid alam von ArmanMalik66
House price ppt  18 bcs6588_md. tauhid alamHouse price ppt  18 bcs6588_md. tauhid alam
House price ppt 18 bcs6588_md. tauhid alam
ArmanMalik661.3K views
Machine Learning - Ensemble Methods von Andrew Ferlitsch
Machine Learning - Ensemble MethodsMachine Learning - Ensemble Methods
Machine Learning - Ensemble Methods
Andrew Ferlitsch2.2K views

Similar a AIRLINE FARE PRICE PREDICTION

IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach... von
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET Journal
81 views5 Folien
Airplane Crash Analysis And Prediction Using Machine Learning von
Airplane Crash Analysis And Prediction Using Machine LearningAirplane Crash Analysis And Prediction Using Machine Learning
Airplane Crash Analysis And Prediction Using Machine LearningMichele Thomas
5 views6 Folien
IRJET - Airplane Crash Analysis and Prediction using Machine Learning von
IRJET - Airplane Crash Analysis and Prediction using Machine LearningIRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET - Airplane Crash Analysis and Prediction using Machine LearningIRJET Journal
23 views6 Folien
Real Estate Investment Advising Using Machine Learning von
Real Estate Investment Advising Using Machine LearningReal Estate Investment Advising Using Machine Learning
Real Estate Investment Advising Using Machine LearningIRJET Journal
37 views5 Folien
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm von
Water Quality Index Calculation of River Ganga using Decision Tree AlgorithmWater Quality Index Calculation of River Ganga using Decision Tree Algorithm
Water Quality Index Calculation of River Ganga using Decision Tree AlgorithmIRJET Journal
12 views11 Folien
A Firefly based improved clustering algorithm von
A Firefly based improved clustering algorithmA Firefly based improved clustering algorithm
A Firefly based improved clustering algorithmIRJET Journal
45 views8 Folien

Similar a AIRLINE FARE PRICE PREDICTION(20)

IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach... von IRJET Journal
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET Journal81 views
Airplane Crash Analysis And Prediction Using Machine Learning von Michele Thomas
Airplane Crash Analysis And Prediction Using Machine LearningAirplane Crash Analysis And Prediction Using Machine Learning
Airplane Crash Analysis And Prediction Using Machine Learning
Michele Thomas5 views
IRJET - Airplane Crash Analysis and Prediction using Machine Learning von IRJET Journal
IRJET - Airplane Crash Analysis and Prediction using Machine LearningIRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET Journal23 views
Real Estate Investment Advising Using Machine Learning von IRJET Journal
Real Estate Investment Advising Using Machine LearningReal Estate Investment Advising Using Machine Learning
Real Estate Investment Advising Using Machine Learning
IRJET Journal37 views
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm von IRJET Journal
Water Quality Index Calculation of River Ganga using Decision Tree AlgorithmWater Quality Index Calculation of River Ganga using Decision Tree Algorithm
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
IRJET Journal12 views
A Firefly based improved clustering algorithm von IRJET Journal
A Firefly based improved clustering algorithmA Firefly based improved clustering algorithm
A Firefly based improved clustering algorithm
IRJET Journal45 views
Machine Learning, K-means Algorithm Implementation with R von IRJET Journal
Machine Learning, K-means Algorithm Implementation with RMachine Learning, K-means Algorithm Implementation with R
Machine Learning, K-means Algorithm Implementation with R
IRJET Journal3 views
IRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms von IRJET Journal
IRJET- Sentimental Analysis for Online Reviews using Machine Learning AlgorithmsIRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET Journal52 views
IRJET- Performance Evaluation of Various Classification Algorithms von IRJET Journal
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET Journal10 views
IRJET- Performance Evaluation of Various Classification Algorithms von IRJET Journal
IRJET- Performance Evaluation of Various Classification AlgorithmsIRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET Journal15 views
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES von IRJET Journal
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESCASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
IRJET Journal2 views
IRJET - Stock Market Prediction using Machine Learning Algorithm von IRJET Journal
IRJET - Stock Market Prediction using Machine Learning AlgorithmIRJET - Stock Market Prediction using Machine Learning Algorithm
IRJET - Stock Market Prediction using Machine Learning Algorithm
IRJET Journal76 views
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati... von IRJET Journal
IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET Journal8 views
IRJET - Smart Vet Locator for Hybrid Pets von IRJET Journal
IRJET -  	  Smart Vet Locator for Hybrid PetsIRJET -  	  Smart Vet Locator for Hybrid Pets
IRJET - Smart Vet Locator for Hybrid Pets
IRJET Journal16 views
AMAZON STOCK PRICE PREDICTION BY USING SMLT von IRJET Journal
AMAZON STOCK PRICE PREDICTION BY USING SMLTAMAZON STOCK PRICE PREDICTION BY USING SMLT
AMAZON STOCK PRICE PREDICTION BY USING SMLT
IRJET Journal10 views
House price prediction von SabahBegum
House price predictionHouse price prediction
House price prediction
SabahBegum2.1K views
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi... von IRJET Journal
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET Journal32 views
Analysis on Fraud Detection Mechanisms Using Machine Learning Techniques von IRJET Journal
Analysis on Fraud Detection Mechanisms Using Machine Learning TechniquesAnalysis on Fraud Detection Mechanisms Using Machine Learning Techniques
Analysis on Fraud Detection Mechanisms Using Machine Learning Techniques
IRJET Journal16 views
A02610104 von theijes
A02610104A02610104
A02610104
theijes203 views

Más de IRJET Journal

SOIL STABILIZATION USING WASTE FIBER MATERIAL von
SOIL STABILIZATION USING WASTE FIBER MATERIALSOIL STABILIZATION USING WASTE FIBER MATERIAL
SOIL STABILIZATION USING WASTE FIBER MATERIALIRJET Journal
19 views7 Folien
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles... von
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...IRJET Journal
8 views7 Folien
Identification, Discrimination and Classification of Cotton Crop by Using Mul... von
Identification, Discrimination and Classification of Cotton Crop by Using Mul...Identification, Discrimination and Classification of Cotton Crop by Using Mul...
Identification, Discrimination and Classification of Cotton Crop by Using Mul...IRJET Journal
7 views5 Folien
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula... von
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...IRJET Journal
13 views11 Folien
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR... von
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...IRJET Journal
14 views6 Folien
Performance Analysis of Aerodynamic Design for Wind Turbine Blade von
Performance Analysis of Aerodynamic Design for Wind Turbine BladePerformance Analysis of Aerodynamic Design for Wind Turbine Blade
Performance Analysis of Aerodynamic Design for Wind Turbine BladeIRJET Journal
7 views5 Folien

Más de IRJET Journal(20)

SOIL STABILIZATION USING WASTE FIBER MATERIAL von IRJET Journal
SOIL STABILIZATION USING WASTE FIBER MATERIALSOIL STABILIZATION USING WASTE FIBER MATERIAL
SOIL STABILIZATION USING WASTE FIBER MATERIAL
IRJET Journal19 views
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles... von IRJET Journal
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...
IRJET Journal8 views
Identification, Discrimination and Classification of Cotton Crop by Using Mul... von IRJET Journal
Identification, Discrimination and Classification of Cotton Crop by Using Mul...Identification, Discrimination and Classification of Cotton Crop by Using Mul...
Identification, Discrimination and Classification of Cotton Crop by Using Mul...
IRJET Journal7 views
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula... von IRJET Journal
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...
IRJET Journal13 views
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR... von IRJET Journal
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...
IRJET Journal14 views
Performance Analysis of Aerodynamic Design for Wind Turbine Blade von IRJET Journal
Performance Analysis of Aerodynamic Design for Wind Turbine BladePerformance Analysis of Aerodynamic Design for Wind Turbine Blade
Performance Analysis of Aerodynamic Design for Wind Turbine Blade
IRJET Journal7 views
Heart Failure Prediction using Different Machine Learning Techniques von IRJET Journal
Heart Failure Prediction using Different Machine Learning TechniquesHeart Failure Prediction using Different Machine Learning Techniques
Heart Failure Prediction using Different Machine Learning Techniques
IRJET Journal7 views
Experimental Investigation of Solar Hot Case Based on Photovoltaic Panel von IRJET Journal
Experimental Investigation of Solar Hot Case Based on Photovoltaic PanelExperimental Investigation of Solar Hot Case Based on Photovoltaic Panel
Experimental Investigation of Solar Hot Case Based on Photovoltaic Panel
IRJET Journal3 views
Metro Development and Pedestrian Concerns von IRJET Journal
Metro Development and Pedestrian ConcernsMetro Development and Pedestrian Concerns
Metro Development and Pedestrian Concerns
IRJET Journal2 views
Mapping the Crashworthiness Domains: Investigations Based on Scientometric An... von IRJET Journal
Mapping the Crashworthiness Domains: Investigations Based on Scientometric An...Mapping the Crashworthiness Domains: Investigations Based on Scientometric An...
Mapping the Crashworthiness Domains: Investigations Based on Scientometric An...
IRJET Journal3 views
Data Analytics and Artificial Intelligence in Healthcare Industry von IRJET Journal
Data Analytics and Artificial Intelligence in Healthcare IndustryData Analytics and Artificial Intelligence in Healthcare Industry
Data Analytics and Artificial Intelligence in Healthcare Industry
IRJET Journal3 views
DESIGN AND SIMULATION OF SOLAR BASED FAST CHARGING STATION FOR ELECTRIC VEHIC... von IRJET Journal
DESIGN AND SIMULATION OF SOLAR BASED FAST CHARGING STATION FOR ELECTRIC VEHIC...DESIGN AND SIMULATION OF SOLAR BASED FAST CHARGING STATION FOR ELECTRIC VEHIC...
DESIGN AND SIMULATION OF SOLAR BASED FAST CHARGING STATION FOR ELECTRIC VEHIC...
IRJET Journal36 views
Efficient Design for Multi-story Building Using Pre-Fabricated Steel Structur... von IRJET Journal
Efficient Design for Multi-story Building Using Pre-Fabricated Steel Structur...Efficient Design for Multi-story Building Using Pre-Fabricated Steel Structur...
Efficient Design for Multi-story Building Using Pre-Fabricated Steel Structur...
IRJET Journal10 views
Development of Effective Tomato Package for Post-Harvest Preservation von IRJET Journal
Development of Effective Tomato Package for Post-Harvest PreservationDevelopment of Effective Tomato Package for Post-Harvest Preservation
Development of Effective Tomato Package for Post-Harvest Preservation
IRJET Journal4 views
“DYNAMIC ANALYSIS OF GRAVITY RETAINING WALL WITH SOIL STRUCTURE INTERACTION” von IRJET Journal
“DYNAMIC ANALYSIS OF GRAVITY RETAINING WALL WITH SOIL STRUCTURE INTERACTION”“DYNAMIC ANALYSIS OF GRAVITY RETAINING WALL WITH SOIL STRUCTURE INTERACTION”
“DYNAMIC ANALYSIS OF GRAVITY RETAINING WALL WITH SOIL STRUCTURE INTERACTION”
IRJET Journal5 views
Understanding the Nature of Consciousness with AI von IRJET Journal
Understanding the Nature of Consciousness with AIUnderstanding the Nature of Consciousness with AI
Understanding the Nature of Consciousness with AI
IRJET Journal12 views
Augmented Reality App for Location based Exploration at JNTUK Kakinada von IRJET Journal
Augmented Reality App for Location based Exploration at JNTUK KakinadaAugmented Reality App for Location based Exploration at JNTUK Kakinada
Augmented Reality App for Location based Exploration at JNTUK Kakinada
IRJET Journal6 views
Smart Traffic Congestion Control System: Leveraging Machine Learning for Urba... von IRJET Journal
Smart Traffic Congestion Control System: Leveraging Machine Learning for Urba...Smart Traffic Congestion Control System: Leveraging Machine Learning for Urba...
Smart Traffic Congestion Control System: Leveraging Machine Learning for Urba...
IRJET Journal18 views
Enhancing Real Time Communication and Efficiency With Websocket von IRJET Journal
Enhancing Real Time Communication and Efficiency With WebsocketEnhancing Real Time Communication and Efficiency With Websocket
Enhancing Real Time Communication and Efficiency With Websocket
IRJET Journal4 views
Textile Industrial Wastewater Treatability Studies by Soil Aquifer Treatment ... von IRJET Journal
Textile Industrial Wastewater Treatability Studies by Soil Aquifer Treatment ...Textile Industrial Wastewater Treatability Studies by Soil Aquifer Treatment ...
Textile Industrial Wastewater Treatability Studies by Soil Aquifer Treatment ...
IRJET Journal4 views

Último

Saikat Chakraborty Java Oracle Certificate.pdf von
Saikat Chakraborty Java Oracle Certificate.pdfSaikat Chakraborty Java Oracle Certificate.pdf
Saikat Chakraborty Java Oracle Certificate.pdfSaikatChakraborty787148
13 views1 Folie
A multi-microcontroller-based hardware for deploying Tiny machine learning mo... von
A multi-microcontroller-based hardware for deploying Tiny machine learning mo...A multi-microcontroller-based hardware for deploying Tiny machine learning mo...
A multi-microcontroller-based hardware for deploying Tiny machine learning mo...IJECEIAES
10 views10 Folien
802.11 Computer Networks von
802.11 Computer Networks802.11 Computer Networks
802.11 Computer NetworksTusharChoudhary72015
9 views33 Folien
Object Oriented Programming with JAVA von
Object Oriented Programming with JAVAObject Oriented Programming with JAVA
Object Oriented Programming with JAVADemian Antony D'Mello
64 views28 Folien
MSA Website Slideshow (16).pdf von
MSA Website Slideshow (16).pdfMSA Website Slideshow (16).pdf
MSA Website Slideshow (16).pdfmsaucla
39 views8 Folien
Art of Writing Research article slide share.pptx von
Art of Writing Research article slide share.pptxArt of Writing Research article slide share.pptx
Art of Writing Research article slide share.pptxsureshc91
14 views42 Folien

Último(20)

A multi-microcontroller-based hardware for deploying Tiny machine learning mo... von IJECEIAES
A multi-microcontroller-based hardware for deploying Tiny machine learning mo...A multi-microcontroller-based hardware for deploying Tiny machine learning mo...
A multi-microcontroller-based hardware for deploying Tiny machine learning mo...
IJECEIAES10 views
MSA Website Slideshow (16).pdf von msaucla
MSA Website Slideshow (16).pdfMSA Website Slideshow (16).pdf
MSA Website Slideshow (16).pdf
msaucla39 views
Art of Writing Research article slide share.pptx von sureshc91
Art of Writing Research article slide share.pptxArt of Writing Research article slide share.pptx
Art of Writing Research article slide share.pptx
sureshc9114 views
How I learned to stop worrying and love the dark silicon apocalypse.pdf von Tomasz Kowalczewski
How I learned to stop worrying and love the dark silicon apocalypse.pdfHow I learned to stop worrying and love the dark silicon apocalypse.pdf
How I learned to stop worrying and love the dark silicon apocalypse.pdf
Machine Element II Course outline.pdf von odatadese1
Machine Element II Course outline.pdfMachine Element II Course outline.pdf
Machine Element II Course outline.pdf
odatadese16 views
CHI-SQUARE ( χ2) TESTS.pptx von ssusera597c5
CHI-SQUARE ( χ2) TESTS.pptxCHI-SQUARE ( χ2) TESTS.pptx
CHI-SQUARE ( χ2) TESTS.pptx
ssusera597c520 views
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L... von Anowar Hossain
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...
Anowar Hossain10 views
13_DVD_Latch-up_prevention.pdf von Usha Mehta
13_DVD_Latch-up_prevention.pdf13_DVD_Latch-up_prevention.pdf
13_DVD_Latch-up_prevention.pdf
Usha Mehta9 views
Thermal aware task assignment for multicore processors using genetic algorithm von IJECEIAES
Thermal aware task assignment for multicore processors using genetic algorithm Thermal aware task assignment for multicore processors using genetic algorithm
Thermal aware task assignment for multicore processors using genetic algorithm
IJECEIAES29 views
STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptx von AnnieRachelJohn
STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptxSTUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptx
STUDY OF SMART MATERIALS USED IN CONSTRUCTION-1.pptx
AnnieRachelJohn25 views

AIRLINE FARE PRICE PREDICTION

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2448 AIRLINE FARE PRICE PREDICTION Kartik Rathi1, Anubhav Kumar2, Manish Yadav3 1,2,3 Students, Computer science and Engineering department, MIET Meerut,Uttar Pradesh,India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract The price of airline ticket changes frequently nowadays and there's plenty of difference. Price change keeps happening within few hours for the identical flight. The shoppers want to induce themostcost-effectivepossible price while the airline companies wanttheutmostprofitand revenue possible. To unravel this problem, researchers introduce different models to avoid wasting consumers money- minimum price predicting model and models that tell us an optimal time to shop for a ticket while airlines use techniques like demand prediction and price discrimination to maximize their revenue 1. INTRODUCTION This project aims to develop an application which can predict the flight prices for various flights using different machine learning techniques. The user will get the expected values and with its reference the user can plan to book their tickets suitably. At this time, carrier ticket costs can shift powerfully and fundamentally for an identical flight, in any event, for accessible seats inside the identical cabin. Clients are attempting to urge the foremost minimal cost while Airlines companies try to stay their general income as high as could reasonably be expected and boost their benefit. Airlines utilize different computational methods to extend their income, as an example, demand forecast and value segregation. The proposed system can help save immeasurable rupees of shoppers by proving them the knowledge to book tickets at thecorrecttime.Parameterson which fares are calculated- Now we can perform Exploratory Data Analysis onthegiven information. we'll discover correlation between the highlights. At that time a Machine Learning model are made to utilizing those highlights. 1.1 Proposed Methodology For this project, we've implemented the machine learning life cycle to make a basic web application which is able to predict the flight fare by applying machine learning algorithms on historical flight data using some python libraries like Pandas, NumPy, Matplotlib, seaborn, and sklearn. Below image shows the number of steps that we followed from the life cycle. Data selection is that the initial step where historical data of flight is assembles for the model to predict prices. Our dataset consists of quite 10,000+ records of information associated with flights and costs. A number of thefeaturesof the dataset are source, destination, departure date, point, and number of stops, point in time, prices. Within the exploratory data analysis step, we cleaned the dataset by removing the duplicate values and null values. If the null values aren't removed, the accuracy of the model are affected. Next step is data pre-processing where we observed that almost all of the information was present in string format. Data from each feature is extracted i.e., day and month is extracted from date of journey in integer format, hours and minutes is extracted from time of departure. Features like sourceanddestinationneededtobe converted into values as they were of categorical type. For this One hot-encoding and label encoding techniques are wont to convert categorical data into the integer data. Feature selection step is involved in selecting important features that are more correlated to the value. So, some features to be selected and passed to the group of models. Random forest is an ensemblelearningmethodthatbasically uses group of decision trees as group of models. Random amount of knowledge is passed to decision trees and every decision tree predicts values in line with the datasetgiven to that. From the predictions made by the choice trees the features like extra information and route which are unnecessary features which can affect the accuracy of the model and so, they have to be removed before getting our model ready for prediction.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2449 After selecting the features which are more correlated to cost the following step involves applying machine algorithm and creating a model. As our dataset incorporates labeled data, we've got to use supervised machine learning algorithms also in supervised we are going to be using regression algorithms as our dataset contains continuous values within the features. Regression models are wont to describe relationship between dependent and independent variables. The machinelearningalgorithmsthatwearegoing to be using in our project are: 1.2 Linear Regression In simple linear regression there's only 1 independent and one dependent feature but as our dataset consists of the many independent features on which the worthmayrely on, we are going to be using multiple linear regression which estimates relationship between two or more independent variables and one dependent variable. The multiple linear regression models are represented by: C Where, y = the predicted value of the dependent variable = the independent variables m= independent variables coefficients C= y-intercept x 1.3 Decision Tree There are basically of two type’s Decision tree i.e., classification and regression tree where classification is employed for categorical values and regression is employed for continuous values. Decision tree chooses independent variable from dataset as decision nodes for decision making.It divides the entire dataset in several sub-section and when test data is passed to the model the output is determined by checking the section to which the info point belongs to. And to whichever section the info point belongs to the choice tree will give output because the average value of all the information points within the sub-section. 1.4 Random Forest Random Forest is an ensemble learning technique where training model uses multiple learning algorithms then combine individual results to urge a final predicted result. Under ensemble learning random forest falls into bagging category where random number of featuresandrecordswill average value of the expected values if considered because the output of the random forest model. 2. PERFORMANCE METRICS Performance metrics are statistical models which is able to be accustomed compare the accuracy of the machine learning models trained by different algorithms. The sklearn.metrics module are accustomed implement the functions to live the errors from each model using the regression metrics. Following metricsareaccustomedcheck the error measure of every model. 2.1 MAE (Mean Absolute Error) Mean Absolute Error is basically the sum of average of the absolute difference between the expected and actual values. MAE = 1/n [∑(y-ý)] y = actual output values, ý = predicted output values n = Total number of data points Lesser the value of MAE better the performance of your model. 2.2 MSE (Mean Square Error) Mean Square Error squares the difference of actual and predicted output values before summing all rather than using absolute value. MSE = 1/n [∑ ( ] y=actual output values y’=predicted output values n = Total number of data points Lower the value of MSE better the performanceofthemodel. 2.2 RMSE (Root Mean Square Error) RMSE is measured by taking the square root of the average of the squared difference between the prediction and also the actual value. RMSE = √1/n [∑ ( ] y=actual output values y’=predicted output values n = Total number of data points RMSE is greater than MAE and lesser the value of RMSE between different models the better the performanceofthat model.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2450 2.3 (Coefficient of determination) It helps you to grasp how well the independent variable adjusted with the variance in your model. = 1 − ∑ ( / ∑ ( The worth of R-square lies between 0 to 1. The closer its value to at least one, the higher your model is when comparing with other model values. There are different cross-validation techniques like GridsearchCV and RandomizedsearchCV which can be used for improving the accuracy of the model.Parametersofthemodelslikenumber of trees in random forest or max depth of decision tree will be changed using this method which can help us in further enhancement of the accuracy. The last three steps of the life cycle model are involved within the deployment of the trained machine learning model. Therefore, after getting the model with the most effective accuracy we store that model in a very file using pickle module. The back-end of the applying are going to be created using Flask Framework where API end-points such and acquire and POST are going to be created to perform operations associated with fetching and displaying data on the front-end of the appliance. The front-end oftheapplying are going to be created using the bootstrap framework where user will have the functionality of enteringtheirflight data. This data is sent to the back-end service where the model will predict the output consistent with the provided data. the anticipated value is distributedtothefront-endand displayed. 3. RESULTS We had used many algorithms for prepared our ML model i.e., Linear Regression, Decision tree, Random Forest. From all of these, Random Forest gives us the most accurate predictions. Models' performance using a few metrics are given below: A) LINEAR REGRESSION: R2 SCORE : 0.6220180540708748 MAE: 1866.1170014687825 MSE: 6974960.920111607 RMSE: 2641.0151306101234 B) DECISION TREE: R2 SCORE : 0.74098856555086889 MAE : 1270.242692247699 MSE : 4779579.164036292 RMSE : 2186.224865844383 C) RANDOM FOREST: R2 SCORE : 0.8598264560438941 MAE : 1055.5042333744152 MSE : 2586644.6841106885 RMSE : 1608.3049101804945 By observing all of the performancemetricswecanconclude that Random Forest will give us accurate and better results. So, we use Random Forest model for the deployment. 4. CONCLUSION This project can result in saving money of inexperienced people by providing them the information related to trends of the flight prices and also give them a predicted value of the price which they use to decide whether to book ticket now or later. On working with different models, itwasfound out that Random Forest algorithmgivesthehighestaccuracy in predicting the output. REFERENCES [1] K. Tziridis T. Kalampokas G.Papakostas and K. Diamantaras "Airfare price prediction using machine learning techniques" in European Signal Processing Conference (EUSIPCO), DOI: 10.23919/EUSIPCO .2017.8081365L. Li Y. Chen and Z. Li” Yawning detection for monitoring driver fatigue based on two cameras” Proc. 12th Int. IEEE Conf. Intell. Transp. Syst. pp. 1-6 Oct. 2009. [2] William Groves and Maria Gini "An agent for optimizing airline ticket purchasing" in proceedings of the 2013 international conference on autonomous agents and multi- agent systems. [3] Supriya Rajankar, Neha sakhrakar and Omprakash rajankar “Flight fare prediction using machine learning algorithms” International journal of Engineering Research and Technology (IJERT) June 2019. [4] Tianyi wang, samira Pouyanfar,haimanTianandYudong Tao "A Framework for airline price prediction: A machine learning approach" [5] T. Janssen "A linear quantile mixed regression model for prediction of airline ticket prices" [6] medium.com/analytics-vidhya/mae-mse-rmse- coefficient-ofdetermination-adjusted-r-squared-which- metric-is-bettercd0326a5697e article on performance metrics
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 2451 [7] www.keboola.com/blog/random-forest-regression article on random forest [8] https://towardsdatascience.com/machine-learning- basics-decisiontree-regression-1d73ea003fda article on decision tree regression BIOGRAPHIES KARTIK RATHI MIET,MEERUT(STUDENT) AKTU ANUBHAV KUMAR MIET,MEERUT(STUDENT) AKTU MANISH YADAV MIET,MEERUT(STUDENT) AKTU