Repository logo
 
Publication

Big data and analytics: the case study of students mobility

datacite.subject.fosCiências Naturais::Ciências da Computação e da Informaçãopt_PT
dc.contributor.advisorPereira, João Paulo
dc.contributor.authorMukhin, Dmitrii
dc.date.accessioned2022-03-24T14:04:15Z
dc.date.available2022-03-24T14:04:15Z
dc.date.issued2022
dc.descriptionMestrado em Informaticapt_PT
dc.description.abstractIt's no secret that the most important thing in our world is information. Nowadays, almost every action leaves a trace. And if we use this data correctly, we will get new knowledge and predictions. But this requires new specialized technologies such as Big Data. The work described in this dissertation focuses on three methods of Big Data analysis: descriptive analysis, correlation and predictive analysis. The purpose of the work is to explore these methods for practical application to a dataset containing information about IPB and Erasmus students. The following tasks were performed: collecting data from international students about their university practices and mobility, conducting descriptive analysis on general characteristics by year, course, gender, place of residence, degree, number of subjects studied and their grade point average. Correlation heat charts were constructed between the values in the dataset and dependencies were analyzed. The most important contribution of this paper is the practical application of three machine learning algorithms (Linear regression, Ridge regression, and Random forest) to predict the number of Erasmus students for the next year. The machine learning algorithms build a model from sample data, known as "training data," to make predictions or decisions without being explicitly programmed to do so.pt_PT
dc.identifier.tid202975550pt_PT
dc.identifier.urihttp://hdl.handle.net/10198/25282
dc.language.isoengpt_PT
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/pt_PT
dc.subjectBig datapt_PT
dc.subjectMachine learningpt_PT
dc.subjectDescriptive analysispt_PT
dc.subjectGraphpt_PT
dc.subjectTablept_PT
dc.subjectCorrelationpt_PT
dc.subjectPredictive analysispt_PT
dc.subjectLinear regressionpt_PT
dc.subjectComb regressionpt_PT
dc.subjectRandom forestpt_PT
dc.titleBig data and analytics: the case study of students mobilitypt_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.rightsopenAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.nameInformáticapt_PT

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Dmitrii Mukhin.pdf
Size:
21.99 MB
Format:
Adobe Portable Document Format