Machine Learning for Fluid Property Correlations: Classroom Examples with MATLAB

Research output: Contribution to journalArticle

Abstract

Recent advances in computer hardware and algorithms are spawning an explosive growth in the use of computer-based systems aimed at analyzing and ultimately correlating large amounts of experimental and synthetic data. As these machine learning tools become more widespread, it is becoming imperative that scientists and researchers become familiar with them, both in terms of understanding the tools and the current limitations of artificial intelligence, and more importantly being able to critically separate the hype from the real potential. This article presents a classroom exercise aimed at first-year science and engineering college students, where a task is set to produce a correlation to predict the normal boiling point of organic compounds from an unabridged data set of >6000 compounds. The exercise, which is fully documented in terms of the problem statement and the solution, guides the students to initially perform a linear correlation of the boiling point data with a plausible relevant variable (the molecular weight) and to further refine it using multivariate linear fitting employing a second descriptor (the acentric factor). Finally, the data are processed through an artificial neural network to eventually provide an engineering-quality correlation. The problem statements, data files for the development of the exercise, and solutions are provided within a MATLAB environment but are general in nature.

Bibliographical metadata

Original languageEnglish
JournalJournal of Chemical Education
DOIs
Publication statusE-pub ahead of print - 8 Jan 2019