A chemiresistive sensor array based on polyaniline nanocomposites and machine learning classification

Jiri Kroutil; Alexandr Laposa; Ali Ahmad; Jan Voves; Vojtech Povolny; Ladislav Klimsa; Marina Davydova; Miroslav Husak

doi:10.3762/bjnano.13.34

NANOSCIENCE – THEORY TO APPLICATION

/ E-Alerts

A chemiresistive sensor array based on polyaniline nanocomposites and machine learning classification

^¹ ,
^{^‡,1} ,
^{^‡,1} ,
^{^‡,1} ,
^{^‡,1} ,
^{^‡,2} ,
^{^‡,2} and
^{^‡,1}

¹Department of Microelectronics, Czech Technical University in Prague, Technicka 2,166 27 Prague, Czech Republic

²FZU - Institute of Physics, of the Czech Academy of Sciences, Na Slovance 1999/2, 182 21 Prague, Czech Republic

Corresponding author email
‡ Equal contributors

Associate Editor: N. Motta
Beilstein J. Nanotechnol. 2022, 13, 411–423. https://doi.org/10.3762/bjnano.13.34
Received 14 Dec 2021, Accepted 25 Mar 2022, Published 27 Apr 2022

Full Research Paper

PDF

Album

Cite

Abstract

The selective detection of ammonia (NH₃), nitrogen dioxide (NO₂), carbon oxides (CO₂ and CO), acetone ((CH₃)₂CO), and toluene (C₆H₅CH₃) is investigated by means of a gas sensor array based on polyaniline nanocomposites. The array composed by seven different conductive sensors with composite sensing layers are measured and analyzed using machine learning. Statistical tools, such as principal component analysis and linear discriminant analysis, are used as dimensionality reduction methods. Five different classification methods, namely k-nearest neighbors algorithm, support vector machine, random forest, decision tree classifier, and Gaussian process classification (GPC) are compared to evaluate the accuracy of target gas determination. We found the Gaussian process classification model trained on features extracted from the data by principal component analysis to be a highly accurate method reach to 99% of the classification of six different gases.

Keywords: feature extraction; gas sensor; pattern recognition; sensor array

Introduction

The control and monitoring of toxic gaseous substances, such as ammonia, nitrogen oxides, and various volatile organic compounds, is crucial in automotive, defense, aviation, chemical, medicine, and food industries [1,2]. Research on chemical sensors is currently focused on the fabrication of multisensor arrays for enhanced detection and identification of various chemical compounds. In most cases, the cross-sensitivity toward different chemical analytes is unavoidable, regardless of their oxidizing or reducing nature. Many authors have suggested a number of ways to overcome the drawbacks of cross-sensitivity/selectivity and reliability of the sensor arrays [3-5]. One powerful tool to address the abovementioned drawbacks is the implementation of a multisensor array combined with appropriate pattern recognition and classification tools [6]. Recently, classification in gas sensing applications has been carried out by principal component analysis to identify the difference between VOCs, supported by a vector machine to distinguish between acetone, nitrogen dioxide, and ammonia, and by a neutral network model to distinguish between ammonia and formaldehyde gas [7-9].

In our previous work [10], we demonstrated a combination of organic (polyaniline, PANI) and inorganic (carbon nanotubes (CNT), SnO₂, TiO₂) materials in a gas sensors based on nanocomposite layers with good sensitivity, temperature stability, reversibility, which was operating at room temperature. Herein, we extended our study by applying other nanocomposite sensing layers, namely PANI/ZnO, PANI/WO₃ (nanopowder), PANI/WO₃ (nanotubes), PANI/In₂O₃, PANI/C₆₀ (fullerene), PANI/nanocrystalline diamond (NCD), and PANI/BaTiO₃, deposited on a flexible sensor array platform with a new design. Seven different nanocomposite sensing layers deposited on the array were exposed to six different gases (ammonia, carbon dioxide, nitrogen dioxide, carbon monoxide, acetone, and toluene). Moreover, the obtained data were used for machine learning classification.

Many pattern recognition models based on intuitive, linear and nonlinear supervised techniques have been explored in E-nose data [11,12]. A considerable number of studies have been implemented in recent years using different statistical analysis algorithms, like principal component analysis (PCA), linear discriminant analysis (LDA), k-nearest neighbors algorithm (KNN), support vector machine (SVM), decision tree classifier (DT), random forest (RF), and Gaussian process classification (GPC) in order to enhance the discrimination of gases and get better selectivity [13]. In this work we suggest a new method in our classification system by combining the abovementioned methods and using the output of the two most powerful techniques in dimensionality reduction and increasing interpretability. We apply PCA and LDA as input data for five machine learning algorithms with a 10-fold cross-validation method.

The preprocessing stage was implemented by applying PCA and LDA on the extracted dataset [14,15]. Five different kinds of flexible pattern recognition algorithms have been used for the classification of gas sensor data using a 10-fold cross-validation to reach the highest classification rate.

Results and Discussion

The sensors layers were investigated by scanning electron microcopy (SEM), Raman spectroscopy, current–voltage and temperature analysis, and gas sensing analysis. Further, statistical classification analysis was implemented for the evaluation of target gases.

Scanning electron microscopy and Raman spectroscopy

The surface morphology and uniformity of additives in PANI of the deposited active layers were examined by scanning electron microscopy (TESCAN FERA3 GM), as shown in Figure 1a–h. All nanocomposite active layers, that is, PANI/ZnO, PANI/WO₃ (nanopowder), PANI/WO₃ (nanowires), PANI/In₂O₃, PANI/C₆₀, PANI/NCD, and PANI/BaTiO₃ have similar morphological features with a uniform distribution of additives in polyaniline. We can observe small ZnO flakes and WO₃ nanowires homogeneously distributed in the layers.

[2190-4286-13-34-1] — **Figure 1:** SEM micrographs of deposited layers on an interdigital transducer structure: (a) PANI/ZnO, (b) PANI/WO₃ (nanopowder), (c) PANI/WO₃ (nanowire), (d) PANI/In₂O₃, (e) PANI/C₆₀, (f) PANI/NCD, (g) PANI/BaTiO₃, and (h) pristine PANI.

**Figure 1:** SEM micrographs of deposited layers on an interdigital transducer structure: (a) PANI/ZnO, (b) PANI...

Jump to Figure 1

Pristine PANI was examined by SEM (Figure 1h) and Raman spectroscopy (Raman spectrometer Renishaw inVia Qontor) at room temperature with 633 nm excitation wavelength (Figure 2). The spectrum of pristine PANI is typical of the emeraldine salt, showing the following main bands: (1) 748 cm⁻¹ (Q ring bending, C–C ring deformation); (2) 810 and 870 cm⁻¹ (out-of-plane C–H vibrations in the aromatic rings); (3) 1169 cm⁻¹ (C–H bending of the quinoid rings); (4) 1221 and 1260 cm⁻¹ (C–N in benzene diamine units); (5) 1336 cm⁻¹ (C–N⁺, characteristic band of the polaron radical cation); (6) 1412 cm⁻¹ (phenazine structures); (7) 1498 cm⁻¹ (C=N of the quinoid nonprotonated diimine units); (8) 1590 cm⁻¹ (C=C stretching vibration of the quinonoid ring) [16,17].

[2190-4286-13-34-2] — **Figure 2:** The Raman spectra of the PANI.

**Figure 2:** The Raman spectra of the PANI.

Jump to Figure 2

Current–voltage and temperature analysis

Figure 3 shows current–voltage characteristics of active layers. These characteristics were examined for currents up to 200 mA and exhibit an almost linear character.

[2190-4286-13-34-3] — **Figure 3:** Current–voltage characteristics of active layers.

**Figure 3:** Current–voltage characteristics of active layers.

Jump to Figure 3

Figure 4 shows the temperature dependence of the relative resistance of PANI/nanocomposite layers for different temperatures. All layers exhibit a decrease in resistance with increasing temperature and composite layers exhibit lower temperature dependences in comparison to a pristine PANI layer. The resistance R₀ and the average temperature coefficient of resistance (TCRs) values of the prepared sensing layers are given in Table 1. All active layers have a negative temperature coefficient. The resistivity of these nanocomposites depends on the p–n depletion layer width on the interface between the n-type nanoparticles and the surrounding p-type PANi molecules.

[2190-4286-13-34-4] — **Figure 4:** Temperature dependence characteristics of active layers.

**Figure 4:** Temperature dependence characteristics of active layers.

Jump to Figure 4

Table 1: Average TCRs of prepared sensitive layers in range from 20 to 80 °C.

Layer	R₀ (Ω)	TCR (K⁻¹)

PANI+ZnO	85123	−0.012
PANI+WO₃ (nanopowder)	1764	−0.009
PANI+WO₃ (nanowires)	25906	−0.009
PANI+In₂O₃	50750	−0.011
PANI+C₆₀	3305	−0.01
PANI+NCD	3549	−0.01
PANI+BaTiO₃	4338	−0.0096
PANI	4240	−0.012

Gas sensing analysis

The gas sensing characterizations of sensitive layers were performed using a custom-built apparatus (Figure 5). The characterization system consists of mass flow controllers (Bronkhorst High-Tech) for setting the required gas concentration, a source-meter (Keithley, Model 2400) for resistance measurement of active layers, an air-tight chamber with electrical feedthroughs for the sensor array, a relay multiplexor for switching four sensor elements, and a Labview-based data acquisition system.

[2190-4286-13-34-5] — **Figure 5:** Schematic diagram of the gas sensing characterizations apparatus.

**Figure 5:** Schematic diagram of the gas sensing characterizations apparatus.

Jump to Figure 5

The sensing layers were tested towards carbon dioxide (250 ppm CO₂), carbon monoxide (25 ppm), ammonia (25 ppm), nitrogen dioxide (25 ppm), acetone (6%), toluene (500 ppm) and humid air (RH) in synthetic air (SA) at room temperature. The response of the sensing layers was calculated by the relative resistance change:

where R₀ is the resistance at room temperature in synthetic air and R_g is the resistance of the sensor in the presence of the specific gas.

The dynamic responses of sensing layers for gases are displayed in Figure 6. The experiment consisted of 5 min of sensor exposure to a certain gas concentration and 5 min of purging at a flow rate of 100 mL·s⁻¹. It is evident that all active layers have the highest sensitivity and clear response to NH₃. All active composite layers except PANI+ZnO exhibit a higher sensitivity toward NH₃ in comparison to a pristine PANI layer. PANI+ZnO composite shows the lowest sensitivity to all gases. Moreover, the resistance decreases when the polyaniline composite sensing layers are exposed to toluene. In addition, the sensor responses of all sensing layers to NO₂ gas are nearly three times lower than those to NH₃.

[2190-4286-13-34-6] — **Figure 6:** Gas characterization of active layers towards (a) 25 ppm of NH₃, (b) 25 ppm of NO₂, (c) 25 ppm of CO, (d) 250 ppm of CO₂, (e) 6000 ppm of acetone, and (f) 500 ppm of toluene at room temperature.

**Figure 6:** Gas characterization of active layers towards (a) 25 ppm of NH₃, (b) 25 ppm of NO₂, (c) 25 ppm of C...

Jump to Figure 6

Due to the high sensitivity to NH₃ gas, the sensing layers were also tested towards different concentrations of NH₃ at room temperature (Figure 7). It is evident that all sensing layers demonstrate an enhancement in sensitivity to the highest gas concentration (50 ppm) as well as incomplete reversibility.

[2190-4286-13-34-7] — **Figure 7:** The responses of sensing layers for different concentrations of NH₃.

**Figure 7:** The responses of sensing layers for different concentrations of NH₃.

Jump to Figure 7

When the operating temperature increased up to 80 °C, the sensing layers showed an at least three times lower response to ammonia in contrast to the room temperature measurements (Figure 7), but with almost complete reversibility (Figure 8).

[2190-4286-13-34-8] — **Figure 8:** Gas characterization of active layers to different concentrations of NH₃ at 80 °C.

**Figure 8:** Gas characterization of active layers to different concentrations of NH₃ at 80 °C.

Jump to Figure 8

The gas sensing mechanism of nanocomposite layers based on PANI was discussed in [10]. Polyaniline is known as one of the most famous p-type conductive polymers. During the exposure to a reducing gas (NH₃), the emeraldine salt form of polyaniline is converted to the emeraldine base form leading to an increase in resistance due to the decrease of hole density within the p-type film [18]. In the case of the hybrid structures, a p–n heterojunction is formed between polyaniline and n-type nanostructures such as ZnO, WO₃, In₂O₃, or fullerene [19]. The protons from polyaniline are transferred to the NH₃ molecules. This results in a widening of the depletion layers on p–n junctions and, thus, the resistance increases [18]. All these effects are reversible when the reducing gas is replaced by air. Beside these effects, nanostructures added into polyaniline increase the initial resistance due to larger disorder and deformation of the polyaniline conjugation chains. The summary of the gas sensor responses for all active layers is shown in Figure 9. All layers show an increasing resistivity as a clear response to CO, CO₂, NH₃, and NO₂.

[2190-4286-13-34-9] — **Figure 9:** The summary of the gas sensor responses for all active layers.

**Figure 9:** The summary of the gas sensor responses for all active layers.

Jump to Figure 9

Additionally, to exclude the influence of humidity on the sensor signal, the sensor arrays were exposed to various relative humidity levels ranging from 20% to 80% (Figure 10). The obtained results show a decreasing sensor response with increasing RH% value. This change in response toward relative humidity is due to the adsorption of water molecules and an increase of charge concentration due to PANI doping and the formation of charge transfer complexes [20]. The decrease of electrical resistance is caused by the greater mobility of the dopant ions, related to the development of PANI chains. Furthermore, the swelling effect contributes to the change in resistivity [21].

[2190-4286-13-34-10] — **Figure 10:** RH dependences of the sensing layers.

**Figure 10:** RH dependences of the sensing layers.

Jump to Figure 10

Statistical classification analysis

A data preprocessing stage was applied on the gas sensor responses to improve the overall pattern analysis performance. It was implemented by applying PCA and LDA on the extracted dataset of exposing six different gases to sensor arrays with seven different PANI/nanocomposite sensing layers. Figure 11 presents two-dimensional LDA and PCA projections of the extracted features. Regarding the LDA results, LDA1 describes approximately 72% and LDA2 describes 20.6% of data variation. By using the LDA method, a good classification with clearly separated clusters is obtained for NO₂, CO₂, acetone, and NH₃, but an overlap (overlaps are highlighted by oval-shaped drawings) was detected between CO and toluene. Applying the PCA method, we found that PCA1 describes approximately 94%, while PCA2 only 3.6% of data variation. These results demonstrate that PCA2 could be neglected in comparison with PCA1. Therefore, by analyzing the data projection on PCA1 to evaluate the behavior of the sensors, a high classification in clearly separated clusters is obtained for all gases via PCA. However, PCA and LDA results should be evaluated by machine learning algorithms for high accuracy. The generated feature set by PCA and LDA was provided to the classification algorithms as input vectors, Then, the training/testing process of SVM, KNN, DT, RF, and GPC classifiers was executed by using the 10-fold and 2-fold cross-validation approach. Table 2 shows the classification rate percentage of the sensor array after applying different classification algorithms.

[2190-4286-13-34-11] — **Figure 11:** Two-dimensional (a) LDA and (b) PCA projections of extracted features.

**Figure 11:** Two-dimensional (a) LDA and (b) PCA projections of extracted features.

Jump to Figure 11

Table 2: Gas classification rate.

K- fold cross-validation	K = 10		K = 2
Classifier	PCA (%)	LDA (%)	PCA (%)	LDA (%)

KNN	89	69	68	55
SVM	73	77	57	61
RF	97	78	71	57
DT	96	72	65	52
GPC	99	74	85	58

For visualization, Figure 12 illustrates the classification accuracy of the test samples based on the presented five classifiers. The result shows that, by applying 10-fold cross-validation method, 10–90% of training and test samples were analyzed. KNN, RF, DT, and GPC classifiers yielded a higher classification rate when the PCA method was used as input vector. A different behavior was detected for SVM, where a higher classification rate was achieved by the LDA method as input vector for SVM. The highest rate with 99% classification of the target gases was achieved by using PCA as input vector to train the GPC classifier. Results show that using a smaller amount of training samples, such as the 2-fold cross-validation method in which 50% of training and test samples were analyzed, could strongly decrease the classification rate. In the case of the RF classifier, classification decreased by 30%. The same result of a 30% decrease was obtained for KNN. However, GPC still shows a very high classification rate (85%) even with 50% less training samples, which proves the high capacity and powerful discrimination of using the PCA–GPC combination for E-nose and gas sensor applications.

[2190-4286-13-34-12] — **Figure 12:** Total classification accuracy using the five classifiers SVM, KNN, DT, RF, and GPC with K-fold cross-validation (K = 2, K = 10).

**Figure 12:** Total classification accuracy using the five classifiers SVM, KNN, DT, RF, and GPC with K-fold cros...

Jump to Figure 12

The usage of a powerful classification system based on statistical analysis and machine learning algorithms is a prime need for sensing applications in different fields, such as gas detection and monitoring [22,23], food industry [24], and agriculture [25]. However, the potential of such electronic nose systems varies according to the implemented classification system. Only between two and four different statistical analysis and classification algorithms were used in the cited literature [22-25]. In this work, we developed a robust system with seven different mathematical algorithms and we examined the interaction between these algorithms. The seven algorithms were studied in deep using different combinations and the highest classification rate was obtained by PCA/GPC.

Conclusion

We presented the fabrication and characterization of seven different composite sensors based on polyaniline. These sensors were used in gas sensing arrays. The sensitivity and selectivity of the gas sensing system was successfully tested on a set of six different gases. The improvement of the selectivity was analyzed by implementation of a complex statistical classification system. The strength of statistical analysis and classification algorithms was determined and was based on several factors such as data field application, parameters, behavior of the system, and the convenience of data behavior and the data correlation regarding the mathematical algorithms and calculations. For different data patterns the algorithms show different strength and reliability. For our system it was possible to perform a suitable preprocessing and feature selection of the dataset by PCA for powerful predictions with uncertainty determined by GPC, which summarizes the distribution of random variables and define the covariance function of the data, a crucial ingredient for our system predictions. The highest classification rate of about 99% was obtained for our classification system using PCA and GPC.

Experimental

Sensor array fabrication

The sensor array with four interdigitated electrode systems was manufactured as a flexible printed circuit board (DuPont Pyralux AP8535 with 75 µm thickness, double-sided, copper-clad laminate in an all-polyimide composite of polyimide film bonded to copper foil). It contains the heating elements and the temperature sensors for the temperature controlling of individual sensing layers. The heating element can also be used for the desorption of measured gases or heating slightly above the ambient temperature in order to reduce temperature fluctuation. The 18 µm thick Cu interdigitated electrodes were covered with a 12 µm thick gold layer to improve the corrosion resistance. The width and spacing of interdigitated electrodes are 100 µm. The heating element (surface-mounted device SMD0402 resistor with an electric resistance of 50 Ω) and a platinum temperature sensor (SMD0603 Pt1000) are placed on the bottom side of each sensor by soldering with Sn_0.63Pb_0.37. The sensor size array is 16.2 mm (width) × 16.2 mm (length) with the sensing layer size of 1.7 mm × 1.7 mm. A pin header connector is used for connection of the sensor array.

Sensor arrays with PANI/nanocomposite (PANI/ZnO, PANI/WO₃ (nanopowder), PANI/WO₃ (nanowires), PANI/In₂O₃, PANI/C₆₀, PANI/NCD, and PANI/BaTiO₃) sensing layers were fabricated similarly as reported in [10]. The whole fabrication process of the sensor array is described in Figure 13.

[2190-4286-13-34-13] — **Figure 13:** Sensor array fabrication.

**Figure 13:** Sensor array fabrication.

Jump to Figure 13

First, the protonated emeraldine salt form of PANI was prepared by oxidation of 0.2 M aniline hydrochloride with 0.25 M ammonium persulfate at room temperature (25 °C). The precipitate obtained after the polymerization was filtered and purified by 0.2 M hydrochloric acid and acetone. Subsequently, pure polyaniline was dried over silica gel in a desiccator for 24 h.

Next, the dispersion solutions were prepared by mixing 24 mg PANI and 5 mg additives (zinc oxide, two forms of tungsten oxide, indium oxide, fullerene, NCD, and barium titanate) in 2 mL xylene. Table 3 shows properties of the used additives. The prepared solutions were mixed in a shaker for 30 min and subsequently ultrasonicated for 30 min. Finally, the obtained dispersion solutions were deposited by a micropipette on the interdigitated electrode arrays. After that, the deposited sensor layers were dried using the integrated heating elements at 60 °C for 2 h and whole sensor array was subsequently dried in a desiccator over silica gel for 24 h. Before the deposition, the sensor array platforms were cleaned in acetone and isopropyl alcohol for 15 min and then exposed to hexamethyldisilazane (HMDS) for 2 h to improve the adhesion of sensing layers.

Table 3: Properties of the used additives.

Additive	Properties

zinc oxide (ZnO)	nanopowder, particle size < 50 nm, surface area > 10.8 m²·g⁻¹
tungsten oxide (WO₃)	nanopowder, particle size < 100 nm
tungsten oxide (WO₃)	nanowires, diameter 50 nm, length 10 µm
indium oxide (In₂O₃)	nanopowder, particle size < 100 nm, 99.9% trace metals basis
fullerene (C₆₀)	sublimed form
nanocrystalline diamond (NCD)	nanopowder, particle size < 5 nm
barium(IV) titanate (BaTiO₃)	nanopowder (cubic crystalline phase), particle size < 100 nm, ≥99% trace metals basis

Figure 14 demonstrates the top view of sensor array with nanocomposite sensing layers and the bottom view with heating elements and temperature sensors [26].

[2190-4286-13-34-14] — **Figure 14:** Sensor array with nanocomposite sensing layers with dimensions and bottom view with heating elements and temperature sensors.

**Figure 14:** Sensor array with nanocomposite sensing layers with dimensions and bottom view with heating element...

Jump to Figure 14

Statistical classification system

The classification capability for the gas sensor arrays was analyzed and studied to achieve the highest classification rate by constructing a classification system according to the block diagram in Figure 15. PCA and LDA were implemented and applied in order to extract the data measured from seven sensing layers, namely PANI/ZnO, PANI/WO₃ (nanopowder), PANI/WO₃ (nanowires), PANI/In₂O₃, PANI/C₆₀, PANI/NCD, and PANI/BaTiO₃, and the result was used to train/test five different classification algorithms. In this work, KNN, SVM, RF, DT, and GPC were used to achieve the highest classification rate.

[2190-4286-13-34-15] — **Figure 15:** Smart sensing system block diagram.

**Figure 15:** Smart sensing system block diagram.

Jump to Figure 15

This paragraph and the following paragraph about PCA and LDA follow closely the corresponding content in [27]. PCA is a linear transformation that preserves as much data variance as possible. PCA chooses a matrix T that minimizes the mean squared distance between original data and those reconstructed from reduced data. It has been shown that

where U and Λ are the eigenvector matrix and the diagonal eigenvalues matrix of the data covariance matrix, respectively. PCA has been extensively used for gas sensor applications [28-30].

LDA provides a linear projection of the data with (c – 1) dimensions, by considering the scatter of data within each class and across classes. Projection directions are those that maximize the inter-class separation of the projected data. The LDA transformation matrix is given by

where S_W and Λ_W are, respectively, the eigenvectors matrix and the diagonal eigenvalues matrix of the within-class scatter W. S_B is the eigenvectors matrix of the between-class scatter B. LDA was previously used for gas detection applications [31,32].

KNN is a supervised algorithm that is used for classification and regression, KNN is based on the minimum distance of the unknown sample to the training samples, by selecting the specified number of points (k) that are closest to the unknown sample [33]. Then the unknown sample can be identified by the majority votes.

SVM is an algorithm capable of performing classification and regression. The SVM algorithm enhances the classification function by finding the hyperplane in an N-dimensional space that has the widest margin between training data and class boundaries. Many possible hyperplanes could be chosen, but SVM aims to find a plane with the maximum margin, that is, the maximum distance between data points of different classes in feature space. A linear decision boundary is constructed in form of a hyperplane:

A support vector machine takes the input vectors and outputs a hyperplane that separates different classes. Hyperplanes are decision boundaries that help to classify different classes [34,35]. Samples with an unknown class will be classified by the decision boundary, samples falling on either side of the hyperplane can be attributed to different classes.

DT is a learning algorithm. It is a tree-structured classifier where the classification model is performed by a series of test questions and conditions with finite depth based on the features of input data in a tree structure. After the decision tree has been built, sorting of an unknown sample is simple. It begins with the top node of the tree and goes to the leaf. It undergoes distribution channels of the nodes and it follows to the next node, for which a new condition is subjected to the sample till it reaches the leaf node. The unknown sample is labeled and sorted, and a new and unknown sample is classified [36,37].

RF is an ensemble method for classification and regression tasks that uses multiple models of several DT to achieve a better output classification. The concept of the algorithm to build the structure is to select random samples from the dataset and create a decision tree from each sample and to get prediction results from each decision tree created [38]. The RF consists of a large number of individual decision trees. Each tree votes for a class prediction in the random forest, and the prediction with the most votes becomes the prediction of random forest.

GPC is a supervised learning method and a generalization of the Gaussian distribution of probability, which can be used for advanced non-parametric machine learning algorithms for classification problems [39]. GPC puts a Gaussian prior over a latent function, which is then squashed by a logistic function to obtain the probabilistic classification. In GPC, the posterior of the latent function is not Gaussian [40]. In GPC, for a given dataset, the training data x₁,..., x_n are chosen with the corresponding class labels y = (y₁,..., y_n), to predict the class of a test sample:

where Φ denotes the standard normal cumulative distribution function, that is,

[Graphic 1]

Gaussian process classification is not a parametric model. Instead, training is carried out to get hyperparameters that are needed for the covariance function and posterior. The target of GPC is to obtain the distribution of a latent variable corresponding to test sample x*:

where p(f|x₁, …, x_n, y) is the posterior over the latent variables, and this distribution subsequently will be used to predict the probabilistic prediction:

Acknowledgements

The classification system was processed by Scikit-learn, which is one of the most popular machine learning libraries of python. Anaconda installer for Python 3.8 was used to run all the libraries and Jupyter Notebooks. This work contains parts from the thesis of J. Kroutil, "Gas sensor array with nanocomposite films", Czech Technical University in Prague, 2019.

Funding

This work was supported by the Czech Science Foundation project No. GA22-04533S „Printed heterogeneous gas sensor arrays with enhanced sensitivity and selectivity“, by the grant of CTU No. SGS20/176/OHK3/3T/13, by the project Centre of the Advanced Applied Natural Sciences No. CZ.02.1.01/0.0/0.0/16_019/0000778 supported by the Operation Programme Research, Development and Education co-financed by European Comunity and by Ministry of Education Czech Republic and by the grant CzechNanoLab Research Infrastructure supported by MEYS CR (LM2018110).

References

Hosseini, S. H.; Khalkhali, R. A.; Noor, P. Monatsh. Chem. 2010, 141, 1049–1053. doi:10.1007/s00706-010-0374-5
Return to citation in text: [1]
Akbar, S.; Dutta, P.; Lee, C. Int. J. Appl. Ceram. Technol. 2006, 3, 302–311. doi:10.1111/j.1744-7402.2006.02084.x
Return to citation in text: [1]
Korotcenkov, G.; Han, S. D.; Cho, B. K.; Brinzari, V. Crit. Rev. Solid State Mater. Sci. 2009, 34, 1–17. doi:10.1080/10408430902815725
Return to citation in text: [1]
Korotcenkov, G.; Cho, B. K. Sens. Actuators, B 2013, 188, 709–728. doi:10.1016/j.snb.2013.07.101
Return to citation in text: [1]
Mao, Y.; Park, T. J.; Wong, S. S. Chem. Commun. 2005, 46, 5721–5735. doi:10.1039/b509960a
Return to citation in text: [1]
Zee, F.; Judy, J. W. Sens. Actuators, B 2001, 72, 120–128. doi:10.1016/s0925-4005(00)00638-9
Return to citation in text: [1]
Liu, S. F.; Moh, L. C. H.; Swager, T. M. Chem. Mater. 2015, 27, 3560–3563. doi:10.1021/acs.chemmater.5b00153
Return to citation in text: [1]
Pugh, D. C.; Newton, E. J.; Naik, A. J. T.; Hailes, S. M. V.; Parkin, I. P. J. Mater. Chem. A 2014, 2, 4758–4764. doi:10.1039/c3ta15049f
Return to citation in text: [1]
Zhang, D.; Liu, J.; Jiang, C.; Liu, A.; Xia, B. Sens. Actuators, B 2017, 240, 55–65. doi:10.1016/j.snb.2016.08.085
Return to citation in text: [1]
Kroutil, J.; Laposa, A.; Voves, J.; Davydova, M.; Nahlik, J.; Kulha, P.; Husak, M. IEEE Sens. J. 2018, 18, 3759–3766. doi:10.1109/jsen.2018.2811461
Return to citation in text: [1] [2] [3]
Zhang, L.; Liu, Y.; Deng, P. IEEE Trans. Instrum. Meas. 2017, 66, 1679–1692. doi:10.1109/tim.2017.2669818
Return to citation in text: [1]
Zhang, L.; Zhang, D.; Yin, X.; Liu, Y. IEEE Sens. J. 2016, 16, 4919–4931. doi:10.1109/jsen.2016.2551743
Return to citation in text: [1]
Zhang, L.; Tian, F.; Nie, H.; Dang, L.; Li, G.; Ye, Q.; Kadri, C. Sens. Actuators, B 2012, 174, 114–125. doi:10.1016/j.snb.2012.07.021
Return to citation in text: [1]
Kiselev, I.; Sommer, M.; Kaur Mann, J.; Sysoev, V. V. IEEE Sens. J. 2010, 10, 849–855. doi:10.1109/jsen.2009.2036441
Return to citation in text: [1]
Ahluwalia, A.; De Rossi, D. Encyclopedia of Materials: Science and Technology; 2001; pp 344–347. doi:10.1016/b0-08-043152-6/00071-1
Return to citation in text: [1]
Salvatierra, R. V.; Moura, L. G.; Oliveira, M. M.; Pimenta, M. A.; Zarbin, A. J. G. J. Raman Spectrosc. 2012, 43, 1094–1100. doi:10.1002/jrs.3144
Return to citation in text: [1]
Trchová, M.; Morávková, Z.; Bláha, M.; Stejskal, J. Electrochim. Acta 2014, 122, 28–38. doi:10.1016/j.electacta.2013.10.133
Return to citation in text: [1]
Nie, Q.; Pang, Z.; Lu, H.; Cai, Y.; Wei, Q. Beilstein J. Nanotechnol. 2016, 7, 1312–1321. doi:10.3762/bjnano.7.122
Return to citation in text: [1] [2]
Kulkarni, S. B.; Navale, Y. H.; Navale, S. T.; Stadler, F. J.; Ramgir, N. S.; Patil, V. B. Sens. Actuators, B 2019, 288, 279–288. doi:10.1016/j.snb.2019.02.094
Return to citation in text: [1]
Chani, M. T. S.; Karimov, K. S.; Khalid, F. A.; Moiz, S. A. Solid State Sci. 2013, 18, 78–82. doi:10.1016/j.solidstatesciences.2013.01.005
Return to citation in text: [1]
Anisimov, Y. A.; Evitts, R. W.; Cree, D. E.; Wilson, L. D. Polymers (Basel, Switz.) 2021, 13, 2722. doi:10.3390/polym13162722
Return to citation in text: [1]
Robin, Y.; Goodarzi, P.; Baur, T.; Schultealbert, C.; Schutze, A.; Schneider, T. Machine Learning based calibration time reduction for Gas Sensors in Temperature Cycled Operation. In 2021 IEEE International Instrumentation and Measurement Technology Conference, I2MTC, Glasgow, UK, May 17–20, 2021; IEEE, 2021; pp 1–6.
Return to citation in text: [1] [2]
Brahim-Belhouari, S.; Bermak, A. Pattern Recognit. Lett. 2005, 26, 699–706. doi:10.1016/j.patrec.2004.09.020
Return to citation in text: [1] [2]
Tan, J.; Balasubramanian, B.; Sukha, D.; Ramkissoon, S.; Umaharan, P. J. Food Process Eng. 2019, 42. doi:10.1111/jfpe.13175
Return to citation in text: [1] [2]
Amkor, A.; Maaider, K.; El Barbri, N. Sens. Actuators, A 2021, 328, 112787. doi:10.1016/j.sna.2021.112787
Return to citation in text: [1] [2]
Kroutil, J. Gas sensor array with nanocompositefilms. Ph.D. Thesis, Czech Technical University in Prague, 2019.
Return to citation in text: [1]
Bermak, A.; Belhouari, S. B.; Shi, M.; Martinez, D. Pattern recognition techniques for odor discrimination in gas sensor array; Encyclopedia of Sensors, Vol. X; 2006; pp 1–17.
Return to citation in text: [1]
Kermit, M.; Tomic, O. IEEE Sens. J. 2003, 3, 218–228. doi:10.1109/jsen.2002.807488
Return to citation in text: [1]
McEntegart, C. M.; Penrose, W. R.; Strathmann, S.; Stetter, J. R. Sens. Actuators, B 2000, 70, 170–176. doi:10.1016/s0925-4005(00)00561-x
Return to citation in text: [1]
Penza, M.; Cassano, G.; Tortorella, F.; Zaccaria, G. Sens. Actuators, B 2001, 73, 76–87. doi:10.1016/s0925-4005(00)00687-0
Return to citation in text: [1]
Aishima, T. J. Agric. Food Chem. 1991, 39, 752–756. doi:10.1021/jf00004a027
Return to citation in text: [1]
Brahim-Belhouari, S.; Bermak, A.; Wei, G.; Chan, P. C. H. A comparative study of density models for gas identification using microelectronic gas sensor. In Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2003, Darmstadt, Germany, Dec 17, 2003; IEEE, 2003; pp 138–141. doi:10.1109/isspit.2003.1341079
Return to citation in text: [1]
Petersson, H. Multivariate Exploration and Processing of Sensor Data-applications with multidimensional sensor systems. Ph.D. Thesis, Linköpings Universitet, Sweden, 2008.
Return to citation in text: [1]
Boser, B. E.; Guyon, I. M.; Vapnik, V. N. A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, PA, USA; Association for Computing Machinery: New York, NY, USA, 1992; pp 144–152. doi:10.1145/130385.130401
Return to citation in text: [1]
Cortes, C.; Vapnik, V. Mach. Learn. 1995, 20, 273–297. doi:10.1023/a:1022627411411
Return to citation in text: [1]
Cho, J.; Li, X.; Gu, Z.; Kurup, P. U. IEEE Sens. J. 2012, 12, 2384–2391. doi:10.1109/jsen.2011.2182042
Return to citation in text: [1]
Li, Q.; Bermak, A. J. Low Power Electron. Appl. 2011, 1, 45–58. doi:10.3390/jlpea1010045
Return to citation in text: [1]
Shi, T.; Horvath, S. J. Comput. Graph. Stat. 2006, 15, 118–138. doi:10.1198/106186006x94072
Return to citation in text: [1]
Yuan, F.; Xia, X.; Shi, J.; Li, H.; Li, G. IEEE Access 2017, 5, 6833–6841. doi:10.1109/access.2017.2697408
Return to citation in text: [1]
Rasmussen, C. E.; Williams, C. K. I. Gaussian Processes for Machine Learning; The MIT Press, 2006. doi:10.7551/mitpress/3206.001.0001
Return to citation in text: [1]

© 2022 Kroutil et al.; licensee Beilstein-Institut.
This is an open access article licensed under the terms of the Beilstein-Institut Open Access License Agreement (https://www.beilstein-journals.org/bjnano/terms), which is identical to the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0). The reuse of material under this license requires that the author(s), source and license are credited. Third-party material in this article could be subject to other licenses (typically indicated in the credit line), and in this case, users are required to obtain permission from the license holder to reuse the material.

Other Beilstein-Institut Open Science Activities

[R1] Hosseini, S. H.; Khalkhali, R. A.; Noor, P. Monatsh. Chem. 2010, 141, 1049–1053. doi:10.1007/s00706-010-0374-5
Return to citation in text: [1]

[R2] Akbar, S.; Dutta, P.; Lee, C. Int. J. Appl. Ceram. Technol. 2006, 3, 302–311. doi:10.1111/j.1744-7402.2006.02084.x
Return to citation in text: [1]

[R3] Korotcenkov, G.; Han, S. D.; Cho, B. K.; Brinzari, V. Crit. Rev. Solid State Mater. Sci. 2009, 34, 1–17. doi:10.1080/10408430902815725
Return to citation in text: [1]

[R4] Korotcenkov, G.; Cho, B. K. Sens. Actuators, B 2013, 188, 709–728. doi:10.1016/j.snb.2013.07.101
Return to citation in text: [1]

[R5] Mao, Y.; Park, T. J.; Wong, S. S. Chem. Commun. 2005, 46, 5721–5735. doi:10.1039/b509960a
Return to citation in text: [1]

[R6] Zee, F.; Judy, J. W. Sens. Actuators, B 2001, 72, 120–128. doi:10.1016/s0925-4005(00)00638-9
Return to citation in text: [1]

[R7] Liu, S. F.; Moh, L. C. H.; Swager, T. M. Chem. Mater. 2015, 27, 3560–3563. doi:10.1021/acs.chemmater.5b00153
Return to citation in text: [1]

[R8] Pugh, D. C.; Newton, E. J.; Naik, A. J. T.; Hailes, S. M. V.; Parkin, I. P. J. Mater. Chem. A 2014, 2, 4758–4764. doi:10.1039/c3ta15049f
Return to citation in text: [1]

[R9] Zhang, D.; Liu, J.; Jiang, C.; Liu, A.; Xia, B. Sens. Actuators, B 2017, 240, 55–65. doi:10.1016/j.snb.2016.08.085
Return to citation in text: [1]

[R10] Kroutil, J.; Laposa, A.; Voves, J.; Davydova, M.; Nahlik, J.; Kulha, P.; Husak, M. IEEE Sens. J. 2018, 18, 3759–3766. doi:10.1109/jsen.2018.2811461
Return to citation in text: [1] [2] [3]

[R11] Zhang, L.; Liu, Y.; Deng, P. IEEE Trans. Instrum. Meas. 2017, 66, 1679–1692. doi:10.1109/tim.2017.2669818
Return to citation in text: [1]

[R12] Zhang, L.; Zhang, D.; Yin, X.; Liu, Y. IEEE Sens. J. 2016, 16, 4919–4931. doi:10.1109/jsen.2016.2551743
Return to citation in text: [1]

[R13] Zhang, L.; Tian, F.; Nie, H.; Dang, L.; Li, G.; Ye, Q.; Kadri, C. Sens. Actuators, B 2012, 174, 114–125. doi:10.1016/j.snb.2012.07.021
Return to citation in text: [1]

[R14] Kiselev, I.; Sommer, M.; Kaur Mann, J.; Sysoev, V. V. IEEE Sens. J. 2010, 10, 849–855. doi:10.1109/jsen.2009.2036441
Return to citation in text: [1]

[R15] Ahluwalia, A.; De Rossi, D. Encyclopedia of Materials: Science and Technology; 2001; pp 344–347. doi:10.1016/b0-08-043152-6/00071-1
Return to citation in text: [1]

[R16] Salvatierra, R. V.; Moura, L. G.; Oliveira, M. M.; Pimenta, M. A.; Zarbin, A. J. G. J. Raman Spectrosc. 2012, 43, 1094–1100. doi:10.1002/jrs.3144
Return to citation in text: [1]

[R17] Trchová, M.; Morávková, Z.; Bláha, M.; Stejskal, J. Electrochim. Acta 2014, 122, 28–38. doi:10.1016/j.electacta.2013.10.133
Return to citation in text: [1]

[R18] Nie, Q.; Pang, Z.; Lu, H.; Cai, Y.; Wei, Q. Beilstein J. Nanotechnol. 2016, 7, 1312–1321. doi:10.3762/bjnano.7.122
Return to citation in text: [1] [2]

[R19] Kulkarni, S. B.; Navale, Y. H.; Navale, S. T.; Stadler, F. J.; Ramgir, N. S.; Patil, V. B. Sens. Actuators, B 2019, 288, 279–288. doi:10.1016/j.snb.2019.02.094
Return to citation in text: [1]

[R20] Chani, M. T. S.; Karimov, K. S.; Khalid, F. A.; Moiz, S. A. Solid State Sci. 2013, 18, 78–82. doi:10.1016/j.solidstatesciences.2013.01.005
Return to citation in text: [1]

[R21] Anisimov, Y. A.; Evitts, R. W.; Cree, D. E.; Wilson, L. D. Polymers (Basel, Switz.) 2021, 13, 2722. doi:10.3390/polym13162722
Return to citation in text: [1]

[R22] Robin, Y.; Goodarzi, P.; Baur, T.; Schultealbert, C.; Schutze, A.; Schneider, T. Machine Learning based calibration time reduction for Gas Sensors in Temperature Cycled Operation. In 2021 IEEE International Instrumentation and Measurement Technology Conference, I2MTC, Glasgow, UK, May 17–20, 2021; IEEE, 2021; pp 1–6.
Return to citation in text: [1] [2]

[R23] Brahim-Belhouari, S.; Bermak, A. Pattern Recognit. Lett. 2005, 26, 699–706. doi:10.1016/j.patrec.2004.09.020
Return to citation in text: [1] [2]

[R24] Tan, J.; Balasubramanian, B.; Sukha, D.; Ramkissoon, S.; Umaharan, P. J. Food Process Eng. 2019, 42. doi:10.1111/jfpe.13175
Return to citation in text: [1] [2]

[R25] Amkor, A.; Maaider, K.; El Barbri, N. Sens. Actuators, A 2021, 328, 112787. doi:10.1016/j.sna.2021.112787
Return to citation in text: [1] [2]

[R26] Kroutil, J. Gas sensor array with nanocompositefilms. Ph.D. Thesis, Czech Technical University in Prague, 2019.
Return to citation in text: [1]

[R27] Bermak, A.; Belhouari, S. B.; Shi, M.; Martinez, D. Pattern recognition techniques for odor discrimination in gas sensor array; Encyclopedia of Sensors, Vol. X; 2006; pp 1–17.
Return to citation in text: [1]

[R28] Kermit, M.; Tomic, O. IEEE Sens. J. 2003, 3, 218–228. doi:10.1109/jsen.2002.807488
Return to citation in text: [1]

[R29] McEntegart, C. M.; Penrose, W. R.; Strathmann, S.; Stetter, J. R. Sens. Actuators, B 2000, 70, 170–176. doi:10.1016/s0925-4005(00)00561-x
Return to citation in text: [1]

[R30] Penza, M.; Cassano, G.; Tortorella, F.; Zaccaria, G. Sens. Actuators, B 2001, 73, 76–87. doi:10.1016/s0925-4005(00)00687-0
Return to citation in text: [1]

[R31] Aishima, T. J. Agric. Food Chem. 1991, 39, 752–756. doi:10.1021/jf00004a027
Return to citation in text: [1]

[R32] Brahim-Belhouari, S.; Bermak, A.; Wei, G.; Chan, P. C. H. A comparative study of density models for gas identification using microelectronic gas sensor. In Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2003, Darmstadt, Germany, Dec 17, 2003; IEEE, 2003; pp 138–141. doi:10.1109/isspit.2003.1341079
Return to citation in text: [1]

[R33] Petersson, H. Multivariate Exploration and Processing of Sensor Data-applications with multidimensional sensor systems. Ph.D. Thesis, Linköpings Universitet, Sweden, 2008.
Return to citation in text: [1]

[R34] Boser, B. E.; Guyon, I. M.; Vapnik, V. N. A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, PA, USA; Association for Computing Machinery: New York, NY, USA, 1992; pp 144–152. doi:10.1145/130385.130401
Return to citation in text: [1]

[R35] Cortes, C.; Vapnik, V. Mach. Learn. 1995, 20, 273–297. doi:10.1023/a:1022627411411
Return to citation in text: [1]

[R36] Cho, J.; Li, X.; Gu, Z.; Kurup, P. U. IEEE Sens. J. 2012, 12, 2384–2391. doi:10.1109/jsen.2011.2182042
Return to citation in text: [1]

[R37] Li, Q.; Bermak, A. J. Low Power Electron. Appl. 2011, 1, 45–58. doi:10.3390/jlpea1010045
Return to citation in text: [1]

[R38] Shi, T.; Horvath, S. J. Comput. Graph. Stat. 2006, 15, 118–138. doi:10.1198/106186006x94072
Return to citation in text: [1]

[R39] Yuan, F.; Xia, X.; Shi, J.; Li, H.; Li, G. IEEE Access 2017, 5, 6833–6841. doi:10.1109/access.2017.2697408
Return to citation in text: [1]

[R40] Rasmussen, C. E.; Williams, C. K. I. Gaussian Processes for Machine Learning; The MIT Press, 2006. doi:10.7551/mitpress/3206.001.0001
Return to citation in text: [1]

aromatic	the word “aromatic”
aromatic aldehyde	the word “aromatic” OR “aldehyde”
+aromatic +aldehyde	both words “aromatic” AND “aldehyde”
+aromatic -aldehyde	the word “aromatic” but NOT “aldehyde”
“aromatic aldehyde”	the exact phrase “aromatic aldehyde”
benz*	words which begin with “benz”, such as “benzene” or “benzyl”
benz*yl	words that begin with “benz” and end with “yl”, such as “benzyl” or “benzoyl”
benzyl~	words that are close to the word “benzyl”, such as “benzoyl” (i.e., fuzzy search)

1.	Hosseini, S. H.; Khalkhali, R. A.; Noor, P. Monatsh. Chem. 2010, 141, 1049–1053. doi:10.1007/s00706-010-0374-5
2.	Akbar, S.; Dutta, P.; Lee, C. Int. J. Appl. Ceram. Technol. 2006, 3, 302–311. doi:10.1111/j.1744-7402.2006.02084.x

7.	Liu, S. F.; Moh, L. C. H.; Swager, T. M. Chem. Mater. 2015, 27, 3560–3563. doi:10.1021/acs.chemmater.5b00153
8.	Pugh, D. C.; Newton, E. J.; Naik, A. J. T.; Hailes, S. M. V.; Parkin, I. P. J. Mater. Chem. A 2014, 2, 4758–4764. doi:10.1039/c3ta15049f
9.	Zhang, D.; Liu, J.; Jiang, C.; Liu, A.; Xia, B. Sens. Actuators, B 2017, 240, 55–65. doi:10.1016/j.snb.2016.08.085

22.	Robin, Y.; Goodarzi, P.; Baur, T.; Schultealbert, C.; Schutze, A.; Schneider, T. Machine Learning based calibration time reduction for Gas Sensors in Temperature Cycled Operation. In 2021 IEEE International Instrumentation and Measurement Technology Conference, I2MTC, Glasgow, UK, May 17–20, 2021; IEEE, 2021; pp 1–6.
23.	Brahim-Belhouari, S.; Bermak, A. Pattern Recognit. Lett. 2005, 26, 699–706. doi:10.1016/j.patrec.2004.09.020