SIMULTANEOUS DETERMINATION OF TOTAL POLYPHENOLS AND CAFFEINE

Microchemical Journal 83 (2006) 42 – 47 www.elsevier.com/locate/microc

Simultaneous determination of total polyphenols and caffeine contents of green tea by near-infrared reflectance spectroscopy Quansheng Chen a,⁎, Jiewen Zhao a,⁎, Xingyi Huang a , Haidong Zhang b , Muhua Liu c a

School of Biological and Environmental Engineering, Jiangsu University, 212013 Zhenjiang, P.R. China Faculty of Engineering and Technology, Yunnan Agricultural University, 650201 Kunming, P.R. China c College of Engineering, Jiangxi Agricultural University, 330045 Nanchang, P.R. China

b

Received 9 December 2005; received in revised form 15 January 2006; accepted 15 January 2006 Available online 9 March 2006

Abstract This paper indicates the possibility to use near infrared (NIR) spectroscopy as a rapid method to predict quantitatively the content of caffeine and total polyphenols in green tea. A partial least squares (PLS) algorithm is used to perform the calibration. To decide upon the number of PLS factors included in the PLS model, the model is chosen according to the lowest root mean square error of cross-validation (RMSECV) in training. The correlation coefficient R between the NIR predicted and the reference results for the test set is used as an evaluation parameter for the models. The result showed that the correlation coefficients of the prediction models were R = 0.9688 for the caffeine and R = 0.9299 for total polyphenols. The study demonstrates that NIR spectroscopy technology with multivariate calibration analysis can be successfully applied as a rapid method to determine the valid ingredients of tea to control industrial processes. © 2006 Elsevier B.V. All rights reserved. Keywords: Near infrared spectroscopy; PLS; Green tea; Caffeine; Total polyphenols

1. Introduction Tea is one of the most popular beverages worldwide, which is of great interest due to its beneficial medicinal properties [1]. There is increasing evidence that specific substances found in certain foods can enhance general healthy. Recent research suggests that antioxidants found in tea may play an important role to prevent cardiovascular disease [2], chronic gastritis [3,4] and some cancers [5,6]. Moreover, an observational study in Japan found that the regular consumption of green tea (more than 3 cups a day) might be protective against recurrence of breast cancer in the early stages [7]. With the increasing consumption of the tea, quality control of tea becomes more and more important nowadays, for example, many national and international authorities are setting criteria for quality factors. In generally, total polyphenols and caffeine content are analyzed as the important tealeaves quality factors. Total polyphenols content account for more than 30% of the dry weight of ⁎ Corresponding authors. Tel.: +86 511 8790308; fax: +86 511 8780201. E-mail address: [email protected] (Q. Chen). 0026-265X/$ - see front matter © 2006 Elsevier B.V. All rights reserved. doi:10.1016/j.microc.2006.01.023

tealeaves. These compounds are mainly responsible for the characteristic astringent and bitter taste of tea brews [8]. In addition, caffeine in tea, known for their stimulative effect, has to be recognized as important quality factors in tealeaves. In contrast to the catechins in polyphenols, caffeine can enhance observably tea flavor. In the past few years, different methods of analysis had been employed to determine the contents of the compounds in question. Some approaches such as high performance liquid chromatography (HPLC) [9] and capillary electrophoresis [10] were applied to determine the caffeine content in tea. Some other approaches have also been described to estimate the content of total polyphenols using colorimetric measurements and the titration using potassium permanganate [11]. However, all of the methods mentioned above are time-consuming. Near infrared reflectance spectroscopy is a fast, accurate and nondestructive technique that can be employed as a replacement of time-consuming chemical method. Near-infrared (NIR) spectroscopy has proved to be a powerful analytical tool for analyzing quantitative caffeine content in coffee [12–14]. Some studies on analyzing tea by

Q. Chen et al. / Microchemical Journal 83 (2006) 42–47

NIR spectroscopy are reported, for example, it was used for measuring the theaflavin and moisture contents as well as for the prediction of black tea quality by Hall [15]. The prediction of quality parameters (like catechins, gallic acid, caffeine and theobromine) in green tealeaves by NIR was also reported by Schulz [16]. Recently, Luypaert and Zhang et al. [17,18] attempted the feasibility for prediction of total antioxidant capacity in green tea using NIR. Although they gave some better results for tea using NIR, they had no details in discussing the prediction models even not use an independent test set to test the robustness of the model, such as Schulz [16]. The prerequisite of NIR application for quantitative purpose is building a reliable calibration model. In this paper, our aim is to prove the applicability of multivariate calibration to NIR data. We systematically study the different steps that have to be gone through in multivariate calibration. PLS model is used and focused on the effect on the principal component factor and the method of spectra preprocessing. The robustness of the final PLS model is evaluated according to the root mean square error of cross-validation (RMSECV), the root mean square error of prediction (RMSEP) and the correlation coefficient (R). 2. Materials and methods 2.1. Sample preparation All tea samples come from different provinces in China, and they have been all already on stock within 4 months period. Taking into consideration the heterogeneity of tea samples, major attention is paid to the sampling stage, and the samples would be grinded before analysis. For the grinding, the whole tealeaves are put into a small electric coffee mill and ground

43

during 10 s. After this procedure, the powders are sieved with a mesh width 500 μm and these sieved powders are used for the further analysis. 2.2. Chemical analysis 2.2.1. High performance liquid chromatography (HPLC) Approximately 2.0 g of the powdered material, accurately weighed, is extracted twice with 80 mL of 70% aqueous methanol each for 30min at a temperature of 80 °C. After cooling, the extracts are centrifuged at 3000rpm for 10 min. The liquid phases of both extracts are collected in a 250-mL volumetric flask and made up to volume by 70% aqueous methanol. The tea brew is filtered through a 0.45-μm membrane filter, diluted 5 times with Millipore water and analyzed immediately. To determine the content of caffeine, RP-HPLC method is applied in the Agilent 1100 series (Aligent, USA). The used column is a deactivated monomeric ultrapure silica Zorbax RxC18 column with 4.6 mm × 250 mm (i.d. × length) and 5μm nominal particle size. The flow rate is set at 1.0ml/min and the injected volume is 50μl. The column temperature is kept at 35°C using a column oven. Eluents are water/acetonitrile (9:1, v/v). The caffeine of the separation is checked by its spectra recorded using the DAD and the UV-detector is set at 276nm. The HPLC separation of the caffeine is shown in Fig. 1. 2.2.2. Colorimetric measurements [11] Total polyphenols are estimated by a photometric FolinCiocalteu assay according to a proposed international standard method. Absorbance (E) at 540 nm of the reaction solution is determined in a 1-cm light-path cell by a Lengguang-752

Fig. 1. HPLC separation of caffeine: (a) Chromatogram of tea samples; (b) Chromatogram of caffeine as calibration standard.

44


spectrophotometer (Lengguang Optical Instrument Ltd. Co., Shanghai, China). The calibration standard is gallic acid.

2.5. Software All methods were performed in Matlab (V.6.5) (Mathworks, Natick, USA) for windows XP. For the spectral acquisition OMNIC 5.2a (NEXUS 670 FT-IR Systems) is used.

2.3. Spectra collection The NIR spectra are collected in the reflectance mode using a NEXUS 670 FT-IR spectrophotometer (Nicollet, USA) with an optical fiber in the range from 11,000 to 3800 cm− 1. Each spectrum is the average spectrum of 64 runs. The spectra used for the data analysis goes from 11,000 to 3800 cm− 1, and the data are measured in 1.928-cm− 1 intervals, which results in 3735 variables. The standard sample cup is used for performing the tea spectra collection. For each tea sample respectively, 10 ± 0.1 g of tea powder is filled into the cup in the standard procedure depending upon the bulk density of materials. The corresponding amount of powder is densely packed into the cup and compresses by closing it. Each tea sample is collected three times after rotation of the 120°. The mean of three spectra which are collected from same tea sample is used in the following analyze step. The temperature and humidity are kept a steady level in the laboratory. 2.4. Preprocessing methods In this study, three data preprocessing method are applied comparatively, which are standard normal variate transformation (SNV), first derivative and second derivative, etc. SNV is a mathematical transformation method of the log (1/R) spectra used to remove slope variation and to correct for scatter effects. Compared to SNV, first and second derivatives eliminate baseline drifts and small spectral differences are enhanced. To avoid enhancing the noise, which is a consequence of derivative, spectra are first smoothed. This smoothing is done by using the Savitzky-Golay algorithm, which is a moving window averaging method: a window is selected where the data are fitted by a polynomial of a certain degree. The central point in the window is replaced by the value of the polynomial.

3. Results and discussion 3.1. Spectra investigation Fig. 2(a) shows the spectra for the original data. The spectra after first preprocessing are presented in Fig. 2(b). As seen from Fig. 2(a,b), the water absorption band around 5155cm− 1 and 7000cm− 1 corresponding to O–H stretching + O–H deformation is excluded from analysis, and some regions exhibiting a high noise level (e.g. 11,000–9000 cm− 1) should be also excluded as Fig. 2(b). Also seen from Fig. 2 (b) is the most intensive band in the spectrum that belongs to the vibration of the 2nd overtone of the carbonyl group (5352cm− 1), followed by the C–H stretch and C–H deformation vibration (7212cm− 1), the –CH2 (5742cm− 1), and the –CH3 overtone (5808cm− 1). The vibration of the carbonyl group, the –C–H and –CH2 vibrations are caused by ingredients such as polyphenols, alkaloids, protein, volatile and non-volatile acid and some aroma compounds. According to the investigation of spectrum, we select 4500– 9000cm− 1 spectral region to build PLS model, but the water absorption band around 5155 cm− 1, and 7000 cm− 1 corresponding to O–H stretching + O–H deformation, is excluded from the analysis. 3.2. Quantitative analysis of the PLS models Fifty samples are select to build PLS model in the experiment. All 50 spectra are divided into a training set and a test set. To avoid bias in subset selection, this division is made as follows: all samples have been sorted according to their respective y-value (viz. the reference measurement value of caffeine and total polyphenols content). In order to come to a 3/2 division of

Fig. 2. Spectra of tea obtained from (a) raw data and (b) first derivative data.


45

0.35

Table 1 The reference measurements and sample numbers in train set

No preprocess SNV First deviation Second deviation

Components

Units (%)

S.N.

Range

Mean

S.D.

Caffeine Total polyphenols

g/g g/g

30 30

2.2611–3.7616 19.1543–30.2329

2.9381 19.3802

0.3338 2.8151

S.N., sample number; S.D., standard deviation.

RMSECV(%)

0.3 0.25 0.2 0.15 0.1 0.05 0 1

training/test spectra, the two spectra of every five samples are selected into the test set, so that finally the training set contains 30 spectra (see Table 1), the remaining 20 spectra constitute the test set (see Table 2). As seen from Tables 1 and 2, the range of yvalue in training set covers the range in the test set, therefore the distribution of the samples is appropriate in training and test set. The performance of the final PLS model is evaluated in terms root mean square error of cross-validation (RMSECV), the root mean square error of prediction (RMSEP) and the correlation coefficient (R). For RMSECV, a leave-one-sampleout cross-validation is performed: the spectrum of one sample of the training set is deleted from this set and a PLS model is built with the remaining spectra of the training set. The left-out sample is predicted with this model and the procedure is repeated with leaving out each of the samples of the training set. The RMSECV is calculated as follows, vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi uX n u u ðyˆq i −yi Þ2 t i¼1 RMSECV ¼ ð1Þ n where n is the number of samples in the training set, yi is the reference measurement result for sample i, and ŷi is the estimated result for sample i when the model is constructed with sample i removed. The number of PLS factors included in the model is chosen according to the lowest RMSECV. This procedure is repeated for each of the preprocessed spectra. For the test set, the root mean square error of prediction (RMSEP) is calculated as follows, vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi uX n u u ðyi −yî Þ2 t i¼1 RMSEP ¼ ð2Þ n where n is the number of samples in the test set, yi is the reference measurement result for test set sample i, and ŷi is the estimated result of the model for test sample i. Finally the model with the overall lowest RMSECV will be selected as final model. Correlation coefficients between the predicted and the measured value are calculated for both the training and the test set, which are calculated as follows Eq. (3),

Table 2 The reference measurements and sample numbers in test set Components

Units (%)

S.N.

Range

Mean

S.D.

Caffeine Total polyphenols

g/g g/g

20 20

2.3538–3.4813 19.3802–29.1771

2.9177 24.5466

0.3008 2.7320

S.N., sample number; S.D., standard deviation.

2

3

4

5 6 7 8 9 Number of PLS factors

10

11

12

Fig. 3. Effect of number of PLS factors on RMSECV for caffeine calibration model.

where y¯ is the mean of the reference measurement results for all samples in the train and test sets. vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi u P n u ðyî −yi Þ2 u u i¼1 ð3Þ R ¼ u1− P n t 2 ðy −y Þ i

i

i¼1

3.2.1. Caffeine In the application of PLS algorithm, it is generally known that the spectral preprocessing methods and the number of PLS factors is critical parameters. The optimum number of factors is determined by the lowest root mean square error cross validation (RMSECV). Fig. 3 shows RMSECV plotted as a function of PLS factors for determining caffeine content with the different spectral preprocessing methods. As seen from Fig. 3, SNV spectral preprocessing method is obviously superior to others, and for every spectral preprocessing method, RMSECV decreases sharply with initial factors, however, gradually decreases as more PLS factors. Table 3 shows the best results of the calibration models by different spectral preprocessing method for determining caffeine content. Compared with others, the lowest RMSECV equals to 0.0742% obtained after the SNV spectral preprocess. This model only needs 3 PLS factors, which is obviously simpler than others. In this application, SNV performs better than other preprocesses. Fig. 4 is the scatter plot showing a correlation between NIR prediction value and reference measurement for caffeine content by the SNV spectral preprocessing method. Red circles and blue plus sign represent calibration and prediction data, respectively. The calibration and prediction data have good correlation with reference measurement data and many points fall on or close to Table 3 Best results for each of the processing method for the prediction model of caffeine Preprocessing method

PLS factors

RMSECV (%)

RMSEP (%)

R (train)

R (test)


5 3 9 8

0.1617 0.0742 0.1021 0.1231

0.1643 0.0836 0.1019 0.1246

0.9216 0.9743 0.9578 0.9483

0.9117 0.9688 0.9467 0.9436

46

Q. Chen et al. / Microchemical Journal 83 (2006) 42–47 Table 4 Best results for each of the processing method for the prediction model of total polyphenols

Fig. 4. Reference determination versus NIR prediction for caffeine of train (○) and test (+) set data.

the unity line. Caffeine content in the test set is predicted with the root mean square error prediction (RMSEP) value of 0.0836%. The correlation coefficients for this calibration model equal to 0.9743 and 0.9688 for the training and test set, respectively. 3.2.2. Total polyphenols Fig. 5 shows RMSECV plotted as a function of PLS factors for determining total polyphenols content by the different spectral preprocessing methods. As seen from Fig. 5, the RMSECV values decrease sharply with initial factors, however, they decrease very slowly even trend to increasing slightly as more PLS are include. Table 4 shows the best results of the calibration models by different spectral preprocessing method for determining total polyphenols content. Compared with other preprocessing methods, the results for SNV preprocess are better, which could be expected since is a preprocessing method that is usually used for powders. In this application, the calibration model only needs 3 PLS factors, which is also simpler than others. The values of RMSECV and RMSEP equal to 1.0858% and 1.1138%, respectively. The correlation coefficients for the training and test set are 0.9382 and 0.9299, respectively.

3.5

PLS factors

RMSECV (%)

RMSEP (%)

R (train)

R (test)


5 3 7 10

1.5211 1.0858 1.2543 1.0667

1.6482 1.1138 1.2446 1.7987

0.8976 0.9382 0.9134 0.9747

0.8473 0.9299 0.9107 0.8024

Also seen from Table 4, the correlation coefficient for the second derivation spectral preprocessing method equals to 0.9747 for the training set, but, only 0.8024 for the test set. RMSECV and RMSEP are 1.0667% and 1.7987% respectively with 10 PLS factors in this model. This high number of PLS factors can explain the difference between the test and training set, because too high PLS factors might include specific information when training, which will result in a worse generalization performance of the PLS model. This phenomenon is also called ‘over-fitting’ of the model that specific information related to the training samples is included in the model, but when unknown samples are predicted by this model, this specific information will lead to ‘bad’ results for the ‘untrained’ samples. The scatter plot in Fig. 6 shows the model for determining total polyphenols content by the SNV spectral preprocessing method. Compared the caffeine model, the total polyphenols model is worse. Therefore, many points in Fig. 6 falloff the unity line compared with Fig. 4. Pure caffeine and impure total polyphenols might be explained the differences between them. 4. Conclusion The overall results sufficiently demonstrate that caffeine and total polyphenols contents in tealeaves can be determined simultaneously by NIR spectroscopy. The PLS calibration models in determining caffeine and total polyphenols contents are all achieved with 3PLS factors under SNV preprocesses,

No preprocess

3 RMSECV (%)

Preprocessing method

SNV First deviation

2.5

Second deviation

2 1.5 1 0.5 0

1

2

3

4

5 6 7 8 9 Number of PLS factors

10

11

12

Fig. 5. Effect of number of PLS factors on RMSECV for total polyphenols calibration model.

Fig. 6. Reference determination versus NIR prediction for total polyphenols of train (○) and test (+) set data.


and the correlation coefficients between the NIR prediction results and reference measurement results follow as: R = 0.9688 for caffeine, and R = 0.9299 for total polyphenols. It can be concluded that many valid components in tea can be analyzed fast and simultaneously by NIR spectroscopy coupled with the appropriate chemometrics methods, and this real-time, at-site measurement will significantly improve the efficiency of quality control and assurance. Acknowledgements This work has been financially supported by the National High Technology Research and Development Program of China (863 Project, No. 2002AA248051) and the National Natural Science Foundation of China (No. 30370813). References [1] C.S. Yang, P. Maliakal, X. Meng, Inhibition of carcinogenesis by tea, Annu. Rev. Pharmacol. Toxicol 42 (2002) 25–54. [2] K. Nakachi, S. Matsuyama, S. Miyake, M. Suganuma, K. Imai, Preventive effects of drinking green tea on cancer and cardiovascular disease: epidemiological evidence for multiple targeting prevention, BioFactors 13 (2000) 49–54. [3] V.W. Setiawan, Z.F. Zhang, G.P. Yu, Q.Y. Lu, Y.L. Li, M.L. Lu, M.R. Wang, C.H. Guo, S.Z. Yu, R.C. Kurtz, C.C. Hsieh, Protective effect of green tea on the risks of chronic gastritis and stomach cancer, Int. J. Cancer 92 (2001) 600–604. [4] K. Shibata, M. Moriyama, T. Fukushima, A. Kaetsu, M. Miyazaki, H. Une, Green tea consumption and chronic atrophic gastritis: a cross-sectional study in a green tea production village, J. Epidemiol. 10 (2000) 310–316. [5] L. Jian, L.P. Xie, A.H. Lee, C.W. Binns, Protective effect of green tea against prostate cancer: a case-control study in southeast China, Int. J. Cancer 108 (2004) 130–135. [6] H. Fujiki, M. Suganuma, S. Okabe, E. Sueoka, N. Sueoka, N. Fujimoto, Y. Goto, S. Matsuyama, K. Imai, K. Nakachi, Cancer prevention with green tea and monitoring by a new biomarker, hnRNP B1, Mutat. Res. 480–481 (2001) 299–304.

47

[7] M. Inoue, K. Tajima, M. Mizutani, H. Iwata, T. Iwase, S. Miura, K. Hirose, N. Hamajima, S. Tominaga, Regular consumption of green tea and the risk of breast cancer recurrence: follow-up study from the Hospital-based Epidemiologic Research Program at Aichi Cancer Center (HERPACC), Japan, Cancer Lett. 167 (2001) 175–182. [8] D. Zhang, S. Kuhr, U.H. Engelhardt, Influence of catechins and theaflavins on astringent taste of black tea brews, Z. Lebensm.-Unters. Forsch. 195 (1992) 108–111. [9] Y.G. Zuo, H. Chen, Y.W. Deng, Simultaneous determination of catechins, caffeine and gallic acids in green, Oolong, black and pu-erh teas using HPLC with a photodiode array detector, Talanta 57 (2002) 307–316. [10] H. Hideki, M. Toshihiro, K. Katsunori, Simultaneous determination of qualitatively important components in green tea infusions using capillary electrophoresis, J. Chromatogr. A 758 (1997) 332–335. [11] ISO (International Standard Organization). Determination of Individual Catechins and Total Polyphenols in Tea, ISO TC 34/SC 8 N 444; 1994. [12] C.W. Huck, W. Guggenbichler, G.K. Bonn, Analysis of caffeine, theobromine and theophylline in coffee by near infrared spectroscopy compared to high-performance liquid chromatography coupled to mass spectrometry, Anal. Chim. Acta 538 (2005) 195–203. [13] M. Laasonen, T. Harmia-Pulkkinen, C. Simard, M. Räsänen, H. Vuorela, Development and validation of near-infrared method for quantitation of caffeine intact single tablets, Anal Chem. 75 (2003) 754–760. [14] Luis E. Rodriguez-Saona, Fred S. Fry, Elizabeth M. Calvery, Use of Fourier transform near-infrared spectroscopy rapid quantification of castor bean meal in a selection of flour-based products, J. Agric. Food Chem. 48 (2000) 5169–5177. [15] M.N. Hall, A. Robertson, C.N.G. Scotter, Near-infrared reflectance prediction of quality, theaflavin content and moisture content of black tea, Food Chem. 27 (1988) 61–75. [16] H. Schulz, U.H. Engelhardt, A. Wengent, H.H. Drews, S. Lapczynski, Application of NIRS to the simultaneous prediction alkaloids and phenolic substance in green tea leaves, J. Agric. Food Chem. 475 (1999) 5064–5067. [17] J. Lupaert, M.H. Zhang, D.L. Massart, Feasibility study for the using near infrared spectroscopy in the qualitative and quantitative of green tea, Camellia sinensis (L), Anal. Chim. Acta 487 (2003) 303–312. [18] M.H. Zhang, J. Lupaert, Q.S. Xu, D.L. Massart, Determination of total antioxidant capacity in green tea by NIRS and multivariate calibration, Talanta 62 (2004) 25–35.

本文献由“学霸图书馆-文献云下载”收集自网络，仅供学习交流使用。

学霸图书馆（www.xuebalib.com）是一个“整合众多图书馆数据库资源，提供一站式文献检索和下载服务”的24 小时在线不限IP 图书馆。图书馆致力于便利、促进学习与科研，提供最强文献下载服务。

图书馆导航：图书馆首页

文献云下载

图书馆入口

外文数据库大全

疑难文献辅助工具

SIMULTANEOUS DETERMINATION OF TOTAL POLYPHENOLS AND CAFFEINE

Recommend Documents