Transcript Document
Quick guide to PLS
Use [Alt-Tab] to go to LatentiX (if running) Press [Page Down] or [Enter] to continue Press [ESC] to end the show
Dioxin and fatty acid data • •
Problem
To develop a PLS model that can predict the dioxin concentration (y) in ng/(kg fat) in fish meal samples from fatty acids profiles (X).
In total 64 samples were analysed and gas chromatography was used for the determination of 32 fatty acids (including an unidentified group). The results are given in % of all 32 fatty acids.
The very expensive and complex dioxin analyses were performed by a German laboratory.
Data
The dimension of the data structure is 64 objects x 34 variables.
The first two variables are the species code (herring, sprat, blue whiting, sand eel etc. in total 15 species) and the dioxin content.
Variables 3 to 34 are the chromatographic variables.
Source:
Rapid dioxin assessment in fish products by fatty acid pattern recognition, Marc Bassompierre, Lars Munck, Rasmus Bro and Søren Balling Engelsen, Analyst, 129, 553-558, 2004.
2
Get the data 3
The data in the WorkBench 4
Select the fatty acid variables Write 3:34 and press [
ENTER
] or select the fatty acid variables in the Variables listbox 5
Define a variable set 6
Give the set a name, e.g. X 7
Now select dioxin in the listbox The plot window always show the actual selection in the Objects and Variables listboxes 8
Plot dioxin as a function of the objects Click on this icon 9
Define a variable set containing only the dioxin variable 10
Give the set a name, e.g. y 11
Autoscale the X data Then select autoscale First select X in the Sets box 12
The effect is shown instantly in the plot Click Transform to see how the transformation affects the data 13
Make a PLS model Select PLS in the dropdown menu 14
Select the dependent variable(s) Then select y (or click on Dioxin in the listbox) First select y in the Sets box 15
Autoscale y 16
Choose cross validation Select CV: Syst123 (Venetian blinds) 17
Choose the number of segments In this case we use eight segments 18
Calculate the PLS model Then press OK Press Calculate 19
Plot the error 20
RMSEC & RMSECV Select both using CTRL 21
U vs T plot 22
U vs T plot
TIP: use the arrows keys to change the number of components
Change the markers & labels 23
Plot regression coefficicents 24
Regression coefficients
TIP: use the arrows keys to change the number of components
Change the markers & labels 25
Plot Actual vs Predicted 26
Actual vs Predicted for six components
TIP: use the arrows keys to change the number of components
THE END Change the markers & labels 27