Silva Fennica

Fig. 1. Stoniness classification process of forest soil, based on the excavator IMU data.

Fig. 2. A drawing of the measurement setup and the coordinate axes in the excavator. The location of the measurement device is marked with the “IMU” label (IMU = inertial measurement unit).

Fig. 3. Manual stoniness measurements at four different sites. The circles are the center points for the series of 10 bar insertions and the color indicates the year of the measurements. (Aerial photo © National Land Survey of Finland 2018).

Table 1. Initial features before feature selection.
Features for IMU signals
Basic descriptive statistics	Mean	Minimum
	Median	Maximum
	Standard deviation (STD)	25th percentile
	Variance	75th percentile
	Median absolute deviation (MAD)	Skewness
	Interquartile range (IQR)	Kurtosis
Signal metrics	Peak to peak
	Root-mean-square level (RMS)
	Signal power
	1-lag autocorrelation
Frequency domain properties	Dominant frequency
	Dominant frequency magnitude
Correlations	Correlation coefficients between axes

Table 2. Features ranked highest by the ReliefF algorithm.
Ranked features for each axis by the ReliefF algorithm (Rank order)
	Activity recognition	Stoniness classification
Accelerometer X	-	Minimum (10) Maximum (18) Peak to peak (20) Dominant frequency magnitude (21) Autocorrelation (22)
Accelerometer Y	Kurtosis (4) Maximum (5) MAD (11) IQR (14) 75-percentile (24)	Max (2) Peak to peak (3) Autocorrelation (5) 75-percentile (12) STD (14) Minimum (16) Dominant frequency (23) Power (24) Variance (25)
Accelerometer Z	Dominant frequency magnitude (7) Autocorrelation (12) Mean (15) 75-percentile (22) Median (25)	Autocorrelation (13) MAD (15) IQR (19)
Gyroscope X	MAD (1) Dominant frequency magnitude (10) 75-percentile (21)	Autocorrelation (6)
Gyroscope Y	-	Autocorrelation (8)
Gyroscope Z	Minimum (3) 75-percentile (6) Median (8) 25-percentile (13) Mean (20) MAD (23)	-
Correlation coeff.	Acc. X – Acc. Y (2) Acc. Y – Acc. Z (9) Gyro. Y – Gyro. Z (19)	Gyro. X – Gyro. Y (1) Acc. Z – Gyro. Y (4) Acc. X – Acc. Y (7) Acc. Y – Gyro. X (9) Acc. Z – Gyro. X (11) Acc. X – Gyro. Z (17)
STD = standard deviation, MAD = Median absolute deviation, IQR = Interquartile range.

Fig. 4. The misclassification rate with an increasing number of features selected by the ReliefF algorithm in (A) activity recognition and (B) stoniness classification.

Table 3. Selected features in activity recognition by the SFS algorithm.
Selected features for each axis by the SFS algorithm (activity recognition)
	SVM	BDTree	KNN	NaïveBayes	LDA	QDA
Accelerometer X	RMS Peak to peak	Minimum	-	-	-	-
Accelerometer Y	Variance	STD	Maximum STD Peak to peak	STD	Variance	Variance
Accelerometer Z	-	-	-	-	-	-
Gyroscope X	-	-	25-percentile	Median	MAD	MAD
Gyroscope Y	-	Peak to peak	-	-	-	-
Gyroscope Z	-	-	-	Mean	Mean	Mean
SVM = Support Vector Machine, BDTree = Binary Decision Tree, KNN = K-Nearest Neighbors, LDA = Linear Discriminant Analysis, QDA = Quadratic Discriminant Analysis. RMS = Root-mean-square level, STD = standard deviation, MAD = Median absolute deviation.

Table 4. Selected features in stoniness classification by the SFS algorithm.
Selected features for each axis by the SFS algorithm (stoniness classification)
	SVM	BDTree	KNN	NaïveBayes	LDA	QDA
Accelerometer X	Dom. freq.	-	-	-	RMS 75-percentile Dom. freq. mag.	RMS 75-percentile Dom. freq. mag.
Accelerometer Y	Autocorrelation	-	-	Mean Skewness Autocorrelation	Median Maximum Variance Power Autocorrelation	Median Maximum Variance Power Autocorrelation
Accelerometer Z	STD 75-percentile	-	-	Median 75-percentile	Mean	Mean
Gyroscope X	Maximum	-	-	-	Maximum Peak to peak	Maximum Peak to peak
Gyroscope Y	Maximum	-	-	MAD	MAD Kurtosis Peak to peak	MAD Kurtosis Peak to peak
Gyroscope Z	Minimum	Dom. freq.	Dom. freq.	-	Minimum Autocorrelation	Minimum Autocorrelation
Correlation coeff.	-	-	-	Acc. X – Acc. Y Acc. X – Gyro. Y	Acc. X – Gyro. Y	Acc. X – Gyro. Y
SVM = Support Vector Machine, BDTree = Binary Decision Tree, KNN = K-Nearest Neighbors, LDA = Linear Discriminant Analysis, QDA = Quadratic Discriminant Analysis. RMS = Root-mean-square level, STD = standard deviation, MAD = Median absolute deviation.

Fig. 5. The accuracy and percentage of false-positive predictions in the activity recognition using two feature-selection algorithms, ReliefF and Sequential Feature Selection (SFS).

Table 5. The stoniness prediction accuracy.
Stoniness prediction accuracy (%)
	SVM	BDTree	KNN	NaïveBayes	LDA	QDA	ANN
Point prediction (ReliefF)	43.3	43.8	42.6	42.6	47.6	44.6	43.0
Point prediction (SFS)	45.0	37.7	45.6	44.6	44.6	40.8	43.0
Grid prediction (ReliefF)	66.1	69.6	37.5	64.3	35.7	57.1	62.5
Grid prediction (SFS)	75.0	26.8	46.4	50.0	28.6	53.6	62.5
SVM = Support Vector Machine, BDTree = Binary Decision Tree, KNN = K-Nearest Neighbors, LDA = Linear Discriminant Analysis, QDA = Quadratic Discriminant Analysis, ANN = a neural network model.

Fig. 6. Predicted stoniness classes and manual measurements for the first site.

Fig. 7. Predicted stoniness classes and manual measurements for the second site.

Fig. 8. Predicted stoniness classes and manual measurements for the third site.

Fig. 9. Predicted stoniness classes and manual measurements for the fourth site.

Fig. 10. Entropy calculated for each grid cell in the first site.