A predictive model for precision tree measurements using applied machine learning

Date
2022-04
Journal Title
Journal ISSN
Volume Title
Publisher
Stellenbosch : Stellenbosch University
Abstract
ENGLISH ABSTRACT: Accurately determining biological asset values is of great importance for forestry enterprises — the process ought to be characterised by the proper collection of tree data by means of utilising appropriate enumeration practices conducted at managed forest compartments. Currently, only between 5–20% of forest areas are enumerated which serve as a representative sample for the entire enclosing compartment. For forestry companies, timber volume estimations and future growth projections are based on these statistics, which may be accompanied by numerous unintentional errors during the data collection process. Many alternative methods towards estimating and inferring tree data accurately are available in the literature — the most popular characteristic is the so-called diameter at breast height (DBH), which can also be measured by means of remote sensing techniques. The advancements in laser scanning measurement apparatuses are significant in recent decades, however, these approaches are notably expensive and require specialised and technical skills for their operation. One of the main drawbacks associated with the measurement of DBH by means of laser scanning is the lack of scalability — equipment setup and data capture are arduous processes that take a significant amount of time to complete. Algorithmic breakthroughs in the domain of data science, predominantly spanning machine learning (ML) and deep learning (DL) approaches, warrant the selection and practical application of computer vision (CV) procedures. More specifically, an algorithmic approach towards monocular depth estimation (MDE) techniques was employed for the extraction of tree data features from video recordings (captured using no more than an ordinary smartphone device) and are investigated in this thesis. Towards this end, a suitable forest study area was identified to conduct the experiment and the industry partner of the project, i.e. the South African Forestry Company SOC Limited (SAFCOL) granted the necessary plantation access. The research methodology adopted for this thesis includes fieldwork at the given site, which involved first performing data collection steps according to accepted and standardised operating procedures developed for tree enumerations. This data set is regarded as the “ground truth” and comprises the target feature (i.e. actual DBH measurements) later used for modelling purposes. The video files were processed in a structured manner in order to extract tree segment patterns from the correspond ing imagery. Various ML models are then trained and tested in respect of the basic input feature data file, which produced a relative root mean squared error (RMSE %) between 14.1 and 18.3% for the study. The relative bias yields a score between −0.08% and 1.13% indicating that the proposed workflow solution exhibits a consis tent prediction result, but at an undesirable error rate (i.e. RMSE) deviation from the target output. Additionally, the suggested CV/ML workflow model is capable of generating a dis cernibly similar spatial representation upon visual inspection (when compared with the ground truth data set — i.e. tree coordinates captured during fieldwork). In the pursuit of precision forestry, the proposed predictive model developed for accurate tree measurements produce DBH estimations that approximate real-world values with a fair degree of accuracy.
AFRIKAANSE OPSOMMING: Die akkurate bepaling van biologiese batewaardes is baie belangrik vir groot bos bou ondernemings — die proses word gekenmerk deur die korrekte versameling van boomdata, deur gebruik te maak van gepaste opsommingspraktyke wat in bestuurde bosbou kompartemente uitgevoer word. Tans word slegs tussen 5 en 20% van die bosareas opgesom wat as ‘n verteenwoordigende steekproef van die hele omhulde kompartement dien. Vir bosbou ondernemings is die beraming van houtvolumes en toekomstige groeiprojeksies gebaseer op hierdie statistieke, wat moontlik gepaard gaan met talle onbedoelde foute tydens die data-insamelingsproses. Baie alternatiewe metodes om boomdata akkuraat te bereken is in die literatuur beskikbaar — die gewildste data punt (kenmerkend in bosbou) is die sogenaamde diameter op borshoogte (DBH), wat selfs ook gemeet kan word deur middel van af standswaarnemings tegnieke. Die vooruitgang in meetapparate vir laserskandering is die afgelope dekades aansienlik verbeter, maar hierdie benaderings is veral duur en vereis gespesialiseerde en tegniese vaardighede vir die werking daarvan. Een van die belangrikste nadele verbonde aan die meting van DBH deur middel van hierdie laserskandering is die gebrek aan skaalbaarheid — die opstel van toerusting en die opneem van data is moeisame prosesse wat aansienlik baie Algoritmiese deurbrake op die gebied van data wetenskap, wat oorwegend masjien leer (ML) en diep leer (DL) benaderings bevat, regverdig die keuse en praktiese toepassing van rekenaarvisie (CV) prosedures. Meer spesifiek is die algoritmiese benadering ten opsigte van monokulˆere diepte skatting (MDE) tegnieke vir die ont trekking van boomdatafunksies vanuit video opnames (met nie meer as ‘n gewone slimfoonapparaat nie) en word in hierdie tesis deeglik ondersoek. Hiervoor is ‘n geskikte bosstudiegebied ge¨ıdentifiseer om die eksperiment uit te voer en die bedryfs vennoot van die projek, South African Forestry Company SOC Limited (SAFCOL) het die nodige toegang tot die plantasie verleen. Die navorsingsmetodologie wat vir hierdie proefskrif aangeneem is, bevat veldwerk op die gegewe terrein en die eerste stap van die uitgevoerde data insameling was volgens aanvaarde en gestandaardiseerde werkingsprosedures wat vir boomtellings neem om te voltooi. ontwikkel is. Hierdie opgawe en datastel word beskou as die “grondwaarheid” en be vat die teikenfunksie (werklike DBH metings), wat later vir modelleringsdoeleindes gebruik is. Die videolˆeers is op ‘n gestruktureerde manier verwerk om boomseg ment patrone uit die ooreenstemmende beelde te onttrek. Verskeie ML modelle word dan opgelei en getoets ten opsigte van die basiese invoerfunksiedatalˆeer, wat ‘n relatiewe wortel gemiddelde kwadraatfout (RMSE %) tussen 14.1% en 18.3% vir die studie opgelewer het. Die relatiewe vooroordeel lewer ‘n telling tussen −0.08% en 1.13% wat aandui dat die voorgestelde werkstroom oplossing ‘n konstante voor spellings resultaat toon, maar met ‘n ongewenste foutkoers (RMSE) afwyking vanaf die teikenuitset wat verlang word. Verder kan die voorgestelde CV/ML werkstroom model ook ‘n waarneembare en soortgelyke ruimtelike voorstelling genereer onder meer visuele inspeksie (in verge lyking met die grondwaarheids data stel — m.a.w. boomko¨ordinate wat tydens veld werk vasgelˆe is). In die strewe na presiese bosbou lewer hierdie voorspellingsmodel wat ontwikkel is vir boommetings (i.t.m. DBH beramings), die werklike waardes verteenwoordigend tot ‘n redelike mate van akkuraatheid.
Description
Thesis (MEng)--Stellenbosch University, 2022.
Keywords
Machine learning, Deep learning, Computer vision, UCTD, Model based predictive control
Citation