A Lightweight Method for Grape Berry Counting based on Automated3D Bunch Reconstruction from a Single Image
Scarlett Liu
1
, Mark Whitty
2
and Steve Cossell
3
Abstract
—Berry counting is an integral step towards grapevine yield estimation. As a traditional yield estimation step,counting berry by human hand is tedious and time consuming.Recent methods have approached this using specialized stereocameras and lighting rigs which are impractical for a large scalefield application. This paper presents a lightweight method forgenerating a representative 3D reconstruction of an individualgrape bunch from a single image from one side of the bunch.The results were poor prior to the application of a sparsityfactor to compensate for bunches of varying sparsity, with thefinal result being an absolute average accuracy of 87.6% andaverage error of 4.6%, with an
R
2
value of 0.85. These resultsshow promise for
in vivo
counting of berry numbers in a non-computationally expensive manner.
Keywords
: Grape, Berry, Viticulture, Image Processing, 3DBunch ReconstructionI. INTRODUCTIONYield estimation in viticulture is notorious for producingpoor estimates due to range of sampling factors and de-pendency on subjective interpretation of the state of vinematurity. This poor estimation costs hundreds of millionsof dollars each year in contract adjustments, harvest logisticmanagement, oak barrel purchases and tank space allocationamongst others. The structure of vineyards means aerialimagery is only able to contribute a small amount to theyield estimation, and other on ground estimation methods aretime consuming. Recent work by Nuske [1] in the US hasshown the potential for image processing to speed up thisanalysis as well as generate unbiased estimates which areorders of magnitude smaller than manual estimates, leadingto substantial cost savings.As to traditional yield estimation in vineyards, berry num-ber is a critical parameter for early forecasting productionsince the number of berries remains stable after fruit setting[2]. Also the ratio between of berry number per bunch andbunch size is one of many factors governing the quality of the fruit at harvest. At current vineyards, counting berry isaccomplished by hand, which is work intensive and timeconsuming. [3], [4], [5] demonstrated the advantages of image processing on yield components analysis for the sakeof saving time and energy for grape production forecast. [6],
1
Scarlett Liu is with School of Mechanical and Manufacturing,University of New South Wales, 2052 Sydney, Australia
sisi.liu@unsw.edu.au
2
Mark Whitty is with School of Mechanical and Manufacturing,University of New South Wales, 2052 Sydney, Australia
m.whitty@unsw.edu.au
3
Steve Cossell is with School of Mechanical and Manufac-turing, University of New South Wales, 2052 Sydney, Australia
scos506@gmail.com
[4] applied image processing techniques for berry countingone side of a bunch, achieving average
R
2
value of 0.92 and0.82 between actual berries and detected berries per bunch.However, the image processing algorithm proposed in paper[6] can not be utilised after v´eraison since the reflection onberry skin is affected by pruine (which causes matte surfaceon berries on both green and purple grapes). As the work presented by Diago [4], a dataset with 70 bunches from 7 va-rieties was tested, with a
R
2
value varying from 0.62 to 0.95based on 10 bunches for each variety (0.817 for 7 caltivarsin average). Leaving the image techniques described by theauthor alone, 10 bunches is not representative for validatingimage processing procedure in one cultivar. Especially forCabernet Sauvignon as well as Shiraz which are famous fornon-uniform bunch shape, [4] obtained the lowest
R
2
valuewith 0.62 based a single image of Cabernet Sauvignon from7 cultivars.Except detecting berries from one side by processing oneimage, other work [5], [7] showed the advantages of perform-ing 3D reconstruction of grape bunches for the purpose of estimating the number of grapes in a bunch by stereo images.Their accuracy improved achieved an
R
2
value of 0.78 op-posed to more traditional 2D estimation techniques [3] whichhave been a staple for the image processing community [8],[9]. Their 3D reconstruction relies on substantial manualinput (semi-automatic) for each bunch, which is tediouseven given an impressive user interface and thus cannotbe applied on a large scale for reliable yield estimation.As to the scope of experiment, data sets in paper [5], [7]are small, 10 bunches from one cultivar (10 cultivars) and20 bunches from 14 vines in one block, respectively. Also
R
2
achieved in both paper are 0.71 and 0.78, which is notsatisfied for practical implementation in current vineyards.In addition, a specialized stereo camera arrangement wasrequired, along with controlled lighting conditions, limitingthe applicability to
ex vivo
analysis. Stereo cameras alsohave a minimum range which restricts the level of detailwhich may be achieved by moving closer, meaning in fieldapplication within the confines of a sprawling canopy isimpractical.In order to increase of these image processing methods,low cost and simpler solutions are needed that can be appliedby farmers on the ground. Thus objective of this paper is toform a representative 3D reconstruction of grape bunchesfrom a single image for the purpose of accurate berrycounting. The use of a single image only is a key feature,which simplifies the data capture process and keeps thecost manageable, to the point where cameras such as those
ad
ad
ad
Reward Your Curiosity
Everything you want to read.
Anytime. Anywhere. Any device.
No Commitment. Cancel anytime.