Artigo Revisado por pares

Human Visual System vs Convolution Neural Networks in food recognition task: An empirical comparison

2019; Elsevier BV; Volume: 191; Linguagem: Inglês

10.1016/j.cviu.2019.102878

ISSN

1090-235X

Autores

Pedro Furtado, Manuel Caldeira, Pedro Martins,

Tópico(s)

Nutritional Studies and Diet

Resumo

Automated food recognition from food plate is useful for smartphone-based applications promoting healthy lifestyles and for automated carbohydrate counting, e.g. targeted at type I diabetic patients, but the variation of appearance of food items makes it a difficult task. Convolution Neural Networks (CNNs) raised to prominence in recent years, and they will enable those applications if they are able to match HVS accuracy at least in meal classification. In this work we run an experimental comparison of accuracy between CNNs and HVS based on a simple meal recognition task. We set up a survey for humans with two phases, training and testing, and also give the food dataset to state-of-the-art CNNs. The results, considering some relevant variations in the setup, allow us to reach conclusions regarding the comparison, characteristics and limitations of CNNs, which are relevant for future improvements.

Referência(s)