The search functionality is under construction.
The search functionality is under construction.

Open Access
Simultaneous Estimation of Dish Locations and Calories with Multi-Task Learning

Takumi EGE, Keiji YANAI

  • Full Text Views

    76

  • Cite this
  • Free PDF (2.6MB)

Summary :

In recent years, a rise in healthy eating has led to various food management applications which have image recognition function to record everyday meals automatically. However, most of the image recognition functions in the existing applications are not directly useful for multiple-dish food photos and cannot automatically estimate food calories. Meanwhile, methodologies on image recognition have advanced greatly because of the advent of Convolutional Neural Network (CNN). CNN has improved accuracies of various kinds of image recognition tasks such as classification and object detection. Therefore, we propose CNN-based food calorie estimation for multiple-dish food photos. Our method estimates dish locations and food calories simultaneously by multi-task learning of food dish detection and food calorie estimation with a single CNN. It is expected to achieve high speed and small network size by simultaneous estimation in a single network. Because currently there is no dataset of multiple-dish food photos annotated with both bounding boxes and food calories, in this work we use two types of datasets alternately for training a single CNN. For the two types of datasets, we use multiple-dish food photos annotated with bounding boxes and single-dish food photos with food calories. Our results showed that our multi-task method achieved higher accuracy, higher speed and smaller network size than a sequential model of food detection and food calorie estimation.

Publication
IEICE TRANSACTIONS on Information Vol.E102-D No.7 pp.1240-1246
Publication Date
2019/07/01
Publicized
2019/04/25
Online ISSN
1745-1361
DOI
10.1587/transinf.2018CEP0004
Type of Manuscript
Special Section PAPER (Special Section on Multimedia for Cooking and Eating Activities)
Category

Authors

Takumi EGE
  The University of Electro-Communications
Keiji YANAI
  The University of Electro-Communications

Keyword