Visual Learning of Arithmetic Operation

Abstract

A simple Neural Network model is presented for end-to-end visual learning of arithmetic operations from pictures of numbers. The input consists of two pictures, each showing a 7-digit number. The output, also a picture, displays the number showing the result of an arithmetic operation (e.g., addition or subtraction) on the two input numbers. The concepts of a number, or of an operator, are not explicitly introduced. This indicates that addition is a simple cognitive task, which can be learned visually using a very small number of neurons. Other operations, e.g., multiplication, were not learnable using this architecture. Some tasks were not learnable end-to-end (e.g., addition with Roman numerals), but were easily learnable once broken into two separate sub-tasks: a perceptual Character Recognition and cognitive Arithmetic sub-tasks. This indicates that while some tasks may be easily learnable end-to-end, other may need to be broken into sub-tasks.

Cite

Text

Hoshen and Peleg. "Visual Learning of Arithmetic Operation." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.9882

Markdown

[Hoshen and Peleg. "Visual Learning of Arithmetic Operation." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/hoshen2016aaai-visual/) doi:10.1609/AAAI.V30I1.9882

BibTeX

@inproceedings{hoshen2016aaai-visual,
  title     = {{Visual Learning of Arithmetic Operation}},
  author    = {Hoshen, Yedid and Peleg, Shmuel},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2016},
  pages     = {3733-3739},
  doi       = {10.1609/AAAI.V30I1.9882},
  url       = {https://mlanthology.org/aaai/2016/hoshen2016aaai-visual/}
}