14When an arbitrary number of hidden layers is allowed, one can devise many alternative schemes. For instance, a squaring function x x2 can be approximated on a small interval via linear combinations of an arbitrary nonlinear function F, since a Taylor expansion around a constant c gives x2 = 2[F(c + x) -F(c) - xF(c)]F′′(c) + O(x3). The only provision here is that F is at least three times differentiable (or at least four times differentiable if we would have used the more accurate alternative x2 = [F(c + x) - 2F(c) + F(c-x)]F′′(c) + O(x4)). These requirements are satisfied by our C functions F 0, F1 and F2. A multiplication xy can subsequently be constructed as a linear combination of squaring functions through xy = 1
4[(x + y)2 - (x-y)2], xy = 1
2[(x + y)2 -x2 -y2] or xy = -1
2[(x-y)2 -x2 -y2]. A combination of additions and multiplications can then be used to construct any multidimensional polynomial, which in turn can be used to approximate any continuous multidimensional function up to arbitrary accuracy. See also [33].