keras/examples/antirectifier.py

'''The example demonstrates how to write custom layers for Keras.

We build a custom activation layer called 'Antirectifier',
which modifies the shape of the tensor that passes through it.
We need to specify two methods: `compute_output_shape` and `call`.

Note that the same result can also be achieved via a Lambda layer.

Because our custom layer is written with primitives from the Keras
backend (`K`), our code can run both on TensorFlow and Theano.
'''

from __future__ import print_function
import keras
from keras.models import Sequential
from keras import layers
from keras.datasets import mnist
from keras import backend as K


class Antirectifier(layers.Layer):
    '''This is the combination of a sample-wise
    L2 normalization with the concatenation of the
    positive part of the input with the negative part
    of the input. The result is a tensor of samples that are
    twice as large as the input samples.

    It can be used in place of a ReLU.

    # Input shape
        2D tensor of shape (samples, n)

    # Output shape
        2D tensor of shape (samples, 2*n)

    # Theoretical justification
        When applying ReLU, assuming that the distribution
        of the previous output is approximately centered around 0.,
        you are discarding half of your input. This is inefficient.

        Antirectifier allows to return all-positive outputs like ReLU,
        without discarding any data.

        Tests on MNIST show that Antirectifier allows to train networks
        with twice less parameters yet with comparable
        classification accuracy as an equivalent ReLU-based network.
    '''

    def compute_output_shape(self, input_shape):
        shape = list(input_shape)
        assert len(shape) == 2  # only valid for 2D tensors
        shape[-1] *= 2
        return tuple(shape)

    def call(self, inputs):
        inputs -= K.mean(inputs, axis=1, keepdims=True)
        inputs = K.l2_normalize(inputs, axis=1)
        pos = K.relu(inputs)
        neg = K.relu(-inputs)
        return K.concatenate([pos, neg], axis=1)

# global parameters
batch_size = 128
num_classes = 10
epochs = 40

# the data, shuffled and split between train and test sets
(x_train, y_train), (x_test, y_test) = mnist.load_data()

x_train = x_train.reshape(60000, 784)
x_test = x_test.reshape(10000, 784)
x_train = x_train.astype('float32')
x_test = x_test.astype('float32')
x_train /= 255
x_test /= 255
print(x_train.shape[0], 'train samples')
print(x_test.shape[0], 'test samples')

# convert class vectors to binary class matrices
y_train = keras.utils.to_categorical(y_train, num_classes)
y_test = keras.utils.to_categorical(y_test, num_classes)

# build the model
model = Sequential()
model.add(layers.Dense(256, input_shape=(784,)))
model.add(Antirectifier())
model.add(layers.Dropout(0.1))
model.add(layers.Dense(256))
model.add(Antirectifier())
model.add(layers.Dropout(0.1))
model.add(layers.Dense(10))
model.add(layers.Activation('softmax'))

# compile the model
model.compile(loss='categorical_crossentropy',
              optimizer='rmsprop',
              metrics=['accuracy'])

# train the model
model.fit(x_train, y_train,
          batch_size=batch_size, epochs=epochs,
          verbose=1, validation_data=(x_test, y_test))

# next, compare with an equivalent network
# with2x bigger Dense layers and ReLU
Add antirectifier example 2016-01-08 04:22:33 +00:00			`'''The example demonstrates how to write custom layers for Keras.`

			`We build a custom activation layer called 'Antirectifier',`
			`which modifies the shape of the tensor that passes through it.`
Replace references to get_output_shape_for with compute_output_shape (#5848) 2017-03-19 03:43:17 +00:00			We need to specify two methods: `compute_output_shape` and `call`.
Add antirectifier example 2016-01-08 04:22:33 +00:00
Antirectifier example style fixes 2016-01-08 05:18:34 +00:00			`Note that the same result can also be achieved via a Lambda layer.`
Add antirectifier example 2016-01-08 04:22:33 +00:00
			`Because our custom layer is written with primitives from the Keras`
			backend (`K`), our code can run both on TensorFlow and Theano.
			`'''`

			`from __future__ import print_function`
Make utils globally importable & update examples. 2017-02-28 22:41:30 +00:00			`import keras`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`from keras.models import Sequential`
Update a couple examples. 2017-03-11 23:03:27 +00:00			`from keras import layers`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`from keras.datasets import mnist`
			`from keras import backend as K`


Update a couple examples. 2017-03-11 23:03:27 +00:00			`class Antirectifier(layers.Layer):`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`'''This is the combination of a sample-wise`
			`L2 normalization with the concatenation of the`
Antirectifier example style fixes 2016-01-08 05:18:34 +00:00			`positive part of the input with the negative part`
			`of the input. The result is a tensor of samples that are`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`twice as large as the input samples.`

			`It can be used in place of a ReLU.`

			`# Input shape`
			`2D tensor of shape (samples, n)`

			`# Output shape`
			`2D tensor of shape (samples, 2*n)`

			`# Theoretical justification`
			`When applying ReLU, assuming that the distribution`
			`of the previous output is approximately centered around 0.,`
			`you are discarding half of your input. This is inefficient.`

			`Antirectifier allows to return all-positive outputs like ReLU,`
			`without discarding any data.`

			`Tests on MNIST show that Antirectifier allows to train networks`
Antirectifier example style fixes 2016-01-08 05:18:34 +00:00			`with twice less parameters yet with comparable`
			`classification accuracy as an equivalent ReLU-based network.`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`'''`
PEP8 fixes in examples. 2017-01-11 19:39:58 +00:00
Integration tests passing. 2017-02-15 00:08:30 +00:00			`def compute_output_shape(self, input_shape):`
Keras 1.0 preview. 2016-03-19 16:07:15 +00:00			`shape = list(input_shape)`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`assert len(shape) == 2 # only valid for 2D tensors`
			`shape[-1] *= 2`
			`return tuple(shape)`

Update a number of example scripts. 2017-02-20 03:24:32 +00:00			`def call(self, inputs):`
			`inputs -= K.mean(inputs, axis=1, keepdims=True)`
			`inputs = K.l2_normalize(inputs, axis=1)`
			`pos = K.relu(inputs)`
			`neg = K.relu(-inputs)`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`return K.concatenate([pos, neg], axis=1)`

			`# global parameters`
			`batch_size = 128`
Integration tests passing. 2017-02-15 00:08:30 +00:00			`num_classes = 10`
			`epochs = 40`
Add antirectifier example 2016-01-08 04:22:33 +00:00
Finish PR backporting 2016-04-04 18:30:24 +00:00			`# the data, shuffled and split between train and test sets`
Update a number of example scripts. 2017-02-20 03:24:32 +00:00			`(x_train, y_train), (x_test, y_test) = mnist.load_data()`
Add antirectifier example 2016-01-08 04:22:33 +00:00
Update a number of example scripts. 2017-02-20 03:24:32 +00:00			`x_train = x_train.reshape(60000, 784)`
			`x_test = x_test.reshape(10000, 784)`
			`x_train = x_train.astype('float32')`
			`x_test = x_test.astype('float32')`
			`x_train /= 255`
			`x_test /= 255`
			`print(x_train.shape[0], 'train samples')`
			`print(x_test.shape[0], 'test samples')`
Add antirectifier example 2016-01-08 04:22:33 +00:00
			`# convert class vectors to binary class matrices`
Make utils globally importable & update examples. 2017-02-28 22:41:30 +00:00			`y_train = keras.utils.to_categorical(y_train, num_classes)`
			`y_test = keras.utils.to_categorical(y_test, num_classes)`
Add antirectifier example 2016-01-08 04:22:33 +00:00
			`# build the model`
			`model = Sequential()`
Update a couple examples. 2017-03-11 23:03:27 +00:00			`model.add(layers.Dense(256, input_shape=(784,)))`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`model.add(Antirectifier())`
Update a couple examples. 2017-03-11 23:03:27 +00:00			`model.add(layers.Dropout(0.1))`
			`model.add(layers.Dense(256))`
Add antirectifier example 2016-01-08 04:22:33 +00:00			`model.add(Antirectifier())`
Update a couple examples. 2017-03-11 23:03:27 +00:00			`model.add(layers.Dropout(0.1))`
			`model.add(layers.Dense(10))`
			`model.add(layers.Activation('softmax'))`
Add antirectifier example 2016-01-08 04:22:33 +00:00
			`# compile the model`
Keras 1.0 preview. 2016-03-19 16:07:15 +00:00			`model.compile(loss='categorical_crossentropy',`
			`optimizer='rmsprop',`
			`metrics=['accuracy'])`
Add antirectifier example 2016-01-08 04:22:33 +00:00
			`# train the model`
Make utils globally importable & update examples. 2017-02-28 22:41:30 +00:00			`model.fit(x_train, y_train,`
Integration tests passing. 2017-02-15 00:08:30 +00:00			`batch_size=batch_size, epochs=epochs,`
Make utils globally importable & update examples. 2017-02-28 22:41:30 +00:00			`verbose=1, validation_data=(x_test, y_test))`
Add antirectifier example 2016-01-08 04:22:33 +00:00
			`# next, compare with an equivalent network`
			`# with2x bigger Dense layers and ReLU`