Thursday, February 8, 2024

[FIXED] Issue in simple neural network

February 08, 2024 python, regression, tensorflow No comments

Issue

I tried below code in tensorflow with different variations for a simple regression problem. I have synthesized data as y=10.0*x. One input and one outcome variable. But tensorflow is giving me a loss of ~2000 to ~200000. I am using MSE as the loss function. Also tried relu activations as well, without any use.

What should be the appropriate model for such a [simple] regression problem?
What should the model be for problem like y=x^3?

def PolynomialModel():
    inp = layers.Input((1))
    l=layers.Dense(16, activation='tanh')(inp)
    l=layers.Dense(8, activation='tanh')(l)
    l=layers.Dropout(.5)(l)
    l=layers.Dense(4, activation='tanh')(l)
    l=layers.Dropout(.5)(l)
    output=layers.Dense(1, activation='tanh')(l)
    return models.Model(inp,output)

Solution

In fact, you don't need hidden layers but an input of the size of the polynomial degree + 1. Check the documentation of PolynomialFeatures from sklearn for more information.

import tensorflow as tf
from tensorflow.keras import layers, models, optimizers, activations

def PolynomialModel(degree):
    inp = layers.Input((degree+1))
    out = layers.Dense(1)(inp)  # activation='linear'
    return models.Model(inp, out)

# Suppose you want to fit a polynomial function of degree 3
degree = 3
model = PolynomialModel(degree)
model.compile(loss='mean_squared_error',  optimizer=optimizers.Adam(0.1))

# You need an input vector of degree+1 (here: x**0, x**1, x**2 and x**3)
x = tf.linspace(-20, 20, 1000)
X = tf.transpose(tf.convert_to_tensor([x**p for p in range(degree+1)]))
y = x**3

model.fit(X, y, epochs=200)
model.predict([[4**0, 4**1, 4**2, 4**3]])

Output:

Epoch 1/200
32/32 [==============================] - 0s 974us/step - loss: 891936.5000
Epoch 2/200
32/32 [==============================] - 0s 945us/step - loss: 25650.7812
Epoch 3/200
32/32 [==============================] - 0s 897us/step - loss: 1188.4584
Epoch 4/200
32/32 [==============================] - 0s 1ms/step - loss: 75.3480
Epoch 5/200
32/32 [==============================] - 0s 2ms/step - loss: 21.2651
...
Epoch 196/200
32/32 [==============================] - 0s 1ms/step - loss: 1.2130e-11
Epoch 197/200
32/32 [==============================] - 0s 1ms/step - loss: 1.5810e-11
Epoch 198/200
32/32 [==============================] - 0s 1ms/step - loss: 1.2358e-11
Epoch 199/200
32/32 [==============================] - 0s 1ms/step - loss: 1.2775e-11
Epoch 200/200
32/32 [==============================] - 0s 1ms/step - loss: 1.4443e-11

Test:

>>> model.predict([[4**0, 4**1, 4**2, 4**3]])
1/1 [==============================] - 0s 63ms/step
array([[64.]], dtype=float32)

>>> model.predict([[3**0, 3**1, 3**2, 3**3]])
1/1 [==============================] - 0s 51ms/step
array([[27.000002]], dtype=float32)

>>> model.summary()
Model: "PolynomialRegressor"
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
=================================================================
 Input (InputLayer)          [(None, 4)]               0         
                                                                 
 Output (Dense)              (None, 1)                 5         
                                                                 
=================================================================
Total params: 5 (20.00 Byte)
Trainable params: 5 (20.00 Byte)
Non-trainable params: 0 (0.00 Byte)
_________________________________________________________________

Note: x**0 is not really necessary (only ones...) but I want the same behavior than PolynomialFeatures. So you can also consider if your degree is 3, the input size is 3 (and not 4).

Answered By - Corralien

This Answer collected from stackoverflow and tested by PythonFixing community admins, is licensed under cc by-sa 2.5 , cc by-sa 3.0 and cc by-sa 4.0

Thursday, February 8, 2024

[FIXED] Issue in simple neural network

Issue

Solution

0 comments:

Post a Comment

Popular Posts

Labels