TensorFlow Variables

Introduction

In TensorFlow, variables are special tensors that maintain state across multiple executions of your model. Unlike regular tensors, variables persist their values during the lifetime of a program and can be modified. This makes them essential for storing and updating model parameters like weights and biases during training.

Variables are one of the fundamental building blocks in TensorFlow that allow you to build trainable models. In this tutorial, we'll explore how to create, initialize, use, and update TensorFlow variables.

Creating TensorFlow Variables

Basic Variable Creation

You can create a TensorFlow variable using tf.Variable(). Let's start with a simple example:

import tensorflow as tf

# Creating a scalar variable initialized with a value of 3.0
x = tf.Variable(3.0)

# Print the variable
print(x)
print("Type:", type(x))
print("Shape:", x.shape)
print("Value:", x.numpy())

Output:

<tf.Variable 'Variable:0' shape=() dtype=float32, numpy=3.0>
Type: <class 'tensorflow.python.ops.resource_variable_ops.ResourceVariable'>
Shape: ()
Value: 3.0

Creating Variables with Different Shapes and Types

You can create variables with various shapes and data types:

# Create a vector variable
v = tf.Variable([1.0, 2.0, 3.0])

# Create a matrix variable
m = tf.Variable([[1.0, 2.0], [3.0, 4.0]])

# Create an integer variable
i = tf.Variable(42, dtype=tf.int32)

print("Vector:", v.numpy())
print("Matrix:", m.numpy())
print("Integer:", i.numpy())

Output:

Vector: [1. 2. 3.]
Matrix: [[1. 2.]
         [3. 4.]]
Integer: 42

Creating Variables from Existing Tensors

You can also create variables from existing tensors:

tensor = tf.constant([[1.0, 2.0], [3.0, 4.0]])
var_from_tensor = tf.Variable(tensor)

print(var_from_tensor)

Output:

<tf.Variable 'Variable:0' shape=(2, 2) dtype=float32, numpy=
array([[1., 2.],
       [3., 4.]], dtype=float32)>

Variable Operations

Reading Variable Values

There are several ways to access the value of a TensorFlow variable:

x = tf.Variable(3.0)

# Method 1: Using numpy()
print("Value using numpy():", x.numpy())

# Method 2: Using read_value()
print("Value using read_value():", x.read_value())

# Method 3: Using tf.convert_to_tensor()
print("Value as tensor:", tf.convert_to_tensor(x))

Output:

Value using numpy(): 3.0
Value using read_value(): tf.Tensor(3.0, shape=(), dtype=float32)
Value as tensor: tf.Tensor(3.0, shape=(), dtype=float32)

Updating Variables

TensorFlow variables can be updated in several ways:

Assign Method

x = tf.Variable(3.0)
x.assign(10.0)
print("After assign:", x.numpy())

# Add and assign
x.assign_add(2.0)
print("After assign_add:", x.numpy())

# Subtract and assign
x.assign_sub(1.0)
print("After assign_sub:", x.numpy())

Output:

After assign: 10.0
After assign_add: 12.0
After assign_sub: 11.0

Variables in Practice

Variables for Model Parameters

Variables are essential for storing model parameters. Here's an example of using variables for a simple linear regression model:

# Create model parameters as variables
W = tf.Variable(tf.random.normal([3, 2]), name='weights')
b = tf.Variable(tf.zeros([2]), name='biases')

# Input data
x = tf.constant([[1.0, 2.0, 3.0]])

# Simple linear model
def linear_model(x):
    return tf.matmul(x, W) + b

# Compute the output
y = linear_model(x)
print("Model output:", y.numpy())
print("Weights:", W.numpy())
print("Biases:", b.numpy())

The output will vary due to random initialization, but might look like:

Model output: [[-0.23040158 -1.6147798 ]]
Weights: [[-0.5087863   0.21997929]
          [ 0.01292515 -0.8250246 ]
          [ 0.03260326 -0.03783466]]
Biases: [0. 0.]

Variables in Optimization

Variables are updated during training. Here's a simplified gradient descent example:

# Create a variable to optimize
x = tf.Variable(2.0)

# Define a simple function: f(x) = x^2
def f(x):
    return x**2

# Learning rate
learning_rate = 0.1

# Run 5 steps of gradient descent
for i in range(5):
    with tf.GradientTape() as tape:
        y = f(x)
    
    # Compute gradient of f(x) with respect to x
    grad = tape.gradient(y, x)
    
    # Update x using gradient descent: x = x - learning_rate * gradient
    x.assign_sub(learning_rate * grad)
    
    print(f"Step {i+1}: x = {x.numpy():.4f}, f(x) = {f(x).numpy():.4f}")

Output:

Step 1: x = 1.6000, f(x) = 2.5600
Step 2: x = 1.2800, f(x) = 1.6384
Step 3: x = 1.0240, f(x) = 1.0486
Step 4: x = 0.8192, f(x) = 0.6711
Step 5: x = 0.6554, f(x) = 0.4295

TensorFlow 2.x encourages the use of object-oriented programming for variable management, which is different from the variable scope mechanism in TensorFlow 1.x. However, understanding how to organize variables is still important:

# Creating a simple neural network layer with variables
class SimpleLayer(tf.Module):
    def __init__(self, input_dim, output_dim, name=None):
        super().__init__(name=name)
        self.weight = tf.Variable(
            tf.random.normal([input_dim, output_dim]), name='weight'
        )
        self.bias = tf.Variable(tf.zeros([output_dim]), name='bias')
    
    def __call__(self, x):
        return tf.matmul(x, self.weight) + self.bias

# Create a layer and use it
layer1 = SimpleLayer(3, 2, name='layer1')
input_data = tf.constant([[1.0, 2.0, 3.0]])
output = layer1(input_data)

print("Layer output:", output.numpy())
print("Layer variables:", [var.name for var in layer1.variables])

Output:

Layer output: [[some values based on random initialization]]
Layer variables: ['layer1/weight:0', 'layer1/bias:0']

Saving and Loading Variables

Variables can be saved and restored using TensorFlow's checkpoint system:

# Create some variables
x = tf.Variable(10.0)
y = tf.Variable([1.0, 2.0, 3.0])

# Create a checkpoint object
checkpoint = tf.train.Checkpoint(x=x, y=y)

# Save the checkpoint
save_path = checkpoint.save('./checkpoints/variables')
print(f"Checkpoint saved at: {save_path}")

# Change the values
x.assign(11.0)
y.assign([4.0, 5.0, 6.0])
print("After modification - x:", x.numpy(), "y:", y.numpy())

# Restore the saved values
checkpoint.restore(save_path)
print("After restoration - x:", x.numpy(), "y:", y.numpy())

Output:

Checkpoint saved at: ./checkpoints/variables-1
After modification - x: 11.0 y: [4. 5. 6.]
After restoration - x: 10.0 y: [1. 2. 3.]

Real-World Example: Linear Regression

Let's implement a complete linear regression model using TensorFlow variables:

import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt

# Generate synthetic data
np.random.seed(42)
x_data = np.random.rand(100, 1)
y_data = 2 * x_data + 1 + 0.1 * np.random.randn(100, 1)  # y = 2x + 1 + noise

# Create variables for model parameters
W = tf.Variable(tf.random.normal([1, 1], stddev=0.01))
b = tf.Variable(tf.zeros([1]))

# Define the model
def linear_regression(x):
    return tf.matmul(x, W) + b

# Define loss function
def loss_fn(y_pred, y_true):
    return tf.reduce_mean(tf.square(y_pred - y_true))

# Create an optimizer
optimizer = tf.keras.optimizers.SGD(learning_rate=0.1)

# Training loop
losses = []
for epoch in range(100):
    with tf.GradientTape() as tape:
        y_pred = linear_regression(x_data)
        loss = loss_fn(y_pred, y_data)
    
    # Compute gradients and update variables
    gradients = tape.gradient(loss, [W, b])
    optimizer.apply_gradients(zip(gradients, [W, b]))
    
    losses.append(loss.numpy())
    if epoch % 10 == 0:
        print(f"Epoch {epoch}, Loss: {loss.numpy():.6f}")

print(f"Final parameters - W: {W.numpy()}, b: {b.numpy()}")

# Plot the results
plt.figure(figsize=(10, 6))
plt.subplot(1, 2, 1)
plt.scatter(x_data, y_data)
x_test = np.linspace(0, 1, 10).reshape(-1, 1)
y_pred = linear_regression(x_test).numpy()
plt.plot(x_test, y_pred, 'r-', linewidth=3)
plt.title('Linear Regression Fit')
plt.xlabel('x')
plt.ylabel('y')

plt.subplot(1, 2, 2)
plt.plot(losses)
plt.title('Loss over time')
plt.xlabel('Epoch')
plt.ylabel('Loss')
plt.tight_layout()
plt.savefig('linear_regression_result.png')
plt.close()

The code will train a simple linear regression model and should output something like:

Epoch 0, Loss: 1.323452
Epoch 10, Loss: 0.015666
Epoch 20, Loss: 0.010727
Epoch 30, Loss: 0.010381
Epoch 40, Loss: 0.010316
Epoch 50, Loss: 0.010303
Epoch 60, Loss: 0.010300
Epoch 70, Loss: 0.010299
Epoch 80, Loss: 0.010299
Epoch 90, Loss: 0.010299
Final parameters - W: [[1.9978735]], b: [[1.0068734]]

The trained parameters (W ≈ 2 and b ≈ 1) match our synthetic data generation formula (y = 2x + 1 + noise).

Summary

In this tutorial, we've covered:

How to create TensorFlow variables with different shapes and types
Reading and updating variable values
Using variables to store model parameters
Implementing gradient-based optimization with variables
Variable scope and organization
Saving and restoring variable values
A complete linear regression example using TensorFlow variables

Variables are essential building blocks in TensorFlow that allow you to create stateful, trainable models. Understanding how to properly create and manage variables is crucial for effective deep learning model development.

Additional Resources

Exercises

Create a simple neural network model with two hidden layers using TensorFlow variables and train it on the MNIST dataset.
Implement a custom training loop with variables for a logistic regression model.
Experiment with different variable initializations (e.g., zeros, random normal, Xavier/Glorot) and observe their effects on training.
Create a custom layer class that uses variables and train it within a model.
Implement a saving and loading mechanism for your model variables using the checkpoint system.

If you spot any mistakes on this website, please let me know at [email protected]. I’d greatly appreciate your feedback! :)

Introduction​

Creating TensorFlow Variables​

Basic Variable Creation​

Creating Variables with Different Shapes and Types​

Creating Variables from Existing Tensors​

Variable Operations​

Reading Variable Values​

Updating Variables​

Assign Method​

Variables in Practice​

Variables for Model Parameters​

Variables in Optimization​

Variable Scope and Sharing​

Saving and Loading Variables​

Real-World Example: Linear Regression​

Summary​

Additional Resources​

Exercises​

Introduction

Creating TensorFlow Variables

Basic Variable Creation

Creating Variables with Different Shapes and Types

Creating Variables from Existing Tensors

Variable Operations

Reading Variable Values

Updating Variables

Assign Method

Variables in Practice

Variables for Model Parameters

Variables in Optimization

Variable Scope and Sharing

Saving and Loading Variables

Real-World Example: Linear Regression

Summary

Additional Resources

Exercises