Deep Learning - Part 2: Programming

Artificial Intelligence
Deep Learning
Tutorial

20. October 2017

Team statworx

Building on the theoretical introduction to neural networks and deep learning from the last blog post, Part 2 of the "Deep Learning" series will provide a hands-on demonstration of implementing a simple neural network (feedforward network) in Python. Various frameworks are available to users for this purpose. In this post, we will use Keras, one of the most important Python libraries for programming neural networks and deep learning models.

Overview of Deep Learning Frameworks

In recent years, the deep learning software ecosystem has seen many new additions. Numerous frameworks offering predefined building blocks for constructing deep learning networks have been introduced to the open-source community. These include Torch, used by Facebook, whose high-level interface utilizes the scripting language Lua; Caffe, which enjoys great popularity in academic settings; Deeplearning4j, which provides a Java-based deep learning environment; and Theano, which focuses on mathematically efficient computations. In addition to the aforementioned frameworks, many other libraries allow users to program both simple and complex deep learning models. These notably include Apache MxNet and Intel Nervana NEON. However, the largest and most resource-rich deep learning framework currently is TensorFlow, originally developed by the Google Brain Team and now released as open-source software. TensorFlow is implemented in C++ and Python but can also be integrated and used in other languages such as R, Julia, or Go, with varying degrees of effort. Most recently, Python has become the lingua franca of deep learning model programming, primarily due to TensorFlow and its high-level library, Keras.

Google TensorFlow

TensorFlow is a software library for Python that allows mathematical operations to be modeled as a graph. This graph serves as a framework in which data is mathematically transformed at each node and then passed on to subsequent nodes. The data is stored and processed in so-called tensors. A tensor, simply put, is a container that holds the values computed within the graph. The following illustration and the Python code snippet below aim to illustrate this with a simple example:

The graph defines a simple mathematical operation—an addition in this case. The values (tensors) a and b are added at the square node of the graph, forming the value c. In TensorFlow, this looks as follows:

# TensorFlow laden
import tensorflow as tf
# a und b als Konstanten definieren
a = tf.constant(5)
b = tf.constant(4)
# Die Addition definieren
c = tf.add(a, b)
# Den Graphen initialisieren
graph = tf.Session()
# Den Graphen an der Stelle c ausführen
graph.run(c)

‍

The result of the computation is 9. This simple example quickly reveals the fundamental logic behind TensorFlow. In the first step, an abstract concept of the model to be computed is created (the graph), which is then populated with tensors in the next step and executed and evaluated at a specific point. Of course, graphs for deep learning models are significantly more complex than the minimal example above. TensorFlow provides users with the TensorBoard function, which allows the programmed graph to be visualized in great detail. An example of a more complex graph can be seen in the figure below.

Since TensorFlow allows the definition of arbitrary mathematical operations, it is not strictly a pure deep learning framework. Other machine learning models can also be represented as graphs and computed using TensorFlow. However, TensorFlow was primarily developed with a focus on deep learning and comes with numerous predefined building blocks for implementing neural networks (e.g., prebuilt layers for MLPs or CNNs). After the release of version 1.0 in 2016 and significant progress in the TensorFlow API, a certain level of stability and code consistency can be expected in the coming years. This will further support the widespread adoption of TensorFlow and its use in production systems. In addition to the Python library, there is also a TensorFlow server designed for enterprise use, which provides ready-made TensorFlow models as a service.

Introduction to Keras

Despite the prebuilt blocks mentioned earlier, TensorFlow is still an expert system and requires a significant learning curve for users. The relative development time from the initial idea to a fully functional deep learning model is extremely long in TensorFlow—but it is also highly flexible.

Keras aims to mitigate the lengthy exploration time common in most deep learning frameworks by providing an easy-to-use interface for building deep learning models. By using a simpler syntax, Keras streamlines the design, training, and evaluation of deep learning models. Fundamentally, Keras operates at the abstraction level of individual neural network layers and typically connects them automatically. This eliminates most concerns regarding network architecture details or their implementation. As a result, standard models such as MLPs, CNNs, and RNNs can be quickly and efficiently prototyped. What makes Keras unique is that it does not provide its own computation backend; instead, it simplifies access to underlying libraries such as TensorFlow, Theano, or CNTK. One advantage of this approach is that the code specifying network architectures remains the same across all backends and is automatically "translated" by Keras. This enables extremely fast development in different frameworks without requiring deep familiarity with their complex syntax. Consequently, Keras code is significantly shorter and more readable compared to its equivalent in the native syntax of the respective backend framework.

Example: Implementing a Neural Network

The following code examples use the Python API of Keras to implement deep learning models. The example focuses on predicting the S&P 500 price for the next trading minute based on the prices of the individual stocks in the index. This example is solely intended to demonstrate the implementation of a neural network and is not optimized for performance. (It is worth noting here that forecasting stock and index prices remains extremely challenging, particularly as time intervals become shorter. An interesting approach to ensemble-based stock prediction is taken by the AI fund NUMERAI.)

The dataset for model training consists of minute-by-minute trading prices of the underlying stocks and the index itself from April to July 2017. Each row of the training dataset contains the prices of all index components as features for the prediction, along with the index price for the next minute as the target. The test dataset consists of the same type of data for August 2017.

The following section explains the Python model specification in Keras, assuming that the DataFrames containing the training and test data are already available:

# Layer aus der Keras Bibliothek laden
from keras.layers import Dense
from keras.models import Sequential

‍‍

First, the necessary components are imported from Keras. These include the classes for a fully connected layer (Dense), the model type (Sequential), and the main library for computations.

# Initialisierung eines leeren Netzes
model = Sequential()
# Hinzufügen von 2 Feedforward Layern
model.add(Dense(512, activation="relu", input_shape=(ncols,)))
model.add(Dense(256, activation="relu"))
# Output Layer
model.add(Dense(1))

‍‍

Next, the framework of the model is defined within an object (model). After instantiating the container for the model, two hidden layers with 512 and 256 neurons, respectively, and the ReLU activation function are added. The final layer of the model, the output layer, sums up the previously computed outputs of the preceding nodes and weights them accordingly.

# Modell kompilieren
model.compile(optimizer="adam", loss="mean_squared_error")
# Modell trainieren mit den Trainingsdaten
model.fit(x=stockdata_train_scaled,
          y=stockdata_train_target,
          epochs=100, batch_size=128)
# Geschätztes Modell auf den Testdaten evaluieren
results = model.evaluate(x=stockdata_test_scaled, y=stockdata_test_target)

‍‍

After defining the network architecture, the model is compiled along with the training parameters. Since this is a regression problem, the Mean Squared Error (MSE) is used as the loss function. The MSE calculates the average squared deviation between the actual observed values and the values predicted by the network in each iteration. During training, the MSE is iteratively minimized using an adaptive gradient method (ADAM). Through backpropagation, the weights between neurons are adjusted so that the MSE decreases (or at least should) with each iteration. In this example, 100 epochs were chosen as the training duration, but in a real-world application, this parameter should also be optimized through extensive testing. One epoch corresponds to a complete pass through the data, meaning that the network has "seen" every data point in the training set once.

Results and Outlook

As shown in the following figure, the result is not ideal. The blue line represents the actual S&P 500 index values, while the model’s prediction is shown in orange.

Interestingly, even the non-optimized network has already learned the overall structure of the trend, although it significantly overestimates fluctuations and effects.

In the next post in the "Deep Learning" series, we will build on this work and further improve our model’s performance by applying various tuning approaches.

Marcel Plaschke

Head of Strategy, Sales & Marketing

schedule a consultation

Content

Zugehörige Leistungen

More Blog Posts

Coding
Data Science
Machine Learning

Zero-Shot Text Classification

Fabian Müller

17.4.2025

Coding
Python

Making Of: A Free API For COVID-19 Data

Sebastian Heinz

17.4.2025

Coding
Python
R

R and Python: Using Reticulate to Get the Best of Both Worlds

Team statworx

17.4.2025

Coding
Frontend
R

Getting Started With Flexdashboards in R

Thomas Alcock

17.4.2025

Artificial Intelligence
Machine Learning
Statistics & Methods

Machine Learning Goes Causal I: Why Causality Matters

Team statworx

17.4.2025

Coding
Data Visualization
R

Coordinate Systems in ggplot2: Easily Overlooked and Rather Underrated

Team statworx

17.4.2025

Data Engineering
R
Tutorial

How To Create REST APIs With R Plumber

Stephan Emmer

17.4.2025

Coding
Frontend
R

Dynamic UI Elements in Shiny – Part 1

Team statworx

17.4.2025

Recaps
statworx

statworx 2019 – A Year in Review

Sebastian Heinz

17.4.2025

Recap
statworx

STATWORX on Tour: Wine, Castles & Hiking!

Team statworx

17.4.2025

Recap
statworx

Off To New Adventures: STATWORX Office Soft Opening

Team statworx

17.4.2025

Recap
statworx

STATWORX on Tour: Year-End-Event in Belgium

Sebastian Heinz

17.4.2025

Recap
statworx

statworx summer barbecue 2019

Team statworx

17.4.2025

Coding
R
Tutorial

Compiling R Code in Sublime Text

Team statworx

17.4.2025

Coding
R
Tutorial

Make RStudio Look the Way You Want — Because Beauty Matters

Team statworx

17.4.2025

Recaps
statworx

2020 – A Year in Review for Me and GPT-3

Sebastian Heinz

17.4.2025

Coding
R

Master R shiny: One trick to build maintainable and scaleable event chains

Team statworx

17.4.2025

Coding
Python
Statistics & Methods

Ensemble Methods in Machine Learning: Bagging & Subagging

Team statworx

15.4.2025

Deep Learning
Python
Tutorial

Using Reinforcement Learning to play Super Mario Bros on NES using TensorFlow

Sebastian Heinz

15.4.2025

Coding
Machine Learning
R

Tuning Random Forest on Time Series Data

Team statworx

15.4.2025

Data Science
Statistics & Methods

Model Regularization – The Bayesian Way

Thomas Alcock

15.4.2025

Coding
Python
Statistics & Methods

How to Speed Up Gradient Boosting by a Factor of Two

Team statworx

15.4.2025

Coding
Frontend
R

Dynamic UI Elements in Shiny – Part 2

Team statworx

15.4.2025

Coding
R

Why Is It Called That Way?! – Origin and Meaning of R Package Names

Team statworx

15.4.2025

Data Engineering
Python

Access your Spark Cluster from Everywhere with Apache Livy

Team statworx

15.4.2025

Coding
Data Engineering
Data Science

Testing REST APIs With Newman

Team statworx

14.4.2025

Machine Learning
Python
R

XGBoost Tree vs. Linear

Fabian Müller

14.4.2025

Data Science
R

Combining Price Elasticities and Sales Forecastings for Sales Improvement

Team statworx

14.4.2025

Data Science
Machine Learning
R

Time Series Forecasting With Random Forest

Team statworx

14.4.2025

Data Visualization
R

Community Detection with Louvain and Infomap

Team statworx

14.4.2025

Machine Learning

Machine Learning Goes Causal II: Meet the Random Forest’s Causal Brother

Team statworx

11.4.2025

Coding
Data Visualization
R

Animated Plots using ggplot and gganimate

Team statworx

8.4.2025

Artificial Intelligence

AI Trends Report 2025: All 16 Trends at a Glance

Tarik Ashry

25.2.2025

Artificial Intelligence
Data Science
GenAI

How a CustomGPT Enhances Efficiency and Creativity at hagebau

Tarik Ashry

15.1.2025

Artificial Intelligence
Data Science
Human-centered AI

Explainable AI in practice: Finding the right method to open the Black Box

Jonas Wacker

15.1.2025

Artificial Intelligence
GenAI
statworx

Back to the Future: The Story of Generative AI (Episode 4)

Tarik Ashry

6.12.2024

Artificial Intelligence
GenAI
statworx

Back to the Future: The Story of Generative AI (Episode 3)

Tarik Ashry

6.12.2024

Artificial Intelligence
GenAI
statworx

Back to the Future: The Story of Generative AI (Episode 2)

Tarik Ashry

6.12.2024

Artificial Intelligence
Data Culture
Data Science
Deep Learning
GenAI
Machine Learning

AI Trends Report 2024: statworx COO Fabian Müller Takes Stock

Tarik Ashry

6.12.2024

Artificial Intelligence
GenAI
statworx

Custom AI Chatbots: Combining Strong Performance and Rapid Integration

Tarik Ashry

6.12.2024

Artificial Intelligence
GenAI
statworx

Back to the Future: The Story of Generative AI (Episode 1)

Tarik Ashry

6.12.2024

Artificial Intelligence
Data Culture
Human-centered AI

AI in the Workplace: How We Turn Skepticism into Confidence

Tarik Ashry

6.12.2024

Artificial Intelligence
GenAI
statworx

Generative AI as a Thinking Machine? A Media Theory Perspective

Tarik Ashry

6.12.2024

Artificial Intelligence
Data Culture
Human-centered AI

How managers can strengthen the data culture in the company

Tarik Ashry

6.12.2024

Artificial Intelligence
Data Science

How we developed a chatbot with real knowledge for Microsoft

Isabel Hermes

6.12.2024

Data Science
Data Visualization
Frontend Solution

Why Frontend Development is Useful in Data Science Applications

Jakob Gepp

6.12.2024

Artificial Intelligence
Human-centered AI
statworx

the byte - How We Built an AI-Powered Pop-Up Restaurant

Sebastian Heinz

6.12.2024

Artificial Intelligence
Data Science
GenAI

The Future of Customer Service: Generative AI as a Success Factor

Tarik Ashry

6.12.2024

Artificial Intelligence
Human-centered AI
Strategy

The AI Act is here – These are the risk classes you should know

Fabian Müller

6.12.2024

Artificial Intelligence
Human-centered AI
Machine Learning

Gender Representation in AI – Part 2: Automating the Generation of Gender-Neutral Versions of Face Images

Team statworx

6.12.2024

Data Science
Human-centered AI
Statistics & Methods

Unlocking the Black Box – 3 Explainable AI Methods to Prepare for the AI Act

Team statworx

6.12.2024

Artificial Intelligence
Human-centered AI
Strategy

How the AI Act will change the AI industry: Everything you need to know about it now

Team statworx

6.12.2024

Artificial Intelligence
Recap
statworx

Big Data & AI World 2023 Recap

Team statworx

6.12.2024

Artificial Intelligence
Data Science
Statistics & Methods

A first look into our Forecasting Recommender Tool

Team statworx

6.12.2024

Artificial Intelligence
Data Science

On Can, Do, and Want – Why Data Culture and Death Metal have a lot in common

David Schlepps

6.12.2024

Artificial Intelligence
Deep Learning
Machine Learning

How to create AI-generated avatars using Stable Diffusion and Textual Inversion

Team statworx

6.12.2024

Artificial Intelligence
Data Science
Strategy

Decoding the secret of Data Culture: These factors truly influence the culture and success of businesses

Team statworx

6.12.2024

Artificial Intelligence
Human-centered AI
Machine Learning

GPT-4 - A categorisation of the most important innovations

Mareike Flögel

6.12.2024

Artificial Intelligence
Human-centered AI
Strategy

Knowledge Management with NLP: How to easily process emails with AI

Team statworx

6.12.2024

Artificial Intelligence
Deep Learning
Machine Learning

3 specific use cases of how ChatGPT will revolutionize communication in companies

Ingo Marquart

6.12.2024

Artificial Intelligence
Machine Learning
Tutorial

Paradigm Shift in NLP: 5 Approaches to Write Better Prompts

Team statworx

6.12.2024

Recap
statworx

Ho ho ho – Christmas Kitchen Party

Julius Heinz

6.12.2024

Artificial Intelligence
Deep Learning
Machine Learning

Real-Time Computer Vision: Face Recognition with a Robot

Sarah Sester

6.12.2024

Recap
statworx

statworx @ UXDX Conf 2022

Markus Berroth

6.12.2024

Data Engineering
Tutorial

Data Engineering – From Zero to Hero

Thomas Alcock

6.12.2024

Recap
statworx

statworx @ vuejs.de Conf 2022

Jakob Gepp

6.12.2024

Data Engineering
Data Science

Application and Infrastructure Monitoring and Logging: metrics and (event) logs

Team statworx

6.12.2024

Data Engineering
Data Science
Python

How to Scan Your Code and Dependencies in Python

Thomas Alcock

6.12.2024

Cloud Technology
Data Engineering
Data Science

How to Get Your Data Science Project Ready for the Cloud

Alexander Broska

6.12.2024

Artificial Intelligence
Human-centered AI
Machine Learning

Gender Representation in AI – Part 1: Utilizing StyleGAN to Explore Gender Directions in Face Image Editing

Isabel Hermes

6.12.2024

The helfRlein package – A collection of useful functions

Jakob Gepp

6.12.2024

Data Engineering
Data Science
Machine Learning

Data-Centric AI: From Model-First to Data-First AI Processes

Team statworx

6.12.2024

Artificial Intelligence
Deep Learning
Human-centered AI
Machine Learning

DALL-E 2: Why Discrimination in AI Development Cannot Be Ignored

Team statworx

6.12.2024

Artificial Intelligence
Human-centered AI

statworx AI Principles: Why We Started Developing Our Own AI Guidelines

Team statworx

6.12.2024

Recap
statworx

5 highlights from the Zurich Digital Festival 2021

Team statworx

6.12.2024

Recap
statworx

Unfold 2022 in Bern – by Cleverclip

Team statworx

6.12.2024

Data Science
Human-centered AI
Machine Learning
Strategy

Why Data Science and AI Initiatives Fail – A Reflection on Non-Technical Factors

Team statworx

6.12.2024

Machine Learning
Python
Tutorial

How to Build a Machine Learning API with Python and Flask

Team statworx

6.12.2024

Artificial Intelligence
Data Science
Human-centered AI
Machine Learning

Break the Bias in AI

Team statworx

6.12.2024

Artificial Intelligence
Cloud Technology
Data Science
Sustainable AI

How to Reduce the AI Carbon Footprint as a Data Scientist

Team statworx

6.12.2024

Coding
Data Engineering

Automated Creation of Docker Containers

Stephan Emmer

6.12.2024

‍Coding
Data Visualization
R

Customizing Time and Date Scales in ggplot2

Team statworx

6.12.2024

Artificial Intelligence
Data Science
Machine Learning

5 Types of Machine Learning Algorithms With Use Cases

Team statworx

6.12.2024

Coding
Machine Learning
Python

Data Science in Python - Getting started with Machine Learning with Scikit-Learn

Team statworx

6.12.2024

Recap
statworx

2022 and the rise of statworx next

Sebastian Heinz

6.12.2024

Recap
statworx

As a Data Science Intern at statworx

Team statworx

6.12.2024

Coding
Data Science
Python

How to Automatically Create Project Graphs With Call Graph

Team statworx

6.12.2024

Artificial Intelligence
Data Science
Human-centered AI
Machine Learning
statworx

Column: Human and machine side by side

Sebastian Heinz

6.12.2024

Data Engineering
Data Science
Machine Learning

Deploy and Scale Machine Learning Models with Kubernetes

Team statworx

6.12.2024

Coding
Python
Tutorial

statworx Cheatsheets – Python Basics Cheatsheet for Data Science

Team statworx

6.12.2024

Cloud Technology
Data Engineering
Machine Learning

3 Scenarios for Deploying Machine Learning Workflows Using MLflow

Team statworx

6.12.2024

Data Science
statworx
Strategy

STATWORX meets DHBW – Data Science Real-World Use Cases

Team statworx

6.12.2024

Coding
Deep Learning

Car Model Classification I: Transfer Learning with ResNet

Team statworx

6.12.2024

Artificial Intelligence
Deep Learning
Machine Learning

Car Model Classification IV: Integrating Deep Learning Models With Dash

Dominique Lade

6.12.2024

Artificial Intelligence
Deep Learning
Machine Learning

Car Model Classification III: Explainability of Deep Learning Models With Grad-CAM

Team statworx

6.12.2024

Artificial Intelligence
Coding
Deep Learning

Car Model Classification II: Deploying TensorFlow Models in Docker Using TensorFlow Serving

6.12.2024

AI Act

Potential Not Yet Fully Tapped – A Commentary on the EU’s Proposed AI Regulation

Team statworx

6.12.2024

Artificial Intelligence
Deep Learning
statworx

Creaition – revolutionizing the design process with machine learning

Team statworx

6.12.2024

Data Science
Deep Learning

The 5 Most Important Use Cases for Computer Vision

Team statworx

6.12.2024

Artificial Intelligence
Data Science
Machine Learning

‍

Generative Adversarial Networks: How Data Can Be Generated With Neural Networks

Team statworx

6.12.2024