Loading Events

« All Events

Reinforcement Learning

November 3 @ 8:00 am December 15 @ 5:00 pm

Reinforcement Learning

In this course, “Reinforcement Learning”, you will learn about various reinforcement learning (RL) algorithms, a branch of machine learning and AI. This course covers everything you need to know about RL, including an overview of the basic concepts of RL, value-based methods, policy-based methods, and actor-critic algorithms. You will also learn how to leverage these algorithms to address specific real-world problems.  

In this course, you will engage in hands-on activities, homework, and instructor consulting to make learning RL enjoyable and rewarding. You will also be able to tackle real-world problems in discrete and continuous controls, and sequential decision-making. By the end of this course, you’ll have the skills and confidence to tackle any machine-learning challenge with RL. 

Course At a Glance

Course Hours: 16 hours

Instructional Period: November 3 – December 15, 2025

Time to Complete Badge: 60 days

Last Dy to Earn Badge: December 15, 2025

Expertise Level: Intermediate/Advanced

Advanced AI Track Course – This course is part of the Advanced AI track in the TrAC Micro-Credential pathway at Iowa State University.

Graph Neural Networks

Scientific Machine Learning

Reinforcement Learning

Self-Supervised Learning

Parallelism in Deep Learning

3D Vision

Prerequisites & Audience

Prerequisites

  • Basic Python programming
  • Basic understanding of deep learning
  • Basic understanding of control and sequential decision-making
  • Basic PyTorch programming

Audience: The course is intended for a broad audience within the spectrum of the software and technology industry, including software engineers, data scientists, data engineers, data analysts, research scientists, and software developers. We have had past students from Corteva, Bayer, John Deere, Principal, Jord Biotech, Ag Leader, and other leading organizations. The course is designed to provide comprehensive understanding of reinforcement learning and how to use these methods for a broad range of audiences and applications.

Learning Outcomes

By the end of the course, you should be able to:

Develop policy gradient algorithms with deep neural networks implementing advanced policy gradient methods and training both value and policy networks for complex control tasks

Formulate reinforcement learning problems into Markov Decision Processes by defining state and action spaces, reward functions, and transition dynamics for specific tasks

Develop basic value-based and policy-based reinforcement learning algorithms using deep learning packages such as PyTorch to solve decision-making tasks

Develop Q-learning algorithms with deep neural networks including custom Double Deep Q Networks for discrete and continuous control problems such as CarPole, Pendulum, and Atari Games

Assessments

  • 2 Quizzes covering basic and advanced definitions and concepts of reinforcement learning, including Markov Decision Processes and advanced RL methods
  • 2 Coding Exercise Assignments in which you will implement Python codes based on hands-on activities, including: Coding value-based & policy-based RL algorithms to solve classic control problems

Course Outline

Module 1: Introduction to Markov Decision Process and Reinforcement Learning

  • Fundamentals: state, action, transition dynamics, reward function, and MDP formulation
  • Overview of reinforcement learning framework and problem formulation techniques
  • Implementation of GridWorld problem using tabular RL methods

Module 2: Develop Value-based Reinforcement Learning Algorithms

  • State value and state-action value functions and their approximations
  • Traditional value-based methods: Q-learning and SARSA algorithms
  • Deep Q-learning and advanced value-based deep RL algorithms for continuous control

Module 3: Develop Policy-based Reinforcement Learning Algorithms

  • Policy gradient methods and deep policy optimization algorithms
  • Vanilla policy gradient and Proximal Policy Optimization (PPO) implementation
  • Application to continuous control tasks using PyTorch

Module 4: Advanced Deep Reinforcement Learning Methods

  • Model-based reinforcement learning methods and applications
  • Offline reinforcement learning techniques and their practical applications
  • Advanced optimization and best practices for deep RL systems

Course Procedures

The course starts on November 3, 2025. All coursework must be completed by December 15, 2025 (possible extension till December 31, 2025 for industry professionals), in order to earn the micro-credential badge. You will continue to have access to the course materials until January 31, 2026. The approximate time to complete this course is 16 hours.

This course has an instructional period from November 3 to December 7, 2025. During this instructional period, course materials will be released weekly and live synchronous sessions will be held. You may complete the course materials at your own pace. Live Zoom meetings will be conducted for interactive coding sessions. A suitable time for these live sessions will be determined through a group poll. The recordings of those sessions will be available soon after each meeting.

You will receive the micro-credential badge upon successful completion of the course assessments.

Course Materials

Course materials are provided within the course. No additional purchase is required.

Registration

Students


Register for the course as a 1-credit independent study course with a maximum of 3 such TrAC courses per semester.

Students must register for the independent study by emailing benearl@iastate.edu and cc-ing baditya@iastate.edu. Also, fill this Google form for our records: https://forms.gle/mQRefUJ4qpa29s5w6

Industry Professionals/ISU Staff/Post Docs


$ 500 .00

ISU Professionals/Staff and Government Employees: $300

About the Instructor

Zhanhong Jiang, Data Scientist

Zhanhong Jiang is a data scientist in the Translational AI Center (TrAC) at Iowa State University. His research interests lie in decentralized deep learning, reinforcement learning, time-series prediction, and applications to cyber-physical systems. Prior to that, he was a senior AI scientist at Johnson Controls and worked on smart and healthy building solutions using AI/ML technologies. He has numerous publications in prestigious journals and conferences and more than 10 patents.