Teaching CVAIAC
Computer Vision and Artificial Intelligence for Autonomous Cars
Autumn 2026 · ETH Zurich · 6 ECTS
About the course
This course introduces the core computer vision techniques and algorithms that autonomous cars use to perceive the semantics and geometry of their driving environment, localize themselves in it, and predict its dynamic evolution. Emphasis is placed on techniques tailored for real-world settings, such as multi-modal fusion, domain-adaptive and outlier-aware architectures, and multi-agent methods.
- Lecturer
- Christos Sakaridis
- When
- TBA
- Where
- TBA
Slides, exercise sheets, recordings, and exam materials are password-protected. Access credentials are announced in the lecture and on the course forum.
Lecture team
Lecturer
Teaching Assistants
Lectures
| Date | Time | Room | Topic | Slides | Video |
|---|---|---|---|---|---|
| TBA | TBA | TBA | Fundamentals of Autonomous Cars | TBA | TBA |
| TBA | TBA | TBA | Fundamental Computer Vision Architectures and Algorithms for Autonomous Cars | TBA | TBA |
| TBA | TBA | TBA | Fundamental Computer Vision Architectures and Algorithms (continued) | TBA | TBA |
| TBA | TBA | TBA | Semantic Segmentation | TBA | TBA |
| TBA | TBA | TBA | Depth Estimation | TBA | TBA |
| TBA | TBA | TBA | Object Detection | TBA | TBA |
| TBA | TBA | TBA | Instance Segmentation and Panoptic Segmentation | TBA | TBA |
| TBA | TBA | TBA | Unimodal 3D Object Detection | TBA | TBA |
| TBA | TBA | TBA | 3D Reconstruction and Localization | TBA | TBA |
| TBA | TBA | TBA | Domain Adaptation | TBA | TBA |
| TBA | TBA | TBA | Multi-modal 2D and 3D Object Detection | TBA | TBA |
| TBA | TBA | TBA | Visual Grounding, Anomaly Segmentation and Vehicle-to-Vehicle Communication | TBA | TBA |
| TBA | TBA | TBA | Multiple Object Tracking and Motion Prediction | TBA | TBA |
Practical sessions
| Date | Time | Room | Topic | Slides | Video |
|---|---|---|---|---|---|
| TBA | TBA | TBA | Getting Started with Python and SLURM | TBA | TBA |
| TBA | TBA | TBA | Project 1 Introduction: Semantic Segmentation and Depth Estimation | TBA | TBA |
| TBA | TBA | TBA | Project 1: Attention Mechanisms and Transformers | TBA | TBA |
| TBA | TBA | TBA | Project 1 Q&A | TBA | TBA |
| TBA | TBA | TBA | Project 1 Q&A | TBA | TBA |
| TBA | TBA | TBA | Project 1 Q&A | TBA | TBA |
| TBA | TBA | TBA | Project 2 Introduction: 3D Detection from Point Clouds | TBA | TBA |
| TBA | TBA | TBA | Project 2 Q&A | TBA | TBA |
| TBA | TBA | TBA | Project 2 Q&A | TBA | TBA |
| TBA | TBA | TBA | Project 2 Q&A | TBA | TBA |
| TBA | TBA | TBA | Project 2 Q&A | TBA | TBA |
| TBA | TBA | TBA | Project 2 Q&A | TBA | TBA |
Projects
-
Semantic segmentation and depth estimation
- Starts
- TBA
- Due
- TBA
Implement and train multi-task models for dense visual perception on real-world driving data.
Handout: TBA
-
3D object detection using LiDARs
- Starts
- TBA
- Due
- TBA
Detect vehicles and other agents in LiDAR point clouds from a multi-modal driving dataset.
Handout: TBA
Prerequisites
- Linear algebra, multivariate calculus, and probability theory.
- Basic computer vision and machine learning.
- Programming proficiency in Python, PyTorch, scikit-learn, and scikit-image.
Exam & grading
Exam
- Examiner
- Christos Sakaridis
- Format
- Written session examination
- Duration
- 120 minutes
- Language
- English
- Permitted
- One A4 sheet of paper and a simple non-programmable calculator.
- Offered only in the session following the end of the course.
- A mock exam (short version) is provided, with and without solutions.
Grading
Projects 50% · Exam 50%
- Projects are mandatory and conducted in groups.
- Failing the projects results in failing the final grade.
- Students who do not pass the projects must de-register from the session exam.
Learning objectives
- Understand the principles of visual sensors used in autonomous cars.
- Distinguish the dominant architectural paradigms in modern visual perception models.
- Categorize visual perception tasks and explain the algorithms that solve them.
- Critically analyze current research in computer vision for autonomous driving.
- Reproduce and adapt state-of-the-art methods for visual perception.
- Independently develop new visual perception models for autonomous cars.