Pachyderm Deep Dive(en)
This course is designed to provide students with a thorough and practical understanding of the use of Pachyderm, a platform for data versioning and data pipeline management on Kubernetes. Through hands-on lectures and labs, students will learn how to use Pachyderm to efficiently manage, version, and process data.
The course also includes theoretical sessions on Artificial Intelligence, enabling students to apply the knowledge learned to Machine Learning and Data Science projects.
CODE: DSAI106
Category: Artificial Intelligence
Teaching methodology
The course includes educational laboratories in which each student will be able to work in order to complete training exercises that will provide practical experience in using the instrument, for each of the topics covered during the course.
Prerequisites
- Understanding of basic Linux commands for file management, system navigation, and package installation.
- Knowledge of containerization and image creation concepts.
- Understanding of basic Python concepts.
The following is an overview of course content:
- Installation of Pachyderm
- Basic Concepts of Pachyderm
- Introduction to Pachyderm Pipelines
- Working with Pipelines
- Machine Learning on Pachyderm
- Image and Video Processing on Pachyderm
- CI/CD Workflow on Pachyderm
At the end of the course, participants will be able to:
- Install and configure Pachyderm on a Kubernetes cluster.
- Understand the operation and structure of the Pachyderm File System (PFS).
- Create and manage Pipelines using Pachyderm.
- Integrate OpenCV for image and video processing using Pachyderm.
- Create Machine Learning Pipelines on Pachyderm.
- Understand the CI/CD mechanism of Pachyderm.
Duration – 1 day
Delivery – in Classroom, On Site, Remote
PC and SW requirements:
- Internet connection
- Web browser, Google Chrome
- Zoom
Language
- Instructor: English
- Workshops: English
- Slides: English