iCog Labs is East Africa's first private AI R&D company, based in Addis Ababa. They contributed to programming Sophia the Robot and build cognitive architectures toward AGI. Vision AI is at the core of their robotics and perception work — a robot needs to classify what it sees, locate objects, segment regions of interest, describe scenes in language, and even generate visual plans. Beyond robotics, computer vision is transforming African agriculture: PlantVillage has helped millions of farmers identify crop diseases from phone photos using the same classification techniques you'll learn in this module.
Learn vision intelligence step by step with narration, interactive theory, and hands-on pipeline activities. Everything you need is inside the lesson — no coding required.
Already know the basics? Go straight to building vision intelligence pipelines visually.
Learning Objectives
What you'll learn in this module
- Classify images into categories using pre-trained CNNs (ResNet)
- Detect and localize multiple objects in an image with YOLOv8
- Segment objects at pixel level using SAM (Segment Anything Model)
- Caption images and answer visual questions with BLIP
- Generate images from text prompts using diffusion models or Gemini
Sector Applications
How this technology is used across African sectors
PlantVillage uses image classification to detect crop diseases from leaf photos — farmers snap a picture and get a diagnosis in seconds
Segment tumors in medical imaging, count cells in microscopy, and classify skin lesions for dermatology screening
YOLO detects and counts wildlife in camera trap images, enabling automated biodiversity monitoring across African national parks
Detect road damage, classify building conditions from drone imagery, and map land use for urban planning
Register to track your progress.