Artificial Vision and Language Processing for Robotics

Course Description Overview

035459

Artificial Vision and Language Processing for Robotics will help you train and deploy models that are built with advanced deep learning techniques and integrate them in a unified end-to-end application. With this course, you will learn all about the three hottest topics of artificial intelligence: convolutional neural networks, recurrent neural networks, and robotics.

Artificial Vision and Language Processing for Robotics begins by discussing the theory behind robots. You'll compare different methods used to work with robots and explore computer vision, its algorithms, and limits. You'll then learn how to control the robot with natural language processing commands. You'll study Word2Vec and GloVe embedding techniques, non-numeric data, recurrent neural network (RNNs), and their advanced models. You'll create a simple Word2Vec model with Keras, as well as build a convolutional neural network (CNN) and improve it with data augmentation and transfer learning. You'll study the ROS and build a conversational agent to manage your robot. You'll also integrate your agent with the ROS and convert an image to text and text to speech. You'll learn to build an object recognition system using a video.

By the end of this course, you'll have the skills you need to build a functional application that can integrate with a ROS to extract useful information about your environment.

Course Objectives:

After completing this course, you will be able to:

Explore the ROS and build a basic robotic system
Identify conversation intents with NLP techniques
Learn and use the embedding with Word2Vec and GloVe
Use deep learning to implement artificial intelligence (AI) and object recognition
Develop a simple object recognition system using CNNs
Integrate AI with ROS to enable your robot to recognize objects

Artificial Vision and Language Processing for Robotics takes a practical approach to equip robotics developers with tools for creating systems that integrate computer vision and NLP to control a robot. The course is divided into three parts: NLP, computer vision, and robotics. It introduces the advanced topics after a detailed introduction to the basics. It contains multiple activities for you to practice and apply your new skills in a highly relevant context.

Target Student:

Artificial Vision and Language Processing for Robotics is for robotics engineers who want to learn how to integrate computer vision and deep learning techniques to create complete robotic systems. It will prove beneficial to you if you have working knowledge of Python and a background in deep learning. Knowledge of the ROS is a plus.

Prerequisites:

Course-specific Technical Requirements Software:

OS: Windows 7 SP1 64-bit, Windows 8.1 64-bit or Windows 10 64-bit, Linux (Ubuntu, Debian, Red Hat, or Suse), or the latest version of OS X

Browser: Google Chrome/Mozilla Firefox Latest Version

Google Colab

Course-specific Technical Requirements Hardware:

For the optimal student experience, we recommend the following hardware configuration: