Note: This tutorial has been written using OpenVINO™ Toolkit for Linux with FPGA Support version 2018 R4 and is for use with this version only. Using this tutorial with any other version may not be correct.
The OpenVINO™ toolkit runs inference models using the CPU to process images. As an option, OpenVINO can use GPU and VPU devices, if available. You can use the inference models to process video from an optional USB camera, an existing video file, or still image files.
This tutorial examines a sample application that was created with the OpenVINO™ toolkit. The tutorial steps will guide you through downloading the latest Face Detection Tutorial from GitHub, walk you through the sample code and then compile and run the code on the the available hardware. During the process, you will become familiar with key OpenVINO™ concepts.
This tutorial starts with a base application that reads image data and outputs the image to a window. The steps build on each other by adding deep learning models that process image data and make inferences. In the final step, you will use the completed application to detect faces, report the age and gender of the detected face, and draw a 3D axis representing the head pose for each face.
To download all the files for this tutorial, you will need to access GitHub on the Internet.
To run the application in this tutorial, the OpenVINO™ toolkit for Linux with FPGA and its dependencies must already be installed and verified using the included demos. Installation instructions can be found here: https://software.intel.com/en-us/articles/OpenVINO-Install-Linux-FPGA If any optional hardware is to be used, it must be installed and verified including:
- Intel® Arria® 10 GX FPGA Development Kit or the Intel® Programmable Acceleration Card with Intel® Arria® 10 GX FPGA.
- GPU - normally embedded with supported Intel® CPUs; requires drivers and updated Linux kernel to run.
- VPU - USB Intel® Movidius™ Neural Compute Stick with Myriad. The GitHub download or clone will include the dx_face_detection zip file.
Target and development platforms meeting the requirements described in the "System Requirements" section of the OpenVINO™ toolkit documentation which can be found here: https://software.intel.com/openvino-toolkit
Note: For this tutorial, an Intel® i7-7700 (CPU with GPU & Intel® Arria® 10 GX FPGA Development Kit) was used as both the development and target platform.
Optional:
- Intel® Movidius™ Neural Compute Stick
- GPU support
- OpenVINO™ toolkit supported Linux operating system. This tutorial was run on 64-bit Ubuntu 16.04.3 LTS updated to kernel 4.14.20 following the OpenVINO™ toolkit installation instructions.
- The latest OpenVINO™ toolkit with FPGA support installed and verified. This tutorial was written using version 2018 R4.
- At least one utility for downloading from the GitHub repository: Subversion (svn), Git (git), or both.
By now you should have completed the Linux installation guide for the OpenVINO™ toolkit with FPGA support (link). Before continuing, please ensure:
- That you have run the supplied demo samples.
- If you have and intend to use an FPGA: You have installed and tested the Quartus® Programmer (link?) and are able to program bitstreams.
- If you have and intend to use a GPU: You have installed and tested the GPU drivers.
- If you have and intend to use a Myriad: You have connected and tested the USB Intel® Movidius™ Neural Compute Stick.
- Your development platform is connected to a network and has Internet access.
- You have Github access.
In these steps you create a directory for the files you download from the “OpenVINO FPGA Hello World Face Detection” GitHub repository.
- Open a command shell prompt (such as xterm).
- If you have not installed Git, install it now:
sudo apt-get update
sudo apt-get install git
- Create a directory named "tutorials":
mkdir tutorials
- Go to the tutorials directory:
cd tutorials
- Clone the repository:
git clone https://github.com/fritzboyle/openvino-with-fpga-hello-world-face-detection
- Change to the face detection tutorial folder:
cd openvino-with-fpga-hello-world-face-detection
You now have the Face Detection Tutorial files. The next section shows the directory structure of the files you extracted.
The "tutorial" directory contains:
- Images\ - directory of images
- Videos\ - directory of videos
- cmake\ - directory of common CMake files
- dx_face_detection\ - directory of code to help run the scripts
- Readme.md - This document
The OpenVINO™ toolkit enables the quick deployment of Convolutional Neural Networks (CNN) for heterogeneous execution on Intel® hardware while maximizing performance. Deployment is accomplished through the the Intel® Deep Learning Deployment Toolkit (Intel® DL Deployment Toolkit) included within the OpenVINO™ toolkit.
The CNN workflow is:
- Create and train the CNN inference model in a framework, such as Caffe*.
- Use the Model Optimizer on the trained model to produce an optimized Intermediate Representation (IR), stored .bin and .xml files for use with the Inference Engine.
- Use the Inference Engine with your application to load and run the model on your devices.
This tutorial focuses on the last step in the workflow; using the user application the Inference Engine to run models on a CPU, GPU, FPGA, and Movidius™ Neural Compute Stick.
Below is a more detailed view of the User Application and Inference Engine:
The Inference Engine includes a plugin library for each supported device that has been optimized for the Intel® hardware device CPU, GPU, FPGA and Myriad. We will use the terms "device" and “plugin” with the assumption that one infers the other (e.g. CPU device infers the CPU plugin and vice versa). As part of loading the model, the User Application tells the Inference Engine which device to target which in turn loads the associated plugin library to later run on the associated device. The Inference Engine uses “blobs” for all data exchanges which are arrays in memory arranged according to the input and output data of the model.
In this Face Detection sample, the model estimates the head pose based on the face image it is given. The face detection model estimates the age, gender and head pose. After the head pose model has processed the face, the application will draw a set of axes over the face, indicating the Yaw, Pitch, and Roll orientation of the head. A sample output showing the results with the three axes appears below. The metrics reported also include the time to run the head pose model.
In the image above, the three axes intersect in the center of the head. The blue line represents Roll, and it extends from the center of the head to the front and the back of the head. The red line represents Pitch, and is drawn from the center of the head to the left ear. The green line represents Yaw, and is drawn from the center of the head to the top of the head.
For details about the models see the Full Tutorial: https://github.com/intel-iot-devkit/inference-tutorials-generic/blob/openvino_toolkit_r3_0/face_detection_tutorial/step_4/Readme.md#introduction
In the next section, you build and run the application and see how it runs the three analysis models.
-
Use a terminal window to access a command shell prompt.
-
Go the directory containing the Hello World files:
cd dx_face_detection
- Source the variables:
source /home/<user>/Downloads/fpga_support_files/setup_env.sh
- Create a directory to build the tutorial:
mkdir build
- Go to the build directory:
cd build
- Run CMake to set the build target and file locations:
cmake -DCMAKE_BUILD_TYPE=Release ..
- Build the executable:
make
Alternative make command: Run make across multiple pieces of hardware to speed the process
make -j $(nproc)
- Load a bitstream that works well for object detection. The OpenVINO toolkit with support for FPGA includes bitstreams for Both Arria 10 FPGA cards.
- For the Arria 10 GX Development Kit, use this command:
aocl program acl0 /opt/intel/computer_vision_sdk_fpga_2018.3.343/a10_devkit_bitstreams/2-0-1_A10DK_FP11_ResNet50-101.aocx
- For the Intel® Vision Accelerator Design with Intel® Arria® 10 FPGA (IEI Mustang-F100-A10), use this command:
aocl program acl0 /opt/intel/computer_vision_sdk_2018.4.420/bitstreams/a10_vision_design_bitstreams/4-0_PL1_FP11_MobileNet_ResNet_VGG_Clamp.aocx
- Go to the main level directory:
cd ..
- Turn the script into an executable file:
chmod +x run_fd.sh
This tutorial includes a script to select the media file, models and hardware. The script commands are provided under each step to provide the opportunity to explore other possibilities.
run_fd.sh
requires at least one hardware target, and supports up to 3.
./run_fd.sh (face detection hardware) (age/gender hardware) (head pose hardware)
EXAMPLE: ./run_fd.sh fpga fpga gpu
Choose a hardware component for each argument you use
- cpu
- gpu
- fpga
Targets (in order on command line):
- 1st argument is required, for face detection
- 2nd argument, optional, for age & gender recognition
- 3rd argument, optional, requires face detection + age/gender recognition, for head pose
You will see rectangles and the head pose axes that follow the faces around the image (if the faces move), accompanied by age and gender results for the faces, and the timing statistics for processing each frame of the video.
When the video finishes, press any key to close the window and see statistics of the trial.
./run_fd.sh cpu
Full command
build/intel64/Release/face_detection_tutorial -i ../Videos/head-pose-face-detection-female-and-male.mp4 -m /opt/intel/computer_vision_sdk/deployment_tools/intel_models/face-detection-retail-0004/FP32/face-detection-retail-0004.xml -d CPU
./run_fd.sh fpga
Full command
build/intel64/Release/face_detection_tutorial -i ../Videos/head-pose-face-detection-female-and-male.mp4 -m /opt/intel/computer_vision_sdk/deployment_tools/intel_models/face-detection-retail-0004/FP16/face-detection-retail-0004.xml -d HETERO:FPGA,CPU
./run_fd.sh fpga gpu
Full command
build/intel64/Release/face_detection_tutorial -i ../Videos/head-pose-face-detection-female-and-male.mp4 -m /opt/intel/computer_vision_sdk/deployment_tools/intel_models/face-detection-retail-0004/FP16/face-detection-retail-0004.xml -m_ag /opt/intel/computer_vision_sdk/deployment_tools/intel_models/age-gender-recognition-retail-0013/FP16/age-gender-recognition-retail-0013.xml -d HETERO:FPGA,CPU -d_ag GPU
Example 4 - Run face detection on FPGA, age/gender recognition on a GPU, and head pose estimation on a CPU
./run_fd.sh fpga gpu cpu
Full command
build/intel64/Release/face_detection_tutorial -i ../Videos/head-pose-face-detection-female-and-male.mp4 -m /opt/intel/computer_vision_sdk/deployment_tools/intel_models/face-detection-retail-0004/FP16/face-detection-retail-0004.xml -m_ag /opt/intel/computer_vision_sdk/deployment_tools/intel_models/age-gender-recognition-retail-0013/FP16/age-gender-recognition-retail-0013.xml -m_hp /opt/intel/computer_vision_sdk/deployment_tools/intel_models/head-pose-estimation-adas-0001/FP32/head-pose-estimation-adas-0001.xml -d HETERO:FPGA,CPU -d_ag GPU -d_hp CPU
./run_fd.sh cpu cpu cpu
Full command
build/intel64/Release/face_detection_tutorial -i ../Videos/head-pose-face-detection-female-and-male.mp4 -m /opt/intel/computer_vision_sdk/deployment_tools/intel_models/face-detection-retail-0004/FP32/face-detection-retail-0004.xml -m_ag /opt/intel/computer_vision_sdk/deployment_tools/intel_models/age-gender-recognition-retail-0013/FP32/age-gender-recognition-retail-0013.xml -m_hp /opt/intel/computer_vision_sdk/deployment_tools/intel_models/head-pose-estimation-adas-0001/FP32/head-pose-estimation-adas-0001.xml -d CPU -d_ag CPU -d_hp CPU
NOTE: The FPGA plugin does NOT support the head pose model. If specified, it will be replaced with CPU.
This completes the application tutorial. For more information on using the OpenVINO™ toolkit in your environment, see the documentation here: https://software.intel.com/openvino-toolkit