Using Esp32cam as a video streaming device and with the help of yolo v3 classifying objects and then converting to speech using gtts.