Those clips consisted of still images and ambient sound taken from YouTube videos of urban and rural streets in North America, Asia and Europe. Utilizing deep learning algorithms, the system learned ...