Current IssuePrevious Issue  

Volume 6 No. 3
05 September 2020

Rui Zeng,Yuhui Wen,Wang Zhao,Yong-Jin Liu

2020, 6(3): 225-245.   doi:10.1007/s41095-020-0179-3
Abstract ( 10 HTML ( 0   PDF(912KB) ( 12 )   Save

Rapid development of artificial intelligence motivates researchers to expand the capabilities of intelligent and autonomous robots. In many robotic applications, robots are required to make planning decisions based on perceptual information to achieve diverse goals in an efficient and effective way. The planning problem has been investigated in active robot vision, in which a robot analyzes its environment and its own state in order to move sensors to obtain more useful information under cert...

Abderrazak Iazzi,Mohammed Rziza,Rachid Oulad Haj Thami

2020, 6(3): 247-263.   doi:10.1007/s41095-020-0183-7
Abstract ( 7 HTML ( 0   PDF(1196KB) ( 7 )   Save

This paper presents a vision-based system for recognizing when elderly adults fall. A fall is characterized by shape deformation and high motion. We represent shape variation using three features, the aspect ratio of the bounding box, the orientation of an ellipse representing the body, and the aspect ratio of the projection histogram. For motion variation, we extract several features from three blocks corresponding to the head, center of the body, and feet using optical flow. For each block,...

Meng-Yao Cui,Shao-Ping Lu,Miao Wang,Yong-Liang Yang,Yu-Kun Lai,Paul L. Rosin

2020, 6(3): 265-277.   doi:10.1007/s41095-020-0180-x
Abstract ( 12 HTML ( 0   PDF(893KB) ( 2 )   Save

Humans have the ability to perceive kinetic depth effects, i.e., to perceived 3D shapes from 2D projections of rotating 3D objects. This process is based on a variety of visual cues such as lighting and shading effects. However, when such cues are weak or missing, perception can become faulty, as demonstrated by the famous silhouette illusion example of the spinning dancer. Inspired by this, we establish objective and subjective evaluation models of rotated 3D objects by taking ...

Congyue Deng,Jiahui Huang,Yong-Liang Yang

2020, 6(3): 279-289.   doi:10.1007/s41095-019-0153-0
Abstract ( 25 HTML ( 0   PDF(774KB) ( 17 )   Save

Modeling the complete geometry of general shapes from a single image is an ill-posed problem. User hints are often incorporated to resolve ambiguities and provide guidance during the modeling process. In this work, we present a novel interactive approach for extracting high-quality freeform shapes from a single image. This is inspired by the popular lofting technique in many CAD systems, and only requires minimal user input. Given an input image, the user only needs to sketch several projecte...

Fang-Lue Zhang,Connelly Barnes,Hao-Tian Zhang,Junhong Zhao,Gabriel Salas

2020, 6(3): 291-306.   doi:10.1007/s41095-020-0187-3
Abstract ( 15 HTML ( 0   PDF(2883KB) ( 4 )   Save

For many social events such as public performances, multiple hand-held cameras may capture the same event. This footage is often collected by amateur cinematographers who typically have little control over the scene and may not pay close attention to the camera. For these reasons, each individually captured video may fail to cover the whole time of the event, or may lose track of interesting foreground content such as a performer. We introduce a new algorithm that can synthesize a single smoo...

Aman Chadha,John Britto,M. Mani Roja

2020, 6(3): 307-317.   doi:10.1007/s41095-020-0175-7
Abstract ( 24 HTML ( 0   PDF(727KB) ( 16 )   Save

Recently, learning-based models have enhanced the performance of single-image super-resolution (SISR). However, applying SISR successively to each video frame leads to a lack of temporal coherency. Convolutional neural networks (CNNs) outperform traditional approaches in terms of image quality metrics such as peak signal to noise ratio (PSNR) and structuralsimilarity (SSIM). On the other hand, generative adversarial networks (GANs) offer a competitive advantage by being able to mitigate the i...

Xiaoce Wu,Bingyin Zhou,Qingyun Ren,Wei Guo

2020, 6(3): 319-331.   doi:10.1007/s41095-020-0176-6
Abstract ( 17 HTML ( 0   PDF(783KB) ( 9 )   Save

Multispectral image denoising is a basic problem whose results affect subsequent processes such as target detection and classification. Numerous approaches have been proposed, but there are still many challenges, particularly in using prior knowledge of multispectral images, which is crucial for solving the ill-posed problem of noise removal. This paper considers both non-local self-similarity in space and global correlation in spectrum. We propose a novel low-rank Tucker decomposition model ...

Songye Han,Shaojie Ye,Hongxin Zhang

2020, 6(3): 333-347.   doi:10.1007/s41095-020-0178-4
Abstract ( 19 HTML ( 0   PDF(923KB) ( 8 )   Save

Analyzing and understanding Internet news are important for many applications, such as market sentiment investigation and crisis management. However, it is challenging for users to interpret a massive amount of unstructured text, to dig out its accurate meaning, and to spot noteworthy news events. To overcome these challenges, we propose a novel visualization-driven approach for analyzing news text. We first collect Internet news from different sources and encode sentences into a vector repre...

Aizeng Wang,Chuan He,Fei Hou,Zhanchuan Cai,Gang Zhao

2020, 6(3): 349-354.   doi:10.1007/s41095-020-0182-8
Abstract ( 11 HTML ( 0   PDF(873KB) ( 6 )   Save