World Scientific
  • Search
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at [email protected] for any enquiries.


    In this work we present a framework for on-the-fly video transcoding that exploits computer vision-based techniques to adapt the Web access to the user requirements. The proposed transcoding approach aims at coping with both user bandwidth and resources capabilities, and with user interests in the video's content. We propose an object-based semantic transcoding that, according to the user-defined classes of relevance, applies different transcoding techniques to the objects segmented in a scene. Object extraction is provided by on-the-fly video processing, without manual annotation. Multiple transcoding policies are reviewed and a performance evaluation metric based on the Weighted Mean Square Error (and corresponding PSNR), that takes into account the perceptual user requirements by means of classes of relevance, is defined. Results are analyzed by varying transcoding techniques, bandwidth requirements and video types (with indoor and outdoor scenes), showing that the use of semantics can dramatically improve the bandwidth to distortion ratio.


    • R. Mohan, J. R. Smith and S.-S. Li, IEEE Trans. Multimedia 1(1), 104 (1999), DOI: 10.1109/6046.748175. Crossref, Web of ScienceGoogle Scholar
    • A. W. Huang and N. Sundaresan, A semantic transcoding system to adapt web services for users with disabilities, Proc. ACM SIGCAPH Conf. Assistive Technol. pp. 156–163. Google Scholar
    • N. Jayant, J. Johnston and R. Safranek, Proc. IEEE 81(10), 1385 (1993), DOI: 10.1109/5.241504. Crossref, Web of ScienceGoogle Scholar
    • J. R. Smith, R. Mohan and C.-S. Li, Content-based transcoding of images in the internet, Proc. IEEE Int. Conf. Image Processing3 (1998) pp. 7–11. Google Scholar
    • Y. Yu and C. W. Chen, SNR tarnscoding for video over wireless channels, Proc. Wireless Commun. Networking Conf. (WCNC)3 (2000) pp. 1396–1402. Google Scholar
    • K. Nagao, Y. Shirai and K. Squire, IEEE Multimedia 8(2), 69 (2001), DOI: 10.1109/93.917973. Crossref, Web of ScienceGoogle Scholar
    • ISO/IEC 15938-3:CD. Information Technology — Multimedia Content Description Interface. Part 3 : Visual . Google Scholar
    • IBM research , . Google Scholar
    • S.-F.   Chang et al. , Development of advanced image/video servers in a video on demand testbed , Proc. IEEE Visual Signal Processing and Commun. Workshop . Google Scholar
    • J. Youn, M.-T. Sun and C.-M. Lin, IEEE Trans. Multimedia 1(1), 30 (1999). Crossref, Web of ScienceGoogle Scholar
    • N.   Bjork and C.   Christopoulos , Video transcoding for universal multimedia access , Proc. ACM Multimedia MM2000 . Google Scholar
    • R. Cucchiara, C. Grana and A. Prati, Detecting moving objects and their shadows: an evaluation with the pets2002 dataset, Proc. Third IEEE Int. Workshop on Performance Evaluation of Tracking and Surveillance (PETS 2002) pp. 18–25. Google Scholar
    • Axis Communication: Web site , . Google Scholar
    • A. Vetro, H. Sun and Y. Wang, IEEE Trans. Circuits and System for Video Technol. 11(3), 387 (2001), DOI: 10.1109/76.911163. Crossref, Web of ScienceGoogle Scholar
    • S.   Chandra et al. , Transcoding characteristics of web images , Proc. SPIE Multimedia Computing and Networking Conf. . Google Scholar
    • J.-N. Hwang, T.-D. Wu and C.-W. Lin, Dynamic frame-skipping in video transcoding, Proc. IEEE Second Workshop on Multimedia Signal Processing pp. 616–621. Google Scholar
    • A.   Vetro and H.   Sunt , Encoding and transcoding multiple video-objects with variable temporal resolution , Proc. Symp. Circuit and System . Google Scholar
    • I. Haritaoglu, D. Harwood and L. S. Davis, IEEE Trans. Pattern Anal. Machine Intelligence 22(8), 809 (2000), DOI: 10.1109/34.868683. Crossref, Web of ScienceGoogle Scholar
    • R.   Cucchiara et al. , Video-based Surveillance Syst. — Computer Vision and Distributed Processing , The Sakbot system for moving object detection and tracking ( Kluwer Academic , 2001 ) . Google Scholar
    • R. Cucchiaraet al., Detecting objects, shadows and ghosts in video streams by exploiting color and motion information, Proc. Int. Conf. Image Anal. Processing (ICIAP 2001) pp. 360–365. Google Scholar
    • A. Pratiet al., Shadow detection algorithms for traffic flow analysis: a comparative study, Proc. IEEE Intelligent Transportation Syst. Conf. (ITSC 2001) pp. 340–345. Google Scholar
    • F. Cavalliet al., Performance analysis of MPEG-4 decoder and encoder, Proc. Int. Symp. Video/Image Processing and Multimedia Commun. (VIPromCom-2002) pp. 227–231. Google Scholar
    • A.   Divakaran and H.   Sun , A descriptor for spatial distribution of motion activity , Proc. Storage and Retrieval from Image and Video Databases . Google Scholar
    • J. M. Shapiro, IEEE Trans. Signal Processing 41(12), 3445 (1993), DOI: 10.1109/78.258085. Crossref, Web of ScienceGoogle Scholar
    Remember to check out the Check out our Most Cited Articles!

    Check out these titles on Image Analysis