IEICE global.ieice.org Site

Keyword Search Result

[Keyword] media(541hit)

1-20hit(541hit)

Improving Sliced Wasserstein Distance with Geometric Median for Knowledge Distillation Open Access
Hongyun LU Mengmeng ZHANG Hongyuan JING Zhi LIU

LETTER-Fundamentals of Information Systems

Pubricized:
2024/03/08
Vol:
E107-D No:7
Page(s):
890-893
Currently, the most advanced knowledge distillation models use a metric learning approach based on probability distributions. However, the correlation between supervised probability distributions is typically geometric and implicit, causing inefficiency and an inability to capture structural feature representations among different tasks. To overcome this problem, we propose a knowledge distillation loss using the robust sliced Wasserstein distance with geometric median (GMSW) to estimate the differences between the teacher and student representations. Due to the intuitive geometric properties of GMSW, the student model can effectively learn to align its produced hidden states from the teacher model, thereby establishing a robust correlation among implicit features. In experiment, our method outperforms state-of-the-art models in both high-resource and low-resource settings.
Federated Deep Reinforcement Learning for Multimedia Task Offloading and Resource Allocation in MEC Networks Open Access
Rongqi ZHANG Chunyun PAN Yafei WANG Yuanyuan YAO Xuehua LI

PAPER-Network

Vol:
E107-B No:6
Page(s):
446-457
With maturation of 5G technology in recent years, multimedia services such as live video streaming and online games on the Internet have flourished. These multimedia services frequently require low latency, which pose a significant challenge to compute the high latency requirements multimedia tasks. Mobile edge computing (MEC), is considered a key technology solution to address the above challenges. It offloads computation-intensive tasks to edge servers by sinking mobile nodes, which reduces task execution latency and relieves computing pressure on multimedia devices. In order to use MEC paradigm reasonably and efficiently, resource allocation has become a new challenge. In this paper, we focus on the multimedia tasks which need to be uploaded and processed in the network. We set the optimization problem with the goal of minimizing the latency and energy consumption required to perform tasks in multimedia devices. To solve the complex and non-convex problem, we formulate the optimization problem as a distributed deep reinforcement learning (DRL) problem and propose a federated Dueling deep Q-network (DDQN) based multimedia task offloading and resource allocation algorithm (FDRL-DDQN). In the algorithm, DRL is trained on the local device, while federated learning (FL) is responsible for aggregating and updating the parameters from the trained local models. Further, in order to solve the not identically and independently distributed (non-IID) data problem of multimedia devices, we develop a method for selecting participating federated devices. The simulation results show that the FDRL-DDQN algorithm can reduce the total cost by 31.3% compared to the DQN algorithm when the task data is 1000 kbit, and the maximum reduction can be 35.3% compared to the traditional baseline algorithm.
Hierarchical Detailed Intermediate Supervision for Image-to-Image Translation
Jianbo WANG Haozhi HUANG Li SHEN Xuan WANG Toshihiko YAMASAKI

PAPER-Image Processing and Video Processing

Pubricized:
2023/09/14
Vol:
E106-D No:12
Page(s):
2085-2096
The image-to-image translation aims to learn a mapping between the source and target domains. For improving visual quality, the majority of previous works adopt multi-stage techniques to refine coarse results in a progressive manner. In this work, we present a novel approach for generating plausible details by only introducing a group of intermediate supervisions without cascading multiple stages. Specifically, we propose a Laplacian Pyramid Transformation Generative Adversarial Network (LapTransGAN) to simultaneously transform components in different frequencies from the source domain to the target domain within only one stage. Hierarchical perceptual and gradient penalization are utilized for learning consistent semantic structures and details at each pyramid level. The proposed model is evaluated based on various metrics, including the similarity in feature maps, reconstruction quality, segmentation accuracy, similarity in details, and qualitative appearances. Our experiments show that LapTransGAN can achieve a much better quantitative performance than both the supervised pix2pix model and the unsupervised CycleGAN model. Comprehensive ablation experiments are conducted to study the contribution of each component.
Gradient Descent Direction Random Walk MIMO Detection Using Intermediate Search Point
Naoki ITO Yukitoshi SANADA

PAPER-Wireless Communication Technologies

Pubricized:
2023/07/24
Vol:
E106-B No:11
Page(s):
1192-1199
In this paper, multi-input multi-output (MIMO) signal detection with random walk along a gradient descent direction using an intermediate search point is presented. As a low complexity MIMO signal detection schemes, a gradient descent algorithm with Metropolis-Hastings (MH) methods has been proposed. Random walk along a gradient descent direction speeds up the MH based search using the gradient of a least-squares cost function. However, the gradient vector may be discarded through QAM constellation quantization in some cases. For further performance improvement, this paper proposes an improved search scheme in which the gradient vector is stored for the next search iteration to generate an intermediate search point. The performance of the proposed scheme improves with higher order modulation symbols as compared with that of a conventional scheme. Numerical results obtained through computer simulation show that a bit error rate (BER) performance improves by 5dB at a BER of 10-3 for 64QAM symbols in a 16×16 MIMO system.
Digital Rights Management System of Media Convergence Center Based on Ethereum and IPFS
Runde YU Zhuowen LI Zhe CHEN Gangyi DING

PAPER-Multimedia Pattern Processing

Pubricized:
2023/05/02
Vol:
E106-D No:8
Page(s):
1275-1282
In order to solve the problems of copyrights infringement, high cost and complex process of rights protection in current media convergence center, a digital rights management system based on blockchain technology and IPFS (Inter Planetary File System) technology is proposed. Considering that large files such as video and audio cannot be stored on the blockchain directly, IPFS technology is adopted as the data expansion scheme for the data storage layer of the Ethereum platform, IPFS protocol is further used for distributed data storage and transmission of media content. In addition, smart contract is also used to uniquely identify digital rights through NFT (Non-fungible Tokens), which provides the characteristics of digital rights transferability and traceability, and realizes an open, transparent, tamper-proof and traceable digital rights management system for media convergence center. Several experimental results show that it has higher transaction success rate, lower storage consumption and transaction confirmation delay than existing scheme.
How Many Tweets Describe the Topics on TV Programs: An Investigation on the Relation between Twitter and Mass Media
Jun IIO

PAPER

Pubricized:
2022/11/11
Vol:
E106-D No:4
Page(s):
443-449
As the Internet has become prevalent, the popularity of net media has been growing, to a point that it has taken over conventional mass media. However, TWtrends, the Twitter trends visualization system operated by our research team since 2019, indicates that many topics on TV programs frequently appear on Twitter trendlines. This study investigates the relationship between Twitter and TV programs by collecting information on Twitter trends and TV programs simultaneously. Although this study provides a rough estimation of the volume of tweets that mention TV programs, the results show that several tweets mention TV programs at a constant rate, which tends to increase on the weekend. This tendency of TV-related tweets stems from the audience rating survey results. Considering the study outcome, and the fact that many TV programs introduce topics popular in social media, implies codependency between Internet media (social media) and mass media.
AlGaN/GaN HEMT on 3C-SiC/Low-Resistivity Si Substrate for Microwave Applications Open Access
Akio WAKEJIMA Arijit BOSE Debaleen BISWAS Shigeomi HISHIKI Sumito OUCHI Koichi KITAHARA Keisuke KAWAMURA

INVITED PAPER

Pubricized:
2022/04/21
Vol:
E105-C No:10
Page(s):
457-465
A detailed investigation of DC and RF performance of AlGaN/GaN HEMT on 3C-SiC/low resistive silicon (LR-Si) substrate by introducing a thick GaN layer is reported in this paper. The hetero-epitaxial growth is achieved by metal organic chemical vapor deposition (MOCVD) on a commercially prepared 6-inch LR-Si substrate via a 3C-SiC intermediate layer. The reported HEMT exhibited very low RF loss and thermally stable amplifier characteristics with the introduction of a thick GaN layer. The temperature-dependent small-signal and large-signal characteristics verified the effectiveness of the thick GaN layer on LR-Si, especially in reduction of RF loss even at high temperatures. In summary, a high potential of the reported device is confirmed for microwave applications.
Modeling Polarization Caused by Empathetic and Repulsive Reaction in Online Social Network
Naoki HIRAKURA Masaki AIDA Konosuke KAWASHIMA

PAPER-Multimedia Systems for Communications

Pubricized:
2022/02/16
Vol:
E105-B No:8
Page(s):
990-1001
While social media is now used by many people and plays a role in distributing information, it has recently created an unexpected problem: the actual shrinkage of information sources. This is mainly due to the ease of connecting people with similar opinions and the recommendation system. Biased information distribution promotes polarization that divides people into multiple groups with opposing views. Also, people may receive only the seemingly positive information that they prefer, or may trigger them into holding onto their opinions more strongly when they encounter opposing views. This, combined with the characteristics of social media, is accelerating the polarization of opinions and eventually social division. In this paper, we propose a model of opinion formation on social media to simulate polarization. While based on the idea that opinion neutrality is only relative, this model provides new techniques for dealing with polarization.
CMOS Image Sensor with Pixel-Parallel ADC and HDR Reconstruction from Intermediate Exposure Images Open Access
Shinnosuke KURATA Toshinori OTAKA Yusuke KAMEDA Takayuki HAMAMOTO

LETTER-Image

Pubricized:
2021/07/26
Vol:
E105-A No:1
Page(s):
82-86
We propose a HDR (high dynamic range) reconstruction method in an image sensor with a pixel-parallel ADC (analog-to-digital converter) for non-destructively reading out the intermediate exposure image. We report the circuit design for such an image sensor and the evaluation of the basic HDR reconstruction method.
Image Based Coding of Spatial Probability Distribution on Human Dynamics Data
Hideaki KIMATA Xiaojun WU Ryuichi TANIDA

PAPER

Pubricized:
2021/06/24
Vol:
E104-D No:10
Page(s):
1545-1554
The need for real-time use of human dynamics data is increasing. The technical requirements for this include improved databases for handling a large amount of data as well as highly accurate sensing of people's movements. A bitmap index format has been proposed for high-speed processing of data that spreads in a two-dimensional space. Using the same format is expected to provide a service that searches queries, reads out desired data, visualizes it, and analyzes it. In this study, we propose a coding format that enables human dynamics data to compress it in the target data size, in order to save data storage for successive increase of real-time human dynamics data. In the proposed method, the spatial population distribution, which is expressed by a probability distribution, is approximated and compressed using the one-pixel one-byte data format normally used for image coding. We utilize two kinds of approximation, which are accuracy of probability and precision of spatial location, in order to control the data size and the amount of information. For accuracy of probability, we propose a non-linear mapping method for the spatial distribution, and for precision of spatial location, we propose spatial scalable layered coding to refine the mesh level of the spatial distribution. Also, in order to enable additional detailed analysis, we propose another scalable layered coding that improves the accuracy of the distribution. We demonstrate through experiments that the proposed data approximation and coding format achieve sufficient approximation of spatial population distribution in the given condition of target data size.
Mutual Information Approximation Based Polar Code Design for 4Tb/in² 2D-ISI Channels
Lingjun KONG Haiyang LIU Jin TIAN Shunwai ZHANG Shengmei ZHAO Yi FANG

LETTER-Coding Theory

Pubricized:
2021/02/16
Vol:
E104-A No:8
Page(s):
1075-1079
In this letter, a method for the construction of polar codes based on the mutual information approximation (MIA) is proposed for the 4Tb/in2 two-dimensional inter-symbol interference (2D-ISI) channels, such as the bit-patterned magnetic recording (BPMR) and two-dimensional magnetic recording (TDMR). The basic idea is to exploit the MIA between the input and output of a 2D detector to establish a log-likelihood ratio (LLR) distribution model based on the MIA results, which compensates the gap caused by the 2D ISI channel. Consequently, the polar codes obtained by the optimization techniques previously developed for the additive white Gaussian noise (AWGN) channels can also have satisfactory performances over 2D-ISI channels. Simulated results show that the proposed polar codes can outperform the polar codes constructed by the traditional methods over 4Tb/in2 2D-ISI channels.
Generation and Detection of Media Clones Open Access
Isao ECHIZEN Noboru BABAGUCHI Junichi YAMAGISHI Naoko NITTA Yuta NAKASHIMA Kazuaki NAKAMURA Kazuhiro KONO Fuming FANG Seiko MYOJIN Zhenzhong KUANG Huy H. NGUYEN Ngoc-Dung T. TIEU

INVITED PAPER

Pubricized:
2020/10/19
Vol:
E104-D No:1
Page(s):
12-23
With the spread of high-performance sensors and social network services (SNS) and the remarkable advances in machine learning technologies, fake media such as fake videos, spoofed voices, and fake reviews that are generated using high-quality learning data and are very close to the real thing are causing serious social problems. We launched a research project, the Media Clone (MC) project, to protect receivers of replicas of real media called media clones (MCs) skillfully fabricated by means of media processing technologies. Our aim is to achieve a communication system that can defend against MC attacks and help ensure safe and reliable communication. This paper describes the results of research in two of the five themes in the MC project: 1) verification of the capability of generating various types of media clones such as audio, visual, and text derived from fake information and 2) realization of a protection shield for media clones' attacks by recognizing them.
Preventing Fake Information Generation Against Media Clone Attacks Open Access
Noboru BABAGUCHI Isao ECHIZEN Junichi YAMAGISHI Naoko NITTA Yuta NAKASHIMA Kazuaki NAKAMURA Kazuhiro KONO Fuming FANG Seiko MYOJIN Zhenzhong KUANG Huy H. NGUYEN Ngoc-Dung T. TIEU

INVITED PAPER

Pubricized:
2020/10/19
Vol:
E104-D No:1
Page(s):
2-11
Fake media has been spreading due to remarkable advances in media processing and machine leaning technologies, causing serious problems in society. We are conducting a research project called Media Clone aimed at developing methods for protecting people from fake but skillfully fabricated replicas of real media called media clones. Such media can be created from fake information about a specific person. Our goal is to develop a trusted communication system that can defend against attacks of media clones. This paper describes some research results of the Media Clone project, in particular, various methods for protecting personal information against generating fake information. We focus on 1) fake information generation in the physical world, 2) anonymization and abstraction in the cyber world, and 3) modeling of media clone attacks.
Analysis of Rescue Request and Damage Report Tweets Posted during 2019 Typhoon Hagibis Open Access
Keisuke UTSU Osamu UCHIDA

LETTER-Human Communications

Pubricized:
2020/05/20
Vol:
E103-A No:11
Page(s):
1319-1323
The 2019 Typhoon Hagibis (No. 19) caused widespread destruction in eastern Japan. During the disaster, many tweets including rescue request hashtags such as #救助 (meaning #Rescue) and #救助要請 (meaning #Rescue_request) were posted on Twitter. An official disaster information account of the Nagano Prefectural Government asked the public to provide information in the form of damage reports and rescue requests using the hashtag #台風19号長野県被害 (#Typhoon_No.19_Nagano_Prefecture_damage). As a result, many tweets were posted using this hashtag. Moreover, the account contacted the posters of tweets requesting rescue and delivered the information to the Fire Department. In this study, we analyze the circumstances of the above tweets.
An MMT-Based Hierarchical Transmission Module for 4K/120fps Temporally Scalable Video
Yasuhiro MOCHIDA Takayuki NAKACHI Takahiro YAMAGUCHI

PAPER

Pubricized:
2020/06/22
Vol:
E103-D No:10
Page(s):
2059-2066
High frame rate (HFR) video is attracting strong interest since it is considered as a next step toward providing Ultra-High Definition video service. For instance, the Association of Radio Industries and Businesses (ARIB) standard, the latest broadcasting standard in Japan, defines a 120 fps broadcasting format. The standard stipulates temporally scalable coding and hierarchical transmission by MPEG Media Transport (MMT), in which the base layer and the enhancement layer are transmitted over different paths for flexible distribution. We have developed the first ever MMT transmitter/receiver module for 4K/120fps temporally scalable video. The module is equipped with a newly proposed encapsulation method of temporally scalable bitstreams with correct boundaries. It is also designed to be tolerant to severe network constraints, including packet loss, arrival timing offset, and delay jitter. We conducted a hierarchical transmission experiment for 4K/120fps temporally scalable video. The experiment demonstrated that the MMT module was successfully fabricated and capable of dealing with severe network constraints. Consequently, the module has excellent potential as a means to support HFR video distribution in various network situations.
Machine Learning-Based Approach for Depression Detection in Twitter Using Content and Activity Features
Hatoon S. ALSAGRI Mourad YKHLEF

PAPER-Data Engineering, Web Information Systems

Pubricized:
2020/04/24
Vol:
E103-D No:8
Page(s):
1825-1832
Social media channels, such as Facebook, Twitter, and Instagram, have altered our world forever. People are now increasingly connected than ever and reveal a sort of digital persona. Although social media certainly has several remarkable features, the demerits are undeniable as well. Recent studies have indicated a correlation between high usage of social media sites and increased depression. The present study aims to exploit machine learning techniques for detecting a probable depressed Twitter user based on both, his/her network behavior and tweets. For this purpose, we trained and tested classifiers to distinguish whether a user is depressed or not using features extracted from his/her activities in the network and tweets. The results showed that the more features are used, the higher are the accuracy and F-measure scores in detecting depressed users. This method is a data-driven, predictive approach for early detection of depression or other mental illnesses. This study's main contribution is the exploration part of the features and its impact on detecting the depression level.
Simultaneous Estimation of Object Region and Depth in Participating Media Using a ToF Camera
Yuki FUJIMURA Motoharu SONOGASHIRA Masaaki IIYAMA

PAPER-Image Recognition, Computer Vision

Pubricized:
2019/12/03
Vol:
E103-D No:3
Page(s):
660-673
Three-dimensional (3D) reconstruction and scene depth estimation from 2-dimensional (2D) images are major tasks in computer vision. However, using conventional 3D reconstruction techniques gets challenging in participating media such as murky water, fog, or smoke. We have developed a method that uses a continuous-wave time-of-flight (ToF) camera to estimate an object region and depth in participating media simultaneously. The scattered light observed by the camera is saturated, so it does not depend on the scene depth. In addition, received signals bouncing off distant points are negligible due to light attenuation, and thus the observation of such a point contains only a scattering component. These phenomena enable us to estimate the scattering component in an object region from a background that only contains the scattering component. The problem is formulated as robust estimation where the object region is regarded as outliers, and it enables the simultaneous estimation of an object region and depth on the basis of an iteratively reweighted least squares (IRLS) optimization scheme. We demonstrate the effectiveness of the proposed method using captured images from a ToF camera in real foggy scenes and evaluate the applicability with synthesized data.
Trust, Perceived Useful, Attitude and Continuance Intention to Use E-Government Service: An Empirical Study in Taiwan
Hau-Dong TSUI

PAPER-Office Information Systems, e-Business Modeling

Pubricized:
2019/09/24
Vol:
E102-D No:12
Page(s):
2524-2534
According to the official TDOAS 2009~2017 survey, the penetration rate of social media in Taiwan has reached a record 96.8%, while the Internet access rate is as high as 99.7%. However, people using government online services access to relevant information has continued to decline over the years, from 50.8% in 2009 to 35.4% in 2017. At the same time, the proportion of e-transaction users has also dropped simultaneously from 30.3% to 27.7%. In particular, only 1.1% of them are interested in government online forums, while the remaining 97.2% are more willing to engage in social media as a source of personal reference. The study aims to explore why are users not interested in accessing e-government services? Are they affected by the popularity of social networking applications? What are the key factors for users to continue to use e-government service? The research framework was adapted from expectation confirmation theory and model (ECT/ECM), technology acceptance model (TAM) with trust theories, in validating attitude measures for a better understanding of continuance intention of using e-government service. In terms of measurement, the assessment used the structural equation modeling method (SEM) to explore the views and preferences of 400 college students on e-government service. The study results identified that perceived usefulness not only plays a full mediating role, it is expected to be the most important ex-post factor influencing user's intention to continue using e-government service. It also clarifies that the intent to continue to use e-government services is not related to use any alternative means such as social media application.
Emergence of an Onion-Like Network in Surface Growth and Its Strong Robustness
Yukio HAYASHI Yuki TANAKA

LETTER-Graphs and Networks

Vol:
E102-A No:10
Page(s):
1393-1396
We numerically investigate that optimal robust onion-like networks can emerge even with the constraint of surface growth in supposing a spatially embedded transportation or communication system. To be onion-like, moderately long links are necessary in the attachment through intermediations inspired from a social organization theory.
Multimodal Interface for Drawing Diagrams that Does not Interfere with Natural Talking and Drawing
Xingya XU Hirohito SHIBATA

PAPER-Electronic Displays

Vol:
E102-C No:5
Page(s):
408-415
The aim of this research is to support real-time drawingin talking by using multimodal user interface technologies. In this situation, if talking and drawing are considered as commands by mistake during presentation, it will disturb users' natural talking and drawing. To prevent this problem, we introduce two modes of a command mode and a free mode, and explore smooth mode switching techniques that does not interfere with users' natural talking and drawing. We evaluate four techniques. Among them, a technique that specifies the command mode after actions using a pen gesture was the most effective. In this technique, users could quickly draw diagrams, and specifying mode switching didn't interfere with users' natural talk.

1-20hit(541hit)

Keyword Search Result

[Keyword] media(541hit)

Improving Sliced Wasserstein Distance with Geometric Median for Knowledge Distillation Open Access

Federated Deep Reinforcement Learning for Multimedia Task Offloading and Resource Allocation in MEC Networks Open Access

Hierarchical Detailed Intermediate Supervision for Image-to-Image Translation

Gradient Descent Direction Random Walk MIMO Detection Using Intermediate Search Point

Digital Rights Management System of Media Convergence Center Based on Ethereum and IPFS

How Many Tweets Describe the Topics on TV Programs: An Investigation on the Relation between Twitter and Mass Media

AlGaN/GaN HEMT on 3C-SiC/Low-Resistivity Si Substrate for Microwave Applications Open Access

Modeling Polarization Caused by Empathetic and Repulsive Reaction in Online Social Network

CMOS Image Sensor with Pixel-Parallel ADC and HDR Reconstruction from Intermediate Exposure Images Open Access

Image Based Coding of Spatial Probability Distribution on Human Dynamics Data

Mutual Information Approximation Based Polar Code Design for 4Tb/in² 2D-ISI Channels

Generation and Detection of Media Clones Open Access

Preventing Fake Information Generation Against Media Clone Attacks Open Access

Analysis of Rescue Request and Damage Report Tweets Posted during 2019 Typhoon Hagibis Open Access

An MMT-Based Hierarchical Transmission Module for 4K/120fps Temporally Scalable Video

Machine Learning-Based Approach for Depression Detection in Twitter Using Content and Activity Features

Simultaneous Estimation of Object Region and Depth in Participating Media Using a ToF Camera

Trust, Perceived Useful, Attitude and Continuance Intention to Use E-Government Service: An Empirical Study in Taiwan

Emergence of an Onion-Like Network in Surface Growth and Its Strong Robustness

Multimodal Interface for Drawing Diagrams that Does not Interfere with Natural Talking and Drawing

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles