The search functionality is under construction.
The search functionality is under construction.

Keyword Search Result

[Keyword] form(3161hit)

1-20hit(3161hit)

  • TDEM: Table Data Extraction Model Based on Cell Segmentation Open Access

    Zhe WANG  Zhe-Ming LU  Hao LUO  Yang-Ming ZHENG  

     
    LETTER-Artificial Intelligence, Data Mining

      Pubricized:
    2024/05/30
      Vol:
    E107-D No:10
      Page(s):
    1376-1379

    To accurately extract tabular data, we propose a novel cell-based tabular data extraction model (TDEM). The key of TDEM is to utilize grayscale projection of row separation lines, coupled with table masks and column masks generated by the VGG-19 neural network, to segment each individual cell from the input image of the table. In this way, the text content of the table is extracted from a specific single cell, which greatly improves the accuracy of table recognition.

  • MISpeller: Multimodal Information Enhancement for Chinese Spelling Correction Open Access

    Jiakai LI  Jianyong DUAN  Hao WANG  Li HE  Qing ZHANG  

     
    PAPER-Natural Language Processing

      Pubricized:
    2024/06/07
      Vol:
    E107-D No:10
      Page(s):
    1342-1352

    Chinese spelling correction is a foundational task in natural language processing that aims to detect and correct spelling errors in text. Most spelling corrections in Chinese used multimodal information to model the relationship between incorrect and correct characters. However, feature information mismatch occured during fusion result from the different sources of features, causing the importance relationships between different modalities to be ignored, which in turn restricted the model from learning in an efficient manner. To this end, this paper proposes a multimodal language model-based Chinese spelling corrector, named as MISpeller. The method, based on ChineseBERT as the basic model, allows the comprehensive capture and fusion of character semantic information, phonetic information and graphic information in a single model without the need to construct additional neural networks, and realises the phenomenon of unequal fusion of multi-feature information. In addition, in order to solve the overcorrection issues, the replication mechanism is further introduced, and the replication factor is used as the dynamic weight to efficiently fuse the multimodal information. The model is able to control the proportion of original characters and predicted characters according to different input texts, and it can learn more specifically where errors occur. Experiments conducted on the SIGHAN benchmark show that the proposed model achieves the state-of-the-art performance of the F1 score at the correction level by an average of 4.36%, which validates the effectiveness of the model.

  • Advancements in Terahertz Communication: Harnessing the 300 GHz Band for High-Efficiency, High-Capacity Wireless Networks Open Access

    Minoru FUJISHIMA  

     
    INVITED PAPER

      Pubricized:
    2024/03/08
      Vol:
    E107-C No:10
      Page(s):
    366-375

    In this paper, we delve into wireless communications in the 300 GHz band, focusing in particular on the continuous bandwidth of 44 GHz from 252 GHz to 296 GHz, positioning it as a pivotal element in the trajectory toward 6G communications. While terahertz communications have traditionally been praised for the high speeds they can achieve using their wide bandwidth, focusing the beam has also shown the potential to achieve high energy efficiency and support numerous simultaneous connectivity. To this end, new performance metrics, EIRPλ and EINFλ, are introduced as important benchmarks for transmitter and receiver performance, and their consistency is discussed. We then show that, assuming conventional bandwidth and communication capacity, the communication distance is independent of carrier frequency. Located between radio waves and light in the electromagnetic spectrum, terahertz waves promise to usher in a new era of wireless communications characterized not only by high-speed communication, but also by convenience and efficiency. Improvements in antenna gain, beam focusing, and precise beam steering are essential to its realization. As these technologies advance, the paradigm of wireless communications is expected to be transformed. The synergistic effects of antenna gain enhancement, beam focusing, and steering will not only push high-speed communications to unprecedented levels, but also lay the foundation for a wireless communications landscape defined by unparalleled convenience and efficiency. This paper will discuss a future in which terahertz communications will reshape the contours of wireless communications as the realization of such technological breakthroughs draws near.

  • Japanese Institutionalization and Global Standardization of Wireless Power Transmission, and Recently R&D Trend in Japan Open Access

    Takuya FUJIMOTO  

     
    INVITED PAPER

      Pubricized:
    2024/04/23
      Vol:
    E107-C No:10
      Page(s):
    299-306

    In Japan, research on spatial transmission Wireless Power Transfer/Transmission (WPT) for long-distance power transmission has been conducted ahead of the rest of the world; however, until 2022, there has been no category under the Radio Law, and it has been treated as an experimental station. The authors are working on Japanese institutionalization (revision of ministerial ordinances) and global standardization of this spatial transmission WPT for social implementation. This paper describes the Japanese and international institutionalization and standardization trends. In addition, as the latest trend in R&D trends, as the next step of institutionalization, the author introduces two national projects that are being worked on by industry, academia, and government for Step 2, which can be used for a wider range of applications by relaxing the scope of use and restrictions from Step 1, which has various restrictions. The first is about the Cross-ministerial Strategic Innovation Promotion Program (SIP) Phase 2. In SIP Phase 2, we conducted R&D on “WPT system for sensor networks and mobile devices”. This R&D is research on detecting and avoiding people so that radio exposure does not exceed protection guidelines and detecting incumbent radios and avoiding harmful interference so that more power can be transmitted under coexistence conditions. The other is “Research and Development for Expansion of Radio Resources” to be conducted by the Ministry of Internal Affairs and Communications (MIC), which is scheduled for four years from FY2022. This is also a more concrete research and development project for Step 2 institutionalization, along with the results of the SIP mentioned above.

  • Throughput Maximization-Based AP Clustering Methods in Downlink Cell-Free MIMO Under Partial CSI Condition Open Access

    Daisuke ISHII  Takanori HARA  Kenichi HIGUCHI  

     
    PAPER-Wireless Communication Technologies

      Vol:
    E107-B No:10
      Page(s):
    653-660

    In this paper, we investigate a method for clustering user equipment (UE)-specific transmission access points (APs) in downlink cell-free multiple-input multiple-output (MIMO) assuming that the APs distributed over the system coverage know only part of the instantaneous channel state information (CSI). As a beamforming (BF) method based on partial CSI, we use a layered partially non-orthogonal zero-forcing (ZF) method based on channel matrix muting, which is applicable to the case where different transmitting AP groups are selected for each UE under partial CSI conditions. We propose two AP clustering methods. Both proposed methods first tentatively determine the transmitting APs independently for each UE and then iteratively update the transmitting APs for each UE based on the estimated throughput considering the interference among the UEs. One of the two proposed methods introduces a UE cluster for each UE into the iterative updates of the transmitting APs to balance throughput performance and scalability. Computer simulations show that the proposed methods achieve higher geometric-mean and worst user throughput than those for the conventional methods.

  • SLNR-Based Joint Precoding for RIS Aided Beamspace HAP-NOMA Systems Open Access

    Pingping JI  Lingge JIANG  Chen HE  Di HE  Zhuxian LIAN  

     
    PAPER-Antennas and Propagation

      Vol:
    E107-B No:10
      Page(s):
    645-652

    High altitude platform (HAP), known as line-of-sight dominated communications, effectively enhance the spectral efficiency of wireless networks. However, the line-of-sight links, particularly in urban areas, may be severely deteriorated due to the complex communication environment. The reconfigurable intelligent surface (RIS) is employed to establish the cascaded-link and improve the quality of communication service by smartly reflecting the signals received from HAP to users without direct-link. Motivated by this, the joint precoding scheme for a novel RIS-aided beamspace HAP with non-orthogonal multiple access (HAP-NOMA) system is investigated to maximize the minimum user signal-to-leakage-plus-noise ratio (SLNR) by considering user fairness. Specifically, the SLNR is utilized as metric to design the joint precoding algorithm for a lower complexity, because the isolation between the precoding obtainment and power allocation can make the two parts be attained iteratively. To deal with the formulated non-convex problem, we first derive the statistical upper bound on SLNR based on the random matrix theory in large scale antenna array. Then, the closed-form expressions of power matrix and passive precoding matrix are given by introducing auxiliary variables based on the derived upper bound on SLNR. The proposed joint precoding only depends on the statistical channel state information (SCSI) instead of instantaneous channel state information (ICSI). NOMA serves multi-users simultaneously in the same group to compensate for the loss of spectral efficiency resulted from the beamspace HAP. Numerical results show the effectiveness of the derived statistical upper bound on SLNR and the performance enhancement of the proposed joint precoding algorithm.

  • Pool-Unet: A Novel Tongue Image Segmentation Method Based on Pool-Former and Multi-Task Mask Learning Open Access

    Xiangrun LI  Qiyu SHENG  Guangda ZHOU  Jialong WEI  Yanmin SHI  Zhen ZHAO  Yongwei LI  Xingfeng LI  Yang LIU  

     
    PAPER-Image

      Pubricized:
    2024/05/29
      Vol:
    E107-A No:10
      Page(s):
    1609-1620

    Automated tongue segmentation plays a crucial role in the realm of computer-aided tongue diagnosis. The challenge lies in developing algorithms that achieve higher segmentation accuracy and maintain less memory space and swift inference capabilities. To relieve this issue, we propose a novel Pool-unet integrating Pool-former and Multi-task mask learning for tongue image segmentation. First of all, we collected 756 tongue images taken in various shooting environments and from different angles and accurately labeled the tongue under the guidance of a medical professional. Second, we propose the Pool-unet model, combining a hierarchical Pool-former module and a U-shaped symmetric encoder-decoder with skip-connections, which utilizes a patch expanding layer for up-sampling and a patch embedding layer for down-sampling to maintain spatial resolution, to effectively capture global and local information using fewer parameters and faster inference. Finally, a Multi-task mask learning strategy is designed, which improves the generalization and anti-interference ability of the model through the Multi-task pre-training and self-supervised fine-tuning stages. Experimental results on the tongue dataset show that compared to the state-of-the-art method (OET-NET), our method has 25% fewer model parameters, achieves 22% faster inference times, and exhibits 0.91% and 0.55% improvements in Mean Intersection Over Union (MIOU), and Mean Pixel Accuracy (MPA), respectively.

  • Watermarking Method with Scaling Rate Estimation Using Pilot Signal Open Access

    Rinka KAWANO  Masaki KAWAMURA  

     
    PAPER-Information Network

      Pubricized:
    2024/05/22
      Vol:
    E107-D No:9
      Page(s):
    1151-1160

    Watermarking methods require robustness against various attacks. Conventional watermarking methods use error-correcting codes or spread spectrum to correct watermarking errors. Errors can also be reduced by embedding the watermark into the frequency domain and by using SIFT feature points. If the type and strength of the attack can be estimated, the errors can be further reduced. There are several types of attacks, such as scaling, rotation, and cropping, and it is necessary to aim for robustness against all of them. Focusing on the scaling tolerance of watermarks, we propose a watermarking method using SIFT feature points and DFT, and introduce a pilot signal. The proposed method estimates the scaling rate using the pilot signal in the form of a grid. When a stego-image is scaled, the grid interval of the pilot signal also changes, and the scaling rate can be estimated from the amount of change. The accuracy of estimating the scaling rate by the proposed method was evaluated in terms of the relative error of the scaling rate. The results show that the proposed method could reduce errors in the watermark by using the estimated scaling rate.

  • Characterization for a Generic Construction of Bent Functions and Its Consequences Open Access

    Yanjun LI  Jinjie GAO  Haibin KAN  Jie PENG  Lijing ZHENG  Changhui CHEN  

     
    LETTER-Cryptography and Information Security

      Pubricized:
    2024/05/07
      Vol:
    E107-A No:9
      Page(s):
    1570-1574

    In this letter, we give a characterization for a generic construction of bent functions. This characterization enables us to obtain another efficient construction of bent functions and to give a positive answer on a problem of bent functions.

  • Method for Estimating Scatterer Information from the Response Waveform of a Backward Transient Scattering Field Using TD-SPT Open Access

    Keiji GOTO  Toru KAWANO  Munetoshi IWAKIRI  Tsubasa KAWAKAMI  Kazuki NAKAZAWA  

     
    PAPER-Electromagnetic Theory

      Pubricized:
    2024/01/23
      Vol:
    E107-C No:8
      Page(s):
    210-222

    This paper proposes a scatterer information estimation method using numerical data for the response waveform of a backward transient scattering field for both E- and H-polarizations when a two-dimensional (2-D) coated metal cylinder is selected as a scatterer. It is assumed that a line source and an observation point are placed at different locations. The four types of scatterer information covered in this paper are the relative permittivity of a surrounding medium, the relative permittivity of a coating medium layer and its thickness, and the radius of a coated metal cylinder. Specifically, a time-domain saddle-point technique (TD-SPT) is used to derive scatterer information estimation formulae from the amplitude intensity ratios (AIRs) of adjacent backward transient scattering field components. The estimates are obtained by substituting the numerical data of the response waveforms of the backward transient scattering field components into the estimation formulae and performing iterative calculations. Furthermore, a minimum thickness of a coating medium layer for which the estimation method is valid is derived, and two kinds of applicable conditions for the estimation method are proposed. The effectiveness of the scatterer information estimation method is verified by comparing the estimates with the set values. The noise tolerance and convergence characteristics of the estimation method and the method of controlling the estimation accuracy are also discussed.

  • Sum Rate Maximization for Multiuser Full-Duplex Wireless Powered Communication Networks Open Access

    Keigo HIRASHIMA  Teruyuki MIYAJIMA  

     
    PAPER-Wireless Communication Technologies

      Vol:
    E107-B No:8
      Page(s):
    564-572

    In this paper, we consider an orthogonal frequency division multiple access (OFDMA)-based multiuser full-duplex wireless powered communication network (FD WPCN) system with beamforming (BF) at an energy transmitter (ET). The ET performs BF to efficiently transmit energy to multiple users while suppressing interference to an information receiver (IR). Multiple users operating in full-duplex mode harvest energy from the signals sent by the ET while simultaneously transmitting information to the IR using the harvested energy. We analytically demonstrate that the FD WPCN is superior to its half-duplex (HD) WPCN counterpart in the high-SNR regime. We propose a transmitter design method that maximizes the sum rate by determining the BF at the ET, power allocation at both the ET and users, and sub-band allocation. Simulation results show the effectiveness of the proposed method.

  • Waveguide Slot Array with Code-Division Multiplexing Function for Single RF Chain Digital Beamforming Open Access

    Narihiro NAKAMOTO  Kazunari KIHIRA  Toru FUKASAWA  Yoshio INASAWA  Naoki SHINOHARA  

     
    PAPER-Antennas and Propagation

      Vol:
    E107-B No:8
      Page(s):
    541-551

    This study presents a novel waveguide slot array with a code-division multiplexing function for single RF chain digital beamforming. The proposed antenna is comprised of a rectangular metallic waveguide’s bottom part and a multilayer printed circuit board (PCB) with the rectangular waveguide’s top wall and slot apertures. Multiple pairs of two symmetric longitudinal slots are etched on the metal surface of the PCB, and a PIN diode is mounted across each slot. The received signals of each slot pair are multiplexed in a code-division multiplexing fashion by switching the diodes’ bias according to the Walsh Hadamard code, and the original signals are then recovered through a despreading process in the digital domain for digital beamforming. A prototype antenna with eight slot pairs has been fabricated and tested for proof of concept. The measured results show the feasibility of the proposed antenna.

  • A Dual-Branch Algorithm for Semantic-Focused Face Super-Resolution Reconstruction Open Access

    Qi QI  Liuyi MENG  Ming XU  Bing BAI  

     
    LETTER-Image

      Pubricized:
    2024/03/18
      Vol:
    E107-A No:8
      Page(s):
    1435-1439

    In face super-resolution reconstruction, the interference caused by the texture and color of the hair region on the details and contours of the face region can negatively affect the reconstruction results. This paper proposes a semantic-based, dual-branch face super-resolution algorithm to address the issue of varying reconstruction complexities and mutual interference among different pixel semantics in face images. The algorithm clusters pixel semantic data to create a hierarchical representation, distinguishing between facial pixel regions and hair pixel regions. Subsequently, independent image enhancement is applied to these distinct pixel regions to mitigate their interference, resulting in a vivid, super-resolution face image.

  • Video Reflection Removal by Modified EDVR and 3D Convolution Open Access

    Sota MORIYAMA  Koichi ICHIGE  Yuichi HORI  Masayuki TACHI  

     
    LETTER-Image

      Pubricized:
    2023/12/11
      Vol:
    E107-A No:8
      Page(s):
    1430-1434

    In this paper, we propose a method for video reflection removal using a video restoration framework with enhanced deformable networks (EDVR). We examine the effect of each module in EDVR on video reflection removal and modify the models using 3D convolutions. The performance of each modified model is evaluated in terms of the RMSE between the structural similarity (SSIM) and the smoothed SSIM representing temporal consistency.

  • An Efficiency-Enhancing Wideband OFDM Dual-Function MIMO Radar-Communication System Design Open Access

    Yumeng ZHANG  

     
    LETTER-Communication Theory and Signals

      Pubricized:
    2024/03/04
      Vol:
    E107-A No:8
      Page(s):
    1421-1424

    Integrated Sensing and Communication at terahertz band (ISAC-THz) has been considered as one of the promising technologies for the future 6G. However, in the phase-shifters (PSs) based massive multiple-input-multiple-output (MIMO) hybrid precoding system, due to the ultra-large bandwidth of the terahertz frequency band, the subcarrier channels with different frequencies have different equivalent spatial directions. Therefore, the hybrid beamforming at the transmitter will cause serious beam split problems. In this letter, we propose a dual-function radar communication (DFRC) precoding method by considering recently proposed delay-phase precoding structure for THz massive MIMO. By adding delay phase components between the radio frequency chain and the frequency-independent PSs, the beam is aligned with the target physical direction over the entire bandwidth to reduce the loss caused by beam splitting effect. Furthermore, we employ a hardware structure by using true-time-delayers (TTDs) to realize the concept of frequency-dependent phase shifts. Theoretical analysis and simulation results have shown that it can increase communication performance and make up for the performance loss caused by the dual-function trade-off of communication radar to a certain extent.

  • Dynamic Hybrid Beamforming-Based HAP Massive MIMO with Statistical CSI Open Access

    Pingping JI  Lingge JIANG  Chen HE  Di HE  Zhuxian LIAN  

     
    LETTER-Communication Theory and Signals

      Pubricized:
    2023/12/25
      Vol:
    E107-A No:8
      Page(s):
    1417-1420

    In this letter, we study the dynamic antenna grouping and the hybrid beamforming for high altitude platform (HAP) massive multiple-input multiple-output (MIMO) systems. We first exploit the fact that the ergodic sum rate is only related to statistical channel state information (SCSI) in the large-scale array regime, and then we utilize it to perform the dynamic antenna grouping and design the RF beamformer. By applying the Gershgorin Circle Theorem, the dynamic antenna grouping is realized based on the novel statistical distance metric instead of the value of the instantaneous channels. The RF beamformer is designed according to the singular value decomposition of the statistical correlation matrix according to the obtained dynamic antenna group. Dynamic subarrays mean each RF chain is linked with a dynamic antenna sub-set. The baseband beamformer is derived by utilizing the zero forcing (ZF). Numerical results demonstrate the performance enhancement of our proposed dynamic hybrid precoding (DHP) algorithm.

  • New Constructions of Approximately Mutually Unbiased Bases by Character Sums over Galois Rings Open Access

    You GAO  Ming-Yue XIE  Gang WANG  Lin-Zhi SHEN  

     
    LETTER-Information Theory

      Pubricized:
    2024/02/07
      Vol:
    E107-A No:8
      Page(s):
    1386-1390

    Mutually unbiased bases (MUBs) are widely used in quantum information processing and play an important role in quantum cryptography, quantum state tomography and communications. It’s difficult to construct MUBs and remains unknown whether complete MUBs exist for any non prime power. Therefore, researchers have proposed the solution to construct approximately mutually unbiased bases (AMUBs) by weakening the inner product conditions. This paper constructs q AMUBs of ℂq, (q + 1) AMUBs of ℂq-1 and q AMUBs of ℂq-1 by using character sums over Galois rings and finite fields, where q is a power of a prime. The first construction of q AMUBs of ℂq is new which illustrates K AMUBs of ℂK can be achieved. The second and third constructions in this paper include the partial results about AMUBs constructed by W. Wang et al. in [9].

  • Conflict Management Method Based on a New Belief Divergence in Evidence Theory Open Access

    Zhu YIN  Xiaojian MA  Hang WANG  

     
    PAPER-Office Information Systems, e-Business Modeling

      Pubricized:
    2024/03/01
      Vol:
    E107-D No:7
      Page(s):
    857-868

    Highly conflicting evidence that may lead to the counter-intuitive results is one of the challenges for information fusion in Dempster-Shafer evidence theory. To deal with this issue, evidence conflict is investigated based on belief divergence measuring the discrepancy between evidence. In this paper, the pignistic probability transform belief χ2 divergence, named as BBχ2 divergence, is proposed. By introducing the pignistic probability transform, the proposed BBχ2 divergence can accurately quantify the difference between evidence with the consideration of multi-element sets. Compared with a few belief divergences, the novel divergence has more precision. Based on this advantageous divergence, a new multi-source information fusion method is devised. The proposed method considers both credibility weights and information volume weights to determine the overall weight of each evidence. Eventually, the proposed method is applied in target recognition and fault diagnosis, in which comparative analysis indicates that the proposed method can realize the highest accuracy for managing evidence conflict.

  • Power Peak Load Forecasting Based on Deep Time Series Analysis Method Open Access

    Ying-Chang HUNG  Duen-Ren LIU  

     
    PAPER-Artificial Intelligence, Data Mining

      Pubricized:
    2024/03/21
      Vol:
    E107-D No:7
      Page(s):
    845-856

    The prediction of peak power load is a critical factor directly impacting the stability of power supply, characterized significantly by its time series nature and intricate ties to the seasonal patterns in electricity usage. Despite its crucial importance, the current landscape of power peak load forecasting remains a multifaceted challenge in the field. This study aims to contribute to this domain by proposing a method that leverages a combination of three primary models - the GRU model, self-attention mechanism, and Transformer mechanism - to forecast peak power load. To contextualize this research within the ongoing discourse, it’s essential to consider the evolving methodologies and advancements in power peak load forecasting. By delving into additional references addressing the complexities and current state of the power peak load forecasting problem, this study aims to build upon the existing knowledge base and offer insights into contemporary challenges and strategies adopted within the field. Data preprocessing in this study involves comprehensive cleaning, standardization, and the design of relevant functions to ensure robustness in the predictive modeling process. Additionally, recognizing the necessity to capture temporal changes effectively, this research incorporates features such as “Weekly Moving Average” and “Monthly Moving Average” into the dataset. To evaluate the proposed methodologies comprehensively, this study conducts comparative analyses with established models such as LSTM, Self-attention network, Transformer, ARIMA, and SVR. The outcomes reveal that the models proposed in this study exhibit superior predictive performance compared to these established models, showcasing their effectiveness in accurately forecasting electricity consumption. The significance of this research lies in two primary contributions. Firstly, it introduces an innovative prediction method combining the GRU model, self-attention mechanism, and Transformer mechanism, aligning with the contemporary evolution of predictive modeling techniques in the field. Secondly, it introduces and emphasizes the utility of “Weekly Moving Average” and “Monthly Moving Average” methodologies, crucial in effectively capturing and interpreting seasonal variations within the dataset. By incorporating these features, this study enhances the model’s ability to account for seasonal influencing factors, thereby significantly improving the accuracy of peak power load forecasting. This contribution aligns with the ongoing efforts to refine forecasting methodologies and addresses the pertinent challenges within power peak load forecasting.

  • VMD-Informer-DCC for Photovoltaic Power Prediction Open Access

    Yun WU  Xingyu PAN  Jieming YANG  

     
    PAPER-Fundamental Theories for Communications

      Vol:
    E107-B No:7
      Page(s):
    487-494

    Photovoltaic power is an important part of sustainable development. Accurate prediction of photovoltaic power can improve energy utilization and prevent resource waste. However, the volatility and uncertainty of photovoltaic power make power prediction difficult. Although Informer has achieved good prediction results in the field of time series prediction, it does not put forward a good solution for the volatility of series and the leakage of future information when stacking. Therefore, this paper proposes a photovoltaic power prediction model based on VMD-Informer-DCC. Firstly, Spearman’s feature selector was used to screen the sequence features. Then, the VMD layer was added to the encoder of Informer to decompose the feature sequence to reduce the volatility of the feature sequence. Finally, the dilated causal convolutional layer was used to replace the Self-attention distilling of Informer, which expanded the receptive field of Informer information extraction and ensured the causality of time series prediction. To verify the effectiveness of the model, this paper uses the dataset of a photovoltaic power plant in Jilin Province in 2021 to conduct a large number of experiments. The results show that the VMD-Informer-DCC model has high prediction accuracy and wide applicability.

1-20hit(3161hit)